# Language infrastructure and training in fieldwork and theoretical linguistics

> **NSF 01002526DB NSF RESEARCH & RELATED ACTIVIT** · Georgetown University (DC) · $249,993

## Abstract

This project examines language data to advance language infrastructure, knowledge, and theory. The research provides empirical data for the development and testing of theories of language, which are important for understanding human cognition, and trains students in these methods. The data are also important for demonstrating how different languages package information, knowledge that can improve the quality and usability of technologies like artificial intelligence. The project provides language and educational materials and fosters engagement with language speakers.

The investigators focus on developing language infrastructure through training students in language documentation, data collection, management, language maintenance, and linguistic description and theory. The linguistic analysis focuses on language structures that are typologically unusual, such as a vowel harmony process that interacts with nasality, grammatical tone, and serial verb constructions. The team uses these data for producing grammatical sketches and a dictionary for the language, and project data are made available in a public language archive. This corpus of language documentation materials serves as a permanent record that can be used by linguists, lexicographers, and scholars in other fields that study and apply linguistic data.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts

## Key facts

- **NSF award ID:** 2450715
- **Awardee organization:** Georgetown University (DC)
- **SAM.gov UEI:** TF2CMKY1HMX9
- **PI:** Michael Obiri-Yeboah
- **Primary program:** 01002526DB NSF RESEARCH & RELATED ACTIVIT
- **All programs:** Artificial Intelligence (AI), LINGUISTICS, DLI-Dyn Language Infrastructure, Translational Research, GRADUATE INVOLVEMENT, SCIENCE, MATH, ENG & TECH EDUCATION
- **Estimated total:** $249,993
- **Funds obligated:** $249,993
- **Transaction type:** Standard Grant
- **Period:** 09/01/2025 → 08/31/2028

## Primary source

NSF Award Search: https://www.nsf.gov/awardsearch/showAward?AWD_ID=2450715

## Citation

> US National Science Foundation, Award 2450715, Language infrastructure and training in fieldwork and theoretical linguistics. Retrieved via AI Analytics 2026-06-08 from https://api.ai-analytics.org/grant/nsf/2450715. Licensed CC0.

---

*[NSF Awards dataset](/datasets/nsf-awards) · CC0 1.0*
