This role is a crucial part of shaping the capabilities of large language models (LLMs) and NLP-based systems.
* Create linguistically-rich content including grammar guides, syntactic analyses, usage explanations, or examples for NLP pipelines.
* Identify and resolve issues related to ambiguity, bias, and grammaticality in linguistic data.
* Evaluate model behavior, error patterns, and generalization issues by applying linguistic knowledge.
Key Responsibilities:
* Linguistic Data Curation:
o Create and edit high-quality linguistic content for NLP pipelines.
* Model Evaluation:
o Annotate linguistic datasets with syntactic, semantic, or pragmatic labels.
o Evaluate model outputs for fluency, tone, factual accuracy, and language appropriateness.
* Supporting Teams:
o Conduct linguistic research and summarize findings to support internal teams.