Language Data Specialist
We are seeking a highly analytical and detail-oriented linguist to support AI training initiatives and linguistic content creation. The ideal candidate will have a strong academic background in linguistics and a passion for language, technology, and clear communication.
This role involves shaping the capabilities of large language models and NLP-based systems through high-quality linguistic data curation, annotation, and evaluation.
Key responsibilities include creating or editing linguistically-rich content, identifying and resolving issues related to ambiguity, bias, and grammaticality, and performing quality assurance on model outputs for fluency, tone, factual accuracy, and language appropriateness.
The successful candidate will also apply linguistic knowledge to evaluate model behavior, error patterns, and generalization issues, as well as conduct linguistic research and summarize findings to support internal teams.
A deep understanding of linguistic theory and language structure is essential, along with strong writing, editing, and communication skills.
English proficiency at the AC1 or C2 level is required, and experience with computational linguistics, corpus analysis, language data annotation, or LLM training is a plus.