Innodata is a leading data engineering company with over 2000 customers in 13 cities worldwide.
We combine advanced technologies like machine learning and artificial intelligence to provide AI solutions.
Our team of subject matter experts supports global operations with high-security infrastructure.
About the role:
We seek an analytical linguist to support AI training initiatives and linguistic content creation.
You will play a crucial role in shaping large language models through high-quality data curation, annotation, and evaluation.
Key Responsibilities:
* Create or edit linguistically-rich content for NLP pipelines.
* Identify and resolve issues related to ambiguity, bias, and grammaticality.
* Perform quality assurance on model outputs for fluency, tone, factual accuracy, and language appropriateness.
* Annotate linguistic datasets with syntactic, semantic, or pragmatic labels.
* Support internal teams by conducting linguistic research and summarizing findings.
* Apply linguistic knowledge to evaluate model behavior, error patterns, and generalization issues.
Qualifications:
* Deep understanding of linguistic theory and language structure.
* Experience with computational linguistics, corpus analysis, language data annotation, or LLM training is a plus.
* Strong writing, editing, and communication skills are essential.