Innodata is a leading data engineering company with over 2,000 customers worldwide. We are an AI technology solutions provider-of-choice for top tech companies and others across financial services, insurance, technology, law, and medicine.
By combining advanced machine learning and artificial intelligence technologies, a global workforce of subject matter experts, and a high-security infrastructure, we're helping usher in the promise of AI. Innodata offers a powerful combination of digital data solutions and easy-to-use platforms.
Our global workforce includes employees in the United States, Canada, the United Kingdom, the Philippines, India, Sri Lanka, Israel, and Germany.
About the Role:
We are seeking a highly analytical linguist to support AI training initiatives and linguistic content creation. This role is ideal for someone with a strong academic background in linguistics and a passion for language, technology, and clear communication. You will play a crucial role in shaping the capabilities of large language models through high-quality linguistic data curation, annotation, and evaluation.
Key Responsibilities:
* Create or edit linguistically-rich content including grammar guides, syntactic analyses, usage explanations, or examples for NLP pipelines.
* Identify and resolve issues related to ambiguity, bias, and grammaticality.
* Perform quality assurance on model outputs for fluency, tone, factual accuracy, and language appropriateness.
* Annotate linguistic datasets with syntactic, semantic, or pragmatic labels.
* Support internal teams by conducting linguistic research and summarizing findings.
* Apply linguistic knowledge to evaluate model behavior, error patterns, and generalization issues.
Qualifications:
* Deep understanding of linguistic theory and language structure.
* Experience with one or more of the following is a plus: computational linguistics, corpus analysis, language data annotation, LLM training.