Innodata is a leading data engineering company with over 2000 customers and operations in 13 cities around the world. We combine advanced machine learning and artificial intelligence technologies with a global workforce of subject matter experts to shape the capabilities of large language models (LLMs) and NLP-based systems.
Your role will play a crucial part in this journey as a linguistics expert. You will create or edit linguistically rich content for NLP pipelines, identify and resolve issues related to ambiguity, bias, and grammaticality, perform quality assurance on model outputs for fluency, tone, factual accuracy, and language appropriateness, annotate linguistic datasets with syntactic, semantic, or pragmatic labels, and support internal teams by conducting linguistic research and summarizing findings.
To be successful in this position, you should have a deep understanding of linguistic theory and language structure. Experience with computational linguistics, corpus analysis, language data annotation, and LLM training is a plus. Strong writing, editing, and communication skills are also essential.