Senior software engineer - large language model evaluator

Santa Cruz do Sul

beBee Careers

Modelista

Anunciada dia 14 junho

Descrição

About Turing's Project: Curating Software Engineering Challenges for Large Language Models We're building high-quality evaluation and training datasets to improve how LLMs interact with realistic software engineering tasks. Our goal is to curate verifiable software engineering challenges from public GitHub repository histories using a human-in-the-loop process. Project Overview * Collaborate directly with AI researchers shaping the future of AI-powered software development. * Work with high-impact open-source projects and evaluate how LLMs perform on real bugs, issues, and developer tasks. * Influence dataset design that will train and benchmark next-gen LLMs. A day in this role might look like:

Se candidatar

Criar um alerta

Salvar