About the Role
We are seeking a seasoned software engineer to contribute to a project focused on building and evaluating large language models (LLMs) in realistic software engineering scenarios. The ideal candidate will have experience working with high-quality public GitHub repositories and can help us expand our dataset coverage across different types of tasks, including programming languages, difficulty levels, and more.
The role involves hands-on software engineering work, including development environment automation, issue triaging, and evaluating test coverage and quality. You should be comfortable navigating complex codebases and have experience running, modifying, and testing real-world projects locally.