Job Description
We are seeking a skilled software engineer to contribute to our LLM evaluation and training datasets project. This project aims to train LLMs to work on realistic software engineering problems.
The successful candidate will have experience working with high-quality public GitHub repositories and contributing to this project. You should be familiar with well-maintained, widely-used repos with 500+ stars.
This role involves hands-on software engineering work, including development environment automation, issue triaging, and evaluating test coverage and quality.
About the Role
You will analyze and triage GitHub issues across trending open-source libraries, set up and configure code repositories, and evaluate unit test coverage and quality. Additionally, you will modify and run codebases locally to assess LLM performance in bug-fixing scenarios.
You will collaborate with researchers to design and identify repositories and issues that are challenging for LLMs. Opportunities to lead a team of junior engineers to collaborate on projects are also available.
Required Skills
* Strong experience with at least one of the following languages: Python, JavaScript, Java, Go, Rust, C/C++, C#, or Ruby.
* Experience working with well-maintained, widely-used repositories with 500+ stars.
* Proficiency with Git, Docker, and basic software pipeline setup.
* Ability to understand and navigate complex codebases.
* Comfortable running, modifying, and testing real-world projects locally.
What We Offer
* Work in a fully remote environment.
* Opportunity to work on cutting-edge AI projects with leading LLM companies.