We are seeking a seasoned software engineer to contribute to the development of LLM evaluation and training datasets. The successful candidate will be responsible for analyzing and triaging GitHub issues, setting up and configuring code repositories, and evaluating unit test coverage and quality.
The ideal candidate will have strong experience with at least one of the following languages: Python, JavaScript, Java, Go, Rust, C/C++, C#, or Ruby. Additionally, they will have proficiency with Git, Docker, and basic software pipeline setup. Experience working with well-maintained, widely-used repositories with 500+ stars is required.
This role involves hands-on software engineering work, including development environment automation, issue triaging, and evaluating test coverage and quality. The successful candidate will have the opportunity to collaborate with researchers to design and identify repositories and issues that are challenging for LLMs.
Key Responsibilities:
* Analyze and triage GitHub issues across trending open-source libraries.
* Set up and configure code repositories, including Dockerization and environment setup.
* Evaluate unit test coverage and quality.
Requirements:
* Strong experience with at least one of the following languages: Python, JavaScript, Java, Go, Rust, C/C++, C#, or Ruby.
* Proficiency with Git, Docker, and basic software pipeline setup.
* Experience working with well-maintained, widely-used repositories with 500+ stars.
Benefits:
* Opportunity to work on cutting-edge AI projects.
* Fully remote work environment.