About This Opportunity
We are seeking a highly skilled Remote Senior Software Engineer to join our team. As a key member of our organization, you will play a crucial role in shaping the future of AI-powered software development.
Project Overview
We are building high-quality evaluation and training datasets to improve how Large Language Models (LLMs) interact with realistic software engineering tasks. A key focus of this project is curating verifiable software engineering challenges from public GitHub repository histories using a human-in-the-loop process.
Key Responsibilities
* Collaborate directly with AI researchers to evaluate LLM-generated code responses for correctness, code quality, style, and efficiency.
* Evaluate code diffs for maintainability, consistency, and objectivity.
* Provide clear, detailed rationales explaining the reasoning behind each ranking decision.
* Maintain high consistency and objectivity across evaluations.
* Collaborate with the team to identify edge cases and ambiguities in model behavior.
Requirements
* 7+ years of professional software engineering experience, ideally at top-tier product companies.
* Strong fundamentals in software design, coding best practices, and debugging.
* Excellent ability to assess code quality, correctness, and maintainability.
* Proficient with code review processes and reading diffs in real-world repositories.
* Exceptional written communication skills to articulate evaluation rationale clearly.
Engagement Details
* Commitment: ~20 hours/week (partial PST overlap required)
* Type: Contractor (no medical/paid leave)
* Duration: 1 month (potential extensions based on performance and fit)