Emprego
Meus anúncios
Meus alertas e-mail de emprego
Fazer login
Encontrar um emprego Dicas de emprego Fichas de empresas
Pesquisar

Ai/ml engineer - web data quality - remote

Rio de Janeiro (RJ)
Zyte
Web
Anunciada dia 11 dezembro
Descrição

AI/ML Engineer - Web Data Quality - Remote
3 weeks ago Be among the first 25 applicants
Get AI-powered advice on this job and more exclusive features.
About ZyteAt Zyte, we eat data for breakfast and you can eat your breakfast anywhere and work for Zyte. Founded in 2010, we are a globally distributed team of over 250 Zytans working from over 28 countries who are on a mission to enable our customers to extract the data they need to continue to innovate and grow their businesses. We believe that all businesses deserve a smooth pathway to data.
For more than a decade, Zyte has led the way in building powerful, easy-to-use tools to collect, format, and deliver web data, quickly, dependably, and at scale. Today, the data we extract helps thousands of organizations make smarter business decisions, secure competitive advantage, and drive sustainable growth. Today, over 3,000 companies and 1 million developers rely on our tools and services to get the data they need from the web.
Data QA is an important function within Zyte. The Data QA team works to ensure that the quality and usability of the data scraped by our web scrapers meets and exceeds the expectations of our enterprise clients.
Are you passionate about data and data quality and integrity? Do you enjoy using Python and AI to analyze and manipulate data, detect data quality issues, and visualize your findings? Are you highly customer-focused with excellent attention to detail?
Owing to growing business and the need for ever more sophisticated Data QA, we are looking for a talented Data Scientist to join our team. As a Zyte Engineer, you work on AI-based data wrangling, data manipulation, and data visualisation techniques and apply them in the verification and validation of data quality as it pertains to data extracted from the web.
Requirements

Design and implement AI-driven quality checks: build models to detect anomalies, identify schema drift, and classify data errors in real time
Automate and scale QA: replace manual and rule-based validation with ML-powered solutions that continuously improve
Leverage GenAI for validation: use embedding models, LLMs, and prompt-driven pipelines to perform semantic checks on scraped data
Develop monitoring & alerting pipelines: quantify data quality via KPIs, dashboards, and automated reports for stakeholders
Experiment & innovate: research and prototype new AI techniques for QA, e.g. using embeddings, synthetic data, and reinforcement learning to stress-test scrapers
Collaborate cross-functionally: work with developers, product managers, and account teams to integrate AI-based QA into production workflows
Communicate insights: present findings with clear visualizations, metrics, and evidence-based recommendations to technical and non-technical audiences

Requirements

Proficiency in Python & PyData stack (NumPy, pandas, scikit-learn, PyTorch/TensorFlow preferred)
3+ years in a data science, applied ML, or data engineering role (ideally with exposure to QA or data validation at scale)
Hands-on experience with GenAI tools: LLM APIs (OpenAI, Anthropic, Google), prompt engineering, cost/token optimization
Strong ML fundamentals: anomaly detection, classification, clustering, embeddings, evaluation metrics
Experience with big data frameworks (Spark, BigQuery, or similar)
Ability to work with very large datasets (millions+ of records)
Version control skills (GitHub/Bitbucket)
Excellent communication in English, both technical and non-technical

Desired Skills

Prior experience in data quality automation or web data QA
Familiarity with LangChain, MCP, Marvin, or similar orchestration frameworks
Experience building QA dashboards or visualization layers
Background in statistics or applied mathematics
Previous remote/distributed work experience

Benefits

As a new Zytan, you will become part of a self-motivated, progressive, multi-cultural team.
Have the freedom and flexibility to work from where you do your best work.
Attend conferences and meet with team members from across the globe.
Work with cutting-edge open source technologies and tools.

Seniority level

Mid-Senior level

Employment type

Full-time

Job function

Other

Industries

IT Services and IT Consulting

Referrals increase your chances of interviewing at Zyte by 2x
Sign in to set job alerts for “Machine Learning Engineer” roles.
Other related job postings:

Python and Kubernetes Software Engineer - Data, AI/ML & Analytics
Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics
Senior Machine Learning Engineer, Ad Performance
Software Engineer Iii, Fullstack, Quickpack (Remote)
Lead Machine Learning Engineer, Ad Performance
Principal Machine Learning Engineer, Ad Performance
Software Engineer - Solutions Engineering
Software Engineer II / Senior Software Engineer

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr

Se candidatar
Criar um alerta
Alerta ativado
Salva
Salvar
Vaga parecida
Líder técnico iii (web/mobile/servidores)
Rio de Janeiro (RJ)
Instituto Nacional de Telecomunicações - Inatel
Web
Vaga parecida
Senior web engineer
Rio de Janeiro (RJ)
Street Diligence, Inc.
Web
Vaga parecida
Web analyst specialist
Rio de Janeiro (RJ)
DocPlanner
Web
Vagas parecidas
Emprego Informática em Rio de Janeiro (RJ)
Emprego Rio de Janeiro (RJ)
Emprego Rio de Janeiro
Emprego Sudeste
Página principal > Emprego > Emprego Informática > Emprego Web > Emprego Web em Rio de Janeiro (RJ) > AI/ML Engineer - Web Data Quality - Remote

Jobijoba Brasil

  • Dicas de emprego

Encontre vagas

  • Vagas de emprego por cargo
  • Pesquisa de vagas de emprego por área
  • Empregos por empresas
  • Empregos por localização

Contato / Parceria

  • Entre em contato
  • Publique suas ofertas no site Jobijoba

Menções legais - Menções legais e termos de uso - Política de dados - Gerir os meus cookies - Acessibilidade: Não conforme

© 2026 Jobijoba Brasil - Todos os direitos reservados

Se candidatar
Criar um alerta
Alerta ativado
Salva
Salvar