Machine Learning Engineer Opportunity
We are seeking a seasoned Machine Learning Engineer to join our team. In this role, you will be responsible for defining and proposing an infrastructure management stack that drives business objectives.
Lateral Group is a profitable, award-winning design and technology company with over 20 years of experience launching bold ventures and transforming businesses. A globally distributed team of experts united by the pursuit of quality.
We work with speed, focus, and integrity, delivering high-quality work and continuous improvement.
Key Responsibilities:
 * Identify and mitigate AI infrastructure issues and improve model training speed on specific hardware.
 * Evaluate and implement new AI training and development platforms.
 * Automate model training and checkpointing using MLOps tools; maintain containerization tools (Docker, Singularity) for reproducibility.
 * Transfer and replicate models from R&D to production; manage model lifecycle, tracking, and ensure compatibility with evolving training packages (e.g., CUDA, PyTorch, drivers).
Requirements:
 * 5+ years of hands-on experience with ML Ops tools such as SLURM, MLflow, Kubeflow, SageMaker, or Vertex AI.
 * Experience managing Kubernetes clusters and distributed training workloads at scale.
 * Proficiency with containerization (Docker, Singularity) and reproducible ML environments.
 * Familiarity with popular deep learning frameworks (PyTorch, TensorFlow) and how they operate at infra level.
 * Strong understanding of model lifecycle best practices (training, validation, deployment, tracking).
 * Strong scripting and automation skills in Python, Bash, or similar.
 * Comfort working closely with ML researchers to translate needs into scalable, production-grade systems.
Bonus Points:
 * Experience with multi-node, hardware-optimized training setups (e.g., GPU clusters, TPUs).
 * Contributions to internal tools or open-source projects in the ML Infra space.
 * Prior experience helping bring ML systems through regulatory, safety, or quality review stages.
About the Opportunity:
 * Real Impact: meaningful products across healthcare, commerce, sustainability, and next-gen tech.
 * Remote-First, Office Friendly: work from anywhere; offices available for in-person collaboration if desired.
 * Async collaboration that respects time zones and outcomes over hours.
 * Outstanding Team: talented, kind professionals who care about craft and each other.
 * Growth: opportunities to grow your craft and take on greater responsibility at your pace.
 * Culture of Excellence: high-quality work delivered sustainably with no burnout or crunch.
 * Variety & Stability: profitable, independent, and over a decade of experience with new challenges in each project.