Data Pipeline & AI Infrastructure Developer
We're looking for an experienced machine learning and data engineer to build the systems that power our embodied AI research and production. In this role, you'll own the build-out of critical components of our data pipelines and compute infrastructure, ensuring our research team has reliable, high-performance platforms to train and deploy advanced robotics models. Data PipelinesYou'll build and maintain large-scale data ingestion systems that capture multimodal robotics data (video, point clouds, proprioception, and action trajectories), handling the end-to-end flow from ingestion through transformation, quality assurance, and delivery to training systems.You'll ensure data reliability, versioning, and reproducibility across terabytes of embodied data while building observability and dataset management tooling. Your work directly determines the quality and scale of data our AI systems learn from. AI Cluster InfrastructureYou'll architect and operate our training infrastructure—Kubernetes-based HPC clusters, GPU orchestration, distributed training, and model deployment—optimizing resource allocation, monitoring cluster health, and ensuring high availability.You'll build automation and tooling that makes research code production-ready, enables efficient multi-tenant experiments, and lets the team move fast. Your infrastructure enables breakthroughs in robotic intelligence. What you bringYou're fluent in Python and comfortable with systems languages (C, C++, Rust, or Go). You have deep experience building data pipelines or infrastructure at scale. You know Kubernetes, distributed systems, and HPC environments well. You've worked with large-scale data storage, workflow orchestration, and compute resource management.You understand Linux systems, networking, and real-time constraints. You bridge the gap between research and production. You debug across layers and value reliability, observability, and clean abstractions. You're excited to work in a fast-moving environment where your infrastructure directly enables cutting-edge AI research and real-world robotic deployments. Apply tot his job