Data Pipeline & AI Infrastructure Developer

Remote, USA Full-time
We're looking for an experienced machine learning and data engineer to build the systems that power our embodied AI research and production. In this role, you'll own the build-out of critical components of our data pipelines and compute infrastructure, ensuring our research team has reliable, high-performance platforms to train and deploy advanced robotics models. Data PipelinesYou'll build and maintain large-scale data ingestion systems that capture multimodal robotics data (video, point clouds, proprioception, and action trajectories), handling the end-to-end flow from ingestion through transformation, quality assurance, and delivery to training systems.You'll ensure data reliability, versioning, and reproducibility across terabytes of embodied data while building observability and dataset management tooling. Your work directly determines the quality and scale of data our AI systems learn from. AI Cluster InfrastructureYou'll architect and operate our training infrastructure—Kubernetes-based HPC clusters, GPU orchestration, distributed training, and model deployment—optimizing resource allocation, monitoring cluster health, and ensuring high availability.You'll build automation and tooling that makes research code production-ready, enables efficient multi-tenant experiments, and lets the team move fast. Your infrastructure enables breakthroughs in robotic intelligence. What you bringYou're fluent in Python and comfortable with systems languages (C, C++, Rust, or Go). You have deep experience building data pipelines or infrastructure at scale. You know Kubernetes, distributed systems, and HPC environments well. You've worked with large-scale data storage, workflow orchestration, and compute resource management.You understand Linux systems, networking, and real-time constraints. You bridge the gap between research and production. You debug across layers and value reliability, observability, and clean abstractions. You're excited to work in a fast-moving environment where your infrastructure directly enables cutting-edge AI research and real-world robotic deployments. Apply tot his job
Apply Now

Similar Jobs

Data Engineer (Cloud Data Architecture / Pipelines / API Integration / Python)

Remote, USA Full-time

Senior Data Pipeline Architect - Apache Beam & bolthires Cloud Platform

Remote, USA Full-time

Senior Software Engineer- Streaming Pipelines

Remote, USA Full-time

Software Engineer (.NET, C#, Python) - Real-Time Data Pipelines & AWS Cloud

Remote, USA Full-time

Data Integration Engineer, Provider Data

Remote, USA Full-time

AI Engineer (Synthetic Data Pipelines)

Remote, USA Full-time

Data Engineering Lead: Source ETL & Analytics

Remote, USA Full-time

Senior Data Engineer IS (DataOps) *Virtual*

Remote, USA Full-time

Immediate Hiring: Senior Real Time Pipeline Engineer (PH)

Remote, USA Full-time

Data Engineer (For OPT/CPT Candidates)- Immediate Hiring

Remote, USA Full-time

BLUE CROSS BLUE SHIELD INSURANCE CLERK

Remote, USA Full-time

Senior Software Engineer - Machine Learning Feature Store

Remote, USA Full-time

Design and Layout Guide to Create Trademark Samples for Clients of Trademark Attorney

Remote, USA Full-time

Research Analyst - Product Launchpad Program

Remote, USA Full-time

Executive Assistant (Remote, part-time or full-time)

Remote, USA Full-time

Senior Creative Content Strategist - Splendid

Remote, USA Full-time

Vendor Operations Coordinator ( Virtual Remote) in Tempe, AZ in US Foods

Remote, USA Full-time

IT & Security Controls Manager - Long-term Contract - Remote

Remote, USA Full-time

Regulatory Compliance Analyst

Remote, USA Full-time

Compliance Monitoring and Testing Officer

Remote, USA Full-time
Back to Home