Data Pipeline & AI Infrastructure Developer

Remote Full-time
We're looking for an experienced machine learning and data engineer to build the systems that power our embodied AI research and production. In this role, you'll own the build-out of critical components of our data pipelines and compute infrastructure, ensuring our research team has reliable, high-performance platforms to train and deploy advanced robotics models. Data Pipelines You'll build and maintain large-scale data ingestion systems that capture multimodal robotics data (video, point clouds, proprioception, and action trajectories), handling the end-to-end flow from ingestion through transformation, quality assurance, and delivery to training systems. You'll ensure data reliability, versioning, and reproducibility across terabytes of embodied data while building observability and dataset management tooling. Your work directly determines the quality and scale of data our AI systems learn from. AI Cluster Infrastructure You'll architect and operate our training infrastructure—Kubernetes-based HPC clusters, GPU orchestration, distributed training, and model deployment—optimizing resource allocation, monitoring cluster health, and ensuring high availability. You'll build automation and tooling that makes research code production-ready, enables efficient multi-tenant experiments, and lets the team move fast. Your infrastructure enables breakthroughs in robotic intelligence. What you bring You're fluent in Python and comfortable with systems languages (C, C++, Rust, or Go). You have deep experience building data pipelines or infrastructure at scale. You know Kubernetes, distributed systems, and HPC environments well. You've worked with large-scale data storage, workflow orchestration, and compute resource management. You understand Linux systems, networking, and real-time constraints. You bridge the gap between research and production. You debug across layers and value reliability, observability, and clean abstractions. You're excited to work in a fast-moving environment where your infrastructure directly enables cutting-edge AI research and real-world robotic deployments. Apply tot his job
Apply Now

Similar Opportunities

Software Engineer, Data Platform-Slack (Senior SWE/Staff SWE)

Remote

Data Platform Support Engineer

Remote

Analytics Platform Engineer Associate

Remote

Senior Software Engineer (Data Platform)

Remote

Senior/Staff Software Engineer, Data

Remote

Data Platform Engineer

Remote

Senior Privacy Analyst, FedRAMP

Remote

Data Loss Prevention (DLP) Analyst

Remote

Cyber Security Analyst @ Texas Remote in USA

Remote

IT Security Analyst 3 - IS - Data Security - FT - Day - Remote SoCal

Remote

Remote Work From Home Administrative Assistant Admin – Part Time Panelists Needed

Remote

**Experienced Content Creator – Disney Fandom Storytelling and Digital Content Development at blithequark**

Remote

Assistant Editor, New Beauty job at MJH Life Sciences in US National

Remote

Experienced Remote Data Entry Specialist – Work from Home Opportunity with Competitive Hourly Rate and Professional Growth at arenaflex

Remote

Experienced Remote Call Center Agent for Federal Assistance and Customer Support – Work from Home Opportunity with Competitive Salary and Growth Prospects

Remote

**Experienced Full Stack Customer Support Specialist – Remote Live Chat Support**

Remote

REMOTE WORK FROM HOME DATA ENTRY CLERK - PART TIME & FULL TIME - Now Hiring

Remote

**Experienced Customer Service Administrator – Education and Certification Programs Support**

Remote

Experienced Customer Service Associate and Cashier for Retail Excellence – Delivering Exceptional Shopping Experiences at blithequark

Remote

Experienced Gymnastics Instructor for Children's Development Programs in San Antonio, TX

Remote
← Back to Home