[Remote] Distinguished Engineer – ML Infrastructure & Data Platforms
Note: The job is a remote job and is open to candidates in USA. Work Vista is hiring for a mission-critical leadership role within the ML Infrastructure organization. The Distinguished Engineer will lead the technical vision and architecture for systems that support the full machine learning lifecycle, focusing on building scalable and reliable ML infrastructure. Responsibilities • Own and evolve the architecture of enterprise-scale ML infrastructure, enabling scalable storage, curation, and access for 100+ engineers and researchers • Design infrastructure supporting petabyte-scale ML workflows, including multimodal perception data, simulation outputs, synthetic data, and continuous training pipelines • Architect high-throughput distributed training systems on large GPU clusters, improving utilization, throughput, and job efficiency • Establish robust data governance, observability, lineage, and retention strategies to ensure compliance, reproducibility, and long-term usability • Collaborate cross-functionally with ML engineers, data engineers, platform teams, and DevOps to tightly align infrastructure with user workflows • Define and drive the technical roadmap and long-term strategy for ML infrastructure, incorporating industry best practices and open-source innovation • Mentor and influence engineers across teams, promoting excellence in distributed systems, ML platforms, and large-scale data management Skills • 15+ years of meaningful software engineering experience, including architecture-level ownership of ML or data infrastructure • Proven experience designing and operating ML platforms supporting large-scale training and inference workloads • Deep expertise in distributed storage systems, high-volume data pipelines, and ML-oriented data compression strategies • Strong proficiency with Linux systems, Python, and C++ or other performance-oriented languages • Experience operating in hybrid environments, including bare metal, HPC, and public cloud platforms (AWS, GCP, or Azure) • Demonstrated ability to lead cross-organization initiatives and influence system-level design across platform and ML teams • Prior experience in robotics, autonomous systems, or safety-critical domains is strongly preferred • Experience building or leading infrastructure at a top-tier ML, AI, or autonomous systems organization • Contributions to open-source ML or data infrastructure projects Benefits • Medical • Dental • Vision • Retirement plans • Paid time off • Additional competitive offerings Company Overview • JobLift Media is a performance-driven recruitment-marketing studio helping companies amplify their talent reach through creative media, precision targeting, and employer-brand storytelling. It was founded in undefined, and is headquartered in , with a workforce of 11-50 employees. Its website is . Apply tot his job