[Remote] Kubernetes GPU Engineer
Note: The job is a remote job and is open to candidates in USA. SemiAnalysis is an independent research and analysis firm specializing in the Semiconductor and AI industries. They are seeking a highly motivated & skilled Member of Technical Staff to join their growing engineering team, focusing on developing GPU Cloud benchmarks and writing technical research reports. Responsibilities • Lead the development of the next generation of our industry leading ClusterMAX™ GPU Cloud benchmark • Develop & operate dozens of GPU clusters including GB200 NVL72, TPUv7, H200, Mi355, etc • Develop GPU Cloud benchmarks such as Storage IO & bandwidth benchmarks, PyTorch MLPerf benchmarks, etc • Author detailed technical research reports analyzing benchmark results, GPU Cloud performance, scalability, & efficiency • Establish and maintain strategic partnerships & collaborations with over 50 leading neocloud providers & AI chip manufacturers, including AMD, NVIDIA, and other industry stakeholders Skills • Experience working at a hyperscaler or a GPU cloud • 1-2 years using GPU or TPU clusters and/or running a multi-tenant GPU cluster • Solid understanding of SLURM, & Kubernetes • Practical experience in InfiniBand, NCCL, Fabric Manager, PyTorch, etc • Strong research skills and the ability to synthesize information from various sources to draw insights Benefits • Generous PTO • Office stipends • Competitive healthcare (medical, dental, vision) • Support for conferences and ongoing learning Company Overview • SemiAnalysis offers AI and semiconductor research, consulting, and hosts tech events like Nvidia Blackwell GPU Hackathon. It was founded in 2019, and is headquartered in San Francisco, California, USA, with a workforce of 11-50 employees. Its website is Company H1B Sponsorship • SemiAnalysis has a track record of offering H1B sponsorships, with 1 in 2025. Please note that this does not guarantee sponsorship for this specific role. Apply tot his job