Machine Learning Researcher
MatX is focused on creating the best AI models with efficient performance. They are seeking a Machine Learning Researcher to train and optimize large language models, build distributed infrastructure, and provide hardware architecture advice from an ML perspective. Responsibilities Train and optimize LLMs for our hardware Run quality evaluations Build and set up distributed infrastructure for training and inference Advise on the hardware architecture from an ML perspective Skills Excellent software engineering skills Experience training and tweaking neural networks, ideally LLMs Experience optimizing neural networks for hardware efficiency, for example regarding FLOPs, memory bandwidth, communication bandwidth, precision, parallel layout, batch sizes Benefits A Stake in our success A cash/equity mix that fits your needs and option to do early exercise Health & Wellness Company subsidized Health, Dental, Vision, and Life insurance; Pre-tax Health Savings Accounts with generous company contribution (even if you don’t) Time To Recharge 4 weeks paid time off (accrued), 12 company holidays, and 3 weeks remote/flexible work per year Support to Parents Up to 12 weeks of paid parental leave, regardless of your path to parenthood Learning & Development $1,500 yearly towards your professional development e.g. conferences, courses, and other learning opportunities Team Connection Team Lunches, quarterly off-sites, and regular town halls Financial Wellbeing 401K and/or Roth IRA, with 5% company contribution, even if you don’t! Flexible Spending Accounts Pre-tax spend accounts for medical, dental/vision, dependent care, parking, and transit expenses Commute On Us For those commuting up to 1 hour, put your rideshare cost on our company card and reclaim the drive-time to get work done! MatX E[x]tras $50 per month to use on the perks you care about most Remote Perks We work remotely Monday & Friday, supported by home-tech setup, and remote wifi expense reimbursement Company Overview MatX is an AI chip startup that designs chips that support large language models. It was founded in 2022, and is headquartered in Mountain View, California, USA, with a workforce of 11-50 employees. Its website is