[Remote] AI Research Lead (Multimodal & Video Foundation Model)
Note: The job is a remote job and is open to candidates in USA. Tether.io is pioneering a global financial revolution by building cutting-edge blockchain solutions. The AI Research Lead will drive the technical directions and build multimodal foundation models for image, video, and 3D generation, while collaborating with world-class engineers and researchers to advance open source development and the global AI community. Responsibilities • Lead the research, design, and development of state-of-the-art image, video, and 3D generation models, including multimodal foundation models. • Lead high-impact, specialized projects focused on innovative text, images, audio and video applications. • Define and drive the technical roadmap for multimodal AI initiatives, aligning research goals with business and product objectives. • Provide technical leadership and mentorship to teams of AI researchers and engineers, fostering innovation and skill development. • Oversee the end-to-end lifecycle of multimodal model development, from dataset curation and model training to deployment and performance evaluation. • Lead large-scale multi-node GPU model training, ensuring scalability, efficiency, and reproducibility of experiments. • Collaborate closely with cross-functional teams, including product, design, and engineering, to integrate AI solutions into production systems. • Drive applied research initiatives in image/video/3D generation, editing, animation, and other related domains. • Monitor advancements in AI research and multimodal technologies, and incorporate novel techniques to improve model capabilities and performance. • Contribute to the AI research community, including publications, open-source contributions, and participation in conferences. • Establish best practices and standards for coding, model evaluation, and experimentation within the team. • Lead and manage complex projects, ensuring timely delivery, quality outcomes, and alignment with strategic objectives. • Communicate technical insights and updates effectively to executive leadership, stakeholders, and external collaborators. • Promote a culture of collaboration, innovation, and excellence, maintaining high team morale and accountability. Skills • PhD, MS or equivalent experience • Hands on experience in building Image/Video/3D generation and multimodal foundation models building from scratch • 5+ years of experience in managing or leading 10+ research & engineer teams • Excellent communication and interpersonal skills • Excellent understanding of an AI-based product lifecycle. • Hands-on experience in building end-to-end multimodal foundation models on thousands of multi-node GPUs. • Proficiency in modern deep learning and diffusion frameworks & libraries. • Demonstrated expertise in computer vision, video generation foundation model and/or multimodal research especially building them from scratch. • Strong history of delivering innovation in the space of multimodal & video. • Ability to develop a long-term vision and execute strategies at scale while maintaining a grasp of technical details for better decision-making. • Experience with VP-level presentations and reporting. • Publications at leading AI conferences such as CVPR, ICCV, ECCV, ICML, ICLR, NeurIPS etc. Company Overview • Tether has evolved to meet global needs with agility and vision. It was founded in 2014, and is headquartered in Seattle, Washington, USA, with a workforce of 11-50 employees. Its website is Apply tot his job Apply tot his job