Senior Deep Learning Algorithm Engineer

Remote Full-time
Job Description: • Optimize deep learning models for low-latency, high-throughput inference. • Convert and deploy models using frameworks such as TensorRT and TensorRT-LLM • Understand, analyze, profile, and optimize performance of deep learning workloads on state-of-the-art hardware and software platforms. • Collaborate with internal and external researchers to ensure seamless integration of models from training to deployment. Requirements: • Master’s or PhD in Computer Science, Electrical Engineering, Computer Engineering, or a related field (or equivalent experience) • 4+ years of professional experience in deep learning or applied machine learning. • Strong foundation in deep learning algorithms, including hands-on experience with LLMs and VLMs • Deep understanding of transformer architectures, attention mechanisms, and inference bottlenecks. • Proficient in building and deploying models using PyTorch or TensorFlow in production-grade environments. • Solid programming skills in Python and C++ • Proven experience deploying LLMs or VLMs at scale in real-world applications. • Hands-on experience with model optimization and serving frameworks, such as: TensorRT, TensorRT-LLM, vLLM, SGLang. Benefits: • Eligible for equity and benefits Apply tot his job
Apply Now
← Back to Home