[Remote] Student Researcher [Seed Vision – AI Platform] – 2026 Start (PhD)
Note: The job is a remote job and is open to candidates in USA. ByteDance is a technology company dedicated to pioneering advanced AI foundation models. The Student Researcher role involves contributing to the Seed Vision AI Platform team by designing data processing pipelines and conducting research to enhance model training and evaluation. Responsibilities Design and optimize data processing pipelines for large-scale image, video, and multimodal datasets used in model pretraining and fine-tuning Conduct research on data deduplication, filtering, and quality evaluation to maximize training signal efficiency Collaborate with model teams to close the loop between data characteristics and downstream performance Explore data-centric machine learning methods, including synthetic data generation, dataset pruning, and active data selection Build high-throughput systems for dataset tracking, versioning, and feedback-based iteration Skills Currently pursuing a PhD in Computer Vision, Machine Learning, Systems, or a related field Research experience in data-centric ML, vision data pipelines, or training dataset optimization Familiarity with deep learning frameworks (e.g., PyTorch, TensorFlow) and data processing stacks (e.g., Spark, Ray, DALI) Strong engineering skills in Python and/or distributed data systems Experience working with large-scale visual datasets (e.g., LAION, WebVid, ImageNet, Ego4D) Background in data evaluation, synthetic data curation, or auto-labeling systems Familiarity with vision foundation model pretraining workflows (e.g., CLIP, DINO, EVA, InternImage) Understanding of data–model alignment loops and evaluation-driven dataset iteration Benefits Day one access to health insurance Life insurance Wellbeing benefits 10 paid holidays per year Paid sick time (56 hours if hired in first half of year, 40 if hired in second half of year) Housing allowance Company Overview ByteDance is a technology company that develops content creation platforms and services. It was founded in 2012, and is headquartered in Beijing, Beijing, CHN, with a workforce of 10001+ employees. Its website is Company H1B Sponsorship ByteDance has a track record of offering H1B sponsorships, with 1350 in 2025, 1123 in 2024, 775 in 2023, 487 in 2022, 417 in 2021, 245 in 2020. Please note that this does not guarantee sponsorship for this specific role.