MLOps/Data Scientist -- LLM - REMOTE WORK -- 66966

Remote Full-time

MLOps/Data Scientist - LLM - REMOTE WORK - 66966 Pay Range - $65 - $70/hr One of our clients is looking for a MLOps/Data Scientist - LLM to join their team remotely. TECH STACK Python, LangChain, LlamaIndex, MLflow, Svelte/SvelteKit/TypeScript, MongoDB, Qdrant, FastAPI, Kubernetes, Terraform, AWS (EKS, Lambda, S3, Bedrock, etc.), Azure Cognitive Services, REST, GraphQL, OpenAI and HuggingFace APIs, Anthropic Claude API, scikit-learn, pandas, numpy, prompt engineering frameworks, evaluation libraries, A/B testing tools, statistical analysis tools. RESPONSIBILITIES Design evaluation strategies and roadmaps aligned with development priorities Create rigorous experiments to test prompt variations, hyperparameters, and agentic tooling configurations Define success criteria and quality gates for AI features before development begins Interpret evaluation results and identify systematic patterns in failures and successes Make data-driven go/no-go decisions on feature readiness Drive prompt engineering improvements based on systematic testing and iteration Recommend specific changes to cognitive functions: prompt adjustments, parameter tuning, tool selection Provide statistical rigor to experiment design (sample sizes, significance testing, holdout sets) Transform metrics into actionable insights with clear next steps for developers Lead weekly evaluation standups and present findings to stakeholders Mentor team members on evaluation best practices and ML principles Document evaluation frameworks and build institutional knowledge QUALIFICATIONS Advanced degree in Computer Science, Machine Learning, Statistics, or related field (MS/PhD preferred) 3+ years of hands-on experience with large language models and prompt engineering Strong background in applied machine learning, particularly NLP or generative AI Deep understanding of evaluation methodologies: metrics selection, dataset design, statistical testing Experience with experiment design: A/B testing, hyperparameter optimization, systematic variation testing Proficiency with LLM APIs and prompt engineering frameworks Strong programming skills in Python and ML libraries Practical experience optimizing AI systems for production use cases Understanding of agentic AI architectures: tool use, function calling, multi-step reasoning Proven ability to translate technical findings into clear, actionable recommendations Strong decision-making skills: comfortable making go/no-go calls based on data Excellent communication and cross-functional collaboration abilities Self-directed and proactive problem solver Experience building evaluation infrastructure or MLOps tooling desirable Background in RLHF or constitutional AI desirable Published research or blog posts on LLM evaluation or prompt engineering desirable For immediate consideration: Neetu PRIMUS Global Services Direct Desk: Ext. 419 Email: Apply tot his job

Apply Now

MLOps/Data Scientist -- LLM - REMOTE WORK -- 66966

Similar Opportunities

Product Manager, Platforms and Mobile Apps

Remote Excel Data Entry Jobs - Work Online From Anywhere

Military OneSource Counselor

Military OneSource Call Center Support Navigator Customer Excellence (Remote) in USA in Leidos

Military OneSource - Event Coordinator, MOS at Magellan Health, Inc. Frisco, TX

Engineering Manager, ML Security and Research

DevOps + MLOps Engineer (GPU Workloads, AWS, Production Pipelines)

App Developer - FlutterFlow Specialist

Mobile Application Developer – iOS

Mobile Application Developer – Android

Experienced Full Stack Customer Service Chat Agent – Web & Cloud Application Development at blithequark

Experienced Part-Time Evening Remote Data Entry Specialist – Flexible Work Schedule and Competitive Hourly Rates

Regional Named Account Executive Pennsylvania

Product Operations Specialist | Identity & Fraud

Student Pastor (Lakeside Campus)

Experienced Customer Service Representative for Manufacturing or Aerospace Industry - Remote Opportunity at blithequark

Experienced Customer Service Liaison Per Diem - Weekend - Varied Hours at blithequark

Experienced Remote Customer Support Agent – Travel & Marketing Expert

Installation Planner (Hybrid, Seattle)

Customer Support Engineer – Endpoint/MTD (Device) & Cybersecurity (Dallas Based – Hybrid)

MLOps/Data Scientist -- LLM - REMOTE WORK -- 66966

Similar Opportunities

Product Manager, Platforms and Mobile Apps

Remote Excel Data Entry Jobs - Work Online From Anywhere

Military OneSource Counselor

Military OneSource Call Center Support Navigator Customer Excellence (Remote) in USA in Leidos

Military OneSource - Event Coordinator, MOS at Magellan Health, Inc. Frisco, TX

Engineering Manager, ML Security and Research

DevOps + MLOps Engineer (GPU Workloads, AWS, Production Pipelines)

App Developer - FlutterFlow Specialist

Mobile Application Developer – iOS

Mobile Application Developer – Android

**Experienced Full Stack Customer Service Chat Agent – Web & Cloud Application Development at blithequark**

**Experienced Part-Time Evening Remote Data Entry Specialist – Flexible Work Schedule and Competitive Hourly Rates**

Regional Named Account Executive Pennsylvania

Product Operations Specialist | Identity & Fraud

Student Pastor (Lakeside Campus)

Experienced Customer Service Representative for Manufacturing or Aerospace Industry - Remote Opportunity at blithequark

**Experienced Customer Service Liaison Per Diem - Weekend - Varied Hours at blithequark**

**Experienced Remote Customer Support Agent – Travel & Marketing Expert**

Installation Planner (Hybrid, Seattle)

Customer Support Engineer – Endpoint/MTD (Device) & Cybersecurity (Dallas Based – Hybrid)

Experienced Full Stack Customer Service Chat Agent – Web & Cloud Application Development at blithequark

Experienced Part-Time Evening Remote Data Entry Specialist – Flexible Work Schedule and Competitive Hourly Rates

Experienced Customer Service Liaison Per Diem - Weekend - Varied Hours at blithequark

Experienced Remote Customer Support Agent – Travel & Marketing Expert