[Remote] Senior Machine Learning Operations Engineer

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. 1st Central is a market-leading insurance company utilising smart data and technology. They are seeking an experienced Senior Machine Learning Operations Engineer to join their Data Function, where the role involves designing and implementing machine learning model engineering frameworks and collaborating with various teams to ensure efficient operation of Data Science models. Responsibilities • You’ll contribute to the design and implementation of Machine Learning Engineering standards and frameworks • You’ll support model development, with an emphasis on auditability, versioning, and data security • You’ll implement automated data science model testing and validation • You’ll assist in the optimisation of deployed ML model scoring code in production services • You’ll assist in the design and implementation of data pipelines and engineering infrastructure to embed scaled machine learning solutions • You’ll use CI/CD pipelines, manage the deployment and version management of large numbers of data science models (Azure DevOps) • You’ll support the implementation of Machine Learning Ops on cloud (Azure & Azure ML. Experience with Databricks is advantageous.) • You’ll protect against model degradation and operational performance issues through the development and continual automated monitoring of model execution and model quality • You’ll manage automatic model retraining within a production environment • You’ll engage in group discussions on system design and architecture, sharing knowledge with the wider engineering community • You’ll collaborate closely with data scientists, data engineers, architects, and the software development team • You’ll liaise with stakeholders across the business to ensure ML is being used to improve strategic business decisions and identify new areas for improvements • You’ll adhere to the Group Code of Conduct and Fitness and Propriety policies, Company Policies, Values, guidelines, and other relevant standards/ regulations at all times Skills • Comprehensive knowledge of Databricks, PySpark, Microsoft Azure (Azure ML, Azure Stream Analytics, Cognitive Services, Event Hubs, Synapse, Data Factory). • Fluency in Python and modelling frameworks such as PyTorch and TensorFlow. • Skilled in deploying and managing Machine Learning Models within a production environment. • Excellent problem-solving and analytical skills, with the ability to diagnose and troubleshoot problems quickly. • Strong time management and organisational abilities, experience working to tight deadlines. • Excellent communication skills, both verbal and written, ability to collaborate effectively with cross-functional teams. • Experience in developing and maintaining production ML systems, including automatic model retraining and monitoring of production models. • Deploying Infrastructure as Code (IAC) across various environments such as dev, uat and prod. • Handling large volumes of data in various stages of the data pipeline, from ingestion to processing. • Proven experience with feature stores, using them for both offline model development and online production usage. • Building integrations between cloud-based systems using APIs, specifically within the Azure environment. • Practical knowledge of agile methodologies applied in a data science and machine learning environment. • Designing, implementing, and maintaining data software development lifecycles, with a focus on continuous integration and deployment (CI/CD). • Demonstratable expertise in machine learning methodology, best practices, and frameworks. • Understanding of microservices architecture, RESTful API design, development, and integration. • Basic understanding of networking concepts within Azure. • Strong understanding of Microsoft Azure, (Azure ML, Azure Stream Analytics, Cognitive services, Event Hubs, Synapse, and Data Factory). • Fluency in common data science coding capabilities such as Python and modelling frameworks such as Pytorch, Tensorflow etc. • Skilled in application of MLOps frameworks within a production environment. • Excellent communication skills, both verbal and written. • Strong time management and organisation skills. • Ability to diagnose and troubleshoot problems quickly. • Excellent problem-solving and analytic skills. • Embrace, embed and incorporate the company values. • Self-motivated and enthusiastic. • An organised and proactive approach. • Strong stakeholder management. • Ability to work on own initiative and as part of a team. • A flexible approach and positive attitude. • Strives to drive business improvements to contribute to the success of the business. • Experience within financial/insurance services industry is advantageous. • Familiarity with Docker and Kubernetes is advantageous. • Experience with AzureML and Databricks is advantageous. Benefits • Extraordinary working environment • Energetic, inspirational, supportive workplaces Company Overview • 1st CENTRAL offer a 5 Star Defaqto rated car insurance product. It was founded in 2008, and is headquartered in Haywards Heath, West Sussex, GBR, with a workforce of 1001-5000 employees. Its website is Apply tot his job
Apply Now
← Back to Home