[Remote] Senior MLOps Engineer, vLLM Inference

Red Hat

Note: The job is a remote job and is open to candidates in USA. Red Hat is the world’s leading provider of enterprise open source software solutions, and they are seeking an experienced ML Ops engineer to work closely with their product and research teams to scale state-of-the-art deep learning products. The role involves building and releasing AI inference runtimes, managing training and deployment pipelines, and continuously improving processes and tooling used by the DevOps team.

Responsibilities

  • Collaborate with research and product development teams to scale machine learning products for internal and external applications
  • Create and manage model training and deployment pipelines
  • Actively contribute to managing and releasing upstream and midstream product builds
  • Test to ensure correctness, responsiveness, and efficiency
  • Troubleshoot, debug and upgrade Dev & Test pipelines
  • Identifying and deploying cybersecurity measures by continuously performing vulnerability assessment and risk management
  • Collaborate with a cross-functional team about market requirements and best practices
  • Keep abreast of the latest technologies and standards in the field

Skills

  • 2+ years of experience in MLOps, DevOps, Automation and modern Software Deployment practices
  • Experience evaluating LLMs for performance on accelerators and accuracy (think HellaSwag, MMLU, Chatbot Arena, TruthfulQA, etc.)
  • Being super comfortable with Python and PyTest is a must
  • Strong experience with Git, Github Actions including self-hosted runners, Terraform, Jenkins, Ansible, and common technologies for automation and monitoring
  • Highly experienced with administering Kubernetes/Openshift
  • Familiar with Agile development methodology
  • Experience with Cloud Computing using at least one of the following Cloud infrastructures: AWS, GCP, Azure, or IBM Cloud
  • Solid programming skills especially in Python
  • Solid troubleshooting skills
  • Ability to interact comfortably with the other members of a large, geographically dispersed team
  • Experience maintaining an infrastructure and ensuring stability
  • While a Bachelor’s degree or higher in computer science, mathematics, or a related discipline is valued, we prioritize technical prowess, initiative, problem solving, and practical experience
  • Familiarity with contributing to the vLLM CI community is a big plus

Benefits

  • Comprehensive medical, dental, and vision coverage
  • Flexible Spending Account - healthcare and dependent care
  • Health Savings Account - high deductible medical plan
  • Retirement 401(k) with employer match
  • Paid time off and holidays
  • Paid parental leave plans for all new parents
  • Leave benefits including disability, paid family medical leave, and paid military leave
  • Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, employee assistance program, and more!

Company Overview

  • Red Hat is a software company that offers enterprise open-source software solutions. It is a sub-organization of IBM. It was founded in 1993, and is headquartered in Raleigh, North Carolina, USA, with a workforce of 10001+ employees. Its website is http://www.redhat.com.

Company H1B Sponsorship

  • Red Hat has a track record of offering H1B sponsorships, with 128 in 2025, 149 in 2024, 156 in 2023, 181 in 2022, 154 in 2021, 106 in 2020. Please note that this does not guarantee sponsorship for this specific role.

Job Alerts

Get notified when new positions matching your interests become available at {organizationName}.

Need Help?

Questions about our hiring process or want to learn more about working with us?