Opens kla.wd1.myworkdayjobs.com in a new tab

Nice to Have

  • We are seeking a highly skilled and motivated MLOps Site Reliability Engineer (SRE) to join our team.
  • In this role, you will be responsible for ensuring the reliability, scalability, and performance of our machine learning infrastructure.
  • You will work closely with data scientists, machine learning engineers, and software developers to build and maintain robust and efficient systems that support our machine learning workflows.
  • This position offers an exciting opportunity to work on cutting-edge technologies and make a significant impact on our organization's success.

What You'll Do

  • Design, implement, and maintain scalable and reliable machine learning infrastructure.
  • Collaborate with data scientists and machine learning engineers to deploy and manage machine learning models in production.
  • Develop and maintain CI/CD pipelines for machine learning workflows.
  • Monitor and optimize the performance of machine learning systems and infrastructure.
  • Implement and manage automated testing and validation processes for machine learning models.
  • Ensure the security and compliance of machine learning systems and data.
  • Troubleshoot and resolve issues related to machine learning infrastructure and workflows.
  • Document processes, procedures, and best practices for machine learning operations.
  • Stay up-to-date with the latest developments in MLOps and related technologies.
  • Required Qualifications: Bachelor's degree in Computer Science, Engineering, or a related field.
  • Proven experience as a Site Reliability Engineer (SRE) or in a similar role.
  • Strong knowledge of machine learning concepts and workflows.
  • Proficiency in programming languages such as Python, Java, or Go.
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud.
  • Familiarity with containerization technologies like Docker and Kubernetes.
  • Experience with CI/CD tools such as Jenkins, GitLab CI, or CircleCI.
  • Strong problem-solving skills and the ability to troubleshoot complex issues.
  • Excellent communication and collaboration skills.

Requirements

  • Master's Level Degree or Bachelor's Level Degree and related work experience of 2 years We offer a competitive, family friendly total rewards package.
  • We design our programs to reflect our commitment to an inclusive environment, while ensuring we provide benefits that meet the diverse needs of our employees.
  • KLA never asks for any financial compensation to be considered for an interview, to become an employee, or for equipment.
  • Further, KLA does not work with any recruiters or third parties who charge such fees either directly or on behalf of KLA .
  • Please ensure that you have searched KLA’s Careers website for legitimate job postings.
  • KLA follows a recruiting process that involves multiple interviews in person or on video conferencing with our hiring managers.
  • If you are concerned that a communication, an interview, an offer of employment, or that an employee is not legitimate, please send an email to talent.acquisition@kla.com to confirm the person you are communicating with is an employee.
  • We take your privacy very seriously and confidentially handle your information.

Tools & Skills

Languages

Sourced directly from KLA Corporation’s career page

Your application goes straight to KLA Corporation.

KLA Corporation logo

KLA Corporation

Chennai, India

Specialisation
Open roles at KLA Corporation
114 positions
Job ID
/job/Chennai-India/DevOps-Engineer_2528750

Get matched to roles like this

Upload your resume once. We’ll notify you when matching roles open up.

Join talent pool — free

Similar Other roles