Infrastructure Team Manager

Opens nvidia.wd5.myworkdayjobs.com in a new tab

Overview

  • We are seeking an experienced IT/Lab Manager to lead the planning, deployment, and operations of our physical lab environment and IT systems.
  • This role will focus on building and maintaining scalable, reliable, and secure environments to support engineering teams involved in research, quality assurance, validation, and related activities.
  • It will also support internal collaborators.
  • You will have an outstanding opportunity to drive innovation in a multidimensional, technology-focused company that is crafting the future of data-center and lab technologies.
  • If you bring perfection and creative thinking while solving issues as they arise, and enjoy working with distributed teams – your place is with us! What You’ll Be Doing: Own day-to-day operations, planning, and roadmap for the engineering lab and IT infrastructure (servers, storage, networking, and related services).
  • Lead and mentor an IT/Lab team, driving guidelines, standards, and a culture of ownership, partnership, and continuous improvement.
  • Collaborate closely with R&D, QE, Verification, and other engineering teams to design, provision, and maintain environments that meet their performance, reliability, and security needs.
  • Lead all aspects of running data center and lab operations, including rack layout, cabling, power and cooling, hardware lifecycle, and resource availability.
  • Lead procurement and vendor management for hardware, software, and services, including evaluation, negotiation, and ongoing relationship management.
  • Implement and maintain automation for system provisioning, configuration, and operations using tools such as shell/Perl/Ansible.
  • Design and maintain monitoring, logging, and alerting for servers, network, and storage systems to ensure high availability and rapid incident response.
  • Investigate and resolve sophisticated infrastructure issues across OS, networking, storage, virtualization, and application layers.
  • What we need to see: B.Sc.
  • or BA in Computer Science, Engineering, or a related field, or equivalent practical experience.
  • At least 10 years of overall experience in IT / systems administration, including extensive hands-on work with Linux/Unix environments.
  • At least 3 years of experience in a managerial or team-lead position within IT, lab, or infrastructure teams.
  • Vast experience with Linux/Unix system administration, including installation, configuration, troubleshooting, and performance tuning.
  • Demonstrable experience collaborating with engineering organizations (R&D, QE, Verification, etc.) and supporting their infrastructure needs.
  • Solid experience with data center and lab management, including server, network, and storage equipment deployment and lifecycle.
  • Demonstrated experience in procurement and vendor management for infrastructure hardware and software.
  • Proficiency in automation and scripting (e.g., shell, Perl, Ansible) for provisioning, configuration, and operational tasks.
  • Hands-on experience with monitoring and alerting solutions for infrastructure and services.
  • Strong debugging skills and experience resolving complex, cross-domain technical issues.
  • Ways To Stand Out From The Crowd: Experience with Kubernetes (K8s) in on-prem or hybrid environments.
  • Hands-on work with Slurm, HPC clusters, and large-scale compute environments.
  • Background in HPC, large-scale Linux clusters, or performance-sensitive engineering environments.

Sourced directly from NVIDIA’s career page

Your application goes straight to NVIDIA.

NVIDIA logo

NVIDIA

Israel, Raanana

Specialisation
Open roles at NVIDIA
2000 positions
Job ID
/job/Israel-Raanana/Lab-Manager_JR2010787

Get matched to roles like this

Upload your resume once. We’ll notify you when matching roles open up.

Join talent pool — free

Similar Other roles