NVIDIA Logo

NVIDIA

Senior DevOps Engineer

Posted Yesterday
Be an Early Applicant
In-Office
Pune, Maharashtra, IND
Senior level
In-Office
Pune, Maharashtra, IND
Senior level
Lead design, deployment and scaling of on-prem Kubernetes container infrastructure and CI/CD platforms. Build automation, observability, and analytics tooling; maintain OSS CI/CD (GitLab/GitHub/Jenkins); develop integrations, monitor metrics, and support data-center compute environments for Windows and Linux.
The summary above was generated by AI

NVIDIA is looking for an outstanding engineering lead to join its Software Infrastructure and Operations team. The position will be part of a fast-paced crew that develops and maintains sophisticated Kubernetes based development, compute and test environments for a multitude of platforms including Windows and Linux using OSS CICD tools GitHub, GitLab, Jenkins. You will be working with a team of passionate and skilled engineers that are continuously working to provide better tools to build and manage this infrastructure. With your help we would forge the next generation of compute infrastructure multiplying the power of the CPU, GPU and DPU for the age of AI. We need a motivated, hardworking and focused individual who has a real passion for operational excellence, Infrastructure services, and automation.

What you’ll be doing:

  • Architect the scaling operation in our data centers. Deploy and Support end-to-end container management solutions with Kubernetes, Docker, containerd. Design solutions with service discovery, networking, monitoring, logging, scheduling in Kubernetes.

  • Manage end to end OSS CICD tools GitLab/GitHub/Jenkins in on-prim Kubernetes environment. Design and develop tools needed for automating CICD & Developers workflow.

  • Design and build sophisticated automations and AI powered applications.

  • Use your depth in algorithms and system software background!

  • Work in teams to deploy new data center infrastructure.

  • Plan and implement critical metrics tracking using various data analytics mining methods and dashboards.

  • Reuse AI techniques to extract useful signals about machines and jobs from the data generated!

  • Take part in prototyping, crafting and developing cloud infrastructure for Nvidia.

What we need to see:

  • Strong Kubernetes understanding and background especially on-premises setup and extensive experience with Kubernetes components & subsystems.

  • Experience of maintaining large scale on-prim infrastructure applications & OSS CICD tools using Kubernetes.

  • Proven programming background in python/Golang/java and/or relevant scripting languages

  • Excellent debugging and analytical skills and experience in Databases both SQL (MySQL ) and NoSQL (Elastic Search /MongoDB)

  • Proficient with configuration management tools like Ansible, Chef, Puppet and strong experience with Jenkins and/or other CI systems.

  • Hands-on experience with VMs, Dockers, Kubernetes Cluster.

  • Experience with analytics/visualization tools like Kibana, Grafana, Splunk etc. and experience with monitoring systems such as Zabbix and/or Nagios is nice to have

  • 5+ years of proven experience

  • Bachelors or Master's Degree or equivalent experience in CS, Software Engineering, or related field.

Ways to stand out from the crowd:

  • Previous experience with DevOps/SRE teams

  • Thrives in a multi-tasking environment with constantly evolving priorities and documents work well

  • Outstanding collaboration skills across organizational boundaries, experience with using and improving data centers and with computer algorithms and ability to choose the best possible algorithms to meet the scaling challenge

  • Ability to divide complex problems into simple sub problems and then reuse available solutions to implement most of those

  • Experience with designing simple systems that can work reliably without needing much support

NVIDIA Pune, Mahārāshtra, IND Office

Survey No.144 145, Commerzone No.5, Off, Airport Rd, Yerawada, Pune, Maharashtra, India, 411006

Similar Jobs

8 Days Ago
Hybrid
Senior level
Senior level
Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
As a Senior DevOps Engineer, you'll support CI/CD deployment activities, improve release processes, troubleshoot issues, and guide junior members.
Top Skills: Aws LambdaBitbucketCi/CdDockerGitGradleGroovyHarnessJenkinsLight LlmLinuxMavenNexusPythonSonarqubeSonatypeWindows Shell
23 Days Ago
Hybrid
Senior level
Senior level
Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
As a Senior DevOps Engineer, you'll optimize and monitor cloud infrastructure on Azure, automate processes, and enhance system reliability.
Top Skills: AngularAnsibleArm TemplatesAzureGitJenkinsMicroservicesMicrosoft .NetPowershell
3 Days Ago
In-Office or Remote
India
Senior level
Senior level
Marketing Tech • Cryptocurrency
Operate and scale low-latency, high-availability trading infrastructure; automate and manage IaC; monitor and respond to incidents; optimize system and network performance; implement secure CI/CD and IAM practices; collaborate with trading, engineering, and security teams; participate in rotational on-call support.
Top Skills: AnsibleAWSBashCi/CdGrafanaIamKubernetesLinuxPrometheusPythonSIEMTerraform

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account