NVIDIA Logo

NVIDIA

Senior DGX AI Cloud Performance Analysis Tools Engineer

Posted 2 Days Ago
Be an Early Applicant
In-Office
3 Locations
Senior level
In-Office
3 Locations
Senior level
The role involves developing AI performance tools for large-scale systems, conducting performance studies, and automating performance analysis for optimization.
The summary above was generated by AI

Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research. This team focuses on optimizing efficiency and resiliency of AI workloads, as well as developing scalable AI and Data infrastructure tools and services. Our objective is to deliver a stable, scalable environment for AI researchers, providing them with the necessary resources and scale to foster innovation. We are seeking excellent Software Engineers to design and develop tools for AI application performance analysis. Your work will enable AI researchers to work efficiently with a wide variety of DGXC cloud AI systems as they seek out opportunities for performance optimization and continuously deliver high quality AI products. Join our technically diverse team of AI infrastructure experts to unlock unprecedented AI performance in every domain.

What you'll be doing:

  • Develop AI performance tools for large scale AI systems providing real time insight into applications performance and system bottlenecks.

  • Conduct in-depth hardware-software performance studies

  • Define performance and efficiency evaluation methodologies

  • Automate performance data analysis and visualization to convert profiling data into actionable optimizations

  • Support deep learning software engineers and GPU architects in their performance analysis efforts

  • Work with various teams at NVIDIA to incorporate and influence the latest technologies for GPU performance analysis

What we need to see:

  • Minimum of 8+ years of experience in software infrastructure and tools

  • BS or higher degree in computer science or similar (or equivalent experience)

  • Adept programming skills in multiple languages including C++ and Python

  • Solid foundation in operating systems and computer architecture

  • Outstanding ability to understand users, prioritize among many contending requests, and build consensus

  • Passion for “it just works” automation, eliminating repetitive tasks, and enabling team members

Ways to stand out from the crowd:

  • Experience in working with the large scale AI cluster

  • Experience with CUDA and GPU computing systems

  • Hands-on experience with deep learning frameworks (TensorFlow, PyTorch, JAX/XLA etc.)

  • Deep understanding of the software performance analysis and optimization process

NVIDIA leads the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions, from artificial intelligence to autonomous cars. NVIDIA is looking for exceptional people like you to help us accelerate the next wave of artificial intelligence.

Top Skills

C++
Cuda
Jax/Xla
Python
PyTorch
TensorFlow

NVIDIA Pune, Mahārāshtra, IND Office

Survey No.144 145, Commerzone No.5, Off, Airport Rd, Yerawada, Pune, Maharashtra, India, 411006

Similar Jobs

28 Minutes Ago
In-Office
Pune, Maharashtra, IND
Senior level
Senior level
Healthtech • Logistics • Pharmaceutical
The Lead Administrator oversees enterprise applications, ensuring system support, functionality, and user experience while guiding a team and managing system performance metrics.
Top Skills: AnsibleAswCitrixDnsEasy DnsGradleHTTPIbmItilJavaJdbcJeeJmsJpaLdapLinuxMcsaMicrosoft Certified Systems AdministratorMicrosoft Office SuiteNo IpOracleOrmPuppetRdbmsSageSalesforceSAPSmtpSpringSQLWindows
29 Minutes Ago
In-Office
Pune, Maharashtra, IND
Senior level
Senior level
Healthtech • Logistics • Pharmaceutical
The Director oversees SAP technical administration and application delivery, ensuring alignment with business needs and leading a team of SAP professionals. Responsibilities include managing the SAP landscape, improving performance, and strategic collaboration with stakeholders.
Top Skills: Agile MethodologiesApimClarityConfluenceCpiData StudioEccFioriGoogle AnalyticsGoogle OptimizeGoogle Tag ManagerJIRAMicrosoft Office SuiteOracle ExadataS/4HanaSAPSap Hana
6 Hours Ago
Hybrid
Pune, Maharashtra, IND
Senior level
Senior level
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Lead delivery across multiple client engagements in the healthcare sector, managing projects from discovery to operations and ensuring quality solutions. Drive new business and manage team delivery on technology setups, focusing on cloud-based data processing solutions.
Top Skills: AIAnalyticsAWSAzureBi ReportingBig DataCloudDatabricksDigitalGCPMdmSnowflake

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account