NVIDIA Logo

NVIDIA

Senior Software Engineer - Conversational AI

Posted 3 Days Ago
Be an Early Applicant
2 Locations
Senior level
2 Locations
Senior level
As a Senior Software Engineer at NVIDIA, you will architect and optimize low latency conversational AI systems, design dialog workflows, analyze system accuracies, collaborate on product improvements, and develop code and design documents, all while leveraging your extensive experience in speech technologies and LLM applications.
The summary above was generated by AI

Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars, robotics, and intelligent assistants. Come join the team and see how you can make a lasting impact on the world! We're looking to grow our company, and build our teams with the smartest people in the world. Join us at the forefront of technological advancement.

NVIDIA is looking for a highly experienced Senior Software Engineer, to build the next generation Multimodal Conversational AI systems, driven by world class high performance Speech and LLM models, orchestrated by Multimodal AI Agents, creating seamless experiences for our Digital Human solutions.If you're creative and passionate about solving real world Conversational AI problems, come join us. You can check https://build.nvidia.com/nvidia/digital-humans-for-customer-service for a glimpse of what you could be working on.

What you’ll be doing:

  • Architect, implement and optimize reliable low latency full duplex conversation pipelines and dialog systems, that excel across various application areas and challenging environments.

  • Build and benchmark cascaded and unified speech-to-speech models and systems that reflect real human conversations.

  • Designing, implementing and testing domain specific agents and workflows and a framework which can support multi-turn, multi-modal, multi-user conversations with LLM driven agents.

  • Analyze RAG and conversational AI agent end to end accuracy and limitations and recommend the next course of action & Improvements.

  • Characterize performance and quality metrics across platforms for various AI and system components

  • Collaborate with various teams on new product features and improvements of existing products. Customize and integrate the conversational AI framework with other NVIDIA products

  • Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews and help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment.

What we need to see:

  • Bachelor's degree or Master’s degree (or equivalent experience) in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math

  • 10+ years of experience, with a very good hands-on exposure to building solutions that touch various technology areas that cover Speech, LLM, RAG and Agents.

  • Excellent programming skills in Python and/or C++, with ability to debug complex asynchronous systems

  • Deep understanding of various Speech technologies like VAD, ASR, TTS, Translation, End-to-End Speech Models, etc. to build conversation systems.

  • Experience working with RAG and LLM based applications as a key part of building Dialog and Q & A systems. Additional exposure to LLM function calling, Information Retrieval, Vector Databases, Embedding and Rerank models, autonomous agents etc.is welcome.

  • Understanding of scalable deployment of multiple microservices involving Speech components, LLM driven RAG and Agent applications in production environment

  • Experience working with protocol and transports like HTTP REST, gRPC, Websockets, WebRTC, etc.

  • Hands on experience with building microservices and client-server applications.

  • Familiarity with Docker, helm, kubernetes etc.

  • Experience of working on end to end Software lifecycle, release packaging & CI/CD pipeline

  • General background around version control and code review tools like Git, Gerrit, Gitlab.

Ways to stand out from the crowd:

  • Strong fundamentals in Programming, Optimizations and Software design

  • Experience of working with open source frameworks like LangChain, LlamaIndex for building LLM driven applications

  • Strong knowledge of ML/DL techniques, algorithms and tools with exposure to Speech and Language Models

  • Familiarity with GPU based technologies like CUDA, CuDNN and TensorRT

  • Background with deploying machine learning models on data center, cloud, and embedded systems

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

C++
Python

NVIDIA Pune, Maharashtra, IND Office

Survey No.144 145, Commerzone No.5, Off, Airport Rd, Yerawada, Pune, Maharashtra, India, 411006

Similar Jobs

An Hour Ago
Hybrid
Mumbai, Maharashtra, IND
Senior level
Senior level
Financial Services
As a Python Developer Associate, you will lead data science projects, manage deliveries, and provide technical mentorship while developing scalable solutions. Your role involves collaborating with teams to deliver actionable insights and staying current with industry trends.
Top Skills: Python
An Hour Ago
Hybrid
Mumbai, Maharashtra, IND
Mid level
Mid level
Financial Services
As a Software Engineer III, you will design, develop, and maintain secure and scalable software solutions while working within an agile team. You'll be responsible for architecture, problem-solving, and integrating diverse data sets to improve software performance and contribute to community practices.
Top Skills: Java
An Hour Ago
Hybrid
Mumbai, Maharashtra, IND
Junior
Junior
Financial Services
As a Quant Developer, you will drive solutions in the Equity Quantitative Research team, focusing on improving software quality and applying quantitative investment techniques to global equity markets. You'll collaborate with stakeholders and handle programming tasks using large data sets.
Top Skills: PythonSQL

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account