The Staff Engineer will design and build backend services for AI/ML solutions, focusing on performance, reliability, and collaboration with ML teams. Responsibilities include implementing retrieval systems, optimizing ML workflows, and ensuring observability.
Company Description
👋🏼We're Nagarro
we are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale across all devices and digital mediums, and our people exist everywhere in the world (17700+ experts across 39 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!
Job DescriptionREQUIREMENTS:
- Total experience of 5.5 years+
- Strong expertise in Python and backend engineering with experience building scalable, distributed microservices.
- Hands-on experience designing and delivering end-to-end RAG (Retrieval-Augmented Generation) workflows in production systems.
- Solid understanding of ML solution design, including embeddings, retrieval, ranking, feature engineering, and evaluation strategies.
- Experience with vector databases (FAISS, Pinecone, Milvus, Weaviate) and implementing chunking, indexing, vector search, re-ranking, caching, and memory patterns.
- Knowledge of LLM/NLP engineering, including prompt engineering, model integration, orchestration tools (LangChain/LlamaIndex), and evaluation instrumentation.
- Experience productionizing ML systems with observability, online/offline parity, and performance optimization across latency, throughput, and cost.
- Strong backend integration skills using REST/gRPC APIs, Docker, Kubernetes, CI/CD, and cloud platforms (AWS/GCP/Azure).
- Ability to independently design, ship, and operate reliable, scalable, and cost-efficient ML-backed backend systems with strong ownership mindset.
RESPONSIBILITIES:
- Design and build core backend services powering AI/ML runtime including orchestration, session/state management, and tools/services integration.
- Implement end-to-end retrieval and memory systems covering ingestion, embeddings, indexing, vector search, ranking, caching, and lifecycle management.
- Productionize ML workflows with feature/metadata services, model integration contracts, and evaluation hooks.
- Drive performance, reliability, and cost optimization with strong SLO ownership and observability practices (logs, metrics, tracing, guardrails).
- Collaborate with applied ML teams on model routing, prompts/tools, evaluation datasets, and safe releases.
- Translate business requirements into scalable technical designs, define NFR benchmarks, and review architecture for extensibility and best practices.
- Lead troubleshooting, root-cause analysis, and POCs to validate technology and design decisions.
Bachelor’s or master’s degree in computer science, Information Technology, or a related field.
Top Skills
AWS
Azure
Ci/Cd
Docker
Faiss
GCP
Grpc
Kubernetes
Milvus
Pinecone
Python
Rest Apis
Weaviate
Similar Jobs
Big Data • Cloud • Software • Database
The Staff Engineer will design and implement a platform for the Application Modernisation, focusing on distributed systems and infrastructure, mentoring engineers, and ensuring security compliance.
Top Skills:
Api GatewaysContainerizationOrchestrationPersistent Storage SolutionsSoftware Development
Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
Join the Data Services team as a Software Developer/Data Scientist to deliver custom reports, manage data corrections, and automate processes using Python and AWS.
Top Skills:
Aws AthenaAws AuroraExcelPythonSQL
Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
As an Associate Quant Analyst, you'll automate data analysis processes, develop workflow optimization tools, and assist in quantitative analysis for credit ratings, collaborating with internal and external teams.
Top Skills:
AnacondaJupyterExcelMssqlNumpyPandasPythonSQLVBA
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.


