Develop and deploy production-grade Generative AI services using LLMs, RAG, embeddings and vector retrieval. Build APIs and microservices with FastAPI, integrate models and enterprise workflows, optimize model accuracy and latency, implement monitoring/logging/evaluation frameworks, and support cloud/Docker deployments.
Job Summary
We are seeking a Generative AI Engineer with strong Python expertise to build enterprise AI applications leveraging LLMs, RAG, embeddings, and intelligent automation. The role focuses on developing production-grade AI services, integrating foundation models, and delivering business solutions powered by Generative AI.
Key Responsibilities- Build AI-powered applications using OpenAI, Claude, Gemini, Llama, and Hugging Face models
- Develop RAG pipelines using LlamaIndex and LangChain
- Implement prompt engineering, embeddings, semantic search, and vector retrieval
- Build APIs and microservices using FastAPI and Python
- Integrate AI solutions with enterprise applications and workflows
- Evaluate and optimize model accuracy, latency, and reliability
- Develop monitoring, logging, and evaluation frameworks for AI applications
- Support deployment and production operations of AI workloads
- 3+ years of Python development experience
- Experience with OpenAI, Claude, Gemini, or Hugging Face
- Experience with LangChain or LlamaIndex
- Experience with Vector Databases and Embeddings
- Experience building APIs using FastAPI
- Understanding of RAG architectures
- Docker and Cloud deployment experience
- Strong prompt engineering skills
- LangGraph
- Agentic AI concepts
- Fine-tuning and PEFT techniques
- AWS, Azure, or GCP
- LangSmith / LangWatch
- MLflow or experiment tracking
Candidates should be able to explain:
- AI applications delivered
- RAG architecture used
- Embedding and retrieval strategy
- Prompt optimization techniques
- Production deployment approach
- Monitoring and evaluation mechanisms
- Business challenges solved using GenAI
Similar Jobs
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Manage a portfolio of mid-market customers in India: develop and execute account/territory plans, identify and qualify opportunities, expand usage and cross-sell, run demos, build strong client relationships, coordinate internal teams and channel partners, report progress, provide product feedback, and travel occasionally for client and team events.
Top Skills:
ConfluenceCRMJira Service ManagementJira Software
Cloud • Information Technology • Security • Software
As a Staff Software Engineer, you will lead the development of an AI-driven security platform, focusing on backend systems, stream processing, and integration of authentication standards while guiding the ISPM feature set from planning to operational excellence.
Top Skills:
Apache FlinkAWSDynamoDBEcsEksGoGoKafkaKinesisLambdaPostgresRedisSpark StreamingSqsTerrraform
Cloud • Information Technology • Security • Software
The Staff Product Manager will lead the strategy for JumpCloud's SSO and Identity Integrations, focusing on passwordless innovation, AI integration, and strategic partnerships. Responsibilities include overseeing automated provisioning, enhancing IT operations, and representing JumpCloud in global identity standards forums.
Top Skills:
FidoOidcSAMLScimWebauthn
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.


