Sia Partners Logo

Sia Partners

Generative AI Engineer

Posted 8 Days Ago
Be an Early Applicant
In-Office
Mumbai, Maharashtra
Senior level
In-Office
Mumbai, Maharashtra
Senior level
The Generative AI Engineer will implement solutions using LLMs and oversee the integration of AI into applications, ensuring compliance and performance.
The summary above was generated by AI
Company Description

Sia is a next-generation, global management consulting group. Founded in 1999, we were born digital. Today our strategy and management capabilities are augmented by data science, enhanced by creativity and driven by responsibility. We’re optimists for change and we help clients initiate, navigate and benefit from transformation. We believe optimism is a force multiplier, helping clients to mitigate downside and maximize opportunity. With expertise across a broad range of sectors and services, our consultants serve clients worldwide. Our expertise delivers results. Our optimism transforms outcomes. 

Heka.ai is the independent brand of Sia Partners dedicated to AI solutions. We host many AI-powered SaaS solutions that can be combined with consulting services or used independently, to provide our customers with solutions at scale.  

Job Description

We are seeking a skilled Generative AI Engineer to join our team where you will harness model capabilities to implement cutting-edge algorithms and solutions across myriad industries.   

You will serve as a pivotal link between Data Scientists, ML and Platform Engineers to unleash the potential of Generative AI technology by implementing business-centric solutions. You will help customers find the appropriate level of refinement among semantic search, RAG, agents, and ultimately fine-tuning to reach their value delivery threshold in the most cost-effective way.  

Beyond crafting prompts, you will be responsible for designing and building robust and scalable products starting with benchmarks of candidate FMs through targeted requests, rapidly iterating prototypes, and validating product ideas. Your expertise in orchestrating the entire AI workflow will ensure the seamless integration of advanced models' capabilities into applications, optimizing performance, security, compliance, scalability, and efficiency. You will competently navigate between prompts, chains, and agents while mastering the underlying infrastructure challenges. 

We invest in your success through comprehensive training, combining internal programs with resources from our technology partners.  

Join us if you are passionate about pushing the boundaries of AI technology and making a significant impact in enabling our customers to create GenAI-powered applications with confidence and a fast time to market. 

Key Responsibilities 

You are part of a cross-functional consulting team that drives the adoption of Generative AI in every imaginable sector, working step-by-step with customers to understand business requirements to design then build bespoke GenAI solutions. 

  • Build applications powered by LLMs (OpenAI, Claude, Mistral, etc.) using LangChain, LlamaIndex, and related GenAI frameworks. 
  • Implement RAG pipelines with vector DBs (Pinecone, FAISS, pgvector, ChromaDB) for grounding LLM responses with internal knowledge 
  • Develop multimodal AI solutions (text, audio, image) and build autonomous agents where relevant. 
  • Drive MLOps excellence: CI/CD (ML pipelines), drift detection, canary releases, retraining schedules. 
  • Design robust and reusable prompt templates using CoT, ReAct, Graph-of-Thought, and Agent flows. 
  • Continuously improve model reliability, relevance, and UX by tuning prompt flows 
  • Deploy GenAI models on AWS/GCP/Azure using services like SageMaker, Bedrock, Vertex AI 
  • Ensure performance observability, security guardrails, and compliance (GDPR, Responsible AI) 
  • Work with DevOps teams to integrate GenAI solutions into microservices and APIs (FastAPI/Flask) 
  • Benchmark open-source and commercial LLMs for use-case fit and cost-performance tradeoffs 
  • Evaluate fine-tuning strategies (PEFT, LoRA, RLHF) where applicable for proprietary use cases 
  • Support solution architects and cross-functional teams in delivering PoCs and enterprise-grade rollouts 
  • Document frameworks, best practices, risks, and learnings for future scaling 

Qualifications

Qualifications :

  • Education: Bachelor’s/master's degree in computer science, AI , or a related field. 
  • Experience: 5+ years of experience in NLP/ML/AI with at least 3 year hands-on in GenAI. 

Skills

  • Strong coding skills in Python with frameworks like PyTorch, Hugging Face, LangChain, and LlamaIndex. 
  • Proven experience with cloud-based AI services (AWS/GCP/Azure) and APIs (OpenAI, Anthropic, Hugging Face). 
  • Experience with vector databases: Qdrant, pgvector, Pinecone, FAISS, Milvus, or Weaviate. 
  • Familiarity with prompt engineering, transformer architectures, and embedding techniques. 
  • Excellent communication skills, with the ability to convey complex technical concepts to both highly technical and also non-technical stakeholders. 
  • Sharp problem-solving skills. 
  • Ability to collaborate with diverse teams. 

Additional Information

What We Offer 

  • Opportunity to lead cutting-edge AI projects in a global consulting environment. 

  • Leadership development programs and training sessions at our global centers. 

  • A dynamic and collaborative team environment with diverse projects. 

Position based in Mumbai (hybrid)

Sia is an equal opportunity employer. All aspects of employment, including hiring, promotion, remuneration, or discipline, are based solely on performance, competence, conduct, or business needs. 

Top Skills

Anthropic
AWS
Azure
Chromadb
Faiss
Fastapi
Flask
GCP
Hugging Face
Langchain
Llamaindex
Openai
Pinecone
Python
PyTorch

Similar Jobs

23 Days Ago
In-Office or Remote
Mumbai, Maharashtra, IND
Mid level
Mid level
Database
Develop and deploy generative AI models, design solutions using large language models, and optimize for performance and reliability.
Top Skills: AWSBedrockLambdaLangchainLlamaindexPythonSagemaker
3 Hours Ago
In-Office or Remote
Mumbai, Maharashtra, IND
Mid level
Mid level
Database
Develop and deploy generative AI models, implement LLM solutions, design RAG systems, optimize model performance, and utilize AWS services.
Top Skills: AWSBedrockLambdaLangchainLlamaindexPythonSagemaker
3 Hours Ago
Hybrid
Mumbai, Maharashtra, IND
Mid level
Mid level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
As a Senior Technical Support Engineer, you will resolve complex issues for the Dynatrace platform, mentor junior engineers, and drive process improvements in a collaborative environment.
Top Skills: Cloud PlatformsDynatraceJmeterLinuxLoad RunnerNetworkingRest ApisWindows

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account