The Model & Intelligence Engineer is responsible for designing, developing, and improving agents and Deep Research Agent through advanced retrieval architectures, model evaluation, and self-improving feedback mechanisms.
Join our Team
About the opportunity :
Raises the intelligence ceiling of the platform. Responsible for making agents measurably smarter over time through rigorous evaluation, advanced retrieval architecture, and a self-improving feedback loop.
What you will do :
▸ Own the end-to-end design, development, and continuous improvement of the Deep Research Agent
▸ Design and maintain the RAG pipeline: chunking strategy, embedding models, retrieval, and re-ranking
▸ Implement and optimise context compression to reduce overhead on long-horizon, multi-hop queries
▸ Build and operate the model evaluation harness: benchmark design, regression tracking, and A/B testing
▸ Lead the agent self-improvement loop: prompt proposal pipeline and benchmark-gated merge governance
▸ Track frontier model research and assess production applicability for the platform intelligence roadmap
▸ Advise on fine-tuning, prompt optimisation, and model selection strategy across model generations
The Skills you bring:
• Deep expertise in LLMs: transformer architecture, fine-tuning (LoRA/QLoRA), RLHF, and alignment techniques• RAG system design: vector databases (Pinecone, Weaviate, pgvector), embedding models, hybrid search strategies• ML experimentation tooling: MLflow, Weights & Biases, Vertex AI Experiments, or equivalent platforms• Python ML stack: PyTorch or JAX, HuggingFace Transformers, LangChain or equivalent orchestration libraries• Statistical evaluation methods: benchmark design, significance testing, and evaluation dataset curation• Context compression and KV cache optimisation techniques, quantisation basics (GPTQ, AWQ)
Why join Ericsson?At Ericsson, you'll have an outstanding opportunity. The chance to use your skills and imagination to push the boundaries of what's possible. To build solutions never seen before to some of the world's toughest problems. You'll be challenged, but you won't be alone. You'll be joining a team of diverse innovators, all driven to go beyond the status quo to craft what comes next.
What happens once you apply? Click Here to find all you need to know about what our typical hiring process looks like.Encouraging a diverse and inclusive organization is core to our values at Ericsson, that's why we champion it in everything we do. We truly believe that by collaborating with people with different experiences we drive innovation, which is essential for our future growth. We encourage people from all backgrounds to apply and realize their full potential as part of our Ericsson team. Ericsson is proud to be an Equal Opportunity Employer. learn more.
Primary country and city: India (IN) || Bangalore
Req ID: 784074
About the opportunity :
Raises the intelligence ceiling of the platform. Responsible for making agents measurably smarter over time through rigorous evaluation, advanced retrieval architecture, and a self-improving feedback loop.
What you will do :
▸ Own the end-to-end design, development, and continuous improvement of the Deep Research Agent
▸ Design and maintain the RAG pipeline: chunking strategy, embedding models, retrieval, and re-ranking
▸ Implement and optimise context compression to reduce overhead on long-horizon, multi-hop queries
▸ Build and operate the model evaluation harness: benchmark design, regression tracking, and A/B testing
▸ Lead the agent self-improvement loop: prompt proposal pipeline and benchmark-gated merge governance
▸ Track frontier model research and assess production applicability for the platform intelligence roadmap
▸ Advise on fine-tuning, prompt optimisation, and model selection strategy across model generations
The Skills you bring:
• Deep expertise in LLMs: transformer architecture, fine-tuning (LoRA/QLoRA), RLHF, and alignment techniques• RAG system design: vector databases (Pinecone, Weaviate, pgvector), embedding models, hybrid search strategies• ML experimentation tooling: MLflow, Weights & Biases, Vertex AI Experiments, or equivalent platforms• Python ML stack: PyTorch or JAX, HuggingFace Transformers, LangChain or equivalent orchestration libraries• Statistical evaluation methods: benchmark design, significance testing, and evaluation dataset curation• Context compression and KV cache optimisation techniques, quantisation basics (GPTQ, AWQ)
Why join Ericsson?At Ericsson, you'll have an outstanding opportunity. The chance to use your skills and imagination to push the boundaries of what's possible. To build solutions never seen before to some of the world's toughest problems. You'll be challenged, but you won't be alone. You'll be joining a team of diverse innovators, all driven to go beyond the status quo to craft what comes next.
What happens once you apply? Click Here to find all you need to know about what our typical hiring process looks like.Encouraging a diverse and inclusive organization is core to our values at Ericsson, that's why we champion it in everything we do. We truly believe that by collaborating with people with different experiences we drive innovation, which is essential for our future growth. We encourage people from all backgrounds to apply and realize their full potential as part of our Ericsson team. Ericsson is proud to be an Equal Opportunity Employer. learn more.
Primary country and city: India (IN) || Bangalore
Req ID: 784074
Top Skills
Alignment Techniques
Fine-Tuning
Huggingface Transformers
Jax
Langchain
Llms
Mlflow
Pgvector
Pinecone
Python
PyTorch
Rlhf
Transformer Architecture
Vector Databases
Vertex Ai Experiments
Weaviate
Weights & Biases
Ericsson Pune, Mahārāshtra, IND Office
Ericsson Pune Hub Office
Established IT zone near Viman Nagar with good airport access. Surrounded by cafes, housing and retail, offering a comfortable city lifestyle with a slightly slower pace than larger metros.
Similar Jobs at Ericsson
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
The Automation Delivery Manager will lead and guide the automation team to drive automation solutions across Managed Services, ensuring project delivery aligns with business goals while fostering innovation.
Top Skills:
AICi/CdEnableJavaJavaScriptLinuxMateMlRpaWindows
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
The Senior Software Architect - AI will design scalable AI platforms, manage architectural decisions for enterprise solutions, and guide teams on AI model lifecycles.
Top Skills:
ArizeAsyncioChromaDockerElasticsearchFastapiKubernetesMilvusPineconePythonPyTorchRabbitMQRedisTensorFlowTerraform
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
Lead the architecture and delivery of full-stack solutions, focusing on UI, microservices, and cloud-native platforms while mentoring engineers and overseeing technical governance.
Top Skills:
AngularCi/CdDockerGitGitlabHelmJavaJavaScriptJunitKafkaKubernetesReactSpring BootTypescript
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

