Architect and lead development of scalable GenAI application layers: design microservices for LLM requests and streaming (SSE), manage context, integrate React/Next.js frontends with Python/FastAPI backends, oversee cloud deployments (AWS/Azure/GCP), API gateways, and CI/CD for AI components.
Role Overview: Architect and lead the development of the application layer for enterprise GenAI
solutions. Connect LLM backends to scalable frontends while managing API gateways and cloud
deployments.
Key Responsibilities
Application Architecture: Design scalable microservices that handle LLM requests, streaming
responses (Server-Sent Events), and context management.
Cloud & DevOps: Oversee the deployment of AI applications on AWS, Azure, or GCP.
Integrate CI/CD pipelines for AI software components.
Frontend & Backend Integration: Ensure seamless, low-latency integration between modern
frontends (React/Next.js) and Python/FastAPI backends running AI models.
Required Skills & Qualifications
- Tech Stack: Strong Python programming, familiarity with APIs (OpenAI, Anthropic), basic LangChain/LlamaIndex usage.
- Qualifications: Bachelor’s in CS or related field; 3–4 years software or data engineering experience with demonstrated exposure to LLM APIs.
EXL Pune, Mahārāshtra, IND Office
Pune, India
Similar Jobs
Cloud • Security • Software • Cybersecurity
The Solutions Architect II will design and implement complex web and security solutions, provide technical consulting, and optimize customer engagement with Akamai's services.
Top Skills:
AWSAzureDevOpsGCPJavaScriptPerlPythonWeb Development
Automotive • Hardware • Robotics • Software • Transportation • Manufacturing
Lead design and implementation of ML tools for scenario classification, anomaly detection, sensor data quality, behavior analytics, and automated reporting. Build scalable pipelines and automated data curation for camera, radar, and LiDAR; create SIL/HIL replay interfaces, ground-truthing and annotation tools, and offline benchmarking. Integrate ML modules into simulation, HIL/SIL, and CI/CD ecosystems, implement reproducible workflows (Docker, CI/CD, artifact versioning, automated tests), monitor model/data quality and mentor junior engineers while collaborating with verification and perception teams.
Top Skills:
Artifact VersioningAutomated TestsCameraCi/CdDockerHilLidarRadarSensor FusionSilSimulation Platforms
Automotive • Hardware • Robotics • Software • Transportation • Manufacturing
Design, implement, and optimize high-performance embedded inference systems for automotive ADAS. Develop and integrate ML runtimes on edge SoCs, ensure low-latency concurrency and I/O, integrate with ROS2/sensor frameworks, deploy to SIL/HIL and CI pipelines, and perform profiling and debugging of perception stacks (camera/radar/LiDAR) under real-time constraints.
Top Skills:
C++17C++20CameraCudaEmbedded Ml RuntimesGitlab CiHilJenkinsLidarNvidia OrinNvidia XavierNxp S32PythonQualcomm Sa8XxxRadarRenesas R-CarRos2Sensor FrameworksSilTi Tda4
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.


