The DevOps Engineer will manage GCP infrastructure, build AI deployment pipelines, implement security measures, and optimize costs while ensuring system observability.
Responsibilities:
RequirementsRequirements
- Infrastructure Ownership: Own Helpshift production services and ensure complete monitoring coverage, troubleshoot and fix production issues.
- Infrastructure as Code (IaC): Design and maintain scalable GCP infrastructure using Terraform o
- AI Orchestration & LLMOps: Build deployment pipelines for AI agents, managing vector databases (e.g., Vertex AI Search, Pinecone, Weaviate, ElasticSearch) and model endpoints.
- Security (DevSecOps): Implement "Security-by-Design," including IAM least-privilege access, secret management (Secret Manager), and automated vulnerability scanning for AI workloads.
- CI/CD Excellence: Architect high-velocity pipelines for both traditional microservices and AI model prompts/configurations. Design, implement, and maintain secure CI/CD pipelines for automating deployment, configuration, and testing processes.
- Observability: Set up comprehensive monitoring for system health and LLM-specific metrics (latency, token usage, and cost)
- Cloud Governance: Optimise GCP costs and manage resource quotas, especially for GPU/TPU-intensive AI tasks.
- Cross Cloud Deployment: Establish & Optimise the connectivity among apps deployed in different cloud environments (AWS <> GCP)
RequirementsRequirements
- Relevant experience of 6+ years and above
- Expert-level Google Cloud Platform (GCP) administration skills: GKE, Cloud Run, Vertex AI, GCS, NEG etc
- Experience deploying Vector Databases (Pinecone, Weaviate, ElasticSearch or Vertex Search) and managing API rate limits/throttling for LLM providers.
- Setting up Cloud Monitoring/Logging specifically for AI metrics: token consumption, inference latency, and model error rates.
- In-depth knowledge of running/managing UNIX-like operating systems (we use Ubuntu)
- Strong knowledge of networking protocols, security architectures, and identity and access management (IAM) principles.
- Experience with containerisation technologies (e.g., Docker, Kubernetes) and securing containerised environments.
- Proficiency in Python and Bash
- Experience in designing and building solutions that are highly scalable, fault tolerant and cost-effective
- Experience with IaaC tools like Ansible, Terraform.
- Ability to analyse bottlenecks in architecture and quickly debug to reach a resolution for issues
- Have an automation mindset and ability to reason and work with complex systems.
- Excellent communication and documentation skills
- Quick learner and good mentor for junior team members
Similar Jobs
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Lead business analysis and delivery for loans transformations: gather end-to-end requirements, map current/target-state loan processes, define user stories and acceptance criteria, support solution design, drive UAT, and manage delivery governance, risks, and milestone reporting across stakeholders.
Top Skills:
Advanced AnalyticsAgileAi-Enabled SolutionsAutomationConfluenceJIRA
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Senior SAS developer responsible for designing, developing, and optimizing SAS programs and ETL workflows. Work includes Base SAS, macros, PROC SQL, SAS DI Studio and EG, scheduling (LSF), Unix/shell scripting, performance tuning of SAS servers, and transforming banking data for enterprise data warehouses. Collaborate with clients, QA, DBAs and support teams.
Top Skills:
Base SasETLHadoopLsf Platform SuiteProc SqlSASSas Data Integration StudioSas EgSas MacroSas ServersShell ScriptingSQLUnix
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Manager of Software Engineering will lead and mentor teams to deliver cloud-native applications, focusing on operational excellence and innovative solutions for digital B2B payments.
Top Skills:
AngularCi/CdCloudContainerizationJavaKafkaMicroservicesPower BI
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.


