DeepIntent Logo

DeepIntent

Site Reliability Engineer

Reposted 14 Days Ago
Be an Early Applicant
In-Office
Pune, Maharashtra
Mid level
In-Office
Pune, Maharashtra
Mid level
The Senior Site Reliability Engineer ensures system stability and efficiency through managing production systems, automating deployments, and monitoring infrastructure performance.
The summary above was generated by AI

DeepIntent is leading the healthcare advertising industry with data-driven solutions built for the future. From day one, our mission has been to improve patient outcomes through the artful use of advertising, data science, and real-world clinical data. For more information visit, www.DeepIntent.com or find us on LinkedIn. 

We are seeking a skilled and experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have a minimum of 3 years of hands-on experience in managing and maintaining production systems, with a focus on reliability, scalability, and performance. As an SRE at [Company Name], you will play a crucial role in ensuring the stability and efficiency of our infrastructure, as well as contributing to the development of automation and monitoring tools.Responsibilities:
  • Deploy, configure, and maintain Kubernetes clusters for our microservices architecture.
  • Utilize Git and Helm for version control and deployment management.
  • Implement and manage monitoring solutions using Prometheus and Grafana.
  • Work on continuous integration and continuous deployment (CI/CD) pipelines.
  • Containerize applications using Docker and manage orchestration.
  • Manage and optimize AWS services, including but not limited to EC2, S3, RDS, and AWS CDN.
  • Maintain and optimize MySQL databases, Airflow, and Redis instances.
  • Write automation scripts in Bash or Python for system administration tasks.
  • Perform Linux administration tasks and troubleshoot system issues.
  • Utilize Ansible and Terraform for configuration management and infrastructure as code.
  • Demonstrate knowledge of networking and load-balancing principles.
  • Collaborate with development teams to ensure applications meet reliability and performance standards.
Additional Skills (Good to Know):
  • Familiarity with ClickHouse and Druid for data storage and analytics.
  • Experience with Jenkins for continuous integration.
  • Basic understanding of Google Cloud Platform (GCP) and data center operations.
Qualifications:
  • Minimum 3 years of experience in a Site Reliability Engineer role or similar.
  • Proven experience with Kubernetes, Git, Helm, Prometheus, Grafana, CI/CD, Docker, and microservices architecture.
  • Strong knowledge of AWS services, MySQL, Airflow, Redis, AWS CDN.
  • Proficient in scripting languages such as Bash or Python.
  • Hands-on experience with Linux administration.
  • Familiarity with Ansible and Terraform for infrastructure management.
  • Understanding of networking principles and load balancing.
Education:
Bachelor's degree in Computer Science, Information Technology, or a related field.

We believe great work starts with great support. That’s why DeepIntent offers a competitive, holistic benefits package designed to empower you both professionally and personally. Here’s what you can expect when you join our team:

Competitive base salary plus performance based bonus or commission, comprehensive medical, dental, and vision coverage, 401K match program, generous PTO policy and paid holidays, remote friendly culture with flexible work options, career development and advanced education support, WFH and internet stipends, plus many more perks and benefits! 

DeepIntent is committed to bringing together individuals from different backgrounds and perspectives. We strive to create an inclusive environment where everyone can thrive, feel a sense of belonging, and do great work together.

DeepIntent is an Equal Opportunity Employer, providing equal employment and advancement opportunities to all individuals. We recruit, hire and promote into all job levels the most qualified applicants without regard to race, color, creed, national origin, religion, sex (including pregnancy, childbirth and related medical conditions), parental status, age, disability, genetic information, citizenship status, veteran status, gender identity or expression, transgender status, sexual orientation, marital, family or partnership status, political affiliation or activities, military service, immigration status, or any other status protected under applicable federal, state and local laws. If you have a disability or special need that requires accommodation, please let us know in advance.

DeepIntent’s commitment to providing equal employment opportunities extends to all aspects of employment, including job assignment, compensation, discipline and access to benefits and training.

Top Skills

Airflow
Ansible
AWS
Bash
Ci/Cd
Docker
Git
Google Cloud Platform
Grafana
Helm
Kubernetes
MySQL
Prometheus
Python
Redis
Terraform

DeepIntent Pune, Mahārāshtra, IND Office

52/A Shivaji Housing Society, SB Road, Shivaji Nagar, , Pune, Maharashtra, India, 411016

Similar Jobs

10 Hours Ago
Hybrid
Pune, Maharashtra, IND
Senior level
Senior level
Fintech • Information Technology • Logistics • Payments • Analytics • Business Intelligence • Generative AI
The Lead SRE will manage Coupa's cloud applications' reliability, scalability, and performance, enhancing automation and incident response while collaborating across teams.
Top Skills: AksAnsibleAWSAzureBashChefDatadogEksItilJenkinsJIRAKubernetesLinuxMs Sql ServerNew RelicOctopusPowershellPythonRundeckSplunkTerraformWindows
5 Days Ago
Hybrid
Mumbai, Maharashtra, IND
Junior
Junior
Financial Services
The Site Reliability Engineer II role focuses on ensuring system reliability, solving business problems through coding, and participating in incident resolution while improving monitoring solutions.
Top Skills: DatadogDynatraceGitlabGrafanaJenkinsLinuxPrometheusSplunkTerraformWindows
6 Days Ago
Hybrid
Navi Mumbai, Thane, Maharashtra, IND
Entry level
Entry level
Enterprise Web • Fintech • Financial Services
As a Site Reliability Engineer, you'll enhance system availability, automate operations, monitor performance, and drive problem resolution across data platforms.
Top Skills: AWSCloudwatchDynamoDBGlueJenkinsLake FormationLinuxNew RelicPythonS3SnsSparkSplunkSQLSqsTerraformVictorops

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account