The Senior SRE Engineer will design and maintain scalable systems, manage IaC and CI/CD pipelines, implement monitoring solutions, and ensure security in software delivery.
We are looking for an experienced SRE Engineer to join our engineering team. This role is critical in ensuring the reliability, scalability, and security of our cloud infrastructure. Experience with Google Cloud Platform (GCP) or Oracle Cloud Infrastructure (OCI) is highly advantageous.
Key Responsibilities:
Key Responsibilities:
- Design, build, and maintain reliable, automated, and scalable systems.
- Develop and manage Infrastructure-as-Code (IaC) and CI/CD pipelines to support efficient and secure software delivery.
- Implement robust monitoring, logging, and distributed tracing to ensure visibility and observability across environments.
- Ensure high availability, performance, and operational excellence across all environments.
- Collaborate with development, QA, and security teams to integrate security best practices throughout the delivery pipeline.
- Proactively identify issues, perform root cause analysis, and drive resolution to improve system performance and reliability.
- Proven experience in an SRE, DevOps, or DevSecOps role.
- Hands-on experience with cloud platforms, particularly GCP or Oracle Cloud.
- Strong understanding of CI/CD tools, infrastructure automation (e.g., Terraform, Ansible), and container orchestration (e.g., Kubernetes).
- Familiarity with monitoring/logging tools (e.g., Prometheus, Grafana, ELK, Stackdriver, etc.).
- Knowledge of security best practices and experience integrating security into DevOps pipelines.
- Excellent problem-solving skills, with a proactive and analytical mindset.
- Strong collaboration and communication skills; ability to work effectively across teams.
- Certifications in GCP, OCI, or relevant DevOps/Security disciplines (Preferred).
Top Skills
Ansible
Elk
Google Cloud Platform
Grafana
Kubernetes
Oracle Cloud Infrastructure
Prometheus
Stackdriver
Terraform
Similar Jobs
Financial Services
Lead the adoption of Site Reliability Engineering practices, collaborate with stakeholders, and mentor engineers while driving system reliability initiatives.
Top Skills:
AngularDatadogDynatraceGrafanaJavaObservabilityOpen TelemetryPrometheusPythonReactSite Reliability EngineeringSplunk
Cloud
The Senior Site Reliability Engineer focuses on cloud infrastructure security, driving initiatives to enhance security posture, troubleshooting complex issues, automating processes, and promoting best practices within engineering.
Top Skills:
AWSChefCi/CdEc2EcsGceGCPGkeGoKinesisKmsPythonRdsRubyTerraform
Consumer Web • eCommerce • Fashion • Retail
The Senior Site Reliability Engineer will ensure the health, performance, and capacity of internet-facing services, improve deployment tools, and monitor custom applications in a UNIX environment.
Top Skills:
Amazon Web ServicesAnsibleDatadogDockerElasticsearchHaproxyJavaScriptJenkinsKubernetesMongoDBNginxNode.jsPackerRabbitMQRedisRubyTerraformTomcat
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.