The role involves monitoring critical services, developing solutions for reliability, resolving technical issues, enhancing tools for incident management, and leading incident resolution efforts.
SRE Support and Automation Engineers
India (Bangalore)
Job Summary
Site Reliability Engineering (SRE) team bridges the gap between software development and operations. Our mission is to build systems, tools, and platforms that keep our services fast, available, and reliable—at global scale. SRE team works closely with product engineering teams to design, build, and operate resilient applications that power the commerce experiences of millions.
Astreya is looking for Software Engineers with a passion for Reliability, Scalability, and Performance—someone who brings both a developer’s mindset and a systems-thinking approach.
Key Responsibilities:
- Proactive Monitoring: Continuously monitor the health of eBay's critical services to identify and address potential issues before they escalate.
- Solution Development: Collaborate with Architecture, Engineering, and Operations teams to develop solutions that ensure high site availability, reliability and performance.
- Collaborative Problem Solving: Work closely with partner teams to resolve recurring technical issues, onboard new alerts, and develop high-quality Standard Operating Procedures (SOPs).
- Enhance Monitoring Tools: Build and improve tools for monitoring and mitigating site incidents and conduct reliability audits and tests to strengthen eBay’s reliability and incident management capabilities.
- Incident Management: Act as Incident Commander to drive resolution of major incidents, manage alarms, and ensure effective communication with leadership and partner teams.
Qualifications/Skills:
- Bachelor’s or Master’s degree in computer science, Information Technology, or a related field.
- 4+ years of professional experience in software engineering, ideally in backend or platform teams
- Proficiency in one or more programming languages (e.g., Java, Go, Python)
- Experience writing scripts for Automation, automating any repetitive manual tasks.
- Strong incident management and leadership skills, with excellent technical triage and troubleshooting abilities, especially during crises.
- Familiarity with cloud platforms, container orchestration (e.g., Kubernetes), and infrastructure-as-code tools
- Experience with observability stacks (e.g., Prometheus, Grafana, ELK, OpenTelemetry)
- Strong interpersonal and communication skills to thrive in fast-paced, dynamic environments.
Top Skills
Elk
Go
Grafana
Java
Kubernetes
Opentelemetry
Prometheus
Python
Similar Jobs
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Senior Product Security Engineer will mentor security champions, conduct threat modeling, and collaborate on secure software and architectural reviews.
Top Skills:
AWSAzureContainersDynamic AnalysisGCPGoJavaJavaScriptJwtKubernetesOauthOidcPasetoPythonSAMLSoftware Composition AnalysisStatic Analysis
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves designing, developing, and implementing IAM solutions within ServiceNow and SailPoint IIQ, focusing on integrating AI and optimizing workflows.
Top Skills:
JavaLdapMs SqlMySQLOracleRest ApiSailpoint IiqServicenow
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Build high-quality, reusable code, manage projects, and mentor colleagues while designing and enhancing software products that integrate AI functionalities.
Top Skills:
Ai Productivity ToolsAngularJavaJavaScriptReactVue
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.