Flex Logo

Flex

Devops + Site Reliability Engineer (SRE) – IT

Posted 6 Days Ago
Be an Early Applicant
In-Office
Chennai, Tamil Nadu
Senior level
In-Office
Chennai, Tamil Nadu
Senior level
As a Senior DevOps Engineer and Site Reliability Engineer, you will lead a team to enhance platform reliability through automation and performance improvements, architectural design, incident management, and cross-team collaboration.
The summary above was generated by AI
Flex is the diversified manufacturing partner of choice that helps market-leading brands design, build and deliver innovative products that improve the world.A career at Flex offers the opportunity to make a difference and invest in your growth in a respectful, inclusive, and collaborative environment. If you are excited about a role but don't meet every bullet point, we encourage you to apply and join us to create the extraordinary.

Job Description

To support our extraordinary teams who build great products and contribute to our growth, we’re looking to add a Senior Devops Engineer + Site Reliability Engineer (SRE)– IT located in Chennai location.

As a Sr. Site Reliability Engineer (SRE) on the Factory Applications team, you will guide the reliability strategy for “Brix” – a cloud-native, containerized, microservices-based platform powering global shop floor systems. You’ll lead a team of SREs, drive automation and performance initiatives, and collaborate cross-functionally to ensure scalable, resilient, and secure operations.

Report to the Senior Manager, and the role involves:

What a typical day looks like:

  • Leadership & Strategy

    • Lead and mentor a team of SREs, fostering a culture of ownership, learning, and continuous improvement.

    • Define and drive the SRE roadmap aligned with business and technical goals.

    • Champion best practices in reliability engineering across development and operations teams.

  • Technical Execution

    • Architect and implement scalable infrastructure using Infrastructure as Code.

    • Oversee monitoring, alerting, and observability systems to ensure platform health and performance.

    • Lead incident response and postmortem processes, ensuring root cause analysis and long-term fixes.

    • Collaborate with developers to integrate automated testing and CI/CD pipelines.

    • Optimize system performance through metric analysis and proactive tuning.

  • Collaboration & Communication

    • Act as a liaison between engineering, operations, and business stakeholders.

    • Maintain clear documentation and knowledge sharing across teams.

    • Support global teams, including rotational night shift coverage as needed.

The experience we’re looking to add to our team:

  • Bachelor’s or master’s degree in computer science, Information Technology, or related field (or equivalent work experience).

  • 7 - 12+ years of experience in Information Technology or related field.

  • Proven experience in DevOps and SRE methodologies.

  • Expertise in Docker, Kubernetes, and cloud-native architecture.

  • Solid programming skills in C#, TypeScript, Python, or Go.

  • Proficiency in Unix/Linux environments and shell scripting.

  • Experience with monitoring tools (Prometheus, Grafana) and test automation frameworks.

  • Strong analytical and problem-solving abilities.

  • Demonstrated leadership and project management capabilities.

Good to have:

  • Advanced knowledge of CI/CD pipelines and Git workflows.

  • Familiarity with configuration formats (YAML, JSON).

  • Experience leading technical teams and driving cross-functional initiatives.

What you will get for the great work you provide:

  • Health Insurance

  • PTO

NK99

Job Category
IT

Flex pays for all costs associated with the application, interview or offer process, a candidate will not be asked for any payment related to these costs.

Flex does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. Flex is an Equal Opportunity Employer and employment selection decisions are based on merit, qualifications, and abilities. Flex does not discriminate in employment opportunities or practices based on: age, race, religion, color, sex, national origin, marital status, sexual orientation, gender identity, veteran status, disability, pregnancy status or any other status protected by law. Flex provides reasonable accommodation so that qualified applicants with a disability may participate in the selection process. Please advise us of any accommodations you request to express interest in a position by e-mailing: [email protected]. Please state your request for assistance in your message. Only reasonable accommodation requests related to applying for a specific position within Flex will be reviewed at the e-mail address. Flex will contact you if it is determined that your background is a match to the required skills required for this position. Thank you for considering a career with Flex.

Top Skills

C#
Cloud-Native Architecture
Docker
Go
Grafana
Kubernetes
Prometheus
Python
Shell Scripting
Typescript
Unix/Linux

Similar Jobs

10 Hours Ago
Hybrid
Chennai, Tamil Nadu, IND
Mid level
Mid level
Aerospace • Digital Media • Information Technology • Internet of Things • Mobile • Software
The Specialist, Service Demand Planning coordinates demand processes, ensures criteria compliance, manages backlog, and facilitates governance forums for operations and engineering.
Top Skills: Azure DevopsJIRAServicenowSharepoint
10 Hours Ago
Hybrid
Chennai, Tamil Nadu, IND
Senior level
Senior level
Aerospace • Digital Media • Information Technology • Internet of Things • Mobile • Software
The Senior Support Engineer ensures network performance through monitoring, troubleshooting, and customer support while managing network changes and escalations.
Top Skills: AsaBgpCisco AsrCompassDataminerEigrpEx SeriesHelixJuniper MxMicrosoft ApplicationsMplsOspfSatnmsSciencelogicSdwanService NowVrrpZenoss
Yesterday
Hybrid
Chennai, Tamil Nadu, IND
Junior
Junior
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The Global Command Center Operator monitors logistics operations, responds to temperature deviations, and manages claims. They provide 24/7 support, ensuring timely communication and resolution of incidents while maintaining system functionality and analytics.
Top Skills: Ms Applications

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account