Akamai Technologies Logo

Akamai Technologies

Senior II Site Reliability Engineer Lead

Posted Yesterday
Be an Early Applicant
In-Office or Remote
Hiring Remotely in India
Senior level
In-Office or Remote
Hiring Remotely in India
Senior level
Lead SRE efforts to ensure availability, reliability, and scalability of Compute services. Provide technical leadership and mentorship, define operational requirements, build automation, troubleshoot complex issues, manage identity & access platforms, participate in on-call rotations, and improve observability and incident response tooling.
The summary above was generated by AI

Do you like collaborating across teams to solve complex problems?

Do you have a passion for cutting edge technologies and tackling system problems?

Join our highly-skilled Site Reliability team!

Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We create solutions that manage our Compute platform, focusing on cloud interfaces - Compute Portals and APIs. We do this while maintaining Akamai's mission to make life better for billions of people, billions of times a day.

Partner with the best

In this role, you'll ensure the operation and uptime of our Compute services and infrastructure. You'll supervise and maintain our critical infrastructure. You'll collaborate with cross-functional teams to create tooling and software that monitors and improves the reliability of our systems. You'll work with various technologies as we release brand new applications and modernize our existing tooling.

As a Senior II Site Reliability Engineer Lead, you will be responsible for:

  • Providing technical leadership, mentorship, and support to SRE and project teams, fostering collaboration and motivation
  • Defining requirements during the product lifecycle to influence design, standards, and operational readiness.
  • Partnering with engineering, operations, and support teams to ensure availability, reliability, scalability, and usability of platforms.
  • Developing and enhancing automation tools to streamline daily operations, reduce manual effort (toil), and improve performance.
  • Troubleshooting and resolve complex system issues through proactive investigation, automation, and systems programming
  • Managing and improving Compute identity & access management platforms to accelerate issue detection and remediation.
  • Participating in on-call rotations, leading incident resolution, and contributing to robust, stable code delivery alongside other teams.

Do what you love

To be successful in this role you will:

  • Have a Bachelor's degree in Computer Science or equivalent, with relevant hands-on experience in infrastructure and software architecture at scale.
  • Be experienced in infrastructure automation tools like SaltStack, Terraform, and Ansible, and CI/CD tools such as Jenkins or CloudBees.
  • Have expertise in Linux administration, Docker-based environments, and Kubernetes; skilled in optimizing performance using tools like Redis.
  • Be familiar with observability tools, Prometheus, Grafana, Loki, Sentry, NewRelic, and web proxies such as Nginx/Envoy/HAProxy
  • Have understanding of SLOs and system reliability principles.

About us

At Akamai, we make life better for billions of people, trillions of times a day.
Whether you're streaming live events, scrolling social media, watching your favorite series, or managing your savings, we're the engine behind the scenes. We provide the world's most distributed platform from Cloud to Edge to help the giants of the digital world work faster and stay more secure, making the internet a better experience for everyone.
Our focus is simple:
Cloud and Edge: Running apps closer to users for instant performance.
Security: Neutralizing threats before they ever reach your data.
Content Delivery: Scaling the world's biggest moments without a glitch.
AI: Enabling our customers to build, secure, and scale AI apps on the world's most distributed cloud platform.
At Akamai, we don't just support the internet; we power and protect it, because behind every great digital experience is a massive hidden challenge. And we're the ones who solve it. When millions of people hit play or pay, Akamai ensures it just works.

Benefits at Akamai: We support your health, well-being, finances, and life beyond work. See our benefits.

FlexBase adapts to your job's needs

Akamai's FlexBase program is yet another way we show our commitment to providing employees with an exceptional workplace experience. It's not about telling employees where to work; it's about supporting employees to do their best work.
We trust our incredible employees to work in ways that suit them best: at home, in an office, or a combination of both.

Connect with us on social and see what life at Akamai is like!

Similar Jobs

Yesterday
In-Office or Remote
India
Senior level
Senior level
Cloud • Security • Software • Cybersecurity
The Solutions Architect II will design and implement complex web and security solutions, provide technical consulting, and optimize customer engagement with Akamai's services.
Top Skills: AWSAzureDevOpsGCPJavaScriptPerlPythonWeb Development
Yesterday
Remote or Hybrid
Maharashtra, IND
Senior level
Senior level
Automotive • Hardware • Robotics • Software • Transportation • Manufacturing
Lead design and implementation of ML tools for scenario classification, anomaly detection, sensor data quality, behavior analytics, and automated reporting. Build scalable pipelines and automated data curation for camera, radar, and LiDAR; create SIL/HIL replay interfaces, ground-truthing and annotation tools, and offline benchmarking. Integrate ML modules into simulation, HIL/SIL, and CI/CD ecosystems, implement reproducible workflows (Docker, CI/CD, artifact versioning, automated tests), monitor model/data quality and mentor junior engineers while collaborating with verification and perception teams.
Top Skills: Artifact VersioningAutomated TestsCameraCi/CdDockerHilLidarRadarSensor FusionSilSimulation Platforms
Yesterday
Remote or Hybrid
Maharashtra, IND
Senior level
Senior level
Automotive • Hardware • Robotics • Software • Transportation • Manufacturing
Design, implement, and optimize high-performance embedded inference systems for automotive ADAS. Develop and integrate ML runtimes on edge SoCs, ensure low-latency concurrency and I/O, integrate with ROS2/sensor frameworks, deploy to SIL/HIL and CI pipelines, and perform profiling and debugging of perception stacks (camera/radar/LiDAR) under real-time constraints.
Top Skills: C++17C++20CameraCudaEmbedded Ml RuntimesGitlab CiHilJenkinsLidarNvidia OrinNvidia XavierNxp S32PythonQualcomm Sa8XxxRadarRenesas R-CarRos2Sensor FrameworksSilTi Tda4

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account