Cloudflare Logo

Cloudflare

Systems Engineer, Metrics and Alerting

Posted Yesterday
Be an Early Applicant
2 Locations
Mid level
2 Locations
Mid level
The Systems Engineer will design and operate software to enhance Cloudflare's observability, resolve scaling bottlenecks, work within distributed systems, participate in knowledge sharing, and contribute to open-source efforts, while being part of an on-call rotation for team services.
The summary above was generated by AI

Available Locations: London or Lisbon
About the Department
Production Engineering is responsible for the world's most reliable, observable, performant, and safe network ecosystem. Our customers rely on our products and systems to safely modify, troubleshoot, and release products without external impact.
Our external customers rely on us to provide seamless and predictable incident, traffic, policy management, resulting in the fastest and safest network services in the world.
We are accountable for the overall performance of internal and external facing services, guiding our product teams to optimal configurations and maximum efficiency. From the moment that a packet enters the Cloudflare ecosystem, we know exactly what its expected purpose and behaviour is and we are capable of determining and exposing anomalous behaviour.
The Cloudflare network makes it possible to solve challenges at massive scale and efficiency which would be impossible for almost any other organization.
In this role, you can expect to:

  • Design, deliver, and operate software that progresses Cloudflare's Observability competency
  • Solve scaling bottlenecks in critical services in our Metrics & Alerting pipeline
  • Work on highly distributed and scalable systems
  • Participate in the constant cycle of knowledge sharing and mentoring
  • Participate in the global on-call rotation for the services your team owns
  • Research and introduce cutting-edge technologies
  • Contribute to open-source


We are a small team, well-funded, growing and focused on building an extraordinary company. This is a systems engineering role and is a superb opportunity to be part of a high performing team to help to support Cloudflare's mission and help build a better internet.
You may be a good fit for our team if you have:

  • Proficiency in distributed Linux environments
  • Proficiency in designing high-scale distributed systems
  • Proficiency in high-level programming languages (e.g., Golang)
  • Proficiency in Prometheus, Alertmanager, Thanos
  • Proficiency in networking protocols Layer 2-7 of the OSI model
  • Experience working in a fast, high-growth environment
  • Experience working in a 24/7/365 service environment
  • Exquisite written and verbal communication skills
  • Familiarity with Internetworking and BGP
  • Strong bias for action


Bonus points if you have:

  • Experience with high-bandwidth transit Internetworking and routing
  • Passion for code simplicity and performance

Top Skills

Alertmanager
Go
Linux
Prometheus
Thanos

Similar Jobs at Cloudflare

4 Hours Ago
Hybrid
2 Locations
Senior level
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
As a Senior System Engineer at Cloudflare, you will enhance network resilience and reduce operational toil by developing software solutions. You will work on existing and new infrastructure, solving complex problems with scalable tools and services.
14 Hours Ago
2 Locations
Mid level
Mid level
Cloud • Information Technology • Security • Software • Cybersecurity
As a Software Engineer, you will collaborate with engineers and product managers to enhance the scalability and performance of Cloudflare's Network Services. You will work on technologies such as Linux kernel-based networking and design distributed systems.
Top Skills: EbpfGoLarge-Scale Distributed SystemsLinuxNetfilterNftablesRestful ApisRustTc
15 Hours Ago
Hybrid
Lisbon, PRT
Mid level
Mid level
Cloud • Information Technology • Security • Software • Cybersecurity
As a Solutions Engineer, you will serve as the technical advocate for customers, collaborating closely with sales and support teams to design effective solutions based on Cloudflare's offerings. You will leverage your technical knowledge and communication skills to help customers overcome challenges and maximize their use of Cloudflare's services, while managing project deadlines and priorities.

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account