Senior Site Reliability Engineer

Posted 23 Days Ago
Be an Early Applicant
Pune, Maharashtra
Mid level
Information Technology • Security • Cybersecurity
The Role
The Senior Site Reliability Engineer - Incident Management is responsible for monitoring and managing Qualys infrastructure, ensuring service availability, troubleshooting incidents, and automating tasks. This role involves collaborating with technical teams, documenting incidents, and managing incident tickets to resolve issues effectively.
Summary Generated by Built In

Come work at a place where innovation and teamwork come together to support the most exciting missions in the world!


Job Description
Come work at a place where innovation and teamwork come together to support the most exciting missions in the world!

The Site Reliability Engineer - Incident Management, has the responsibility of monitoring, maintaining and managing entire Qualys infrastructure and services installed at different data-centers. When there is any malfunction in Product/Services, the Site Reliability Engineer- Incident Management technician Monitor, troubleshoots, repairs and gets the Service/system back up as quickly as possible. Ensure maximum possible service availability and performance, provide support services for Engineering and other technical teams and to collaborate for quicker resolution. End to end Incident management, Documentations and task Automation are also part of responsibility. 

Responsibilities: 

Monitor the performance and capacity of computer systems using a variety of tools. When an issue is identified, Site Reliability Engineer- Incident Management works to determine the cause of the problem. Responsible for basic troubleshooting platform/product issues to isolate the problems and take appropriate action to resolve. Check performance with Splunk/Grafana/Kibana. Manage PagerDuty. Also help in task automation wherever possible/applicable. Ensure creation and timely resolution to incident tickets tracking and resolution of the incident. When a problem impacts Product (SaaS) or Any (IT) services, Site Reliability Engineer- Incident Management works to triage or troubleshoot the problem, 

Site Reliability Engineer- Incident Management must carefully track and document all issues and resolutions in detail on the ticketing tool / documentation tools. This increases the knowledge base of the Site Reliability Engineer- Incident Management and is a record of the health of the system. When problems are too large or complex for quick troubleshooting, Site Reliability Engineer- Incident Management must escalate the issue to management, other IT resources or 3rd party vendors for assistance in reaching a resolution. Site Reliability Engineer- Incident Management maintain ongoing communication within the team and externally, to keep all stakeholders aware of relevant info, known issues and the steps being taken in summary format. Site Reliability Engineer- Incident Management team will operate 24*7*365 days. Monthly shift rotation basis (*depend on requirement). 

Required Skills

3-6 years IT Operations (Infra/System admin/Linux) or equivalent experience/certification. 

Knowledge or familiarity of Monitoring and other integration tools like Splunk, Prometheus, Grafana, Kibana, PagerDuty, Runscope (good to have any of the knowledge) and Jira /ServiceNow tool for Incident Management. 

Good experience (or familiarity) with ITSM main functions and usage of tools. 

Very good understanding of Incident Management (IM) processes and ability to drive Incident process (IM ticket). 

Strong interpersonal skills and have the ability to interact with all levels of employees in a professional manner. 

Certifications is highly recommended with a strong knowledge of computer functionality. Any technical certification on Linux, System Admin, VMware, IT Security or certification in the area of ITSM/ ITIL will be an added advantage. Knowledge of DevOps/SRE (basics) , Python, Cloud will be also good to have

Top Skills

Linux
Python
The Company
Pune, Maharashtra
2,736 Employees
On-site Workplace
Year Founded: 1999

What We Do

Qualys, Inc. (NASDAQ: QLYS) is a pioneer and leading provider of disruptive cloud-based security, compliance and IT solutions with more than 10,000 subscription customers worldwide, including a majority of the Forbes Global 100 and Fortune 100. Qualys helps organizations streamline and automate their security and compliance solutions onto a single platform for greater agility, better business outcomes, and substantial cost savings.
The Qualys Cloud Platform leverages a single agent to continuously deliver critical security intelligence while enabling enterprises to automate the full spectrum of vulnerability detection, compliance, and protection for IT systems, workloads and web applications across on premises, endpoints, servers, public and private clouds, containers, and mobile devices. Founded in 1999 as one of the first SaaS security companies, Qualys has strategic partnerships and seamlessly integrates its vulnerability management capabilities into security offerings from cloud service providers, including Amazon Web Services, the Google Cloud Platform and Microsoft Azure, along with a number of leading managed service providers and global consulting organizations. For more information, please visit http://www.qualys.com

Jobs at Similar Companies

Cloudflare Logo Cloudflare

Senior/Principal Systems Engineer - Workers AI (Platform)

Cloud • Information Technology • Security • Software • Cybersecurity
Remote
Hybrid
Austin, TX, USA
3900 Employees

Cloudflare Logo Cloudflare

Senior/Principal Systems Engineer - Workers AI (AI/ML)

Cloud • Information Technology • Security • Software • Cybersecurity
Remote
United States
3900 Employees

Cloudflare Logo Cloudflare

Senior/Principal Systems Engineer - Workers AI (Platform)

Cloud • Information Technology • Security • Software • Cybersecurity
Remote
United States
3900 Employees

Cloudflare Logo Cloudflare

Engineering Manager - Workers AI (Platform)

Cloud • Information Technology • Security • Software • Cybersecurity
Remote
Hybrid
London, OH, USA
3900 Employees

Similar Companies Hiring

LogicMonitor Thumbnail
Software • Machine Learning • Information Technology • Cloud • Artificial Intelligence
Santa Barbara, CA
1100 Employees
Zocdoc Thumbnail
Telehealth • Software • Information Technology • Healthtech
New York, NY
715 Employees
Cymulate Thumbnail
Software • Sales • Information Technology • Cybersecurity
New York City, NY
200 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account