NICE Logo

NICE

Senior Specialist Cloud SRE Engineer, Actimize

Posted 2 Days Ago
Be an Early Applicant
Pune, Maharashtra
Senior level
Pune, Maharashtra
Senior level
The Senior Specialist Cloud SRE Engineer will enhance CloudOps monitoring, designing and implementing monitoring systems using various tools. Responsibilities include troubleshooting issues, automating tasks with scripting, and maintaining system metrics and logs. The role requires extensive experience in cloud operations, Linux, AWS, and scripting languages, as well as problem-solving skills for root cause analysis and self-healing capabilities.
The summary above was generated by AI

At NICE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you.


So, what’s the role all about?

 

NICE Actimize Premier is seeking Application and System monitoring Engineer to take our existing CloudOps monitoring to the next level. In this position You will be working with multitude of modern tools and technologies to properly and efficiently build next generation of monitoring system as well as troubleshoot and resolve issues in our development, test and production environments.The ideal candidate has to have the ability to work in a dynamic and complex software build environment and will also be an energetic self-starter with a passion to build, innovate and achieve excellence.

 

What you will be doing?

 

  • Ability to design, implement and improve Grafana, Prometheus, Loki, Promtail, node exporter.
  • Log parsing and management.
  • Configuration of alerting, push notifications to VictorOps (now Splunk) and Email notifications.
  • Architect, design and Implement Icinga 2 monitoring and alerting.
  • Ability to monitor system metrics and log parsing.
  • Ability to automate tasks using bash and / or Python scripting.
  • Predictive monitoring of systems and applications.
  • Familiarity with JVM internals and using of JMX and REST for monitoring.
  • Familiarity with AWS infrastructure.
  • Deep understanding of Java applications, TLS, Apache.
  • Automated checks of performance of system metrics in Grafana.
  • Automated checks of performance of Web Applications.
  • Problem-solving and troubleshooting, including performing root cause analysis to design preventative activities.
  • Crafting and maintaining dashboards and reports, pulling together monitoring data across multiple platforms within the same tool as well as across multiple tools.
  • Assisting with writing scripts and queries that can provide environment self-healing capabilities.


Have you got what it takes?

 

  • Experience with using monitoring tools in a production environment.
  • 5+ years of production cloud operations experience
  • 5+ years expertise in Linux command line.
  • 5+ years of using Terraform in AWS for automation. Hands on with automation and seeking out opportunities to automate manual processes.
  • 5+ years of strong, hands-on experience building production services in AWS.
  • 4+ years of experience with scripting using Python and Bash
  • Ability to participate in on-call rotation
  • Considerable knowledge of IT equipment and diagnostic tools.
  • Considerable knowledge of principles and techniques of systems analysis, design, development and programming.
  • Considerable knowledge of principles of information systems.
  • Cnsiderable knowledge of capabilities of computer technology.
  • Knowledge of methods and procedures used to conduct detailed analysis and design of computer systems.
  • Knowledge of practices and issues of systems’ security and disaster recovery
  • Knowledge of computer operating systems.


What’s in it for you?

Join an ever-growing, market disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NICE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NICEr!

Enjoy NICE-FLEX!

At NICE, we work according to the NICE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere.

Requisition ID: 5664
Reporting into: 
Tech Manager
Role Type: Individual Contributor

About NICE

NICE Ltd. (NASDAQ: NICE) software products are used by 25,000+ global businesses, including 85 of the Fortune 100 corporations, to deliver extraordinary customer experiences, fight financial crime and ensure public safety. Every day, NICE software manages more than 120 million customer interactions and monitors 3+ billion financial transactions.

Known as an innovation powerhouse that excels in AI, cloud and digital, NICE is consistently recognized as the market leader in its domains, with over 8,500 employees across 30+ countries.

NICE is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, age, sex, marital status, ancestry, neurotype, physical or mental disability, veteran status, gender identity, sexual orientation or any other category protected by law.


Top Skills

Bash
Java
Python

NICE Pune, Mahārāshtra, IND Office

8th Floor in Wing A & B Block Rhine Rajiv Gandhi Infotech Park-Phase II, Hinjewadi , Pune, India, 411057

Similar Jobs

Yesterday
Hybrid
Navi Mumbai, Thane, Maharashtra, IND
Senior level
Senior level
Enterprise Web • Fintech • Financial Services
As a Lead Site Reliability Engineer, you'll design and implement system enhancements to boost performance and reliability. You will lead a skilled team, improve deployment processes, and optimize cloud solutions while ensuring system visibility and customer satisfaction.
Top Skills: DockerSQLTerraform
6 Days Ago
Hybrid
Pune, Maharashtra, IND
Mid level
Mid level
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
As a Senior Cloud Site Reliability Engineer, you will analyze, maintain, and nurture Cloud solutions/products. You will coordinate emergency responses, conduct root cause analysis, and identify improvements for system performance. You are expected to promote industry best practices, troubleshoot across infrastructure and software stacks, and collaborate with teams to enhance the quality and reliability of cloud services.
Top Skills: Python
11 Days Ago
Hybrid
Navi Mumbai, Thane, Maharashtra, IND
Mid level
Mid level
Enterprise Web • Fintech • Financial Services
The Site Reliability Engineer will onboard users to observability platforms, ensure best practices are followed, collaborate with teams to educate on observability features, assist with anomaly analysis, automate tasks, and maintain operational documentation.
Top Skills: BashPowershellPython

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account