The Senior Database Reliability Engineer will improve database operations by applying software engineering techniques, ensuring database performance and reliability through optimization, automation, and observability using tools like Dynatrace.
We are seeking a highly skilled and proactive Senior Database Reliability Engineer to join our Infrastructure team. In this role, you will be responsible for the health, performance, and scalability of our diverse database estate. You will move beyond traditional administration by applying software engineering principles to database operations, leveraging Dynatrace for deep observability and driving efficiency through advanced automation.
The ideal candidate is a polyglot database expert who is equally comfortable tuning a complex PostgreSQL query, managing MongoDB replicasets, and writing Python or Ansible scripts to automate failover processes.
ResponsibilitiesKey Responsibilities1. Database Administration & Optimization- PostgreSQL: Manage large-scale PostgreSQL instances. Perform deep-dive performance tuning (vacuuming, indexing strategies, query plan analysis) and manage high-availability configurations (e.g., Patroni, Repmgr).
- MongoDB: Architect, deploy, and maintain MongoDB replica sets. Optimize aggregation pipelines and ensure data consistency and scaling.
- Redis: Manage Redis deployments for caching and real-time data needs. Implement Redis Sentinel or Cluster for high availability and advise development teams on data structure best practices.
- Lifecycle Management: Handle migrations, version upgrades, patching, and backups across all platforms to ensure zero-loss recovery point objectives (RPO).
- Dynatrace Integration: Utilize Dynatrace to monitor database health, create custom dashboards, and set up intelligent alerting.
- Root Cause Analysis: Use Dynatrace’s AI-driven insights to identify bottlenecks, deadlocks, and slow-running queries before they impact the end-user.
- Capacity Planning: Analyze trends in data growth and resource utilization to proactively scale infrastructure.
- Infrastructure as Code (IaC): Automate the provisioning and configuration of database environments using tools like Terraform, Ansible, or CloudFormation.
- CI/CD Integration: Work with DevOps teams to integrate database schema changes and deployments into automated pipelines.
- Self-Healing: Develop scripts (Python, Bash, or Go) to automate routine maintenance tasks and implement self-healing capabilities for common failure modes.
- PostgreSQL: 5+ years of experience with advanced knowledge of MVCC, WAL, and extension management.
- MongoDB: 3+ years of experience managing production NoSQL environments, including sharding and security hardening.
- Redis: Proven experience managing Redis at scale (persistence settings, memory management, and eviction policies).
- Monitoring/Observability: Hands-on experience with Dynatrace (preferred) or similar APM tools (Datadog, New Relic) to monitor database tiers.
- Automation: Proficiency in Ansible or Terraform and strong scripting skills in Python or Bash.
- Linux/Unix: Expert-level command-line proficiency and understanding of system-level performance tuning (kernel parameters, I/O scheduling).
- Understanding of containerization (Docker, Kubernetes) and managing databases within orchestrated environments.
- Relevant certifications (e.g., PostgreSQL Professional, MongoDB Certified DBA, or Dynatrace Associate).
- Experience with cloud-native database services (Cloud SQL, MongoDB Atlas) is a plus.
- Analytical Thinking: Ability to solve complex, multi-dimensional problems under pressure.
- Collaboration: Strong communication skills to act as a bridge between AppDev, Infrastructure, and Security teams.
- Precision: A meticulous approach to data integrity and system security.
Similar Jobs
Cloud • Information Technology • Security • Software • Cybersecurity
Manage and mentor a team to develop high-quality technical training content, focusing on cybersecurity and AI, while ensuring effective learning experiences.
Top Skills:
AICybersecurityInstructional Design
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Advisory AI Foundry Architect will guide revenue through technical consulting in AI integration and support product sales while achieving sales goals.
Top Skills:
AICloud Software SolutionsServicenow Platform
Cloud • Information Technology • Security • Software • Cybersecurity
Support the APJC Public Policy team with research, policy analysis, engaging stakeholders, and developing AI tools for policy tracking.
Top Skills:
Ai ToolsData AnalysisWorkflow Optimization
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.



