Blink Health Logo

Blink Health

Senior Cloud Resilience Architect

Posted 16 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Senior level
Remote
Hiring Remotely in India
Senior level
Lead evaluation and maturation of disaster recovery posture, define DR standards, design multi-region and active-active architectures, drive DR roadmap, implement testing (game days/chaos), produce runbooks, recommend platform/tooling, and advise engineering/security teams on resilience and compliance.
The summary above was generated by AI

Company Overview:

Blink Health is the fastest growing healthcare technology company that builds products to make prescriptions accessible and affordable to everybody.  Our two primary products – BlinkRx and Quick Save – remove traditional roadblocks within the current prescription supply chain, resulting in better access to critical medications and improved health outcomes for patients. 
BlinkRx is the world’s first pharma-to-patient cloud that offers a digital concierge service for patients who are prescribed branded medications. Patients benefit from transparent low prices, free home delivery, and world-class support on this first-of-its-kind centralized platform. With BlinkRx, never again will a patient show up at the pharmacy only to discover that they can’t afford their medication, their doctor needs to fill out a form for them, or the pharmacy doesn’t have the medication in stock. 
We are a highly collaborative team of builders and operators who invent new ways of working in an industry that historically has resisted innovation. Join us!



Responsibilities
  • Evaluate and mature the organization’s disaster recovery posture, including recovery objectives (RTO/RPO), dependency mapping, and failure domain analysis across applications, data, and infrastructure.
  • Define, document, and establish disaster recovery standards and best practices across cloud infrastructure, platforms, and application architectures.
  • Partner with SRE, platform, security, and product engineering teams to design and implement resilient, fault-tolerant systems, progressing from backup-based recovery to multi-region and active-active architectures.
  • Lead the disaster recovery roadmap, balancing technical feasibility, cost, risk, and business priorities.
  • Design and recommend reference architectures for disaster recovery patterns, including pilot-light, warm standby, hot standby, and active-active.
  • Drive adoption of active-active disaster recovery for critical systems, including traffic management, data replication, consistency models, and automated failover.
  • Define and operationalize testing strategies for DR, including game days, chaos testing, and regular recovery exercises.
  • Establish clear documentation, runbooks, and escalation paths to ensure recoverability is well understood and not dependent on individuals.
  • Evaluate and recommend platform upgrades, cloud services, and tooling that improve resilience, recovery speed, and reliability.
  • Serve as a technical authority and advisor on disaster recovery and resilience for leadership and engineering teams.
  • Provide architectural guidance, design reviews, and mentorship to engineers implementing DR-related changes.
  • Partner with security and compliance teams to ensure DR strategies meet regulatory, audit, and data protection requirements.
Desired Experience
  • Bachelor’s or Master’s degree in Computer Science or equivalent practical experience.
  • 8+ years of experience in cloud infrastructure, platform engineering, SRE, or reliability-focused architecture roles.
Disaster Recovery & Resilience
  • Deep understanding of disaster recovery concepts including RTO/RPO, blast radius reduction, failure domains, and dependency isolation.
  • Proven experience designing and implementing multi-region and multi-availability zone architectures.
  • Hands-on experience moving systems toward active-active or highly available architectures.
  • Strong grasp of data replication strategies, consistency tradeoffs, and recovery patterns for databases and stateful systems.
Cloud & Platform Engineering
  • Extensive experience with major cloud providers (AWS preferred, GCP/Azure acceptable).
  • Strong understanding of managed cloud services and their DR characteristics and limitations.
  • Experience with Kubernetes-based platforms, including regional failover, workload portability, and cluster recovery strategies.
  • Familiarity with global traffic management, DNS, load balancing, and service mesh patterns.
Automation & Infrastructure as Code
  • Experience designing and maintaining Infrastructure as Code using tools such as Terraform, Pulumi, CloudFormation, or Ansible.
  • Strong focus on automation for recovery workflows, failover testing, and environment provisioning.
  • Ability to eliminate manual recovery steps and reduce time-to-recovery through software.
Operational Excellence
  • Experience defining and running DR tests, game days, and failure simulations.
  • Comfortable working across organizational boundaries to influence priorities and standards.
  • Strong documentation and communication skills, with the ability to translate complex technical risk into business impact.

Why Join Us:

It is rare to have a company that both deeply impacts its customers and is able to provide its services across a massive population.  At Blink, we have a huge impact on people when they are most vulnerable: at the intersection of their healthcare and finances. We are also the fastest growing healthcare company in the country and are driving that impact across millions of new patients every year.  Our business model not only helps people, but drives economics that allow us to build a generational company. We are a relentlessly learning, constantly curious, and aggressively collaborative cross-functional team dedicated to inventing new ways to improve the lives of our customers.
We are an equal opportunity employer and value diversity of all kinds. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Applicants who provide their phone number and consent to receive text messages may receive SMS or MMS updates from Blink Health regarding their application.


Similar Jobs

54 Minutes Ago
Remote
India
Mid level
Mid level
Cloud • Information Technology • Productivity • Software • Automation
Provide high-level technical support and troubleshooting for Boomi integrations, diagnose root causes across integrations, APIs, and OS/Java logs, use diagnostic tools and protocols to resolve customer issues, collaborate globally in a follow-the-sun model, and transition to in-office Hyderabad with overnight shift coverage.
Top Skills: Ai ToolsBoomi AtomsphereCharles ProxyElt/EtlGroovyHadoopHttp/SJavaJavaScriptKubernetesLinuxNetSuiteOauth 2.0PostmanRancher DesktopReactRestSalesforceSftpSoapSsl/TlsTcp/IpWindowsWiresharkWsdl
2 Hours Ago
Remote or Hybrid
Junior
Junior
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Design, develop, operate, and scale an enterprise container security program. Provide container security services, deploy and maintain containerized SAST/DAST/SCA toolchains, run dynamic security testing, interface with external researchers and customers, identify vulnerabilities, report findings, recommend remediation, and validate fixes while collaborating with Security, Build, Tools, and Infrastructure teams.
Top Skills: Amazon BedrockArtifactoryAWSAzureCi/CdContainer SecurityContainersDastGCPGroovyInfrastructure As CodeJenkinsKubernetesNexusPythonSastScaTerraform
9 Hours Ago
Remote or Hybrid
India
Entry level
Entry level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
As a Python Developer, you will develop fullstack applications, manage backend systems, and collaborate to drive technical solutions that impact business results.
Top Skills: C++DockerGoJavaJavaScriptKubernetesNext.JsPythonReactRust

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account