ZainTECH Logo

ZainTECH

Data & AI Operations Specialist

Posted 6 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Mid level
Remote
Hiring Remotely in India
Mid level
The Data & AI Operations Specialist leads technical operations for AI infrastructure, manages data pipelines, and oversees MLOps across multi-cloud environments, ensuring compliance and performance optimization.
The summary above was generated by AI

The Data & Operations AI Specialist serves as the Level 3 technical lead for Artificial Intelligence and Data Platform estate. You will be responsible for the architecture, engineering, and advanced troubleshooting of AI infrastructure, data pipelines, and MLOps lifecycles across a multi-cloud environment (Azure and OCI).

Responsibilities:

AI Infrastructure & Platform Engineering

  • Design & Architecture: Maintain the monitoring architecture for AI/ML platforms and configure advanced dashboards in Grafana and Azure Monitor.
  • Environment Governance: Manage Azure Machine Learning (AML) workspace configurations, compute targets, and Databricks cluster lifecycles (including runtime versions and platform patching).
  • Resource Optimization: Oversee GPU resource allocation, reserved capacity, and cost-performance optimization to align with FinOps goals.
  • Security Integration: Ensure all AI services utilize private endpoints, VNET integration, and RBAC controls to protect sensitive citizen data.

Data Pipeline & ETL Management

  • Pipeline Engineering: Own the design, optimization, and remediation of Azure Data Factory (ADF) and Synapse pipelines.
  • Advanced Troubleshooting: Resolve complex bottlenecks related to authentication failures, data format changes, and ETL performance.
  • SOP Leadership: Author step-by-step Standard Operating Procedures (SOPs) for the L1 NOC team to handle routine monitoring and first-line triage.

MLOps & Model Lifecycle

  • Automation: Implement CI/CD pipelines for model training, testing, and deployment to AML endpoints.
  • Model Reliability: Configure data drift detection thresholds and automated retraining triggers.
  • Recovery Operations: Develop self-healing scripts and automated recovery runbooks for critical AI workflows.

Governance & Compliance

  • Audit Management: Implement and maintain audit logging for all AI decisions and model outputs, ensuring logs flow to the SIEM/vSOC.
  • Regulatory Alignment: Conduct quarterly AI governance reviews to ensure compliance with NESA standards and data privacy guidelines.

Requirements
  • AI/ML Platforms: Deep expertise in Azure Machine Learning and Databricks.
  • Data Integration: Proficiency in Azure Data Factory and Synapse.
  • Infrastructure-as-Code (IaC): Experience with Terraform or ARM Templates for reproducible deployments.
  • Observability: Ability to use Dynatrace, Grafana, and Azure Monitor for deep-tier diagnostics.
  • Containerization: Knowledge of AKS, Istio Service Mesh, and KEDA.
  • ITIL Mastery: Strong understanding of ITIL-aligned Incident, Change, and Problem management.
  • Security Mindset: Familiarity with NESA standards and UAE data residency requirements.
  • Technical Writing: Ability to draft complex SOPs and Root Cause Analysis (RCA) documents within 48 hours of an incident.
  • Certifications: Microsoft Azure Data Scientist Associate or Azure AI Engineer Associate is highly preferred.

Top Skills

Aks
Arm Templates
Azure Data Factory
Azure Machine Learning
Azure Monitor
Databricks
Dynatrace
Grafana
Istio Service Mesh
Keda
Synapse
Terraform

Similar Jobs

3 Hours Ago
Remote
India
Senior level
Senior level
Cloud • Information Technology • Productivity • Software • Automation
As a Senior Software Quality Engineer, design and implement testing strategies for backend infrastructure and AI features ensuring reliability and quality across applications.
Top Skills: Ci/CdConfluenceGitJavaJIRALinuxPlaywrightPytestPythonSeleniumUnittestUnix
Mid level
Cloud • Information Technology • Productivity • Software • Automation
As a Customer Journey Marketing Specialist, you will build and manage email campaigns, analyze performance data, and support customer lifecycle initiatives, while collaborating with cross-functional teams.
Top Skills: AsanaMarketoSalesforce
9 Hours Ago
In-Office or Remote
Mid level
Mid level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Manage influencer marketing campaigns across various platforms, oversee operations, collaborate with creators, ensure quality, and report on campaign metrics.
Top Skills: InstagramLinkedInTiktokYoutube

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account