Design and deploy machine learning models, manage lifecycle on GCP, optimize performance, and ensure reliability for AI applications in production.
We are looking for a Machine Leaning Engineer (MLE) to design, build, and optimize our machine learning operation. You will play a crucial role in scaling AI models from research to production, ensuring smooth model deployment, monitoring, and lifecycle management across our Google Cloud Platform (GCP) infrastructure. You'll work closely with data scientists, ML Ops, and data engineers to automate workflows, improve model performance, and ensure reliability for our AI that serves millions of players worldwide.
What You'll Do
- Design, develop, and deploy machine learning models and solutions, leveraging tools such as LangGraph and MLflow for orchestration and lifecycle management.
- Collaborate on building and maintaining scalable data and feature pipeline infrastructure for real time and batch processing using tools like BigQuery, BigTable, Dataflow, Composer(Airflow), PubSub, and Cloud Run to support ML model training and inference.
- Develop and implement robust strategies for model monitoring and observability to detect model drift, bias, and performance degradation, leveraging tools like Vertex AI Model Monitoring and custom dashboards.
- Optimize ML model inference performance to improve latency and cost-efficiency of AI applications.
- Ensure the overall reliability, performance, and scalability of the ML models and data infrastructure platform, including proactive identification and resolution of issues related to model performance and data quality.
- Troubleshoot and resolve complex issues impacting ML models, data pipelines, and production AI systems.
- Ensure AI/ML models and workflows meet data governance, security, and compliance requirements, specifically for real-money gaming.
What We're Looking For
- 1+ years of experience as an ML Engineer, with a focus on developing and deploying machine learning models in production environments.
- Strong experience in Google Cloud Platform (GCP), including services relevant to ML and data infrastructure such as BigQuery, Dataflow, Vertex AI, Cloud Run, and Pub/Sub and Composer (Airflow).
- Solid grasp of containerization (Docker, Kubernetes) and experience with Kubernetes orchestration platforms like GKE for deploying ML services.
- Experience building and deploying scalable data pipelines and machine learning models in production environments.
- Understanding of model monitoring, logging, and observability best practices for ML models and applications.
- Experience in Python and ML frameworks (e.g., TensorFlow, PyTorch, scikit-learn).
- Familiarity with AI orchestration concepts using tools like LangGraph or LangChain is a bonus.
- Bonus experience includes working in gaming, real-time fraud detection, or AI personalization systems and Agentic workflows.
Top Skills
BigQuery
Cloud Run
Composer
Dataflow
Docker
Google Cloud Platform
Kubernetes
Langgraph
Pub/Sub
PyTorch
Scikit-Learn
TensorFlow
Vertex Ai
Similar Jobs
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Team Leader manages Group Life claims, ensuring compliance and customer satisfaction by overseeing claim evaluations, mentoring staff, and addressing customer issues.
Top Skills:
AccurintBiosCalligoCdfEdcsGlif ProductionGroupfactsExcelMicrosoft WordNetviewWindowsWorkdesk
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
SailPoint is seeking a Senior Software Engineer to build Python SDKs and frameworks for their big data platform. Responsibilities include designing, delivering, and testing backend services while collaborating with teammates and engaging in product demos and customer support.
Top Skills:
AWSDockerEksGrafanaJIRAKafkaKibanaNoSQLPrometheusPythonRedisSQL
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Lead development, scaling, governance, and adoption of enterprise process capabilities (BPM, BPI, BPR). Manage tools, training, standards, integrations, and reusable assets to enable transformation, collaborate with cross-functional stakeholders, and drive delivery enablement and capability maturity.
Top Skills:
Business Process Intelligence (Bpi)Business Process Management (Bpm)Business Process Reengineering (Bpr)Performance AnalyticsProcess Modeling ToolsSignavio
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.



