AI Data Scientist
Locations: Pune, India
Buildings are getting smarter with connected technologies. With more connectivity, there is access to more data from sensors installed in buildings. Johnson Controls is leading the way in providing AI enabled enterprise solutions that contribute to optimized energy utilization, auto- generation of building insights and enable predictive maintenance for installed devices. Our Data Strategy & Intelligence team is looking for a Data Scientist to join our growing team. You will play a critical role in developing and deploying machine learning/Generative AI and Timeseries analysis models in production.
The Role
To be successful in this role, the Data Scientist should have a deep knowledge of machine learning concepts, Large Language Models (LLM) including their training , optimization and deployment, time series models as well as hands on experience in developing and deploying ML/Generative AI/ time series models in production.
What you will do
As an AI Data Scientist at Johnson Controls, you will help develop and maintain the AI algorithms and capabilities within our digital products. These applications will use data from commercial buildings, apply machine learning, GenAI or other advanced algorithms to provide value in the following ways:
- Optimize building energy consumption, occupancy, reduce CO2 emissions, enhance users’ comfort, etc.
- Generate actionable insights to improve building operations
- Translate data into direct recommendations for various stakeholders
Your efforts will ensure that our AI solutions deliver robust and repeatable outcomes through well-designed algorithms and well-written software.
To be successful in this role, the AI Data Scientist should be comfortable applying machine-learning concepts to practical applications while handling the inherent challenges of real-world datasets.
How you will do it
- Contribute as a member of the AI team with assigned tasks
- Collaborate with product managers to design new AI capabilities
- Explore and analyze available datasets for potential applications
- Design and develop ML/Generative AI/Timeseries Prediction solutions understanding complex business requirements
- Collaborate with cross-functional teams to integrate AI capabilities into existing products
- Research and implement state-of-the-art techniques in generative AI solutions
- Develop and maintain AI model integration pipelines
- Pre-train and finetune ML over GPU clusters while optimizing for trade-offs.
What we look for
- Bachelor's / Master’s degree in Computer Science, Statistics, Mathematics, or related field.
- 7+ years of experience of developing and deploying ML Models with a proven record of delivering production ready ML models.
- Expert level proficiency with Python and standard ML libraries viz. PyTorch, Tensorflow, Keras, NumPy, Pandas, scikit-Learn, Matplotlib, Transformers, FastAPI / Django.
- Strong understanding of ML algorithms and techniques viz. Regression, Classification, Clustering, Deep Learning, NLP / Transformer models, LLMs and Time-series prediction models.
- Proficiency in developing SOA LLM frameworks and models(Azure OpenAI, Meta LLAMA,etc), advanced prompt engineering techniques, LLMs fine-tuning/training and retrieval-augmented generation (RAG) systems.
- Proficiency in working with engineering teams for deploying ML/ LLM models on production and experience with model observability and monitoring
- Experience on working with cloud (AWS / GCP / Azure) based ML/GenAI model development / deployment.
Preferred
- Prior Domain experience in Smart buildings and Energy optimization
- Experience on working with Azure Cloud.