As a Senior Data Scientist, you will improve data quality through metrics definition, root cause analysis, and development of ML/AI models, while mentoring junior colleagues and managing multiple projects.
Career Area:
Technology, Digital and Data
Job Description:
Your Work Shapes the World at Caterpillar Inc.
When you join Caterpillar, you're joining a global team who cares not just about the work we do - but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here - we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.
JOB PURPOSE: Cat Digital is the digital and technology arm of Caterpillar Inc., responsible for bringing world class digital capabilities to our products and services. With over one million connected assets worldwide, we're focused on using IoT and other data, technology, advanced analytics and AI capabilities to help our customers build a better world.
Cat Digital Data Analytics & engineering team is looking for a talented and motivated Senior Data Scientist to help improve platform data quality by defining and implementing quality metrics, performing root cause analysis of data quality problems, and developing algorithms as well as ML/AI models to address the most challenging data quality issues. In this role, you will apply machine learning and other analytics techniques on a very large set of diverse data from IoT-connected assets and our integrated network of dealers. You will also use analytics and visualization methods to solve problems for Caterpillar's internal customers.
Top candidates will have prior experience in developing ML/AI solutions and business intelligence, be proficient in SQL and Python, and have experience with cloud computing and dashboard design and also experienced to :
JOB DUTIES:
Basic Qualifications:
Top candidates will also have:
Posting Dates:
June 10, 2025 - June 23, 2025
Caterpillar is an Equal Opportunity Employer.
Not ready to apply? Join our Talent Community.
Technology, Digital and Data
Job Description:
Your Work Shapes the World at Caterpillar Inc.
When you join Caterpillar, you're joining a global team who cares not just about the work we do - but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here - we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.
JOB PURPOSE: Cat Digital is the digital and technology arm of Caterpillar Inc., responsible for bringing world class digital capabilities to our products and services. With over one million connected assets worldwide, we're focused on using IoT and other data, technology, advanced analytics and AI capabilities to help our customers build a better world.
Cat Digital Data Analytics & engineering team is looking for a talented and motivated Senior Data Scientist to help improve platform data quality by defining and implementing quality metrics, performing root cause analysis of data quality problems, and developing algorithms as well as ML/AI models to address the most challenging data quality issues. In this role, you will apply machine learning and other analytics techniques on a very large set of diverse data from IoT-connected assets and our integrated network of dealers. You will also use analytics and visualization methods to solve problems for Caterpillar's internal customers.
Top candidates will have prior experience in developing ML/AI solutions and business intelligence, be proficient in SQL and Python, and have experience with cloud computing and dashboard design and also experienced to :
- Drive, Design and maintain AI & Machine Learning models & solutions.
- Provide analytics support to high profile Helios Data Division Projects
- Use analytics methods to make recommendations to Designers, Product Owners and Managers
- Work independently without close supervision on medium to high-complexity projects
- Work on 2-3 projects concurrently.
JOB DUTIES:
- As a Senior Data scientist, you will contribute to the design, solution development, strategy & solution development
- Competent in performing all programming, project management, and development assignments without close supervision; normally assigned the more complex aspects of systems work.
- Works directly on complex application/technical problem identification and resolution, including responding to off-shift and weekend support calls.
- Works independently on complex systems or infrastructure components that may be used by one or more applications or systems.
- Drives application development focused on delivering business valuable features
- Mentor and assist data scientists, providing technical assistance and direction as needed
- Maintains high standards of software quality within the team by establishing good practices and habits
- Identifies and encourage areas for growth and improvement within the team
- Guide the team to develop a structured application/interface code, new program documentation, operations documentation and user guides in a casual, flexible environment
- Communicate with end users and internal customers to help direct the development, debugging, and testing of application software for accuracy, integrity, interoperability, and completeness
- Performs integrated testing and customer acceptance testing of components that requires careful planning and execution to ensure timely, quality results.
- The employee is also responsible for performing other job duties as assigned by Caterpillar management from time to time.
Basic Qualifications:
- BS or MS degree in a quantitative discipline such as data science, data analytics, computer science, engineering, statistics, mathematics, finance, or other related degree
- Overall experience of 7+ years of professional experience
- 5+ years of proven experience as a Data scientist in designing and implementing data processing and machine learning frameworks.
- Recent 4+ years of experience with Python, SQL
- Hands-on experience in Agents, Large Language Models, Small Language Models, Retrieval Augmented Generation, Generative AI, Natural Language Processing, and Deep Learning
Top candidates will also have:
- MS degree in a quantitative discipline such as data science, data analytics, computer science, engineering, statistics, mathematics, finance, or other related degree
- Proven experience in some of the following:
- Compiling and standardizing diverse, non-sanitized datasets.
- Working with structured and unstructured data.
- Developing classification and regression models.
- Unsupervised learning algorithms.
- Experience integrating analytical models with existing data pipelines.
- Proven experience as a Lead Data scientist for a Data science team.
- Solid knowledge of statistical approaches, quantitative analytic methods, data management techniques, and/or related digital technologies, and the ability to handle complex issues.
- Proven experience with AWS full-stack development and services such as Athena, Glue, DynamoDB, EC2, EMR, RDS, S3, SageMaker.
- Experience with Snowflake data warehouse
- Experience visualizing data using BI software such as Tableau and MS Power BI
- Good organizational skills and an aptitude for complex analytical and detailed work; ability to prioritize among multiple concurrent projects to meet deadlines promptly.
- Experience gathering information systematically
- Ability to consider a broad range of issues or factors, grasp complexities and perceive relationships among problems or issues, and use accurate logic in analysis
- Exceptional verbal and written communication skills and ability to engage effectively at all levels of the organization, to both technical and non-technical audiences.
- Ability to work independently or collaboratively in a complex, rapidly changing, and culturally diverse environment.
- Ability to learn and comply with company policies and procedures
- Passion for technology and an eagerness to contribute to a team-orientated environment
Posting Dates:
June 10, 2025 - June 23, 2025
Caterpillar is an Equal Opportunity Employer.
Not ready to apply? Join our Talent Community.
Top Skills
AWS
Power BI
Python
SQL
Tableau
Similar Jobs at Caterpillar
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
The Data Scientist will leverage quantitative analysis, data management, and modeling skills to solve business problems, mentor junior data scientists, and communicate complex insights to stakeholders.
Top Skills:
Agile FrameworkCloudData Visualization ToolsDevOpsGitlabPower BIPythonRSASSnowflakeSQL
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
The Data Scientist will analyze large datasets to drive business decisions using quantitative methods, develop predictive models, and collaborate with teams to deliver insights.
Top Skills:
AWSPower BIPythonRSnowflakeSparkSQLTableau
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
As a Lead Software Engineer, you will provide technical leadership in design and deployment of digital platforms, focusing on Adobe Experience Manager and related technologies, while coordinating development teams and optimizing solutions.
Top Skills:
Adobe Experience ManagerAemApacheCi/CdCSSGitHTMLJavaJenkinsJavaScriptJSONMicroservicesRest
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.