Puma Energy Logo

Puma Energy

Senior Analyst - Data Engineer

Posted 25 Days Ago
Mumbai, Maharashtra
Mid level
Mumbai, Maharashtra
Mid level
The Senior Analyst - Data Engineer will design and maintain data pipelines, ensuring data integrity and quality for business insights, while enhancing data infrastructure and governance using Databricks and various cloud technologies.
The summary above was generated by AI
Main Purpose:Main Purpose
▪Collaborate with data scientists and business stakeholders to design, develop, and maintain efficient data pipelines feeding into the organization's data lake.

Maintain the integrity and quality of the data lake, enabling accurate and actionable insights for data scientists and informed decision-making for business stakeholders.
▪Utilize extensive knowledge of data engineering and cloud technologies to enhance the organization’s data infrastructure, promoting a culture of data-driven decision-making.

Apply data engineering expertise to define and optimize data pipelines using advanced concepts to improve the efficiency and accessibility of data storage.
▪Own the development of an extensive data catalog, ensuring robust data governance and facilitating effective data access and utilization across the organization.Knowledge Skills and Abilities, Key Responsibilities:

Key Responsibilities

•Contribute to the development of scalable and performant data pipelines on Databricks, leveraging Delta Lake, Delta Live Tables (DLT), and other core Databricks components.

•Develop data lakes/warehouses designed for optimized storage, querying, and real-time updates using Delta Lake.

•Implement effective data ingestion strategies from various sources (streaming, batch, API-based), ensuring seamless integration with Databricks.

•Ensure the integrity, security, quality, and governance of data across our Databricks-centric platforms.

•Collaborate with stakeholders (data scientists, analysts, product teams) to translate business requirements into Databricks-native data solutions.

•Build and maintain ETL/ELT processes, heavily utilizing Databricks, Spark (Scala or Python), SQL, and Delta Lake for transformations.

•Experience with CI/CD and DevOps practices specifically tailored for the Databricks environment.

•Monitor and optimize the cost-efficiency of data operations on Databricks, ensuring optimal resource utilization.
•Utilize a range of Databricks tools, including the Databricks CLI and REST API, alongside Apache Spark™, to develop, manage, and optimize data engineering solutions.

Work Experience:

•5 years of overall experience & at least 3 years of relevant experience
•3 years of experience working with Azure or any cloud platform & Databricks

Skills:
• Proficiency in Spark, Delta Lake, Structured Streaming, and other Azure Databricks functionalities for sophisticated data pipeline construction.
• Strong capability in diagnosing and optimizing Spark applications and Databricks workloads, including strategic cluster sizing and configuration.
• Expertise in sharing data solutions that leverage Azure Databricks ecosystem technologies for enhanced data management and processing efficiency.
• Profound knowledge of data governance, data security, coupled with an understanding of large-scale distributed systems and cloud architecture design.
• Experience with a variety of data sources and BI tools

Key Relationships and Department Overview:

•Internal – Data Engineering Manager
•Developers across various departments, Managers of Departments in other regional hubs of Puma Energy
•External – Platform providers

Top Skills

Azure
Ci/Cd
Databricks
Delta Lake
Delta Live Tables
Python
Rest Api
Scala
Spark
SQL

Similar Jobs

25 Days Ago
Mumbai, Maharashtra, IND
Senior level
Senior level
Energy
The Senior Analyst - Data Engineer will design and maintain data pipelines for the organization, ensuring data integrity and supporting decision-making through collaboration with data scientists and stakeholders.
Top Skills: DatabricksDatabricks CliDelta LakePythonRest ApiScalaSparkSQL
12 Hours Ago
Mumbai, Maharashtra, IND
Senior level
Senior level
Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
As a Lead Data Scientist, you will research algorithms, apply machine learning, visualize data, and improve analytics tools while collaborating within an Agile team.
Top Skills: Machine LearningPandasPython
12 Hours Ago
Mumbai, Maharashtra, IND
Senior level
Senior level
Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
The Lead Data Scientist will analyze vehicle sensor data to identify hazards, develop algorithms, and create predictive models, collaborating with engineering teams.
Top Skills: Aws Ai/MlClusteringForecastingMachine LearningPythonRegression

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account