The Bigdata Engineer will design and develop ETL/Hadoop and analytics components, ensuring performance optimization and adherence to architectural standards, while managing distributed computing tasks and data ingestion workflows.
This is a remote position.
Responsibilities:
- As a developer, possess excellent Knowledge of distributed computing architecture, core hadoop component (HDFS, Spark, Yarn, Map-Reduce, H base, HIVE, Impala) and related technologies.
- Technical Design and development of ETL/Hadoop and Analytics services /components
- Contribute in end to end architecture and process flow
- Understand Business requirement and publish reusable designs
- Result oriented approach with ability to provide apt solutions.
- Proficient in performance improvement & fine-tuning ETL and Hadoop implementations
- Conduct code reviews across projects. Takes responsibility for ensuring that build and code adhere to architectural and quality standards and policies.
- Can work independently with minimum supervision.
- Strong analytical and problem solving skills
- Experience/Exposure to SQL, advanced SQL skills
Requirements
Skills Set:
- Strong understanding of distributed computing architecture, core hadoop component (HDFS, Spark, Yarn, Map-Reuduce, H base, HIVE, Impala) and related technologies.
- Hands on experience with batch data ingestion (Sqoop)
- Expert level understanding of relational data structure and RDBMS as well as NoSQL databases (Cassandra, MongoDB, Elasticsearch)
- Experience with automation/Scheduling of workflows/jobs (via shell-scripting, Tivoli)
- Solid Grasp of data storage formats (Parquet, Avro, HBase, Cassandra)
- Understanding of Agile methodologies as well as SDLC life-cycles and processes.
- Strong Understanding of Data warehousing and lakes
Similar Jobs
Cloud • Information Technology • Security • Software
Lead quality efforts for major product areas, define test strategies, write and maintain complex automated tests, enable developers to shift testing left, participate in incident reviews and root-cause analysis, mentor junior QEs, and ensure releases meet reliability, performance, and security standards while partnering with architects and development leads.
Top Skills:
Ci/CdJavaScriptPlaywrightPytestPythonTypescript
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Lead clinical trial disclosure strategy and operations for Pfizer-sponsored interventional trials. Ensure timely, compliant posting of protocols, SAPs, CSRs, and clinical summaries per EMA Policy 70 and other regulations. Manage vendors, develop processes and technical solutions, represent Medical Writing on governance committees, and maintain regulatory knowledge and best practices to drive quality and consistency across disclosures.
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Own and roadmap fleet management product experiences for dispatchers, operators, and drivers. Translate customer pain into web and mobile workflows, define metrics, run betas, work with data scientists to build AI features from sensor data, and collaborate cross-functionally while gathering field feedback.
Top Skills:
AIAutomotive SystemsIndustrial IotSQLTelematics
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.



