About Fusemachines
Fusemachines is a 10+ year old AI company, dedicated to delivering state-of-the-art AI products and solutions to a diverse range of industries. Founded by Sameer Maskey, Ph.D., an Adjunct Associate Professor at Columbia University, our company is on a steadfast mission to democratize AI and harness the power of global AI talent from underserved communities. With a robust presence in four countries and a dedicated team of over 400 full-time employees, we are committed to fostering AI transformation journeys for businesses worldwide. At Fusemachines, we not only bridge the gap between AI advancement and its global impact but also strive to deliver the most advanced technology solutions to the world.
About the Role:
Location: Remote | Full-time
We are seeking an experienced and motivated Data Scientist to spearhead our data-driven initiatives. The ideal candidate will be responsible for architecting scalable solutions, and applying advanced analytical techniques to solve complex business problems. You will be instrumental in transforming raw data into actionable insights that drive strategy and operational improvements.
Key Responsibilities
Drive the design, development, and testing of big data applications to ensure the timely delivery of product goals.
Proactively identifying and implementing code and design optimizations.
Collaborate closely with data engineers, analysts, and business teams to analyze requirements and ensure data-driven solutions are effectively implemented.
Develop end-to-end data solutions, from data collection and cleaning to building and deploying predictive machine learning models.
Conduct exploratory data analysis using statistical methods to uncover trends, patterns, and actionable insights.
Create compelling data visualizations, dashboards, and reports to communicate complex findings to both technical and non-technical stakeholders.
Provide data-backed recommendations to support key business decisions and improve strategies.
Learn and integrate with a wide variety of internal and external systems, APIs, and platforms.
Required Skills & Qualifications
Minimum of 3+ years of hands-on experience in data science or big data development.
Proven track record of successfully guiding development projects.
Expertise in Python and PySpark, including tools like Jupyter Notebooks and environment controllers (e.g., Poetry, PipEnv).
Hands-on experience with the Databricks platform and Apache Spark.
Proficiency with relational databases (e.g., PostgreSQL, SQL Server, Oracle) and SQL.
Strong practical knowledge of data cleansing, transformation, and validation techniques.
Experience with code versioning tools like Git (GitHub, Azure DevOps, Bitbucket).
Excellent written and verbal communication skills, with the ability to articulate complex technical concepts clearly.
Fusemachines is an Equal Opportunities Employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local laws