Fractal Logo

Fractal

Lead Gen AI Data Scientist - GenAI

Posted 4 Days Ago
Be an Early Applicant
5 Locations
Expert/Leader
5 Locations
Expert/Leader
The Lead Gen AI Data Scientist will design and implement solutions using Large Language Models (LLMs), conduct research on generative AI, maintain code libraries, engage in the software development lifecycle, and collaborate with cross-functional teams to enhance decision-making processes.
The summary above was generated by AI

It's fun to work in a company where people truly BELIEVE in what they are doing!

We're committed to bringing passion and customer focus to the business.

Job Description

About Fractal

What makes Fractal a GREAT fit for you? When you join Fractal, you’ll be part of a fast-growing team that helps our clients leverage AI together with the power of behavioural sciences to make better decisions. We’re a strategic analytics partner to most admired fortune 500 companies globally, we help them power every human decision in the enterprise by bringing analytics, AI and behavioural science to the decision.

Our people enjoy a collaborative work environment, exceptional training and career development — as well as unlimited growth opportunities. We have a Glassdoor rating of 4 / 5 and achieve customer NPS of 9/ 10. If you like working with a curious, supportive, high-performing team, Fractal is the place for you. close.

Responsibilities:

  • Design and implement advanced solutions utilizing Large Language Models (LLMs).
  • Demonstrate self-driven initiative by taking ownership and creating end-to-end solutions.
  • Conduct research and stay informed about the latest developments in generative AI and LLMs.
  • Develop and maintain code libraries, tools, and frameworks to support generative AI development.
  • Participate in code reviews and contribute to maintaining high code quality standards.
  • Engage in the entire software development lifecycle, from design and testing to deployment and maintenance.
  • Collaborate closely with cross-functional teams to align messaging, contribute to roadmaps, and integrate software into different repositories for core system compatibility.
  • Possess strong analytical and problem-solving skills.
  • Demonstrate excellent communication skills and the ability to work effectively in a team environment.

Primary Skills:

  • Natural Language Processing (NLP): Hands-on experience in use case classification, topic modeling, Q&A and chatbots, search, Document AI, summarization, and content generation.
  • Computer Vision and Audio: Hands-on experience in image classification, object detection, segmentation, image generation, audio, and video analysis.
  • Generative AI: Proficiency with SaaS LLMs, including Lang chain, llama index, vector databases, Prompt engineering (COT, TOT, ReAct, agents). Experience with Azure OpenAI, Google Vertex AI, AWS Bedrock for text/audio/image/video modalities.
  • Familiarity with Open-source LLMs, including tools like TensorFlow/Pytorch and huggingface. Techniques such as quantization, LLM finetuning using PEFT, RLHF, data annotation workflow, and GPU utilization.
  • Cloud: Hands-on experience with cloud platforms such as Azure, AWS, and GCP. Cloud certification is preferred.
  • Application Development: Proficiency in Python, Docker, FastAPI/Django/Flask, and Git.

Tech Skills (10+ Years’ Experience):

Machine Learning (ML) & Deep Learning:

   - Solid understanding of supervised and unsupervised learning.

   - Proficiency with deep learning architectures like Transformers, LSTMs, RNNs, etc.

2. Generative AI:

   - Hands-on experience with models such as OpenAI GPT4, Anthropic Claude, LLama etc.

   - Knowledge of fine-tuning and optimizing large language models (LLMs) for specific tasks.

3. Natural Language Processing (NLP):

   - Expertise in NLP techniques, including text preprocessing, tokenization, embeddings, and sentiment analysis.

   - Familiarity with NLP tasks such as text classification, summarization, translation, and question-answering.

4. Retrieval-Augmented Generation (RAG):

   - In-depth understanding of RAG pipelines, including knowledge retrieval techniques like dense/sparse retrieval.

   - Experience integrating generative models with external knowledge bases or databases to augment responses.

5. Data Engineering:

   - Ability to build, manage, and optimize data pipelines for feeding large-scale data into AI models.

6. Search and Retrieval Systems:

   - Experience with building or integrating search and retrieval systems, leveraging knowledge of Elasticsearch, AI Search, ChromaDB, PGVector etc.

7. Prompt Engineering:

   - Expertise in crafting, fine-tuning, and optimizing prompts to improve model output quality and ensure desired results.

   - Understanding how to guide large language models (LLMs) to achieve specific outcomes by using different prompt formats, strategies, and constraints.

   - Knowledge of techniques like few-shot, zero-shot, and one-shot prompting, as well as using system and user prompts for enhanced model performance.

8. Programming & Libraries:

   - Proficiency in Python and libraries such as PyTorch, Hugging Face, etc.

   - Knowledge of version control (Git), cloud platforms (AWS, GCP, Azure), and MLOps tools.

9. Database Management:

   - Experience working with SQL and NoSQL databases, as well as vector databases

10. APIs & Integration:

   - Ability to work with RESTful APIs and integrate generative models into applications.

11. Evaluation & Benchmarking:

   - Strong understanding of metrics and evaluation techniques for generative models.

If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!

Not the right fit?  Let us know you're interested in a future opportunity by clicking Introduce Yourself in the top-right corner of the page or create an account to set up email alerts as new job postings become available that meet your interest!

Top Skills

Python

Similar Jobs

20 Hours Ago
Pune, Maharashtra, IND
Mid level
Mid level
eCommerce • Logistics • Software • Analytics
As a Senior Data and Applied Scientist, you will design and maintain machine learning models to optimize advertising campaigns for ecommerce. Responsibilities include analyzing data, enhancing model performance, collaborating with teams, and dedicating 20% of your time to MLOps.
Top Skills: PythonSQL
4 Days Ago
Pune, Maharashtra, IND
Mid level
Mid level
eCommerce • Logistics • Software • Analytics
As a Senior Data and Applied Scientist – NLP, you will develop solutions for processing e-commerce data using NLP. Responsibilities include designing datasets, deploying models, optimizing data pipelines, and staying updated on AI advancements while reporting progress to management.
Top Skills: Data ScienceMachine LearningNatural Language Processing
8 Hours Ago
Hybrid
Mumbai, Maharashtra, IND
Mid level
Mid level
Financial Services
As a Regulatory Reporting Analyst, you will ensure compliance with regulations, monitor controls, and resolve issues promptly, while working closely with multiple teams. Your role involves understanding OTC derivative instruments, implementing control frameworks, conducting UAT testing, and managing key projects in a dynamic environment.

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account