Lead end-to-end LLM pipeline for customer service scheduling: data prep, prompt design, RAG systems, multi-agent architectures, multi-GPU deployment, evaluation pipelines, and chatbot integration to improve model quality and decision-making.
Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 250 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.
We are seeking a highly skilled professional to join our team, focusing on advancing customer service scheduling optimization through innovative AI solutions. This role involves researching and implementing cutting-edge algorithms to enhance scheduling systems, leveraging business domain knowledge to elevate the impact of AI products. The successful candidate will develop and refine Large Language Models (LLMs) to extract actionable insights, improve business decision-making, and optimize prompt design for more accurate outputs. Additionally, the role includes creating scalable and robust LLM/RAG frameworks tailored to customer service scheduling, fostering innovation and maintaining a competitive market edge.
Responsibilities:
- Own the full LLM pipeline from data preparation to production real case usage.
- Design, iterate and optimize prompts (zero-/few-shot, chain-of-thought, tool-calling, etc.) to maximize model utility and safety across products and languages.
- Build and maintain Retrieval-Augmented Generation (RAG) QA/search systems that connect to multi-source knowledge bases.
- Familiar with vLLM/SGLang inference architectures and have proven experience deploying and operating LLM services on multi‑GPU or cluster environments.
- Design, implement and operate multi‑agent LLM architectures (e.g. LangGraph, CrewAI, AutoGen) including task decomposition, agent orchestration, memory sharing and tool‑calling workflows.
- Develop evaluation pipelines (automatic metrics & human feedback) to measure prompt and model quality, bias, and hallucination rates.
- Collaborate with product and CS teams to integrate AI models into conversational Chatbot in different scenarios.
- Track cutting-edge research, author tech blogs, and keep improve current architecture.
Qualifications:
- Master’s degree or higher in Computer Science, Data Science or related field..
- 2+ years of deep-learning/NLP experience, including 1+ year practical LLM work (SFT, DPO, RAG, quantization, inference optimization, etc.).
- Demonstrated prompt engineering & tuning expertise (few-shot design, structured prompting, prefix-/p-tuning, reward re-ranking, safety filtering).
- Practical experience building and deploying multi‑agent LLM workflows, with understanding of agent‑orchestrator patterns, shared memory, long‑horizon planning and guard‑rail design.
- Clean coding practices, good English communication skills, and a passion for rapid learning.
- Excellent self-driven and ownership with good deliverables.
- Eager to learn, be curious about AI new technologies
- Good communication and collaboration skills
Similar Jobs
Other
The Senior Account Manager will manage and grow key wholesale telecom accounts in Asia, ME, and Africa, focusing on margin growth, strategic partnerships, and performance optimization through effective negotiation and market analysis.
Artificial Intelligence • Marketing Tech • Sales • Software
Lead and scale the engineering organization for a regulated digital-asset custody platform. Own hiring, org design, engineering operations (on-call, incidents, release hygiene), delivery against roadmaps, audit and regulator readiness (SOC 2, SAMA, ISO 27001), and cross-functional alignment. Amplify an existing small, specialized team to meet regulatory and institutional requirements while preserving culture and retention.
Top Skills:
Aurora PostgresAWSBitcoinClickhouseEthereumGithub ActionsGoHsmKubernetesMpc/TssRustSolanaTemporalTerraformThreshold Signing ProtocolsTypescript
Artificial Intelligence • Marketing Tech • Sales • Software
Own the cryptographic core: select and evolve MPC/TSS schemes, author protocol reviews, implement and review Rust/Go production crypto, lead audit responses, prepare incident fixes, publish and represent externally, and teach/mentor engineers while scaling the crypto function.
Top Skills:
Bls ThresholdCggmp21Dkls23FrostGg18Gg20GoLattice-Based ThresholdMpcMulti-Party-EcdsaRaccoonRustSparkleTss
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.


