The LLM Algorithm Engineer will advance post-training of large language models, manage context protocols, optimize distributed training, and build evaluation pipelines.
Job Responsibilities:
1. Advanced post-training of large language models (e.g. SFT, RLHF/RLAIF, continual pretraining).
2. Aligning models for reliable JSON-schema function calls and external tool usage.
3. Design, deploy, and operate Model Context Protocol (MCP) servers that handle checkpoint routing, manage context windows, and enforce safety gates.
4. Experience in distributed training and inference with DeepSpeed/FSDP, LoRA/QLoRA, mixed precision, and performance tuning on vLLM or Triton clusters.
5. Build offline and live eval pipelines for alignment, factuality, grounding, and hallucinations.
Qualifications
1. Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
2. 3+ years of experience in developing and optimizing large language models.
3. Proven track record in implementing advanced post-training techniques (SFT, RLHF, RLAIF, continual pretraining).
4. Hands-on experience with distributed training frameworks (DeepSpeed, FSDP) and optimization techniques (LoRA, QLoRA, mixed precision).
5. Familiarity with model alignment, JSON-schema function calls, and external tool integration.
6. Experience in building and maintaining evaluation pipelines for model performance assessment.
7. Proficiency in Python and relevant machine learning frameworks (e.g., PyTorch, TensorFlow).
8. Strong understanding of distributed systems and high-performance computing.
9. Experience with model deployment and inference optimization on vLLM or Triton clusters.
10. Knowledge of JSON-schema and API development.
Top Skills
Deepspeed
Fsdp
Lora
Python
PyTorch
Qlora
TensorFlow
Triton
Vllm
Similar Jobs
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
The Engineering Manager will lead a team of Automotive Engineers in managing fleet telematics projects, focusing on data accuracy and diagnostics across vehicle systems.
Top Skills:
Can CommunicationGitSoftware Development
Cloud • Security • Software • Cybersecurity • Automation
As a Backend Engineer, you will develop features for the Switchboard product, collaborate with stakeholders, ensure code quality, and enhance product performance while following best practices.
Top Skills:
Ci/CdDevOpsGitlabRuby On RailsVue
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
As a Staff Frontend Software Engineer, you'll build and enhance component libraries for Square's web applications, focusing on accessibility and collaboration with UI teams while mentoring other engineers.
Top Skills:
AWSCSS3ElasticsearchEsbuildHTML5JavaJavaScriptMemcachedMySQLNode.jsPostgresReactRedisTypescriptViteWebpack
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.