Distributed LLM Training & Inference Engineer

Scale AI

Join a leading AI company to develop and enhance a machine learning framework for large language models, requiring expertise in system optimization and software engineering.

Last checked on June 14, 2026. We may earn a commission when you click through.

This role offers a unique chance to contribute to innovative AI developments, especially for those passionate about large language models.

✓ Work with a leading AI company ✓ Innovative projects in AI technology ✓ Strong emphasis on collaboration

Distributed LLM Training & Inference Engineer

Scale AI

Updated 20 days ago

Apply now →

You'll be redirected to talent.com

New York

This role offers a unique chance to contribute to innovative AI developments, especially for those passionate about large language models.

About this role

Join a leading AI company to develop and enhance a machine learning framework for large language models, requiring expertise in system optimization and software engineering.

About the Company

Scale AI is a prominent AI technology firm based in San Francisco, focused on advancing AI solutions across various industries.

Key Highlights

✓ Opportunity to work with cutting-edge AI technology
✓ Focus on optimizing machine learning frameworks
✓ Requires expertise in CUDA and PyTorch
✓ Collaborative work environment
✓ Based in New York City

💡 Honest Take: Ideal for candidates with strong system optimization skills, but may not suit those without experience in CUDA or PyTorch.

Pros

✓ Work with a leading AI company
✓ Innovative projects in AI technology
✓ Strong emphasis on collaboration
✓ Located in a vibrant tech hub

Cons

✗ No remote work options available
✗ Requires specific technical skills
✗ High-pressure environment typical of tech firms

Best For: Targeted at professionals with a solid background in software engineering and system optimization.

Watch Out: Candidates lacking experience in CUDA or PyTorch may find the technical demands challenging.

Apply for this position →

You'll be redirected to talent.com

What Customers Say

Feedback from employees highlights a collaborative culture but notes the high expectations and pressure typical in tech roles.

Expert Review

Working as an ML Systems Engineer at Scale AI means diving into the heart of distributed large language models. The role demands proficiency in technologies like CUDA and PyTorch, making it crucial for candidates to have a solid engineering background. Those without these skills may struggle to keep pace with the technical requirements.

The position is located in New York City, a thriving hub for tech professionals, offering unique networking opportunities. However, the role is not remote, which could be a limiting factor for many talented candidates seeking flexibility in their work environment.

In terms of company culture, Scale AI promotes a collaborative atmosphere, essential for innovative AI projects. This is a significant plus for team-oriented individuals who thrive in dynamic settings. The potential for high-pressure situations typical of fast-paced tech companies should also be considered.

Overall, this role presents a compelling opportunity for engineers eager to push the boundaries of AI technology. For more details, visit the official listing at Scale AI's page.