Distributed LLM Inference Engineer at Anyscale

Anyscale logo
Anyscale

Distributed LLM Inference Engineer

us flag
United States

170k - 247k USD

On-site

Full Time

#Engineering

#Deep Learning

#Distributed Systems

#PyTorch

#Data

Anyscale is looking for a Distributed LLM Inference Engineer

Sign up to unlock quick summaries and profile fit assessments

At Anyscale, we are on a mission to make distributed computing accessible to every developer. We are the team behind Ray, the popular open-source framework that powers scalable machine learning for industry leaders like OpenAI, Uber, Spotify, and Cruise. With over 250 million dollars in funding from top-tier investors like Andreessen Horowitz, we are building the definitive platform to help teams scale AI applications from a single laptop to massive clusters. We want you to help us push the boundaries of what is possible in AI infrastructure.

Role at a glance

We are looking for a Senior Distributed LLM Inference Engineer to join our team on a full-time, on-site basis in the United States. In this role, you will be at the heart of our efforts to deliver market-leading performance and cost-efficiency for large-scale AI inference.

Your impact

  • Collaborate with product teams to rapidly ship end-to-end batch and online inference solutions that serve our customers at scale.
  • Optimize the full stack by integrating Ray Data and our LLM engine to ensure high performance and low costs.
  • Engage with the open-source community, specifically by integrating tools like vLLM and contributing your own improvements back to the ecosystem.

What you'll need

To be successful in this role, you should be comfortable working in English and possess the following qualifications:

  • Strong experience running machine learning inference at scale with high throughput.
  • Deep familiarity with deep learning concepts and frameworks, particularly PyTorch.
  • A solid grasp of distributed systems and the unique challenges associated with ML inference.
  • A passion for staying current with state-of-the-art research and implementing industry best practices.

Perks and compensation

We believe in a transparent, data-driven approach to compensation. The target salary for this position ranges from $170,112 to $247,000 USD. In addition to your salary, you will be eligible for a comprehensive benefits package, including:

  • Stock options.
  • Medical insurance.
  • 401k retirement plan.
  • Commuter benefits.
Anyscale logo

Anyscale

3 views

0 applied

Social Media

Visit Anyscale
Share this job
Copy Permalink
Open roles at Anyscale
Anyscale logo
Anyscale

Developer Content Engineer

us flag
United States

Hybrid

Full Time

#Engineering

#Machine Learning

#Deep Learning

#Python

#MLOps

#DevOps

Anyscale logo
Anyscale

Distributed LLM Inference Engineer

us flag
United States

170k - 247k USD

On-site

Full Time

#Engineering

#Deep Learning

#Distributed Systems

#PyTorch

#Data

Anyscale logo
Anyscale

Deep Learning Performance Engineer

us flag
United States

170k - 237k USD

On-site

Full Time

#Engineering

#Machine Learning

#Artificial Intelligence

#CUDA

#Deep Learning

#PyTorch

#Systems

#Networking

#TensorFlow

#Learning

#Ray

Anyscale logo
Anyscale

Deep Learning Performance Engineer

us flag
United States

170k - 237k USD

On-site

Full Time

#Engineering

#AI

#Machine Learning

#CUDA

#Systems

#Networking

#Deep Learning

#PyTorch

#Learning

#Ray

#TensorRT

Discover similar jobs
Gauntlet logo
Gauntlet

Infrastructure Engineer

150k - 175k USD

Remote

Full Time

#Engineering

#Infrastructure

#Blockchain

#GCP

#Kubernetes

#Terraform

#GitHub Actions

#Python

#Helm

#Dagster

#IAM

#Observability

A
Astronomer

Staff Software Engineer, Platform Infrastructure

215k - 250k USD

Remote

Full Time

#Engineering

#Infrastructure

#Go

#Kubernetes

#Distributed Systems

#AWS

#GCP

#Azure

#Cloud

M
Miter

Senior Software Engineer

Remote

Full Time

#Engineering

#Software

#React

#React Native

#Node

#Express

#MongoDB

#TypeScript

#Stripe

#API Development

T
Testlio

Principal Software Architect

Remote

Full Time

#Software

#Testing

#SaaS

#AWS

#Distributed Systems

#Event Driven Design

#Database

#CI CD

#AI

#LLM

#Frontend Frameworks

A
Arbor

Data Engineer

Remote

Full Time

#Engineering

#Analytics

#DBT

#SQL

#Snowflake

#Python

#GCP

#Fivetran

V
Vic.ai

QA Engineer

es flag
Spain

Remote

Full Time

#Engineering

#Quality Assurance

#Test Automation

#API Testing

#Testing

#Python

#JavaScript

#TypeScript

#Playwright

#Cypress

#Selenium

J
Jimdo.com

Data Engineer

Remote

Full Time

#Engineering

#Data

#SQL

#DBT

#Python

#Snowflake

#Airflow

#AWS

#Git

TheEverywhereOffice logo
TheEverywhereOffice

Full Stack Developer

Remote

Full Time

#Engineering

#PropTech

#Python

#Flask

#Django

#Laravel

#Vue

#React

R
Rad AI

Data Engineer

Remote

Full Time

#Engineering

#Healthcare

#Analytics

#Metaflow

#Spark

#AWS

#EMR

#Docker

#Kubernetes

#SQL

#NoSQL

#DynamoDB

#Elasticsearch

The Browser Company logo
The Browser Company

Software Engineer, Compiler

us flag
US, CA

295k - 350k USD

Remote

Full Time

#Engineering

#Compiler

#Open Source

#Swift

#LLVM

#C++

#Windows

#Android

#Build Systems

#Tooling

#Design

Homebound logo
Homebound

Technical Lead Manager

Remote

Full Time

#Engineering

#Construction

#TypeScript

#Node

#React

#GraphQL

#PostgreSQL

#AWS

#AI

Flower logo
Flower

Founding Research Engineer in the Flower Frontier Model Team

Remote

Full Time

#Engineering

#Artificial Intelligence

#PyTorch

#Jax

#Transformers

#Optimization

#Training

#Docker

#Git

#Linux

K
Kraken.com

Senior Software Engineer - Frontend - Pro

Remote

Full Time

#Engineering

#Fintech

#React

#JavaScript

#TypeScript

#Next.js

#WebSockets

#API Design

#Testing

#UI UX

Prosper logo
Prosper

Sr. GRC Analyst

Remote

Full Time

#Technology

#Engineering

#GRC

#PCI DSS

#NIST

#SOC

#AWS

#Azure

#GCP

#Python

#BASH

#PowerShell

Versapay logo
Versapay

Principal .NET Software Engineer

Remote

Full Time

#Engineering

#Payments

#C#

#.NET

#SQL

#AWS

#Azure

#GitHub Actions

#RESTful APIs

#ISO 8583

B
Blockworks

Senior Data Engineer

160k - 200k USD

Remote

Full Time

#Engineering

#Cryptocurrency

#Python

#Go

#Rust

#TypeScript

#SQL

#Parquet

#Postgres

#Clickhouse

#Docker

#Kubernetes

#AWS

#GCP

#Airflow

#Dagster

#DBT

Wallarm logo
Wallarm

Senior Rust Developer

Remote

Full Time

#Engineering

#Cyber Security

#Rust

#Kubernetes

#Helm

#Terraform

#Backend Systems

#Distributed Systems

S
SecondDinner

Senior Director, Engineering

270k - 300k USD

Remote

Full Time

#Engineering

#Game Development

#Unity

#AWS

#Git

#.NET

#Technical Leadership

CareMessage logo
CareMessage

Senior Product Manager - Data & Interoperability

Remote

Full Time

#Product Development

#Product

#Data

#FHIR

#HL7

#Electronic Health Records

#Product Management

#B2B SaaS

#Technology

Ethena Labs logo
Ethena Labs

Staff Security Engineer

Remote

Full Time

#Security

#DeFi

#Engineering

#Solidity

#EVM

#Foundry

#SAFe

Your dream job awaits.

Explore exciting opportunities, connect with top employers, and ignite your career.