Deep Learning Performance Engineer
170k - 237k USD
On-site
Full Time
#Engineering
#Machine Learning
#Artificial Intelligence
#CUDA
#Deep Learning
#PyTorch
#Systems
#Networking
#TensorFlow
#Learning
#Ray
At Anyscale, we are on a mission to democratize distributed computing, making it accessible to developers regardless of their technical background. We are the team behind Ray, the open-source project powering scalable machine learning for industry leaders like OpenAI, Uber, Spotify, and Cruise. By building the premier platform to run Ray, we enable data scientists and developers to scale their applications from a single laptop to massive clusters with ease. With over 250 million dollars in funding from partners like Andreessen Horowitz, NEA, and Addition, we are growing rapidly and looking for passionate individuals to help us shape the future of AI infrastructure.
The role
We are seeking a Senior Deep Learning Performance Engineer to join our team on a full-time, on-site basis in the United States. You will play a critical role in our mission by developing systems and optimizations that push the boundaries of performance for cutting-edge machine learning models. This position is essential to maintaining our market-leading performance and cost-efficiency, and you will be expected to work from our office three days per week.
Core responsibilities
- Collaborate rapidly with our product teams to deliver the latest performance optimizations to the Anyscale platform, Anyscale Endpoints, and our open-source offerings.
- Partner closely with research teams on advanced LLM engines such as vLLM and TensorRT-LLM.
- Stay at the forefront of the research community and open-source landscape, implementing and extending best practices to keep our technology ahead of the curve.
Skills and experience
To be successful in this role, you should have a strong technical foundation and a passion for high-performance computing. We are looking for the following qualifications:
- Proven experience working with GPUs and CUDA.
- A solid grasp of operating systems or networking fundamentals, including practical experience with system-level optimizations.
- Familiarity with deep learning frameworks, particularly PyTorch.
- Bonus points if you possess knowledge of ML systems, experience training deep learning models, or have contributed to frameworks like TensorFlow, compilers like Triton, TVM, or MLIR, and the Ray ecosystem.
Compensation and benefits
We believe in a transparent, data-driven, and consistent approach to compensation. The target salary for this position ranges from $170,112 to $237,000 USD, which may be adjusted based on evolving market data. In addition to your salary, you will be eligible for a comprehensive benefits package that includes:
- Stock options.
- 401k retirement plan.
- Healthcare plans with 99% of premiums covered.
- Wellness and education stipends.
- Paid parental leave and flexible time off.
- Commute reimbursement and 100% of in-office meals provided.
How to apply
If you are excited about building the infrastructure that powers the next generation of AI, we would love to hear from you. Please submit your application to join our team, and we will review your background to see how your skills align with our mission.





