AI/ML Infrastructure Engineer
120k - 150k USD
Remote
Full Time
#Engineering
#Linux
#Python
#BASH
#PHP
#Machine Learning
#Metal
#Automation
#Networking
At Vultr, we are on a mission to make high-performance cloud computing accessible, affordable, and easy to use for developers and businesses across the globe. As the world’s largest privately-held cloud computing company, we have successfully scaled our operations without outside equity financing. We currently support over 1.5 million customers in 185 countries through our network of 32 data centers. We are looking for passionate individuals to help us continue building the future of cloud infrastructure.
The role
We are seeking a Senior AI/ML Infrastructure Engineer to join our team on a full-time, remote basis. In this position, you will play a critical role in the growth of our business by owning the setup and provisioning of our GPU-based and bare metal systems. You will be instrumental in ensuring our infrastructure remains fast, stable, and performant for our global user base.
Core responsibilities
- Develop and maintain infrastructure within both bare metal and containerized environments.
- Collaborate with our networking team to architect and support scalable GPU clusters.
- Build and maintain test automation for GPU-based products to guarantee reliable and rapid provisioning.
Skills and experience
To be successful in this role, you should possess the following qualifications:
- Hands-on experience with high-performance GPUs, specifically NVIDIA products like NVLink, Infiniband, and vGPU drivers.
- Deep technical expertise in automating bare metal internals, including BMC, BIOS, firmware, NICs, PCIe, and Redfish/IPMI.
- Proficiency in Linux, including package management and device driver configuration.
- Strong programming skills in Python, BASH, and PHP.
- Familiarity with Machine Learning software and rail optimization across various architectures.
- Experience working with commercial firmware and managing vendor relationships for hardware and software troubleshooting.
Compensation and benefits
The salary range for this position is $120,000 to $150,000. Our comprehensive benefits package includes:
- A fully remote work environment with a virtual company get-together.
- A 401(k) plan with a 100% match up to 4% and immediate vesting.
- An annual professional development reimbursement of $2,500.
- Generous paid time off, including 11 holidays, a rollover plan, and a paid day off for your birthday.
- Long-term tenure rewards, such as increased PTO after three years and a one-month sabbatical after five years.
- Financial support for your home office, including a first-year setup allowance, annual equipment stipends, and monthly internet reimbursement.
- A monthly allowance for gym memberships.
How to apply
If you are ready to make a significant impact on the future of cloud infrastructure and contribute to a mission-driven team, we encourage you to submit your application. We look forward to reviewing your experience and discussing how you can help Vultr continue to innovate.







