ML Ops Engineer at Radical AI

Radical AI logo
Radical AI

ML Ops Engineer

us flag
United States

175k - 275k USD

On-site

Full Time

#Engineering

#AI

#Kubernetes

#Terraform

#Cloud Services

#Distributed Training

#VPC

#Data

#Learning

#DevOps

#Parallel Computing

Radical AI is looking for a ML Ops Engineer

Sign up to unlock quick summaries and profile fit assessments

Radical AI, Inc. is an artificial intelligence company that is accelerating scientific research & development. We are at the forefront of innovation in the field of materials R&D, a critical driver for advancing our most cutting-edge industries and shaping the future. Breaking away from the traditionally slow and costly R&D process, Radical AI leverages artificial intelligence and machine learning to pioneer generative materials science. This innovative field blends AI, engineering, and materials science, revolutionizing how materials are created and discovered. Radical AI's approach speeds up R&D and addresses global challenges, setting new benchmarks in technology and sustainability.

The opportunity

As an ML Ops Engineer, you’ll be joining our AI Research and Development team. Reporting to the Vice President of Research, this role involves playing a key role in developing our ML and data platform from the ground up by designing, implementing, and maintaining scalable ML infrastructure and pipelines to support the development, training, and deployment of machine learning models for materials research applications. The successful candidate will play a crucial role in advancing our AI capabilities and contributing to groundbreaking projects in materials science.

Mission

  • Deploy and manage advanced machine learning models, with a focus on generative models for materials discovery by employing Kubernetes, Terraform, and cloud services (Lambda) to deploy and scale models efficiently, ensuring their adaptability to high-demand scenarios.
  • Optimize computing infrastructure by focusing on enhancing GPU utilization, distributed training, bandwidth efficiency between machines, and VPC connections to maximize system performance.
  • Work closely with the AI research team and cross-functional teams, including engineering, to ensure effective model deployment and integration into production systems.
  • Stay abreast of the latest developments in machine learning and data infrastructure, applying new techniques and methodologies to ongoing projects.
  • Handle large datasets, perform data preprocessing, and extract meaningful insights relevant to materials science.
  • Run, monitor and maintain business-critical systems.
  • Conduct rigorous testing and validation of machine learning models and data pipelines to ensure accuracy, efficiency, and scalability.
  • Maintain comprehensive documentation of models, pipelines, algorithms, and experiments.
  • Troubleshoot and optimize machine learning models and data infrastructure, addressing technical challenges and improving overall performance.
  • Promote engineering best practices throughout the team.
  • Ensure adherence to ethical AI standards and best practices in all aspects of work.

About you

  • Solid experience with DevOps, cloud infrastructure, and deploying machine learning models. Expertise in network optimization and parallel computing is crucial.
  • Experience with Kubernetes, Terraform, and cloud computing platforms for scalable AI model deployment.
  • The ability to navigate complex challenges, strategically manage resources, and improve system efficiency.
  • Basic ML knowledge, with experience in training generative models at scale.
  • Experience working with and scaling model training across GPU clusters.
  • Experience in building data pipelines and managing data infrastructure.
  • Excellent written and verbal communication skills, with the ability to clearly convey complex technical information.
  • Ability to work effectively in a collaborative team environment.

Pluses

  • Master’s or PhD in Computer Science, AI, Data Science, or related field.
  • Experience with Lambda cloud.
  • Experience integrating RAG technologies.

Compensation

$175K – $275K + Equity + Benefits; base pay offered may vary depending on job-related knowledge, skills, and experience.

What we offer

A competitive compensation package also includes the best in benefits:

  • Medical, dental, and vision insurance for you and your family
  • Mental health and wellness support
  • Unlimited PTO and 14+ company holidays per year
  • 401K 
  • Work closely with a team on the cutting edge of AI research.
  • A mission: an opportunity to fundamentally change the way humanity makes progress through materials science discovery.

Radical AI is committed to equal employment opportunity regardless of race, color, ancestry, national origin, religion, sex, age, sexual orientation, gender identity and expression, marital status, disability, or veteran status.

Radical AI logo

Radical AI

6 views

0 applied
Visit Radical AI
Share this job
Copy Permalink
Open roles at Radical AI
Radical AI logo
Radical AI

ML/Dev Ops Engineer

us flag
United States

175k - 275k USD

On-site

Full Time

#Engineering

#Slurm

#Kubernetes

#Terraform

#Ansible

#Python

Discover similar jobs
W
Wrapbook

Senior Software Engineer

ca flag
Canada

Remote

Full Time

#Software Engineering

#Ruby on Rails

#Fintech

#PostgreSQL

#SQL

#Redis

#Sidekiq

#Kubernetes

#StimulusJS

#Backend Development

L
Liquibase

Customer Success Account Executive

Remote

Full Time

#Customer Success

#Account Executive

#Sales

#Technical Sales

#Pipeline Management

#Relationship Building

#DevOps

#Database Management

#HubSpot

#Revenue Forecasting

Sauce logo
Sauce

AI Operations Engineer

Remote

Full Time

#Engineering

#Operations

#OpenAI

#Node.Js

#React

#PostgreSQL

#REST API

#Cloud

P
Prolific

Application Security Lead

Remote

Full Time

#Application Security

#Engineering

#AI

#OWASP Top 10

#Code Review

#Python

#Burp suite

#SSDLC

#SAST

#DAST

#Vulnerability Management

#ISO 27001

CKSource logo
CKSource

QA Engineer

54k - 83k USD

Remote

Full Time

#QA Engineering

#Cloud Services

#Developer Tools

#JavaScript

#TypeScript

#Cypress

#Playwright

#API Testing

#Docker

#Node.Js

#AWS

#Testing

Ethena Labs logo
Ethena Labs

Head of Platform Engineering

Remote

Full Time

#Platform Engineering

#DevOps

#Cryptocurrency

#AWS

#GCP

#Terraform

#Kubernetes

#Prometheus

#Datadog

#DevSecOps

#Infrastructure as Code

Allata logo
Allata

Ascend Program - Data

Remote

Full Time

#Data

#Data Engineering

#Software Development

#Data Analysis

#AI

#Agile

#Jira

#Git

#Cloud Platforms

Tebra logo
Tebra

Security Architect

179k - 204k USD

Remote

Full Time

#Security

#Cloud Security

#Healthcare

#Cloudflare

#GCP

#Kubernetes

#Terraform

#Python

#DevSecOps

#Vertex AI

#BigQuery

#Helm

#Workato

M
Maze

Full Stack Software Engineer

Remote

Full Time

#User Research

#Product Engineering

#Full Stack

#Node.Js

#React

#PostgreSQL

#Next.js

#NestJS

#GraphQL

#TypeScript

#AWS

#Kubernetes

S
Snackpass

Software Engineer, Fullstack

Remote

Full Time

#Engineering

#Payments

#Analytics

#Tooling

#Mobile Apps

#Scalable Systems

OpenVPN logo
OpenVPN

AI Platform Engineer

140k - 150k USD

Remote

Full Time

#AI

#DevOps

#Cloud Infrastructure

#Vertex AI

#Terraform

#GCP

#Compliance

#ISO 27001

#Pipelines

#Kubernetes

U
Union

Sales Engineer

Remote

Full Time

#AI

#Sales

#Machine Learning

#MLOps

#PyTorch

#TensorFlow

#Spark

#Kubernetes

#Docker

#AWS

#Terraform

#MEDDIC

N
NewPage Solutions Inc

Python Developer

Remote

Contractor

#Technology

#Digital Health

#Continuous Delivery

#Python

#AWS Lambda

#AWS ECS

#Automated Testing

#Agile Methodologies

#Terraform

#Drupal

#PHP

#S3

#DynamoDB

D
Deepgram

Pre-Sales Solutions Engineer

Remote

Full Time

#AI

#Solutions Engineering

#Python

#JavaScript

#API Integration

#Speech Recognition

#NLP

#Cloud Platforms

#Docker

#Kubernetes

#Sales Methodologies

Volksbyte logo
Volksbyte

DevOps Engineer

Remote

Full Time

#Technology

#DevOps

#Software Development

#Pipelines

#Linux

#Ansible

#Terraform

#Apache

#Nginx

#PHP

#Node

#PostgreSQL

U
Unit4

Senior Cloud Infrastructure Engineer

pl flag
Poland

Remote

Full Time

#Cloud Infrastructure

#Engineering

#Microsoft Azure

#Infrastructure Engineering

L
Lightdash

Head of Engineering

Remote

Full Time

#Engineering Leadership

#AI

#Developer Experience

#TypeScript

#React

#Node.Js

#SQL

#Docker

#Kubernetes

#GCP

#Architecture

#Security

saas.group logo
saas.group

Applied Research Scientist

Remote

Full Time

#AI

#Research

#SQL

#Python

#Data Analysis

#Experiment Design

#Data Pipelines

#Validation

#AI Tools

#Research Methodology

P
Pinecone

Staff/Principal Product Manager, Database

Remote

Full Time

#Product Management

#AI

#Database

#SaaS Products

#Cloud Infrastructure

#Data Analysis

#User Research

#Roadmap Planning

#Collaboration

#Technical Products

Dataiku logo
Dataiku

Fullstack Software Engineer

Remote

Full Time

#Engineering

#AI

#Solutions

#Vue.Js

#React

#Angular

#Python

#fastAPI

#Flask

#RESTful API

#Data

Your dream job awaits.

Explore exciting opportunities, connect with top employers, and ignite your career.