ML Ops Engineer at Radical AI

Radical AI logo
Radical AI

ML Ops Engineer

us flag
United States

175k - 275k USD

On-site

Full Time

#Engineering

#AI

#Kubernetes

#Terraform

#Cloud Services

#Distributed Training

#VPC

#Data

#Learning

#DevOps

#Parallel Computing

Radical AI is looking for a ML Ops Engineer

Sign up to unlock quick summaries and profile fit assessments

Radical AI, Inc. is an artificial intelligence company that is accelerating scientific research & development. We are at the forefront of innovation in the field of materials R&D, a critical driver for advancing our most cutting-edge industries and shaping the future. Breaking away from the traditionally slow and costly R&D process, Radical AI leverages artificial intelligence and machine learning to pioneer generative materials science. This innovative field blends AI, engineering, and materials science, revolutionizing how materials are created and discovered. Radical AI's approach speeds up R&D and addresses global challenges, setting new benchmarks in technology and sustainability.

The opportunity

As an ML Ops Engineer, you’ll be joining our AI Research and Development team. Reporting to the Vice President of Research, this role involves playing a key role in developing our ML and data platform from the ground up by designing, implementing, and maintaining scalable ML infrastructure and pipelines to support the development, training, and deployment of machine learning models for materials research applications. The successful candidate will play a crucial role in advancing our AI capabilities and contributing to groundbreaking projects in materials science.

Mission

  • Deploy and manage advanced machine learning models, with a focus on generative models for materials discovery by employing Kubernetes, Terraform, and cloud services (Lambda) to deploy and scale models efficiently, ensuring their adaptability to high-demand scenarios.
  • Optimize computing infrastructure by focusing on enhancing GPU utilization, distributed training, bandwidth efficiency between machines, and VPC connections to maximize system performance.
  • Work closely with the AI research team and cross-functional teams, including engineering, to ensure effective model deployment and integration into production systems.
  • Stay abreast of the latest developments in machine learning and data infrastructure, applying new techniques and methodologies to ongoing projects.
  • Handle large datasets, perform data preprocessing, and extract meaningful insights relevant to materials science.
  • Run, monitor and maintain business-critical systems.
  • Conduct rigorous testing and validation of machine learning models and data pipelines to ensure accuracy, efficiency, and scalability.
  • Maintain comprehensive documentation of models, pipelines, algorithms, and experiments.
  • Troubleshoot and optimize machine learning models and data infrastructure, addressing technical challenges and improving overall performance.
  • Promote engineering best practices throughout the team.
  • Ensure adherence to ethical AI standards and best practices in all aspects of work.

About you

  • Solid experience with DevOps, cloud infrastructure, and deploying machine learning models. Expertise in network optimization and parallel computing is crucial.
  • Experience with Kubernetes, Terraform, and cloud computing platforms for scalable AI model deployment.
  • The ability to navigate complex challenges, strategically manage resources, and improve system efficiency.
  • Basic ML knowledge, with experience in training generative models at scale.
  • Experience working with and scaling model training across GPU clusters.
  • Experience in building data pipelines and managing data infrastructure.
  • Excellent written and verbal communication skills, with the ability to clearly convey complex technical information.
  • Ability to work effectively in a collaborative team environment.

Pluses

  • Master’s or PhD in Computer Science, AI, Data Science, or related field.
  • Experience with Lambda cloud.
  • Experience integrating RAG technologies.

Compensation

$175K – $275K + Equity + Benefits; base pay offered may vary depending on job-related knowledge, skills, and experience.

What we offer

A competitive compensation package also includes the best in benefits:

  • Medical, dental, and vision insurance for you and your family
  • Mental health and wellness support
  • Unlimited PTO and 14+ company holidays per year
  • 401K 
  • Work closely with a team on the cutting edge of AI research.
  • A mission: an opportunity to fundamentally change the way humanity makes progress through materials science discovery.

Radical AI is committed to equal employment opportunity regardless of race, color, ancestry, national origin, religion, sex, age, sexual orientation, gender identity and expression, marital status, disability, or veteran status.

Radical AI logo

Radical AI

6 views

0 applied
Visit Radical AI
Share this job
Copy Permalink
Open roles at Radical AI
Radical AI logo
Radical AI

ML/Dev Ops Engineer

us flag
United States

175k - 275k USD

On-site

Full Time

#Engineering

#Slurm

#Kubernetes

#Terraform

#Ansible

#Python

Discover similar jobs
AppXite logo
AppXite

Microsoft CSP Support Specialist

Remote

Full Time

#Technology

#Cloud Services

#Microsoft

#Licensing

#Technical Troubleshooting

#Escalation Management

#Customer Advocacy

#Documentation

#Cross Functional Collaboration

A
Advocate

Product Engineer, Tech Ops

Remote

Full Time

#Technology

#Artificial Intelligence

#TypeScript

#React

#Next.js

#Node.Js

#GraphQL

#PostgreSQL

#AWS

#Terraform

#Docker

#Python

The Browser Company logo
The Browser Company

Software Engineer, Compiler

us flag
US, CA

295k - 350k USD

Remote

Full Time

#Engineering

#Compiler

#Open Source

#Swift

#LLVM

#C++

#Windows

#Android

#Build Systems

#Tooling

#Design

Homebound logo
Homebound

Technical Lead Manager

Remote

Full Time

#Engineering

#Construction

#TypeScript

#Node

#React

#GraphQL

#PostgreSQL

#AWS

#AI

Upwave logo
Upwave

DevOps Security Contractor

us flag
United States

Remote

Contractor

#Product

#DevOps

#Security

#AWS

#Infrastructure Security

#IAM

#Incident Response

#SOC 2

#Cloud Security

Flower logo
Flower

Founding Research Engineer in the Flower Frontier Model Team

Remote

Full Time

#Engineering

#Artificial Intelligence

#PyTorch

#Jax

#Transformers

#Optimization

#Training

#Docker

#Git

#Linux

Fullscript logo
Fullscript

Cloud Security Engineer

73k - 80k USD

Remote

Full Time

#Security

#Cloud

#AWS

#Google Cloud

#Terraform

#Python

#Go

#IAM

Arize AI logo
Arize AI

AI Application Engineer

sg flag
Singapore

Remote

Full Time

#AI

#Software Engineering

#Observability

#Python

#Golang

#JavaScript

#TypeScript

#OpenTelemetry

K
Kraken.com

Senior Software Engineer - Frontend - Pro

Remote

Full Time

#Engineering

#Fintech

#React

#JavaScript

#TypeScript

#Next.js

#WebSockets

#API Design

#Testing

#UI UX

Prosper logo
Prosper

Sr. GRC Analyst

Remote

Full Time

#Technology

#Engineering

#GRC

#PCI DSS

#NIST

#SOC

#AWS

#Azure

#GCP

#Python

#BASH

#PowerShell

Versapay logo
Versapay

Principal .NET Software Engineer

Remote

Full Time

#Engineering

#Payments

#C#

#.NET

#SQL

#AWS

#Azure

#GitHub Actions

#RESTful APIs

#ISO 8583

B
Blockworks

Senior Data Engineer

160k - 200k USD

Remote

Full Time

#Engineering

#Cryptocurrency

#Python

#Go

#Rust

#TypeScript

#SQL

#Parquet

#Postgres

#Clickhouse

#Docker

#Kubernetes

#AWS

#GCP

#Airflow

#Dagster

#DBT

B
Banyan Software

AI Director

250k - 300k USD

Remote

Full Time

#Technology

#Software

#AI

#Cloud Native

#CI CD

#DevSecOps

#Microservices

#Infrastructure as Code

#AWS

#Azure

Wallarm logo
Wallarm

Senior Rust Developer

Remote

Full Time

#Engineering

#Cyber Security

#Rust

#Kubernetes

#Helm

#Terraform

#Backend Systems

#Distributed Systems

S
SecondDinner

Senior Director, Engineering

270k - 300k USD

Remote

Full Time

#Engineering

#Game Development

#Unity

#AWS

#Git

#.NET

#Technical Leadership

Adthena logo
Adthena

Senior Python Scraping / Anti-Bot Engineer

gb flag
United Kingdom

Remote

Full Time

#Technology

#Search

#Business Intelligence

#Python

#Playwright

#Selenium

#Puppeteer

#Docker

#Kubernetes

#JavaScript

#HTML

#HTTP

#TLS

CareMessage logo
CareMessage

Senior Product Manager - Data & Interoperability

Remote

Full Time

#Product Development

#Product

#Data

#FHIR

#HL7

#Electronic Health Records

#Product Management

#B2B SaaS

#Technology

Ethena Labs logo
Ethena Labs

Staff Security Engineer

Remote

Full Time

#Security

#DeFi

#Engineering

#Solidity

#EVM

#Foundry

#SAFe

Waveapps logo
Waveapps

Machine Learning Engineer II

Remote

Full Time

#AI

#Machine Learning

#AWS

#Sagemaker

#Airflow

#Terraform

#MLFlow

#Kubeflow

#Snowflake

#Databricks

#Redshift

#MLOps

Sakurafinetekeureop logo
Sakurafinetekeureop

Manager Field Service Engineer

Remote

Full Time

#Engineering

#People Management

#Coaching

#Performance Management

#Commercial Awareness

#Stakeholder Management

#Regulatory Compliance

Your dream job awaits.

Explore exciting opportunities, connect with top employers, and ignite your career.