High Performance Computing Engineer at Boson AI

Boson AI logo
Boson AI

High Performance Computing Engineer

us flag
United States

Hybrid

Full Time

#Engineering

#Problem Solving

#Slurm

#Ceph

#Networking

#Python

#Linux

Boson AI is looking for a High Performance Computing Engineer

Sign up to unlock quick summaries and profile fit assessments

Boson AI is a startup building large language tools for everyone to use. Our founders (Alex Smola, Mu Li), and a team of Deep Learning, Optimization, NLP, AutoML and Statistics scientists and engineers are working on high quality generative AI models for language, audio, and entertainment.
About The Role
We are looking for a Senior High Performance Computing Engineer to help us operate the GPUs, network and filesystem in our datacenter deployment in Toronto. The ideal candidate needs to have strong problem solving skills and an ability to learn new tools. Experience with Slurm, MAAS, Ceph, Infiniband, NVIDIA deepops, Ethernet networking and related tools are a big plus. You should be comfortable performing some amount of hardware configuration. 
You will have the opportunity to work with NVIDIA H100 and A100 GPUs, over 20PB of storage, Terabit networking and hundreds of computers. You will be responsible for deploying and operating a broad range of infrastructure technologies and hardware systems.

A day in the life:
  • Manage private large high-end GPU clusters
  • Responsible for full lifecycle of physical systems including deployments of new hardware, operations, triage and troubleshooting
  • Configure and maintain network switches (Tomahawk Ethernet, Mellanox Infiniband)
  • Configure and maintain MAAS, Ceph, Slurm and Kubernetes
  • Configure and automate on-premises Linux-based systems at scale using infrastructure-as-code practices
  • Configure and maintain network, e.g. Layer 3 networking
  • Learn about new tools and deploy them


  • You might be a great fit if you have:
  • Strong background in high performance computing
  • Experience with with on-premises Data Center operations and technologies
  • Experience in managing a large hardware cluster
  • Proficiency in at least one programming language (e.g. Python) and ability to write clean, maintainable code
  • Experience in designing, deploying, and maintaining production-grade machine learning systems at scale
  • Familiarity with GPU utilization for machine learning workloads and optimization techniques
  • Experience with managing firmware / systems updates for systems, e.g. on SuperMicro


  • The ability to solve problems and to learn new techniques is key.
    Boson AI logo

    Boson AI

    0 views

    0 applied

    Social Media

    Visit Boson AI
    Share this job
    Copy Permalink
    Open roles at Boson AI
    Boson AI logo
    Boson AI

    Member of Technical Staff, Data Pipeline

    us flag
    United States

    On-site

    Full Time

    #Engineering

    #Machine Learning

    #Data Processing

    #Python

    #PyTorch

    #Data Labeling

    #Database Management

    #Cloud Platforms

    #Data Privacy

    #Data Collection

    Discover similar jobs
    G
    GR8_TECH

    Senior Artificial Intelligence Specialist

    Remote

    Full Time

    #IGaming

    #Artificial Intelligence

    #Python

    #SQL

    #AWS

    #Docker

    #Git

    #LLM

    SelectSourceInternational1 logo
    SelectSourceInternational1

    Electrical Estimator

    Remote

    Full Time

    #Engineering

    #Aerospace

    #MS Excel

    #PowerPoint

    #Word

    #Financial Analysis

    #Project Management

    #Value Engineering

    #Negotiation

    B
    BranchInsurance

    Sales Leader

    Remote

    Full Time

    #Sales

    #Insurance

    #Team Leadership

    #Coaching

    #Problem Solving

    S
    Stora

    Senior Software Engineer

    gb flag
    United Kingdom

    100k - 100k USD

    Remote

    Full Time

    #Engineering

    #Software Development

    #Rails

    #PostgreSQL

    #Redis

    #Sidekiq

    #JavaScript

    #CSS

    #Minitest

    #React

    #Stripe

    #GitHub Actions

    ShipBob, Inc. logo
    ShipBob, Inc.

    Security Engineer II (Cloud Security)

    in flag
    India

    Remote

    Full Time

    #Information Security

    #Cloud Security

    #Azure Active Directory

    #Python

    #PowerShell

    #SIEM

    #IAM

    #RBAC

    #OAuth

    #SAML

    #MITRE

    #Trust

    S
    Socket

    Sr. Software Engineer

    Remote

    Full Time

    #Engineering

    #Security

    #Node.Js

    #JavaScript

    #React

    #TypeScript

    #Postgres

    #GraphQL

    #Elasticsearch

    Fundraise Up logo
    Fundraise Up

    Backend Developer

    62k - 80k USD

    Remote

    Full Time

    #Engineering

    #Fintech

    #Node.Js

    #TypeScript

    #MongoDB

    #Kafka

    #NestJS

    #Koa

    #Redis

    #Clickhouse

    #Elasticsearch

    A
    Altamira.ai

    Senior DevOps Engineer

    Remote

    Full Time

    #DevOps

    #Engineering

    #Kubernetes

    #Terraform

    #AWS

    #Prometheus

    #Grafana

    #ELK

    #CloudFormation

    #GitHub Actions

    #Argo

    Tameson logo
    Tameson

    Technical Content Strategist

    Remote

    Contractor

    #Marketing

    #Technical Content

    #Engineering

    #AI Tools

    #Content Strategy

    #Technical Writing

    #Data Analysis

    #Product Management

    #SEO Optimization

    H
    Hyperbolic

    Member of Technical Staff - Full Stack

    Remote

    Full Time

    #Engineering

    #Node

    #TypeScript

    #Python

    #ORM

    #Postgres

    #Vercel

    #CI CD

    #A B Testing

    #API Design

    AeroVect logo
    AeroVect

    Infrastructure Engineer

    Remote

    Full Time

    #Engineering

    #Infrastructure

    #Autonomous

    #Cloud

    #Data Pipelines

    #DevOps

    #Build Systems

    #Localization

    #Planning

    #Systems

    Panopto logo
    Panopto

    AI Engineer

    Remote

    Full Time

    #Research

    #Engineering

    #AI Engineering

    #LLM

    #Design

    #Workflows

    #GuardRails

    #Observability

    #Data Pipelines

    #Software

    T
    TreehouseStrategyAndCommunicatio

    Technical Lead Full Stack Developer

    25k - 25k USD

    Remote

    Full Time

    #Engineering

    #Software Development

    #C#

    #React

    #HTML5

    #CSS

    #JavaScript

    #Bootstrap

    #Entity Framework

    #LINQ

    #SQL

    #MS SQL Server

    Karaktertraprenovaties logo
    Karaktertraprenovaties

    Freelance Sales Advisor Stair Renovations

    Remote

    Contractor

    #Sales

    #Customer Service

    #CRM

    #Product Knowledge

    #Communication

    #Problem Solving

    FocusReactive logo
    FocusReactive

    JavaScript Engineer

    Remote

    Full Time

    #Engineering

    #JavaScript

    #React

    #Node.Js

    M
    Mystenlabs

    Senior Software Engineer, TypeScript SDK

    Remote

    Full Time

    #Engineering

    #TypeScript

    #React

    #Rust

    #API Development

    #Software Design

    #Code Review

    DroneDeploy logo
    DroneDeploy

    Manager of IT Engineering

    us flag
    United States

    Remote

    Full Time

    #Software Engineering

    #DevOps

    #Okta

    #Google Workspace

    #Slack

    #Atlassian

    #Python

    #BASH

    #SOC 2

    #ISO 27001

    #AI Tools

    Istaridigital.ai logo
    Istaridigital.ai

    Senior DevSecOps Engineer

    Remote

    Full Time

    #Engineering

    #Infrastructure

    #AWS

    #Kubernetes

    #Terraform

    #Linux

    #Windows

    #Active Directory

    #IAM

    #Python

    #BASH

    reka logo
    reka

    Member of Technical Staff (Robotics Research Lead)

    Remote

    Full Time

    #Artificial Intelligence

    #Robotics

    #Computer Vision

    #Python

    #C++

    #3D

    #Systems

    #Machine Learning

    Bitfinex logo
    Bitfinex

    Junior Risk Monitoring Analyst

    Remote

    Full Time

    #Risk Management

    #Financial Markets

    #Risk

    #Data Analysis

    #SQL

    #Python

    #Trading

    #Attention To Detail

    #Written Communication

    Your dream job awaits.

    Explore exciting opportunities, connect with top employers, and ignite your career.