Member of Technical Staff, Data Pipeline at Boson AI

Boson AI logo
Boson AI

Member of Technical Staff, Data Pipeline

us flag
United States

On-site

Full Time

#Engineering

#Machine Learning

#Data Processing

#Python

#PyTorch

#Data Labeling

#Database Management

#Cloud Platforms

#Data Privacy

#Data Collection

Boson AI is looking for a Member of Technical Staff, Data Pipeline

Sign up to unlock quick summaries and profile fit assessments

Boson AI is an early-stage startup building large language tools for everyone to use. Our founders (Alex Smola, Mu Li), and a team of Deep Learning, Optimization, NLP, AutoML and Statistics scientists and engineers are working on high quality generative AI models for language and beyond.
We are seeking machine learning engineers to join our team full-time in our Santa Clara office. As part of your role, you will help us build pipelines of data collection, data extraction, data filtering/synthetic data generation and data analysis. This will help us build more lifelike AI models. You will work closely with other scientists and engineers to empower our next generation of large multimodal model. 

Responsibilities:
  • Design and develop data processing pipelines, including data extraction, data filtering, data labeling, etc. 
  • Implement machine learning models to improve the quality and diversity of data (especially in the data extraction stage), e.g., quality classifier, document layout model, speech transcribe model, etc.


  • You may be a good fit if you have:
  • Experience in machine learning projects in audio or text or vision, e.g., has trained machine learning models to tackle a specific problem.
  • Strong proficiency in building large-scale data processing pipelines, familiar with distributed workload (e.g., multiprocessing, Ray, Docker, Kubernetes).
  • Proficiency in at least one programming language commonly used in machine learning, such as Python and ability to write clean, maintainable code.
  • Proficiency in at least one deep learning framework, such as PyTorch.
  • PhD or Master's degree in computer science or equivalent.
  • Excellent problem-solving skills and attention to detail, especially when handling data anomalies and biases to further improve data quality.


  • Strong candidates may also have:
  • Active Github contributions are a big plus.
  • Experience in building large-scale datasets. 
  • Familiar with at least one of the following tools for data labeling (e.g., LabelStudio), data collection (e.g., VPNs, Selenium), data processing (e.g., Hadoop, Datasketch). 
  • Proficiency in database management.
  • Hands-on experience in the cloud, like AWS, Azure or GCP.
  • Multilingual which contributes to enriching the language diversity crucial for robust model training. 
  • Experience with fairness, toxicity, data privacy regulations and compliance considerations.
  • Boson AI logo

    Boson AI

    4 views

    0 applied

    Social Media

    Visit Boson AI
    Share this job
    Copy Permalink
    Open roles at Boson AI
    Boson AI logo
    Boson AI

    High Performance Computing Engineer

    us flag
    United States

    Hybrid

    Full Time

    #Engineering

    #Problem Solving

    #Slurm

    #Ceph

    #Networking

    #Python

    #Linux

    Discover similar jobs
    Cryptio logo
    Cryptio

    Senior Typescript Engineer

    Remote

    Full Time

    #Engineering

    #Fintech

    #Blockchain

    #TypeScript

    #Node.Js

    #NestJS

    #PostgreSQL

    #AWS

    #Kubernetes

    #Docker

    #Redis

    #Pulumi

    #Gitlab

    P
    Parafin

    Staff Software Engineer, Lending Products

    285k - 330k USD

    Remote

    Full Time

    #Engineering

    #Fintech

    #Lending

    #Backend Engineering

    #Platform Development

    #Architecture

    #Cross Functional Collaboration

    #Mentoring

    #Infrastructure Design

    #Code Review

    #Reliability

    F
    Found

    Staff Software Engineer, Platform

    210k - 278k USD

    Remote

    Full Time

    #Engineering

    #Backend

    #Public Cloud

    #Observability

    #Monitoring

    #Incident Response

    #Tech

    #OpenTelemetry

    #Prometheus

    #Infrastructure as Code

    #Terraform

    U
    UNION

    Sales Engineer

    Remote

    Full Time

    #Sales

    #Infrastructure

    #Machine Learning

    #Data Processing

    #MLOps

    #PyTorch

    #TensorFlow

    #Spark

    #Flink

    #AWS

    #GCP

    #Azure

    #Terraform

    #Docker

    #Kubernetes

    Dijital-team-pty-ltd logo
    Dijital-team-pty-ltd

    Automation Engineer

    Remote

    Full Time

    #IT

    #Managed Services

    #PowerShell

    #Python

    #Jinja

    #BASH

    #REST API

    #JSON

    #Git

    #JavaScript

    C
    Candidly

    Senior Infrastructure Engineer

    Remote

    Full Time

    #Infrastructure Engineering

    #Cloud Computing

    #DevOps

    #AWS

    #Azure

    #Kubernetes

    #Docker

    #IaC

    #Python

    #Linux

    #Monitoring

    #Security

    PelotonInc logo
    PelotonInc

    Senior Software Engineer

    Remote

    Full Time

    #Engineering

    #Full Stack

    #DevOps

    #Docker

    #Kubernetes

    #AWS

    #GCP

    #Azure

    #Flux

    #Rancher

    #Continuous Delivery

    #Infrastructure

    #Microservices

    I
    Ivanti

    Associate Site Reliability Engineer

    Remote

    Full Time

    #Site Reliability

    #Cloud Operations

    #DevOps

    #Linux

    #Windows

    #Networking

    #Kubernetes

    #Docker

    #Python

    #Java

    #AWS

    #Azure

    #Ansible

    BioIntelliSense logo
    BioIntelliSense

    DevOps Engineer

    Remote

    Full Time

    #Cloud

    #DevOps

    #Healthcare

    #Terraform

    #AWS

    #Datadog

    #Bitbucket Pipelines

    #CircleCi

    #Databricks

    #Python

    #Flutter

    H
    Hyperhug

    QA Engineer

    Remote

    Full Time

    #Game Development

    #QA Testing

    #Mobile

    #Manual Testing

    #TestRail

    #Jira

    #Android Studio

    #XCode

    #Unity

    #Git

    #Firebase

    #Python

    #C#

    Tarmac Technologies logo
    Tarmac Technologies

    Python Django Backend Engineer

    Remote

    Full Time

    #Technology

    #Backend Development

    #Tech

    #Python

    #Django

    #RESTful API

    #AWS

    #Backend Engineering

    H
    HeyJobs

    Graphic Design Creative Technology

    Remote

    Part Time

    #Technology

    #AI Tools

    #Digital Marketing

    #Engineering

    #JavaScript

    #Python

    #Landing Pages

    #Content

    I
    Inspiren

    Director of Product Management

    230k - 270k USD

    Remote

    Full Time

    #Product Management

    #Health Tech

    #AI

    #Hardware

    #Machine Learning

    #Product Strategy

    #Collaboration

    #Data Analysis

    #Discovery

    ProktaHRSolutions logo
    ProktaHRSolutions

    Senior Software Engineer - Network Services Orchestration

    in flag
    India

    Remote

    Full Time

    #Automation

    #Orchestration

    #Technology

    #Cisco

    #Python

    #Java

    #Linux

    #DevOps

    N
    Northflank.com

    Backend Software Engineer

    57k - 127k USD

    Remote

    Full Time

    #Backend Engineering

    #Cloud

    #Microservices

    #Go

    #Python

    #Node.Js

    #SQL

    #NoSQL

    #RESTful APIs

    #Docker

    #Kubernetes

    #AWS

    N
    NewPageSolutionsInc

    Python Developer

    Remote

    Contractor

    #Technology

    #Digital Health

    #Software Development

    #Python

    #AWS Lambda

    #AWS ECS

    #Automated Testing

    #Agile Methodologies

    #Terraform

    #Drupal

    #PHP

    T
    Teach For All

    Head of AI Solutions & Engineering

    Remote

    Contractor

    #AI

    #Education

    #Technology

    #TypeScript

    #Python

    #REST APIs

    #Git

    #Design

    #Google Cloud

    #Business Analysis

    Q
    Quora

    Staff Machine Learning Engineer

    220k - 321k USD

    Remote

    Full Time

    #Machine Learning

    #Recommendation Systems

    #Engineering

    #Python

    #C++

    #Data Pipelines

    #Model Training

    #Algorithms

    Ramp logo
    Ramp

    Security Engineer, Cloud

    Remote

    Full Time

    #Cloud Security

    #Security Engineering

    #Fintech

    #AWS

    #Terraform

    #Python

    #Flask

    #Infrastructure

    #DevOps

    DroneDeploy logo
    DroneDeploy

    Senior DevOps Engineer

    Remote

    Full Time

    #DevOps

    #Cloud Infrastructure

    #MLOps

    #Kubernetes

    #Terraform

    #Python

    #Golang

    #AWS

    #Linux

    #Observability

    #GitHub Actions

    Your dream job awaits.

    Explore exciting opportunities, connect with top employers, and ignite your career.