Member of Technical Staff, Data Pipeline at Boson AI

Boson AI logo
Boson AI

Member of Technical Staff, Data Pipeline

us flag
United States

On-site

Full Time

#Engineering

#Machine Learning

#Data Processing

#Python

#PyTorch

#Data Labeling

#Database Management

#Cloud Platforms

#Data Privacy

#Data Collection

Boson AI is looking for a Member of Technical Staff, Data Pipeline

Sign up to unlock quick summaries and profile fit assessments

Boson AI is an early-stage startup building large language tools for everyone to use. Our founders (Alex Smola, Mu Li), and a team of Deep Learning, Optimization, NLP, AutoML and Statistics scientists and engineers are working on high quality generative AI models for language and beyond.
We are seeking machine learning engineers to join our team full-time in our Santa Clara office. As part of your role, you will help us build pipelines of data collection, data extraction, data filtering/synthetic data generation and data analysis. This will help us build more lifelike AI models. You will work closely with other scientists and engineers to empower our next generation of large multimodal model. 

Responsibilities:
  • Design and develop data processing pipelines, including data extraction, data filtering, data labeling, etc. 
  • Implement machine learning models to improve the quality and diversity of data (especially in the data extraction stage), e.g., quality classifier, document layout model, speech transcribe model, etc.


  • You may be a good fit if you have:
  • Experience in machine learning projects in audio or text or vision, e.g., has trained machine learning models to tackle a specific problem.
  • Strong proficiency in building large-scale data processing pipelines, familiar with distributed workload (e.g., multiprocessing, Ray, Docker, Kubernetes).
  • Proficiency in at least one programming language commonly used in machine learning, such as Python and ability to write clean, maintainable code.
  • Proficiency in at least one deep learning framework, such as PyTorch.
  • PhD or Master's degree in computer science or equivalent.
  • Excellent problem-solving skills and attention to detail, especially when handling data anomalies and biases to further improve data quality.


  • Strong candidates may also have:
  • Active Github contributions are a big plus.
  • Experience in building large-scale datasets. 
  • Familiar with at least one of the following tools for data labeling (e.g., LabelStudio), data collection (e.g., VPNs, Selenium), data processing (e.g., Hadoop, Datasketch). 
  • Proficiency in database management.
  • Hands-on experience in the cloud, like AWS, Azure or GCP.
  • Multilingual which contributes to enriching the language diversity crucial for robust model training. 
  • Experience with fairness, toxicity, data privacy regulations and compliance considerations.
  • Boson AI logo

    Boson AI

    4 views

    0 applied

    Social Media

    Visit Boson AI
    Share this job
    Copy Permalink
    Open roles at Boson AI
    Boson AI logo
    Boson AI

    High Performance Computing Engineer

    us flag
    United States

    Hybrid

    Full Time

    #Engineering

    #Problem Solving

    #Slurm

    #Ceph

    #Networking

    #Python

    #Linux

    Discover similar jobs
    A
    Arbor

    Data Engineer

    Remote

    Full Time

    #Engineering

    #Analytics

    #DBT

    #SQL

    #Snowflake

    #Python

    #GCP

    #Fivetran

    V
    Vic.ai

    QA Engineer

    es flag
    Spain

    Remote

    Full Time

    #Engineering

    #Quality Assurance

    #Test Automation

    #API Testing

    #Testing

    #Python

    #JavaScript

    #TypeScript

    #Playwright

    #Cypress

    #Selenium

    J
    Jimdo.com

    Data Engineer

    Remote

    Full Time

    #Engineering

    #Data

    #SQL

    #DBT

    #Python

    #Snowflake

    #Airflow

    #AWS

    #Git

    TheEverywhereOffice logo
    TheEverywhereOffice

    Full Stack Developer

    Remote

    Full Time

    #Engineering

    #PropTech

    #Python

    #Flask

    #Django

    #Laravel

    #Vue

    #React

    R
    Rad AI

    Data Engineer

    Remote

    Full Time

    #Engineering

    #Healthcare

    #Analytics

    #Metaflow

    #Spark

    #AWS

    #EMR

    #Docker

    #Kubernetes

    #SQL

    #NoSQL

    #DynamoDB

    #Elasticsearch

    Jellyvision logo
    Jellyvision

    Senior Data Platform Engineer II

    175k - 195k USD

    Remote

    Full Time

    #Technology

    #Data Engineering

    #Apache Airflow

    #Python

    #SQL

    #Snowflake

    #Databricks

    #Terraform

    #AWS

    #Apache Spark

    #DBT

    #Kafka

    T
    Techpartnerships

    NodeJs DEV

    Remote

    Full Time

    #Engineering

    H
    Helpscout

    Sr. Product Analyst

    Remote

    Full Time

    #Business Operations

    #SaaS

    #Analytics

    #SQL

    #Mixpanel

    #Testing

    #BigQuery

    #DBT

    #Python

    #Product Analytics

    PadSplit logo
    PadSplit

    Vice President, Data & AI Strategy

    Remote

    Full Time

    #Data Science

    #Data Analytics

    #Snowflake

    #DBT

    #Machine Learning

    #Analytics

    #Data Governance

    #Strategy

    #Predictive Analytics

    A
    Advocate

    Product Engineer, Tech Ops

    Remote

    Full Time

    #Technology

    #Artificial Intelligence

    #TypeScript

    #React

    #Next.js

    #Node.Js

    #GraphQL

    #PostgreSQL

    #AWS

    #Terraform

    #Docker

    #Python

    The Browser Company logo
    The Browser Company

    Software Engineer, Compiler

    us flag
    US, CA

    295k - 350k USD

    Remote

    Full Time

    #Engineering

    #Compiler

    #Open Source

    #Swift

    #LLVM

    #C++

    #Windows

    #Android

    #Build Systems

    #Tooling

    #Design

    Homebound logo
    Homebound

    Technical Lead Manager

    Remote

    Full Time

    #Engineering

    #Construction

    #TypeScript

    #Node

    #React

    #GraphQL

    #PostgreSQL

    #AWS

    #AI

    Flower logo
    Flower

    Founding Research Engineer in the Flower Frontier Model Team

    Remote

    Full Time

    #Engineering

    #Artificial Intelligence

    #PyTorch

    #Jax

    #Transformers

    #Optimization

    #Training

    #Docker

    #Git

    #Linux

    Fullscript logo
    Fullscript

    Cloud Security Engineer

    73k - 80k USD

    Remote

    Full Time

    #Security

    #Cloud

    #AWS

    #Google Cloud

    #Terraform

    #Python

    #Go

    #IAM

    Arize AI logo
    Arize AI

    AI Application Engineer

    sg flag
    Singapore

    Remote

    Full Time

    #AI

    #Software Engineering

    #Observability

    #Python

    #Golang

    #JavaScript

    #TypeScript

    #OpenTelemetry

    K
    Kraken.com

    Senior Software Engineer - Frontend - Pro

    Remote

    Full Time

    #Engineering

    #Fintech

    #React

    #JavaScript

    #TypeScript

    #Next.js

    #WebSockets

    #API Design

    #Testing

    #UI UX

    Prosper logo
    Prosper

    Sr. GRC Analyst

    Remote

    Full Time

    #Technology

    #Engineering

    #GRC

    #PCI DSS

    #NIST

    #SOC

    #AWS

    #Azure

    #GCP

    #Python

    #BASH

    #PowerShell

    Versapay logo
    Versapay

    Principal .NET Software Engineer

    Remote

    Full Time

    #Engineering

    #Payments

    #C#

    #.NET

    #SQL

    #AWS

    #Azure

    #GitHub Actions

    #RESTful APIs

    #ISO 8583

    B
    Blockworks

    Senior Data Engineer

    160k - 200k USD

    Remote

    Full Time

    #Engineering

    #Cryptocurrency

    #Python

    #Go

    #Rust

    #TypeScript

    #SQL

    #Parquet

    #Postgres

    #Clickhouse

    #Docker

    #Kubernetes

    #AWS

    #GCP

    #Airflow

    #Dagster

    #DBT

    LetsGetChecked logo
    LetsGetChecked

    Graduate Software Engineer

    76k - 95k USD

    Remote

    Full Time

    #Technology

    #Healthcare

    #Python

    #C#

    #JavaScript

    #AWS

    #Azure

    #GCP

    #Splunk

    #Datadog

    Your dream job awaits.

    Explore exciting opportunities, connect with top employers, and ignite your career.