Data Collection Engineer at Thewire-media

Thewire-media logo
Thewire-media

Data Collection Engineer

us flag
United States

Hybrid

Full Time

#Engineering

#Python

#SQL

#Data Ingestion

#ETL

#Backend Engineering

Thewire-media is looking for a Data Collection Engineer

Sign up to unlock quick summaries and profile fit assessments

Your Role: Data Collection Engineer
As a Data Collection Engineer, you'll play a critical role in acquiring and structuring high-value external data that powers our core products. Your work will fuel our knowledge graph of millions of entities and directly support our mission to deliver transparency and insight into complex global networks.
You’ll work closely with engineering, research, and product teams to identify new data sources, develop reliable pipelines to gather, ingest, and structure that data, and continuously improve our ability to scale and adapt. You'll have ownership over how information flows into our platform — from design and architecture to reliability and performance — and help shape the systems that underpin our next generation of features and products.

What you'll do
  • Design and implement systems to collect, extract, and normalize external data from a variety of sources.
  • Collaborate with researchers and analysts to identify new sources of valuable company data and define integration strategies.
  • Build robust, scalable pipelines that ingest structured and semi-structured data into our database.
  • Ensure high levels of accuracy, coverage, and freshness across incoming data streams.
  • Contribute to the evolution of our data platform and internal tooling.
  • Improve system reliability, observability, and performance over time.


  • Who you are
  • 3+ years of experience as a backend or full-stack software engineer, ideally working with data ingestion or ETL systems.
  • Intimate knowledge of how to crawl the internet at scale.
  • Strong programming skills, especially in Python.
  • Experience working with structured and unstructured data from diverse external systems.
  • Comfortable debugging complex issues involving networking, content rendering, or inconsistent source data.
  • Proficient with SQL and relational databases.
  • A clear communicator who collaborates effectively with both technical and non-technical teammates.
  • Passionate about turning raw data into meaningful insight, and eager to work on technically nuanced challenges.


  • Ideally you'll have
  • Familiarity with headless browser automation or techniques for collecting data from dynamic content sources.
  • Expertise in the architure, technologies, and tools that run the modern internet such as DNS, networking, CDNs, WAFs, proxies and reverse proxies.
  • Experience with event-driven architecture.
  • Eagerness to incorporate new technologies and validate their usefulness using structured experiments and thorough testing.
  • Experience building health monitoring and observability tools for consumption by automated tools, engineers, and non-technical stakeholders.
  • Thewire-media logo

    Thewire-media

    6 views

    0 applied
    Visit Thewire-media
    Share this job
    Copy Permalink
    Open roles at Thewire-media
    Thewire-media logo
    Thewire-media

    Senior Backend Software Engineer

    us flag
    United States

    165k - 200k USD

    Hybrid

    Full Time

    #Engineering

    #Python

    #SQL

    #Relational Databases

    #Apache Spark

    #Data Pipelines

    Thewire-media logo
    Thewire-media

    Product Manager

    us flag
    United States

    145k - 185k USD

    Hybrid

    Full Time

    #Product

    #Product Management

    #Communication

    #Engineering

    #Data Pipeline

    #UX

    #API

    Thewire-media logo
    Thewire-media

    Backend Engineer

    us flag
    United States

    145k - 165k USD

    Hybrid

    Full Time

    #Engineering

    #SQL

    #Python

    #Data Pipelines

    #ReactJS

    #Graph

    #Relational Databases

    #Web

    #Technical Documentation

    #Problem Solving

    #Communication

    Thewire-media logo
    Thewire-media

    Backend Engineer

    us flag
    United States

    130k - 160k USD

    Hybrid

    Full Time

    #Engineering

    #SQL

    #Python

    #Data Pipelines

    #ReactJS

    #Graph

    #Web

    #Scalable Systems

    #Technical Vision

    #Problem Solving

    #Communication Skills

    Discover similar jobs
    G
    GameChanger

    Senior Applied Machine Learning Engineer

    180k - 200k USD

    Remote

    Full Time

    #Machine Learning

    #Computer Vision

    #Engineering

    #Python

    #PyTorch

    #Docker

    #AWS

    #Distributed Systems

    #Systems

    #Performance Optimization

    Sift logo
    Sift

    Software Engineer

    Remote

    Full Time

    #Fraud Detection

    #Infrastructure

    #Platform Engineering

    #Java

    #Python

    #Terraform

    #Kubernetes

    #GCP

    #AWS

    #Kafka

    #Jenkins

    #Docker

    #Spark

    Unqork logo
    Unqork

    Senior Application Security Engineer

    117k - 160k USD

    Remote

    Full Time

    #Application Security

    #Penetration Testing

    #Security Engineering

    #OWASP Top 10

    #Node.Js

    #Python

    #Burp suite

    #OWASP

    #SAST

    #DAST

    #SCA

    #Vulnerability Management

    CoinsPaid logo
    CoinsPaid

    DevOps Engineer

    Remote

    Full Time

    #DevOps

    #Engineering

    #Fintech

    #Kubernetes

    #Docker

    #Helm

    #Terraform

    #AWS

    #Linux

    #Python

    #Prometheus

    Innovativesol-2 logo
    Innovativesol-2

    AI Data Architect

    Remote

    Full Time

    #Cloud Architecture

    #Data Engineering

    #AI

    #AWS

    #Azure

    #Python

    #SQL

    #Data Modeling

    #ETL

    #Big Data

    #Machine Learning

    T
    Thirstysprout

    Senior Fullstack Engineer

    Remote

    Part Time

    #Shipping

    #Engineering

    #Vue.Js

    #Django

    #PostgreSQL

    #Google Cloud

    #Frontend Development

    #API Integration

    #Code Review

    Zushealth logo
    Zushealth

    Director, Solutions & Forward Deployed Engineering

    Remote

    Full Time

    #Solutions Engineering

    #Healthcare

    #Engineering

    #FHIR

    #HL7

    #Integrations

    #APIs

    #Data Pipelines

    #ETL

    #Snowflake

    #HIPAA

    #AI Tools

    #Automation

    C
    Cavnue

    Senior Software Engineer

    150k - 195k USD

    Remote

    Full Time

    #Infrastructure

    #Software Engineering

    #Systems

    #Python

    #C++

    #PostgreSQL

    #Kubernetes

    #Terraform

    #Redis

    #Data Pipelines

    #REST APIs

    #GCP

    #Docker

    TokyoTechie logo
    TokyoTechie

    Blockchain NFT Developer

    Remote

    Full Time

    #Technology

    #Blockchain

    #Consulting

    #NFT

    #Ethereum

    #Smart Contracts

    #NodeJS

    #Python

    #Go

    #Java

    #AWS

    #Distributed Systems

    S
    Sequence

    Senior Product Engineer (Backend)

    149k - 169k USD

    Remote

    Full Time

    #Backend Engineering

    #Fintech

    #Product Engineering

    #Kotlin

    #Spring Boot

    #Postgres

    #Distributed Systems

    #Google Cloud

    #Terraform

    #BigQuery

    LetsGetChecked logo
    LetsGetChecked

    Business Intelligence Analyst

    91k - 114k USD

    Remote

    Full Time

    #Business Intelligence

    #Healthcare

    #Analytics

    #SQL

    #Looker

    #Python

    #AWS RedShift

    #Data Modeling

    #Data Visualization

    #AWS Glue

    #Agile

    #LookML

    Creative Fabrica logo
    Creative Fabrica

    AI Marketing Specialist

    Remote

    Full Time

    #Marketing

    #AI

    #Automation

    #Engineering

    #Email Automation

    #Content

    #Salesforce

    #HubSpot

    #Workflow Automation

    S
    Sardine

    Machine Learning Engineer

    us flag
    US, CA

    Remote

    Full Time

    #Fraud Prevention

    #Machine Learning

    #Fintech

    #Go

    #Python

    #PyTorch

    #SQL

    #Data Pipelines

    #Deployment

    #Kubernetes

    #Docker

    D
    Docker

    Senior Software Engineer

    Remote

    Full Time

    #Developer Tools

    #Platform Engineering

    #Software Development

    #Go

    #Backend Engineering

    #Systems

    #Observability

    #Modular Systems

    D
    Dorbe Leit Consulting

    Senior Full-Stack Software Engineer

    Remote

    Full Time

    #Full Stack

    #Software Engineering

    #Mobility

    #Python

    #Django

    #fastAPI

    #React

    #PostgreSQL

    #TimeScaleDB

    #ETL

    #Docker

    #Kubernetes

    D
    Doxy.me

    Senior Web Engineer

    Remote

    Full Time

    #Telehealth

    #Engineering

    #React

    #Next.js

    #TypeScript

    #CSS

    #Responsive Design

    #Design Systems

    #Storybook

    #Figma

    CoderPad logo
    CoderPad

    Director of Engineering

    Remote

    Full Time

    #Engineering

    #Technical Leadership

    #Developer Tools

    #Software Engineering

    #People Management

    #Technical Architecture

    #Product Management

    #AI Tools

    #Reliability

    #Distributed Teams

    A
    Addi

    Backend Software Engineer

    Remote

    Full Time

    #Backend Engineering

    #Fintech

    #Lending

    #Java

    #Kotlin

    #Spring Boot

    #PostgreSQL

    #Redis

    #Docker

    #Kubernetes

    #Kafka

    #SQS

    #Automated Testing

    P
    Prime Financial Technologies

    ML Engineer

    Remote

    Full Time

    #Data Science

    #Machine Learning

    #Fintech

    #Python

    #JavaScript

    #AWS

    #Databricks

    #Spark

    #PostgreSQL

    #DynamoDB

    #Flask

    #React

    Kayzen logo
    Kayzen

    DevOps Engineer

    in flag
    India

    Remote

    Full Time

    #DevOps

    #Infrastructure

    #AdTech

    #Shell Scripting

    #Python

    #Java

    #SQL

    #Terraform

    #Ansible

    #HAProxy

    #Nginx

    #Kubernetes

    #Prometheus

    Your dream job awaits.

    Explore exciting opportunities, connect with top employers, and ignite your career.