Principal Data Engineer at Onapsis

Onapsis logo
Onapsis

Principal Data Engineer

us flag
United States

Hybrid

Full Time

#Engineering

#Cybersecurity

#Cloud

#AWS

#Azure

#ETL

#Apache Spark

#Kafka

#Python

#SQL

#Data Governance

#CI CD

#Apache Airflow

Onapsis is looking for a Principal Data Engineer

Sign up to unlock quick summaries and profile fit assessments

 

About the job

The world’s most critical--and at risk--business applications have been neglected for far too long. Onapsis eliminates this blind spot by providing cybersecurity solutions dedicated to business-critical applications. Whether running on premises, in the cloud, or in a hybrid environment, Onapsis helps nearly 30% of the Forbes Global 100 understand the threats and risks across their SAP and Oracle landscapes.

We are seeking a Senior Data Engineer to join our mission-driven team. This role is ideal for experienced data engineers with a proven track record in architecting scalable data pipelines, leveraging cloud technologies, and contributing to high-impact cybersecurity solutions. You will be responsible for building high-performance ETL frameworks, optimizing data platforms, and contributing directly to the enhancement of our customers' threat detection, response, and remediation capabilities.

What you will be doing, your legacy: 

You will be working directly with company Principal Engineers evaluating, scoping, proposing, and building features to fulfill business solution requirements to protect our customers. Additionally, you will be working with Engineering and DevOps to deliver high-quality products and services while also working closely with security and IT professionals to ensure safe and secure best practices are followed. 

Responsibilities:

  • Architect and Design Scalable Data Solutions: Design, develop, and maintain highly-scalable ETL/ELT pipelines across diverse data domains using cloud technologies like AWS (Glue, Redshift, Lambda, EMR, S3) and Azure (Data Factory, Synapse, Databricks).
  • Data Pipeline Development: Implement data models and data processing frameworks (Spark, Kafka, Snowflake) to ingest, transform, and load large datasets (100+ TB), ensuring high availability and reliability of data.
  • Advanced Data Integration: Develop solutions that integrate multiple data sources into Snowflake or similar data warehouses to enable real-time analytics and reporting across dashboards.
  • AI/ML Integration: Collaborate with cross-functional teams to co-develop AI-driven features like text summarization and chatbot functionalities using AWS Bedrock, SageMaker, or similar AI/ML technologies, reducing response times and enhancing decision-making capabilities.
  • Compliance and Security: Ensure compliance with industry standards and secure best practices (SOX, SOC 1/2), by implementing data governance frameworks, monitoring data pipelines, and optimizing cloud database architectures to protect sensitive information.
  • Stakeholder Collaboration: Work closely with stakeholders, including analysts, engineers, and product managers, to understand their data needs, propose solutions, and drive data-driven decision-making by delivering actionable insights.
  • Data Infrastructure Monitoring: Continuously monitor, troubleshoot, and enhance data pipelines, leveraging CI/CD tools (Docker, Jenkins, GitHub Actions) and orchestrating workflows using Apache Airflow to maintain operational efficiency.
  • Leadership and Mentorship: Provide technical leadership within the data platform organization, leading the implementation of cutting-edge cloud technologies and mentoring junior data engineers in best practices and advanced data management techniques.
  • Cloud Migration: Lead large-scale database migrations from on-premises environments (Oracle, SQL Server) to cloud-based solutions like Snowflake and AWS, improving query performance and reducing technical debt.
  • Documentation and Governance: Establish comprehensive documentation for data architecture, governance, and processes to ensure scalability, compliance, and security.

Qualifications:

  • 5+ years of proven experience as a Data Engineer or in a similar role with a deep understanding of data architecture and cloud-based ETL/ELT frameworks.
  • Strong experience with AWS and/or Azure cloud services, particularly with Glue, Redshift, Lambda, Step Functions, Databricks, Synapse, and Snowflake.
  • Proficiency in big data technologies such as Apache Spark, Kafka, Hadoop, and Databricks for distributed data processing.
  • Strong programming skills in Python and SQL, with experience in advanced data modeling (star, snowflake schemas) and partitioning techniques.
  • Hands-on experience in building real-time data processing and AI/ML-driven analytics solutions (SageMaker, Bedrock, NLP, Power BI).
  • Proven ability to architect and manage data warehouse solutions (e.g., Snowflake, Redshift) for enterprise-grade performance and reliability.
  • Familiarity with compliance and audit requirements (SOX, SOC 1/2, GDPR) and implementing data governance and security frameworks.
  • Strong problem-solving skills with a focus on data integrity, scalability, and performance optimization.
  • Experience with CI/CD tools (Jenkins, GitHub Actions, Docker) and data orchestration platforms (Apache Airflow).

Preferred Qualifications:

  • Experience with advanced data architecture principles (medallion architecture, materialized views, task scheduling).
  • Proven track record of successful cloud migrations for large datasets and optimizing query performance in Snowflake or similar platforms.
  • Familiarity with real-time analytics using Tableau, Power BI, and other BI tools to drive decision-making and reduce reporting lag.
  • Leadership experience, including mentoring junior engineers and leading technical projects.

Location: Dallas, TX, US. This is a hybrid role. 

About Onapsis:

Onapsis protects the business applications that run the global economy. The Onapsis Platform delivers vulnerability management, change assurance, and continuous compliance for business applications from leading vendors such as SAP, Oracle, and others. The Onapsis Platform is powered by the Onapsis Research Labs, the team responsible for the discovery and mitigation of more than 1,000 zero-day vulnerabilities in business applications.

Onapsis is headquartered in Boston, MA, with offices in Heidelberg, Germany and Buenos Aires, Argentina, and proudly serves hundreds of the world’s leading brands, including close to 30% of the Forbes Global 100, six of the top 10 automotive companies, five of the top 10 chemical companies, four of the top 10 technology companies, and three of the top 10 oil and gas companies.

For more information, connect with Onapsis on LinkedIn or visit https://www.onapsis.com.

Onapsis logo

Onapsis

10 views

0 applied

Company Size

251-500

Markets

Security
Cloud Data Services

Social Media

Visit Onapsis
Share this job
Copy Permalink
Open roles at Onapsis
Onapsis logo
Onapsis

DevOps Engineer II

ro flag
Romania

Hybrid

Full Time

#Engineering

#Cyber Security

#Cloud Computing

#DevSecOps

#Git

#Gitlab

#Terraform

#AWS

#Docker

#Kubernetes

#Linux

#Penetration Testing

Discover similar jobs
Axle logo
Axle

AI Engineer II

120k - 150k USD

Remote

Full Time

#Data Science

#AI Engineering

#Python

#Systems

#Search

#Kubernetes

#PostgreSQL

#Docker

#Git

Extreme Networks logo
Extreme Networks

Account Executive

Remote

Full Time

#Sales

#Networking

#Cloud

#B2B Sales

#Security

#MEDDIC

#Account Management

#Strategic Planning

#Consultative Selling

W
Wrapbook

Senior Software Engineer

ca flag
Canada

Remote

Full Time

#Software Engineering

#Ruby on Rails

#Fintech

#PostgreSQL

#SQL

#Redis

#Sidekiq

#Kubernetes

#StimulusJS

#Backend Development

Sauce logo
Sauce

AI Operations Engineer

Remote

Full Time

#Engineering

#Operations

#OpenAI

#Node.Js

#React

#PostgreSQL

#REST API

#Cloud

LetsGetChecked logo
LetsGetChecked

Software Engineer

76k - 95k USD

Remote

Full Time

#Software Engineering

#Data Analysis

#Health Tech

#Log Analysis

#Python

#C#

#JavaScript

#Splunk

#Datadog

#AWS

#Azure

#GCP

P
Prolific

Application Security Lead

Remote

Full Time

#Application Security

#Engineering

#AI

#OWASP Top 10

#Code Review

#Python

#Burp suite

#SSDLC

#SAST

#DAST

#Vulnerability Management

#ISO 27001

CKSource logo
CKSource

QA Engineer

54k - 83k USD

Remote

Full Time

#QA Engineering

#Cloud Services

#Developer Tools

#JavaScript

#TypeScript

#Cypress

#Playwright

#API Testing

#Docker

#Node.Js

#AWS

#Testing

Constructive Dialogue Institute logo
Constructive Dialogue Institute

Senior Data Scientist

us flag
United States

135k - 145k USD

Remote

Full Time

#Data Science

#Analytics Engineering

#Nonprofit

#SQL

#Python

#Data Pipelines

#AWS

#Dashboards

#Git

#Data Quality

#BI Tools

S
Solo.io, Inc.

RevOps Engineer

Remote

Full Time

#Revenue Operations

#Data Engineering

#Analytics

#SQL

#DBT

#Data Pipelines

#Salesforce

#BigQuery

#Fivetran

#Airbyte

#Marketo

#API Testing

Ethena Labs logo
Ethena Labs

Head of Platform Engineering

Remote

Full Time

#Platform Engineering

#DevOps

#Cryptocurrency

#AWS

#GCP

#Terraform

#Kubernetes

#Prometheus

#Datadog

#DevSecOps

#Infrastructure as Code

Tebra logo
Tebra

Security Architect

179k - 204k USD

Remote

Full Time

#Security

#Cloud Security

#Healthcare

#Cloudflare

#GCP

#Kubernetes

#Terraform

#Python

#DevSecOps

#Vertex AI

#BigQuery

#Helm

#Workato

M
Maze

Full Stack Software Engineer

Remote

Full Time

#User Research

#Product Engineering

#Full Stack

#Node.Js

#React

#PostgreSQL

#Next.js

#NestJS

#GraphQL

#TypeScript

#AWS

#Kubernetes

S
Sportalliance

Senior Commercial & Pricing Analyst

Remote

Full Time

#SaaS

#Analytics

#Pricing Strategy

#Financial Modeling

#SQL

#AI Tools

#Revenue Forecasting

#Business

#Spreadsheets

#Scenario Modeling

#Data Analysis

S
Snackpass

Software Engineer, Fullstack

Remote

Full Time

#Engineering

#Payments

#Analytics

#Tooling

#Mobile Apps

#Scalable Systems

U
Union

Sales Engineer

Remote

Full Time

#AI

#Sales

#Machine Learning

#MLOps

#PyTorch

#TensorFlow

#Spark

#Kubernetes

#Docker

#AWS

#Terraform

#MEDDIC

N
NewPage Solutions Inc

Python Developer

Remote

Contractor

#Technology

#Digital Health

#Continuous Delivery

#Python

#AWS Lambda

#AWS ECS

#Automated Testing

#Agile Methodologies

#Terraform

#Drupal

#PHP

#S3

#DynamoDB

D
Deepgram

Pre-Sales Solutions Engineer

Remote

Full Time

#AI

#Solutions Engineering

#Python

#JavaScript

#API Integration

#Speech Recognition

#NLP

#Cloud Platforms

#Docker

#Kubernetes

#Sales Methodologies

U
Unit4

Senior Cloud Infrastructure Engineer

pl flag
Poland

Remote

Full Time

#Cloud Infrastructure

#Engineering

#Microsoft Azure

#Infrastructure Engineering

L
Lightdash

Head of Engineering

Remote

Full Time

#Engineering Leadership

#AI

#Developer Experience

#TypeScript

#React

#Node.Js

#SQL

#Docker

#Kubernetes

#GCP

#Architecture

#Security

saas.group logo
saas.group

Applied Research Scientist

Remote

Full Time

#AI

#Research

#SQL

#Python

#Data Analysis

#Experiment Design

#Data Pipelines

#Validation

#AI Tools

#Research Methodology

Your dream job awaits.

Explore exciting opportunities, connect with top employers, and ignite your career.