Principal Data Engineer
Hybrid
Full Time
#Engineering
#Cybersecurity
#Cloud
#AWS
#Azure
#ETL
#Apache Spark
#Kafka
#Python
#SQL
#Data Governance
#CI CD
#Apache Airflow
At Onapsis, we are dedicated to securing the world's most critical business applications. Many organizations leave their core systems vulnerable, but we bridge that gap by providing specialized cybersecurity solutions for platforms like SAP and Oracle. Our work impacts nearly 30% of the Forbes Global 100, helping them navigate complex threat landscapes across on-premises, cloud, and hybrid environments. We are looking for a Principal Data Engineer to join our mission-driven team and help us build the next generation of our data infrastructure.
Key outcomes
- Architect and maintain highly scalable ETL and ELT pipelines using cloud-native tools across AWS and Azure environments.
- Develop and optimize data processing frameworks using technologies like Apache Spark and Kafka to handle large-scale datasets.
- Collaborate with cross-functional teams to integrate AI and machine learning features, such as text summarization and automated response tools, into our platform.
- Ensure the integrity and security of our data by implementing robust governance frameworks and maintaining compliance with standards like SOX and SOC 1/2.
- Lead large-scale database migrations from legacy on-premises systems to modern cloud data warehouses to improve performance and reduce technical debt.
- Provide technical leadership and mentorship to junior engineers while fostering a culture of best practices in data management and CI/CD workflows.
- Monitor and troubleshoot data infrastructure to ensure operational efficiency, utilizing tools like Apache Airflow for workflow orchestration.
Requirements
- At least 5 years of professional experience in data engineering, with a strong focus on cloud-based ETL/ELT architecture.
- Advanced proficiency in Python and SQL, including complex data modeling techniques.
- Deep expertise in cloud services, specifically within the AWS or Azure ecosystems.
- Practical experience with big data technologies, including Apache Spark, Kafka, and Databricks.
- Proven ability to architect and manage enterprise-grade data warehouses like Snowflake or Redshift.
- Familiarity with CI/CD tools such as Docker, Jenkins, or GitHub Actions.
- A strong understanding of data security, compliance requirements, and governance frameworks.
- Excellent problem-solving skills and the ability to communicate technical solutions to diverse stakeholders.
- Fluency in English.
Preferred qualifications
- Experience with advanced architectural principles, such as medallion architecture and materialized views.
- A successful track record of executing large-scale cloud migrations and optimizing query performance.
- Experience with business intelligence tools like Tableau or Power BI to drive data-informed decision-making.
- Prior experience in a leadership or mentorship capacity within an engineering team.
Compensation
We offer a hybrid work environment that provides flexibility for our team members based in the United States.
How to apply
If you are passionate about cybersecurity and building scalable data platforms, we would love to hear from you. Please submit your application through our official careers portal to begin the conversation about how you can contribute to our mission.





