Helix is the leading population genomics and viral surveillance company.
Consumer Genetics • Next Generation Sequencing (NGS) • Bioinformatics • Consumer Insights • Big Data
July 31
🏡 Remote – Anywhere in California
Airflow
Amazon Redshift
AWS
Cloud
Distributed Systems
ETL
Java
JavaScript
Node.js
Python
React
Scala
Spark
Terraform
TypeScript
Vue.js
Go
Helix is the leading population genomics and viral surveillance company.
Consumer Genetics • Next Generation Sequencing (NGS) • Bioinformatics • Consumer Insights • Big Data
• The Helix Data Engineering team plays a pivotal role in Helix’s efforts to provide a first-in-class clinicogenomics research dataset that serves our internal research team, provides operational insights back to health systems, and is a valuable asset in our growing Life Science business. • Working closely with Research, Bioinformatics, and other Engineering teams, we are responsible for maintaining infrastructure that enables secure analysis of this quickly growing dataset. • The patient is top of mind in everything we do, and your contributions here have the opportunity to improve the real world outcomes for everyone. • Maintain and evolve infrastructure that allows scientists to process and analyze Helix-produced clinicogenomics datasets. • Drive data infrastructure and data management strategy for clinical and genomic data, contributing to platform components and pipelines that increase the value and usability of these key assets. • Collaborate and work well cross functionally with product managers, bioinformaticians, scientists, other engineers, and business leaders. • Establish and maintain strong engineering best practices. • Own systems and services from development to production. • Mentor other team members to reinforce a culture of learning and teaching.
• Reside in the US, Canada, Mexico, Chile or Colombia • Bachelor's/Master's degree in Computer Science, Bioinformatics, Engineering, Mathematics, or a related field with 7+ years of experience; or PhD with 2+ years of experience • Proven experience in data engineering • Proficiency in Python, Go, Java, Scala, or similar • Proficiency with distributed systems built on cloud infrastructure — AWS or similar • Experience with infrastructure-as-code tooling/frameworks (e.g., Terraform, Cloudformation, AWS CDK) • Experience with authentication protocols such as OAuth, OIDC, SAML, and JWT • Proficiency in managing Identity and Access Management (IAM) configurations • Expertise with distributed compute frameworks such as Spark, Dask, EMR, Databricks, or similar • Expertise with ETL pipeline automation and workflow management tools such as Airflow, AWS Glue, AWS Step Functions, and CI/CD • Familiarity with database design, data manipulation, and data quality techniques • Adaptable in a fast-paced startup environment where priorities may change quickly and frequently • Demonstrated willingness to learn new domains (e.g. genomics, healthcare) and associated technologies
• Comprehensive Health Insurance with Date of Hire eligibility • Above average employer paid premium coverage • 12 weeks Helix Paid Parental Leave option • 401(k) with employer matching of up to 3% and 100% Vesting on the Date of Hire • Comprehensive Well-Being Benefits • 18 well-being programs covering financial, legal and wellness solutions • Flexible PTO • Remote options for many roles and a home office stipend
Apply Now