All roles

Senior Data Engineer, Platform & Pipelines

Remote · USA Full-time New today

This a Full Remote job, the offer is available from: United States

About the Role

We are seeking a Senior Data Engineer to join Natera’s Therapeutics & Innovations group, which focuses on leveraging Natera’s multimodal data assets to enable therapeutic development and scientific innovation. The group works with large-scale biomedical datasets to support therapeutic development, biomarker discovery, and translational research, and is in the process of building shared data foundations to unify and scale these efforts. This role is part of a broader initiative to develop a shared, platform-level data system that spans multi-modal data ingestion, backend services, AI-enabled data access, and web interfaces. The initial focus of the role is on designing and implementing robust data ingestion and transformation pipelines, with scope expanding over time into backend APIs, data-access layers, and LLM-driven analysis tools as the platform matures.

Responsibilities

  • Architect, implement, and maintain data ingestion and transformation pipelines using modern workflow orchestration tools (e.g. Dagster)
  • Identify, catalog, and integrate internal and external data sources used across research efforts
  • Operationalize bioinformatics pipelines that support large-scale batch processing, incremental updates, and backfills within AWS
  • Normalize and structure heterogeneous data into consistent, reusable representations that support downstream analysis, modeling, and querying
  • Populate and maintain patient-centric data models in shared storage systems (e.g., graph and relational databases)
  • Collaborate with backend and AI engineers to design data-access patterns that support analytics applications and AI-driven interactions
  • Contribute to backend services and APIs that expose integrated data to internal tools and applications
  • Participate in the evolution of AI-enabled analysis workflows, including tooling that supports LLM- or agent-based interactions with data
  • Contribute to system-level design decisions around data flow, service boundaries, reliability, and scalability
  • Write clean, tested, and well-documented Python code that meets production software engineering standards
  • Debug and resolve complex data quality, pipeline, backend, and infrastructure issues in a distributed environment

Required Qualifications

  • BS in Computer Science, Bioinformatics, Computational Biology, or a related field, MS preferred
  • 4+ years of experience in production data engineering or software engineering
  • Independently drive technical solutions from high-level goals, exercising judgment in system design, implementation, and tradeoff evaluation
  • Strong proficiency in Python, with experience writing maintainable, production-quality code across data and backend contexts
  • Extensive experience with software engineering fundamentals, design patterns, version control, CI/CD, Docker, and automated testing
  • Experience designing and operating workflow orchestration systems (Dagster preferred; Airflow, Prefect, or similar acceptable)
  • Experience building or contributing to backend services (e.g., FastAPI or similar frameworks)
  • Hands-on experience with AWS services commonly used in data and backend systems (e.g., S3, ECS, Batch, Lambda)
  • Experience deploying and operating large-scale data or bioinformatics pipelines in AWS, including managing throughput, cost, and operational reliability
  • Experience with relational databases (Postgres, MySQL) and/or graph databases (Neo4j), including schema and query design
  • Experience contributing to system-level architecture, including data modeling, service boundaries, and operational robustness
  • Ability to work effectively with scientists, bioinformaticians, and ML practitioners in an R&D environment

Preferred Qualifications

  • Experience integrating machine-learning inference outputs into data pipelines
  • Familiarity with LLM-based agents and associated frameworks such as LangChain
  • Familiarity with bioinformatics data formats and pipelines (e.g., FASTQ, BAM/CRAM, VCF, RNAseq, WES/WGS)
  • Experience with infrastructure as code (Terraform)
  • Experience with DNAnexus
  • Understanding of genomics, proteomics, or other omics data types and their downstream analytical use cases
  • Ability to evaluate build-vs-buy tradeoffs in fast paced environments

The pay range is listed and actual compensation packages are based on a wide array of factors unique to each candidate, including but not limited to skill set, years & depth of experience, certifications and specific office location. This may differ in other locations due to cost of labor considerations. Remote USA $125,000—$155,000 USD OUR OPPORTUNITY Natera™ is a global leader in cell-free DNA (cfDNA) testing, dedicated to oncology, women’s health, and organ health. Our aim is to make personalized genetic testing and diagnostics part of the standard of care to protect health and enable earlier and more targeted interventions that lead to longer, healthier lives. The Natera team consists of highly dedicated statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers and many other professionals from world-class institutions, who care deeply for our work and each other. When you join Natera, you’ll work hard and grow quickly. Working alongside the elite of the industry, you’ll be stretched and challenged, and take pride in being part of a company that is changing the landscape of genetic disease management. WHAT WE OFFER Competitive Benefits - Employee benefits include comprehensive medical, dental, vision, life and disability plans for eligible employees and their dependents. Additionally, Natera employees and their immediate families receive free testing in addition to fertility care benefits. Other benefits include pregnancy and baby bonding leave, 401k benefits, commuter benefits and much more. We also offer a generous employee referral program! For more information, visit www.natera.com. Natera is proud to be an Equal Opportunity Employer. We are committed to ensuring a diverse and inclusive workplace environment, and welcome people of different backgrounds, experiences, abilities and perspectives. Inclusive collaboration benefits our employees, our community and our patients, and is critical to our mission of changing the management of disease worldwide. All qualified applicants are encouraged to apply, and will be considered without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, age, veteran status, disability or any other legally protected status. We also consider qualified applicants regardless of criminal histories, consistent with applicable laws. If you are based in California, we encourage you to read this important information for California residents. Link: https://www.natera.com/notice-of-data-collection-california-residents/ Please be advised that Natera will reach out to candidates with a @natera.com email domain ONLY. Email communications from all other domain names are not from Natera or its employees and are fraudulent. Natera does not request interviews via text messages and does not ask for personal information until a candidate has engaged with the company and has spoken to a recruiter and the hiring team. Natera takes cyber crimes seriously, and will collaborate with law enforcement authorities to prosecute any related cyber crimes. For more information: - BBB announcement on job scams - FBI Cyber Crime resource page This offer from "Natera" has been enriched by Jobgether.com and got a 72% flex score. Apply tot his job Apply To this Job

Related roles

Experienced Data Engineer – Cloud-Based Data Pipeline Development and Support

Remote · USA Full-time

Cloud Data Platform Engineer - Remote

Remote · USA Full-time

Senior Software Engineer (Data Platform)

Remote · USA Full-time

Solution Consultant, Real-World Data Privacy

Remote · USA Full-time

Data Privacy & Automation Specialist

Remote · USA Full-time

[Remote] Cybersecurity and Data Protection Officer

Remote · USA Full-time

Data Privacy Associate

Remote · USA Full-time

Principal Consultant (Data Protection SME)

Remote · USA Full-time

Sr Product Manager (RCM Data)

Remote · USA Full-time

Principal Product Manager, PATT & Benefits

Remote · USA Full-time

Adjunct Faculty (College Level Mathematics) NLC

Remote · USA Full-time

Technology Development Program Intern (2026)

Remote · USA Full-time

Future Opportunities with Corporate Services – Amazon Store

Remote · USA Full-time

Desktop Software Engineer

Remote · USA Full-time

Call Center Agent Full or Part Time

Remote · USA Full-time

Virtual Customer Care Representative - Remote Opportunity with American Express - Deliver Exceptional Service and Grow Your Career

Remote · USA Full-time

Remote Customer Support Representative – Chewy Pet Pharmacy Services (Entry‑Level, Full‑Time, $27 /hr, Work‑From‑Home)

Remote · USA Full-time

Data Input Operators - Remote

Remote · USA Full-time

Sr Manager, Data Analytics - Remote

Remote · USA Full-time

Customer Support Representative at Disney Jobs at Home (United Kingdom)

Remote · USA Full-time