All roles

[Remote] AI-Enabled Data Engineer

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. TechTorch is building the future of intelligent work by helping companies design, build, and deploy AI agents to automate complex workflows. The AI-Enabled Data Engineer will focus on creating scalable data pipelines, managing data quality, and integrating AI capabilities into data engineering processes.

Responsibilities

  • Design, build, and maintain scalable data pipelines and ETL/ELT workflows across cloud and on-prem environments
  • Work with Snowflake, Databricks, and Delta Lake as primary data platforms — handling ingestion, transformation, storage optimization, and access patterns
  • Model data with dbt: write modular SQL transformations, manage dependencies, enforce data contracts, and maintain documentation
  • Build and maintain semantic layers that serve consistent, governed metrics to downstream consumers
  • Design data warehouse schemas and data lake structures that balance performance, cost, and queryability
  • Implement data quality frameworks — testing, validation, alerting, and lineage — as first-class citizens in every pipeline
  • Orchestrate workflows across Airflow, Dagster/Prefect, Azure Data Factory, and Databricks Workflows — choosing the right tool for each job
  • Apply DataOps practices: CI/CD for data pipelines, environment promotion, infrastructure as code, and observability
  • Own the reliability of data products end-to-end — monitoring, alerting, incident response, and root cause analysis
  • Work across AWS and Azure cloud services (S3, Glue, ADLS, ADF, Synapse, Redshift) to design cost-effective, scalable architectures
  • Build data pipelines that feed AI systems — including RAG ingestion workflows, vector store loading, document chunking, and embedding pipelines
  • Use LLMs as active components in ETL logic: classification, entity extraction, enrichment, and data quality remediation in-flight
  • Expose data infrastructure as consumable tools for AI agents via MCP or similar agent-integration patterns
  • Use AI-paired programming (Claude Code or equivalent) as a daily productivity layer — not just autocomplete, but genuine workflow acceleration
  • Stay current on how AI tooling changes the data engineering workflow and bring those patterns back to the team

Skills

  • ETL/ELT Design
  • Data Modeling
  • Data Quality & Testing
  • Data Lineage
  • Batch & Incremental Loads
  • Snowflake
  • Databricks
  • Apache Spark / PySpark
  • Delta Lake
  • Data Warehouses
  • Data Lakes
  • Dbt Core / dbt Cloud
  • SQL (advanced)
  • Semantic Layer
  • Dimensional Modeling
  • Apache Airflow
  • Dagster / Prefect
  • Azure Data Factory
  • Databricks Workflows
  • RAG & Vector Store Pipelines
  • AI-Augmented ETL
  • MCP / Agent Data Tools
  • AI-Paired Programming
  • LLM Integration in Pipelines
  • AWS (S3, Glue, Redshift)
  • Azure (ADLS, ADF, Synapse)
  • CI/CD for Data
  • Infrastructure as Code
  • Python
  • Experience with streaming architectures: Kafka, Spark Streaming, or Flink
  • Exposure to feature stores (Feast, Tecton) or ML platform data pipelines
  • Hands-on with vector databases: Pinecone, Weaviate, Qdrant, or pgvector
  • Familiarity with data mesh or data product ownership models
  • Experience with Snowpark or Databricks AI/BI tooling
  • Building or contributing to internal data tooling, frameworks, or accelerators

Company Overview

  • TechTorch is a AI powered Tech Consulting company It was founded in 2021, and is headquartered in San Mateo, California, USA, with a workforce of 51-200 employees. Its website is https://www.techtorch.io/.
  • Company H1B Sponsorship

  • TechTorch has a track record of offering H1B sponsorships, with 4 in 2025. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Related roles

    [Remote] Account Executive - Northeast

    Remote · USA Full-time

    [Remote] Principal Data Scientist Consultant- R programmer (Remote)

    Remote · USA Full-time

    [Remote] Partner Account Executive

    Remote · USA Full-time

    [Remote] (Global) Director, Clinical Deployment & Growth - Lunit SCOPE

    Remote · USA Full-time

    [Remote] Product Manager, Document Management Platform

    Remote · USA Full-time

    [Remote] Senior Software Engineer

    Remote · USA Full-time

    [Remote] Cloud Consulting Director - Fusion Finance Solution Architect

    Remote · USA Full-time

    [Remote] Channel Account Manager - US

    Remote · USA Full-time

    [Remote] Manager, Software Engineering - Contact Center Pro

    Remote · USA Full-time

    [Remote] Training Content & Curriculum Lead (Applied Epic)

    Remote · USA Full-time

    Public Safety Liaison (San Francisco, CA)

    Remote · USA Full-time

    Remote Customer Service Representative – Pet‑Lovers Edition – Deliver Exceptional Support for arenaflex’s Online Pet Marketplace (Hollywood, FL)

    Remote · USA Full-time

    Experienced Customer Experience Consultant – Agile Applications Support & Customer Service

    Remote · USA Full-time

    [Remote] Staff Technical Program Manager, Queryable Encryption

    Remote · USA Full-time

    Sales Development Representative

    Remote · USA Full-time

    Collaborating Psychiatrist: Indiana Licensed

    Remote · USA Full-time

    Experienced Medical Data Entry Associate – Healthcare Information Management Specialist

    Remote · USA Full-time

    [Remote] Recruiting Sourcer (Contractor)

    Remote · USA Full-time

    Bilingual Temporary Customer Service Representative – Remote, 90‑Day Assignment Supporting English‑Spanish Support for arenaflex Contact Center

    Remote · USA Full-time

    Regional Gift Planner

    Remote · USA Full-time