All roles

Data Engineer (AI Enablement)

Remote · USA Full-time New today

THE JOB / Data Engineer (AI Enablement) STRATEGY / Responsible for building and operating the data foundations that power Octagon’s AI solutions and enterprise search.

  • **Our headquarters are in Stamford, CT, but the location of this position can be 100% remote for qualified candidates.

You’re a systems-minded builder who turns messy, multi-source data into reliable, searchable, and governed knowledge. Your mission is to stand up the pipelines, vector search, and metadata standards that make AI tools accurate, fast, and safe. You’ll partner closely with the Solutions Engineer (peer role) to take prototypes and ship durable infrastructure—ingestion, embeddings, indexing, and APIs—so teams can find and use what they need. You’ll report to the Director, Data Strategy and work across departments to reduce manual effort, improve data quality, and enable AI-powered workflows at scale. THE WORK YOU’LL DO

  • Data foundations: Design and operate the vector database/search layer (e.g., FAISS/pgvector/Milvus) and document-chunking/embedding pipelines that make Octagon’s content discoverable and auditable.
  • Scalable pipelines for AI/ML/LLM: Implement and maintain ELT/ETL to support downstream workflows such as data labeling, classification, and document parsing; build robust validations, lineage, and observability.
  • Retrieval APIs: Expose governed retrieval endpoints that respect permissions (ACLs), support metadata filters, and return source snippets/IDs for grounding and citations.
  • Data structuring & manipulation: Normalize, transform, and move JSON and other structured payloads cleanly through workflows to ensure reliable handoffs and automation outputs.
  • Align & collaborate: Align product peers, design, data science, engineering, and commercial teams around a unified roadmap and shared data contracts.
  • Operationalize prototypes: Take MVPs from the Solutions Engineer and productionize with CI/CD, telemetry, cost/usage guardrails, and pilot → rollout gating.
  • Reliability & security: Build monitoring (freshness, re-index SLAs, retrieval quality), secrets management, access controls, and audit logging aligned with enterprise governance.
  • Flexibility and willingness to travel and work weekends or holidays as needed. Anticipated travel level: Low (0–15%).

THE BIGGER TEAM YOU’LL JOIN Recognized as one of the “Best Places to Work in Sports”, Octagon is the global sports, entertainment, and experiential marketing arm of the Interpublic Group. We take pride in being Playmakers – finding insightful, bold ways to create play in our work, our lives, and in the world. We believe in the power of play to create big ideas and unlock potential for our clients and talent. We can put ourselves in the shoes of fans because we ARE fans – of sports, entertainment, and culture at large. This expertise allows us to continually evolve the fan experience across sports and entertainment alongside some of the biggest brands and talent in the world. The world needs play more than ever. Are you a Playmaker? WHO WE’RE LOOKING FOR

  • 3+ years (or equivalent portfolio) building data systems: data modeling, ELT/ETL, Python + SQL; experience with cloud object storage and relational databases.
  • Hands-on with embeddings and vector databases (e.g., FAISS/pgvector/Milvus) and document processing pipelines for RAG-style retrieval.
  • Scalable pipeline experience supporting AI/ML/LLM use cases (labeling, classification, doc parsing) and partnering closely with Data Science and Data Labeling teams.
  • Data structuring & manipulation expertise: cleanly normalizing and transforming JSON/Parquet/CSV payloads; designing resilient data contracts and schemas.
  • Orchestration/ops: Airflow/Prefect (or similar), CI/CD, structured logging/monitoring, cost/usage guardrails; secure secrets management.
  • Strong collaboration and communication skills; proven ability to align product/design/engineering/commercial stakeholders around a unified roadmap.

Nice-To-Haves

  • Enterprise connectors and productivity stacks (e.g., Microsoft 365/SharePoint/Teams/Graph, Copilot or Copilot Studio/Power Automate; Google Workspace; Salesforce; DAMs).
  • Experience implementing LLM inference patterns, similarity search, guardrails, and memory; familiarity with agent frameworks or custom orchestration.
  • Additional languages for systems work (e.g., C++, C#, Java, or Go).
  • Containers (Docker), GitHub Actions, IaC; lightweight internal UIs (Streamlit or R Shiny) to expose services.
  • Familiarity with marketing/media-measurement datasets and associated normalization/quality checks.

The base range for this position is $90,000 – $100,000. Where an employee or prospective employee is paid within this range will depend on, among other factors, actual ranges for current/former employees in the subject position; market considerations; budgetary considerations; tenure and standing with the company (applicable to current employees); as well as the employee’s/applicant’s background pertinent experience, and qualifications We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, sex, sexual orientation, age, disability, gender identity, marital or veteran status, or any other protected class. Apply tot his job Apply To this Job

Related roles

Lead Engineer, Big Data (AI/Azure Data Services) - REMOTE

Remote · USA Full-time

Customer Experience (CX) Automation Engineer, Mid Level (Remote)

Remote · USA Full-time

Generative AI Automation Engineer - Remote Job - Now Hiring

Remote · USA Full-time

AI Developer - Microsoft Copilot Studio, Power Platform - REMOTE WORK - 67398

Remote · USA Full-time

Developer Advocate - AI and Open Source

Remote · USA Full-time

Platform & AI Product Manager

Remote · USA Full-time

AI Developer II

Remote · USA Full-time

Senior Product Manager – AI & Analytics

Remote · USA Full-time

Healthcare AI Engineer

Remote · USA Full-time

Copy of Technical Project Manager (Voice AI)

Remote · USA Full-time

Experienced Remote Data Entry Specialist – Detail-Oriented Professional for Accurate and Efficient Data Management

Remote · USA Full-time

Advanced Manufacturing Engineer (AME) (Remote Eligible, U.S.)

Remote · USA Full-time

Growth Marketing Manager

Remote · USA Full-time

Data Entry Operator - Accurate Data Management for Recruitment Excellence at The Elite Job, Mumbai

Remote · USA Full-time

Experienced Full Stack Customer Support Agent – Live Chat and CRM Expert

Remote · USA Full-time

Senior Data Engineer - Data Platform

Remote · USA Full-time

Experienced Chat Brand Representative – Online Gaming Community Support

Remote · USA Full-time

Multi-Line Representative - State Farm Agent Team Member

Remote · USA Full-time

Experienced Night Shift Customer Service Representative – Work From Home Opportunity at arenaflex

Remote · USA Full-time

Product Manager/Product Owner, Data Science Storm Insights- REMOTE

Remote · USA Full-time