All roles

Data Science - AI Document Understanding, Co-op

Remote · USA Full-time New today

About Ancestry: When you join Ancestry, you join a human-centered company where every person’s story is important. Ancestry®, the global leader in family history, connects everyone with their past so they can discover, preserve, and share their unique family stories. With our unparalleled collection of more than 65 billion records, over 3.5 million subscribers, and over 27 million people in our growing DNA network, customers can discover their family story and gain a new level of understanding about their lives. Over the past 40 years, we’ve built trusted relationships with millions of people who have chosen us as the platform for discovering, preserving, and sharing the most important information about themselves and their families. We are committed to our location flexible work approach, allowing you to choose to work in the nearest office, from your home, or a hybrid of both (subject to location restrictions and roles that are required to be in the office- see the full list of eligible US locations HERE). We will continue to hire and promote beyond the boundaries of our office locations, to enable broadened possibilities for employee diversity. Together, we work every day to foster a work environment that's inclusive as well as diverse, and where our people can be themselves. Every idea and perspective is valued so that our products and services reflect the global and diverse clients we serve. Ancestry encourages applications from minorities, women, the disabled, protected veterans and all other qualified applicants. Passionate about dedicating your work to enriching people’s lives? Join the curious. Ancestry is seeking an exceptional and highly motivated AI Engineer / Data Science Co-op to join our AI Applied Science Content team. You’ll play a vital role in the design and implementation of AI Native agentic systems that extract and organize text and image information from billions of historical and genealogical records, enabling customers to discover, share, and connect with their family history. The work will focus on building autonomous, multi-agent workflows capable of complex reasoning, tool use, analysis, and self-correction. You will also work closely with engineering teams to train, optimize, and deploy solutions that promote product development, customer success, and content creation across our Family History business. This is a part-time, work-study-based opportunity designed for active master's and PhD students continuing their education in the fall. What you will do: Innovate with State-of-the-Art AI: Implement cutting-edge AI solutions for key Document Understanding tasks such as OCR/HTR, transcription, Named Entity Recognition (NER), Relation Extraction (RE), Coreference Resolution, Summarization, and Knowledge Graphs working with diverse genealogical and historical collections spanning newspapers, city directories, family history books, and vital records (i.e., birth, marriage, & death records). Analyze and Optimize Multi-Modal Models: Evaluate the performance of multi-modal models in zero-shot and few-shot learning scenarios for comprehensive document understanding. Architect Agentic Systems: Design and implement multi-agent workflows using frameworks like LangChain, LangGraph, CrewAI, or AutoGen to automate complex multi-step reasoning tasks in historical document analysis. Evaluation & Observability: Establish "LLM-as-a-Judge" frameworks and use tools like Arize Phoenix, DeepEval, or RAGAS to monitor for hallucination, drift, and bias. Collaborate on Cloud Deployment: Partner closely with ML Ops and Data Science Engineers to seamlessly deploy datasets, models, and pipelines in cloud environments. Communicate Insights Effectively: Clearly and confidently present your findings, deliverables, and proposed solutions to technical and non-technical audiences, including teams, stakeholders, and executives. Who You Are: Currently pursuing an advanced degree (Master's or PhD preferred) in Computer Science, Data Science, Statistics, Mathematics, Linguistics, Engineering or related quantitative field with a strong data focus. Specialization in AI & LLMs including familiarity with foundational models such as GPT, Gemini, Qwen, Llama, Claude, etc. Experience with inference optimization, vLLM, LoRA, QLoRA, quantization, etc. Familiar with embeddings, vector databases, transformer models, with software development experience. Strong proficiency in Python and relevant tools and libraries, including transformer models, multi-modal models, and general NLP (e.g., Hugging Face Transformers, agentic frameworks andworkflows, LangChain, LangGraph, CrewAI, AgentCore). Familiarity with cloud platforms and related AI/ML services such as Google Cloud Platform, GCP, Gemini API, Vertex AI, AWS EC2, S3, SageMaker, Model Registry, and Bedrock is a plus. Additional Information: Ancestry is an Equal Opportunity Employer that makes employment decisions without regard to race, color, religious creed, national origin, ancestry, sex, pregnancy, sexual orientation, gender, gender identity, gender expression, age, mental or physical disability, medical condition, military or veteran status, citizenship, marital status, genetic information, or any other characteristic protected by applicable law. In addition, Ancestry will provide reasonable accommodations for qualified individuals with disabilities. All job offers are contingent on a background check screen that complies with applicable law. For candidates who live in San Francisco, CA, pursuant to the San Francisco Fair Chance Ordinance, Ancestry will consider for employment qualified applicants with arrest and conviction records. Ancestry is not accepting unsolicited assistance from search firms for this employment opportunity. All resumes submitted by search firms to any employee at Ancestry via-email, the Internet or in any form and/or method without a valid written search agreement in place for this position will be deemed the sole property of Ancestry. No fee will be paid in the event the candidate is hired by Ancestry as a result of the referral or through other means. Apply To This Job

Related roles

Sales Manager, Google (Corporate GTM) SADA

Remote · USA Full-time

Senior Staff, Dx Platform Enterprise Architect

Remote · USA Full-time

Specialty Services Legionella Program Manager

Remote · USA Full-time

Associate Director, Market Access Portfolio Strategy

Remote · USA Full-time

Director, Digital Marketing

Remote · USA Full-time

Coordinador (a) de Crédito y Cobro

Remote · USA Full-time

Regional Sales Manager

Remote · USA Full-time

Independent Sales Representative - Aesthetic Device Sales

Remote · USA Full-time

Inside Sales Representative

Remote · USA Full-time

eCommerce Content Manager

Remote · USA Full-time

Remote Live Chat Support Representative – Work From Home Customer Service Specialist (No Phone Calls, Entry Level)

Remote · USA Full-time

Experienced Full Stack Data Entry Specialist – Disney Data Management

Remote · USA Full-time

Experienced Data Entry Clerk – Remote Work Opportunity with arenaflex

Remote · USA Full-time

Remote Entry-Level Data Entry Specialist – Precision Data Management & Reporting Role at arenaflex (Canada)

Remote · USA Full-time

Sr. Workday HRIS Analyst (Payroll & Benefits)

Remote · USA Full-time

Customer Success Manager-Northeast

Remote · USA Full-time

Experienced Customer Support Advisor – Delivering Exceptional Service and Driving Business Growth at arenaflex

Remote · USA Full-time

Senior Integrated Project Manager

Remote · USA Full-time

Experienced Customer Service Associate – Delivering Exceptional Experiences in Largo, FL at arenaflex

Remote · USA Full-time

Senior Research Scientist Environmental Chemistry job at College of William and Mary in Gloucester Point, VA

Remote · USA Full-time