All roles

LLM - Applied AI Research Scientist (USA & LATAM Remote)

Remote · USA Full-time New today

This a Full Remote job, the offer is available from: United States, Latin America Job Description LLM - Applied AI Research Scientist

  • Location: Remote, LATAM & USA Only
  • Start Date: Immediately
  • Availability: A minimum of 4 hours of mandatory overlap with PST (12 PM–6 PM PST)
  • ​​​Employment Type: Contractor assignment (no medical/paid leave)
  • Contract Duration: 3–6 months (expected start date: next week)
  • Rate Range: Competitve, TBD Company Overview: Based in San Francisco, California, our client is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. they supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L Role Overview: We are seeking highly skilled Applied AI Research Scientists with deep expertise in Computer Engineering and hardware-centric systems with an MS or Ph.D. in a relevant technical field to design and execute expert-level evaluation tasks that probe the limits of state-of-the-art AI systems. In this role, you will create headroom-level, rigorously verifiable evaluation questions rooted in hardware, architecture, and low-level systems reasoning. Your work will focus on exposing model limitations in areas that require deep technical correctness, precise reasoning, and graduate-level understanding of computing systems—well beyond surface-level explanations. You will work closely with a collaborative, cross-functional team and are expected to be highly detail-oriented, reliable, and committed to accuracy and quality. Roles & Responsibilities:
  • Design graduate- and research-level evaluation questions grounded in hardware and computer engineering domains.
  • Create tasks that require precise, step-by-step technical reasoning with objectively verifiable ground-truth answers.
  • Develop multimodal prompts, including accurate block diagrams, timing diagrams, microarchitecture diagrams, or circuit-level visuals when appropriate.
  • Evaluate state-of-the-art AI models on hardware- and systems-heavy reasoning tasks and perform structured side-by-side comparisons.
  • Identify and document model failure modes related to architectural correctness, performance reasoning, or low-level system behavior.
  • Provide authoritative solutions and explanations for each evaluation task.
  • Maintain detailed and accurate records of prompts, expected answers, and evaluation outcomes in shared tracking systems.
  • Collaborate with reviewers and researchers to refine evaluation qualiMS or Ph.D. in Computer Engineering, Electrical Engineering, Computer Science, or a closely related field.
  • Strong expertise in at least two of the below hardware- and systems-focused domains:
  • * Computer architecture (pipelines, memory hierarchies, cache coherence, ISA-level reasoning)
  • Hardware systems and performance analysis
  • VLSI design, digital logic, or ASIC/FPGA fundamentals
  • Embedded systems and low-level firmware
  • Operating systems (especially memory management, scheduling, and hardware–software interfaces)
  • Compilers or systems programming with hardware awareness
  • Proven experience in technical research, evaluation, or rigorous problem formulation in academic, lab, or production-oriented environments.
  • Strong programming skills (especially Python, C or C++) for analysis, verification, and evaluation workflows.
  • Excellent written communication skills and a strong attention to technical detail. Evaluation Process:
  • Round 1: Take home assessment Offline assessment to be completed and submitted for review.
  • Round 2: Delivery Interview (60 minutes) A combined technical and cultural discussion with the Delivery Team. Additional Information All your information will be kept confidential according to EEO guidelines. This offer from "Confisa International Group" has been enriched by Jobgether.com and got a 77% flex score. Apply tot his job Apply tot his job

Apply tot his job Apply To this Job

Related roles

AI Solution Architect (Insurance) - Agentic AI & Azure (Remote, TX, US)

Remote · USA Full-time

Senior Systems Engineer - Azure and AI

Remote · USA Full-time

Senior AI Systems Engineer

Remote · USA Full-time

Staff AI Engineer (Systems), Remote Job

Remote · USA Full-time

Sr Consultant - Engineering Excellence and AI Enablement

Remote · USA Full-time

Enterprise Sales Executive - New Logos

Remote · USA Full-time

Staff Platform Manager- Payments, Evaluations and Automation

Remote · USA Full-time

Sr. Regulatory Ops. Mgr. (contract)

Remote · USA Full-time

Algo Trading Quant

Remote · USA Full-time

Operations Lead, Engagement

Remote · USA Full-time

Experienced Customer Service Representative - Sports Entertainment - Hybrid Overnight Shift

Remote · USA Full-time

Staff Software Engineer

Remote · USA Full-time

(Virtual Assistant Remote Jobs) Data Entry Disney Career Job (Remote) – WFH

Remote · USA Full-time

Immediate Hiring: Dynamic Walgreens Retail Sales Associate and Pharmacy Support Specialist – Customer-Focused Role with Comprehensive Training and Growth Opportunities

Remote · USA Full-time

Senior NetSuite Engineer - Analytics Solutions

Remote · USA Full-time

Seasonal Coordinator, Customer Chat (Remote-Ohio)

Remote · USA Full-time

SBIR Principal Investigator Li-ion Battery Engineer

Remote · USA Full-time

Sr. Director, Project Services

Remote · USA Full-time

Experienced Part-Time Online Customer Service Representative – Amazon Customer Chat Support

Remote · USA Full-time

Experienced Customer Onboarding Consultant – Delivering Personalized Integration Experiences for Diverse Client Base at arenaflex

Remote · USA Full-time