Turing

Ai Benchmark Engineer (Data Analysis) - 75113

📍 Location

cuiabá, mato grosso

⏰ Job Type

Full-time

📅 Posted

June 01, 2026

Apply Now

Job Description

Role Overview We are seeking experienced Engineers — Data Analysis to design and develop high-quality multi-agent benchmark tasks that evaluate the analytical reasoning, coordination, and execution capabilities of advanced AI systems. 
In this role, you will build realistic benchmark tasks that require AI agents to analyze large, messy, multi-source datasets, decompose work across specialist sub-agents, and arrive at specific, verifiable conclusions. These tasks may involve structured and semi-structured data such as CSVs, JSON files, logs, reports, survey results, vendor assessments, or financial and operational documents. 
Your work will help measure how effectively AI systems perform complex analytical workflows involving cross-referencing, contradiction detection, anomaly identification, and statistical reasoning across multiple data sources. 
What does day-to-day look like Design and author multi-agent benchmark tasks centered o...
    

Ai Benchmark Engineer (Data Analysis) - 75113

Job Description

Role Overview

What does day-to-day look like

Start Your Week Right!