Lifted, an Upwork Company

Copy of Senior Python Developer (AI Evaluation & Benchmarking)

📍 Location
São Paulo, State of São Paulo
⏰ Job Type
Full-time
📅 Posted
July 02, 2026
Apply Now

Job Description

Job Description

This opportunity is ideal for senior software engineers with strong Python expertise who enjoy writing high-quality code, reviewing technical solutions, and working on AI-related projects.

What You'll Do:

  • Design and develop coding benchmarks used to evaluate frontier AI models.
  • Analyze AI-generated code for correctness, reliability, efficiency, and edge cases.Build and maintain scalable data pipelines that support AI evaluation workflows.
  • Create structured programming scenarios to test reasoning, debugging, and code quality.
  • Work with large codebases and multi-language software environments.
  • Collaborate with teams focused on improving how AI models understand, generate, and evaluate software.
  • Write clean, maintainable, and well-tested Python code following software engineering best practices.

Qualifications

Requirements:

  • 4+ years of professional software engineering ...

Start Your Week Right!

Apply now and make every Monday exciting with Lifted, an Upwork Company

Apply for this Position