What do you delegate today?

From question to deliverable in minutes

Esan is an AI agent that researches the web, reads files you upload, executes code in isolated sandboxes, and connects to your apps (Gmail, Google Drive, Calendar, GitHub, Slack, Notion and more) to deliver real outcomes — not just answers.

Esan doesn't spit out loose text. It investigates, executes, and delivers a structured report you can audit step by step.

01

You ask

Plain language. Attach files, URLs, or images. No special syntax.

02

Esan investigates

Browses the web, reads documents, runs code, calls APIs. Every session runs in an isolated microVM.

03

You get the deliverable

Polished markdown with inline citations, HTML dashboards, and generated files — all downloadable.

GAIA benchmark · validation set · 165 questions

Above human level on the strictest agent benchmark

GAIA measures an agent's ability to solve real tasks: web research, running code, reading PDFs, analyzing images. Esan beats the human baseline on L1 and L2, and ties on L3.

GAIA Benchmark PerformanceEsan multi-agent system · Validation set (165 questions)EsanHuman baselineManus0%20%40%60%80%100%96.294.086.5+2.2ppLevel 1vs human94.492.070.1+2.4ppLevel 286.487.057.7−0.6ppLevel 393.892.075.1+1.8ppOverallSource: GAIA (Mialon et al., 2023). Manus reference per public reports. Run conducted April 2026.

Adjusted mean 92.3% — weighted overall 93.8%. 12 questions excluded as IMPOSSIBLE_TASK_IDS (dead web, GT bug, anti-bot drift) — documented in the scorecard.

What Esan already does

Each example is a single prompt. The agent decomposes, researches, and delivers.

Financial analysis

Analyze NVIDIA’s last 5 quarterly letters, compare with AMD, Intel, and Broadcom, and give me a buy/sell call.

report.md + dashboard.html
Scientific research

Find the 10 most cited papers on room-temperature superconductivity since 2024.

table.md with DOIs
Product comparison

Compare Manus, Suna, and Cognition Devin on price, features, and benchmarks.

table.md
Live data

Measure sentiment on r/wallstreetbets about TSLA today and give me the 5 most upvoted threads.

sentiment.md
Technical document

Summarize this 80-page paper and extract the 3 main formulas.

summary.md
Code + data

Run a linear regression on these CSVs and generate a chart.

analysis.py + chart.png

Try it with your first task

First 10 queries per day are free, no card required. Magic-link, no password.