GitHub - ServiceNow/drbench: An enterprise deep research benchmark

`DRBench` – The Benchmark for Enterprise Deep Research Agents

🚀 Coming Soon

⭐ Star & Watch this repo to be notified about the release

DRBench is the first of its kind benchmark designed to evaluate deep research agents on complex, open-ended enterprise deep research tasks.

It tests an agent’s ability to conduct multi-hop, insight-driven research across public and private data sources—just like a real enterprise analyst.

🧠 Why `drbench`?

🔎 Real Deep Research Tasks
Not simple fact lookups. It has tasks like "What changes should we make to our product roadmap to ensure compliance?" which require multi-step reasoning, synthesis, and reporting.
🏢 Enterprise Context Grounding
Each task is rooted in realistic user personas (e.g., Product Developer) and organizational settings (e.g., ServiceNow), for deep understanding and contextual awareness.
🧩 Multi-Modal, Multi-Source Reasoning
Agents must search, retrieve, and reason across:
- Internal chat logs 💬
- Cloud file systems 📂
- Spreadsheets 📊
- PDFs 📄
- Websites 🌐
- Emails 📧
🧠 Insight-Centric Evaluation
Reports are scored based on whether agents extract the most critical insights and properly cite their sources.

📦 What You'll Get

✅ The first benchmark for deep research across hybrid enterprise environments
✅ A suite of real-world tasks across Enterprise UseCases like CRM ✅ A realistic simulated enterprise stack (chat, docs, email, web, etc.)
✅ A task generation framework blending web-based facts and local context
✅ A lightweight, scalable evaluation mechanism for insightfulness and citation

🧪 Project Status

We’re putting the final polish on the benchmark, evaluation tools, and baseline agents.
Public release coming soon!

🤝 Get Involved

Interested in early access, collaboration, or feedback?

Reach out via [[email protected]]
Join our Discord Channel [https://discord.gg/9rQ6HgBbkd]

🤝 Core Contributers

Tianyi Chen – [email protected]
Miguel Muñoz – [email protected]
Amirhossein Abaskohi – [email protected]
Curtis Fox - [email protected]
Alex Drioun – [email protected]
Issam Laradji – [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
drbench_banner.png		drbench_banner.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

`DRBench` – The Benchmark for Enterprise Deep Research Agents

🚀 Coming Soon

🧠 Why `drbench`?

📦 What You'll Get

🧪 Project Status

🤝 Get Involved

🤝 Core Contributers

About

Uh oh!

Releases

Packages

Contributors 2

License

ServiceNow/drbench

Folders and files

Latest commit

History

Repository files navigation

DRBench – The Benchmark for Enterprise Deep Research Agents

🚀 Coming Soon

🧠 Why drbench?

📦 What You'll Get

🧪 Project Status

🤝 Get Involved

🤝 Core Contributers

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

`DRBench` – The Benchmark for Enterprise Deep Research Agents

🧠 Why `drbench`?

Packages