Independent Benchmark for In-house Legal Work

Project Snapshot

AI adoption in legal departments is accelerating, but clarity on real-world performance, risks, and accuracy remains limited. This project offers an independent, vendor-agnostic benchmark of AI tools based on real in-house legal use cases.

Following our phase 1 report dedicated to information extraction, phase 2 focuses on evaluating AI tools for contract drafting tasks in English, specifically:

  1. Basic drafting: generating basic clauses that follow widely accepted language
  2. Template-based drafting: adapting an existing contract template based on provided facts
  3. Bespoke drafting: drafting custom clauses/agreements from scratch based on unique commercial arrangements

Redlining or markup of existing text is excluded from this phase. Phase 2 focuses on the generative drafting capabilities of AI tools.

Legal AI Solutions in Scope

The following legal AI solutions are included in this phase 2 evaluation

Wordsmith AI
Wordsmith AI
GC AI
GC AI
InstaSpace
InstaSpace
SimpleDocs
SimpleDocs
Brackets AI
Brackets AI
Vecflow
Vecflow

Phase 2 Goals

  • Evaluate contract drafting performance across Legal AI solutions, general-purpose LLMs (e.g., OpenAI GPT, Alibaba Qwen, Anthropic Claude, Google Gemini), and Microsoft Copilot, using human lawyers as the reference baseline.
  • Identify the strengths and recurring failure patterns in AI-generated drafts.
  • Uncover best practices for using AI Solutions in contract drafting, grounded in evaluation findings.

Phase 2 Timeline

  • Phase 2 Launch: Week of June 23 (target)
  • Vendor and Contributor Sign-Up: Late June
  • Task Collection and Evaluation: Early July
  • Data Analysis and Report Drafting: Mid–late July
  • Phase 2 Report Publication: Late July to early August

Roles

Core Working Group (Co-Authors)

Leads task collection, evaluation, and report writing. Anna Guo, Arthur Souza Rodrigues & Mohamed Al Mamari (In-house Counsel), Marc Astbury (AI Product Expert), Sakshi Udeshi (AI Trust & Safety Expert, PhD in ML) & Gabriel Saunders (Legal Ops Expert)

Advisors

Provide expert guidance on evaluation design and quality standards. Nada Alnajafi, Nate Kostelnik (Senior Contract Experts). Jason Tamara Widjaja (Executive Director of AI, global healthcare company)

Contributors

350+ members from the legal tech community.