Pricing

Self-service probing is free forever, and your key lives only in your local environment variables — never uploaded. There are only two things we charge for: ① you don't want to burn your own tokens / set up the environment, so we run the expensive jobs on our own infra (128K long context, cross-checks against the official API, model-identity fingerprinting); ② you want reports privately hosted, branded, re-tested on a schedule, with change alerts. Benchmark dimensions only — we never touch your business data.

TiersFrom free self-testing to enterprise hosting, billed by benchmark usage

Free · Forever
Self-Service Probing
¥0
  • Run gwbench compare / longcontext locally, unlimited
  • Use your own key — it never leaves your machine
  • Portable reports: report.html / report.json
  • Browse all public reports in the Reports gallery
  • Share your reports publicly to the gallery (via PR)
Start testing now →
Most Popular
Managed Probing
Per-run / metered
  • We run it on our own infra + keys, so you don't burn tokens
  • All the expensive jobs included: 128K long-context needle, cross-checks against the official API, model-identity fingerprinting
  • Multi-region probes (including a domestic direct-connect view)
  • One-click reports, no environment setup
  • Publish results to the gallery or keep them private
Join the waitlist →
Subscription
Pro Report Hosting
Monthly
  • Private reports with your branding / watermark
  • Scheduled re-tests (daily / weekly), trend retention
  • Alerts on substitution / truncation / rate-limiting changes
  • Team sharing and permissions
  • Export PDF / share short links
Join the waitlist →
Enterprise / Developer
Unified Benchmark Client
License / usage
  • Wraps each vendor's SDK — one interface to benchmark many
  • Friendly for domestic users: integrate once, run across mainstream gateways
  • Embed the gwbench probe into your own CI
  • Private deployment and custom metrics
  • SLA and technical support
Contact sales →

What you pay for is us running the probes for you + hosting reports, not locking up the open-source capabilities — self-service probing and public reports are free forever and reproducible. Billing and payment channels are coming soon.

Join the Waitlist / ContactBefore payment channels go live, register your needs for priority access

Want managed probing or Pro report hosting? Tell us now which gateways and which dimensions you want tested. We'll notify you as soon as payment and self-service ordering are available.

About This PlatformThe benchmark is the entry point, not the destination

This toolkit and reporting platform does one thing — make gateway/model benchmarking solid and open. If it helped you pick the right gateway, come see the fuller capabilities in the main project.