Pricing · Managed Probing / Report Hosting

TiersFrom free self-testing to enterprise hosting, billed by benchmark usage

Free · Forever

Self-Service Probing

¥0

Run gwbench compare / longcontext locally, unlimited
Use your own key — it never leaves your machine
Portable reports: report.html / report.json
Browse all public reports in the Reports gallery
Share your reports publicly to the gallery (via PR)

Start testing now →

Most Popular

Managed Probing

Per-run / metered

We run it on our own infra + keys, so you don't burn tokens
All the expensive jobs included: 128K long-context needle, cross-checks against the official API, model-identity fingerprinting
Multi-region probes (including a domestic direct-connect view)
One-click reports, no environment setup
Publish results to the gallery or keep them private

Join the waitlist →

Subscription

Pro Report Hosting

Monthly

Private reports with your branding / watermark
Scheduled re-tests (daily / weekly), trend retention
Alerts on substitution / truncation / rate-limiting changes
Team sharing and permissions
Export PDF / share short links

Join the waitlist →

Enterprise / Developer

Unified Benchmark Client

License / usage

Wraps each vendor's SDK — one interface to benchmark many
Friendly for domestic users: integrate once, run across mainstream gateways
Embed the gwbench probe into your own CI
Private deployment and custom metrics
SLA and technical support

Contact sales →

What you pay for is us running the probes for you + hosting reports, not locking up the open-source capabilities — self-service probing and public reports are free forever and reproducible. Billing and payment channels are coming soon.

Join the Waitlist / ContactBefore payment channels go live, register your needs for priority access

Want managed probing or Pro report hosting? Tell us now which gateways and which dimensions you want tested. We'll notify you as soon as payment and self-service ordering are available.

About This PlatformThe benchmark is the entry point, not the destination

This toolkit and reporting platform does one thing — make gateway/model benchmarking solid and open. If it helped you pick the right gateway, come see the fuller capabilities in the main project.

Learn about the main project → Back to the public leaderboard