πŸ’Ž Transparent Pricing

Pay only for what you use.
Route smarter, spend less.

No hidden fees. No vendor lock-in. NexToken routes your requests to the optimal provider β€” automatically.

Monthly
Annual Save 20%
Developer
$0
Free forever. Start building immediately.
Start Free
βœ“$5 one-time credit Β· no card needed
βœ“100 RPM rate limit
βœ“3 API keys
βœ“Standard routing
βœ“REST API access
βœ—Custom routing rules
βœ—SLA guarantee
βœ—Priority support
Pro
$29/mo
For indie developers and growing projects.
Start Pro
βœ“$500 monthly credit included
βœ“1,000 RPM rate limit
βœ“20 API keys
βœ“Smart routing + fallback
βœ“Streaming support
βœ“Usage analytics
βœ—SLA guarantee
βœ—Dedicated support
Enterprise
Custom
Negotiated pricing for large-scale deployments.
Contact Sales
βœ“Unlimited credits (prepaid)
βœ“Custom RPM limits
βœ“Dedicated routing cluster
βœ“Volume discounts negotiable
βœ“99.99% uptime SLA
βœ“Dedicated Slack channel
βœ“Custom invoicing / PO
β˜…On-premise option
Platform fee + pass-through model costs. Subscription fees cover your NexToken platform access, rate limits, and features. Model usage is billed separately at cost-plus-markup rates shown in the table below. All prices in USD. Singapore users are charged 9% GST on platform fees per IRAS regulations.

Model Token Pricing

NexToken charges provider cost + a small routing markup. Prices per 1,000,000 tokens (1M tokens). Input / Output priced separately.

OpenAI
ModelInput ($/1M)Output ($/1M)MarkupCapabilities
gpt-4o128k$2.60$10.40+8%stream vision tools
gpt-4o-mini128k$0.16$0.65+8%stream vision tools
o1200k$16.20$64.80+8%tools
o3-mini200k$1.13$4.40+8%stream tools
Anthropic
ModelInput ($/1M)Output ($/1M)MarkupCapabilities
claude-opus-4200k$15.00$75.00+7%stream vision tools
claude-sonnet-4200k$3.00$15.00+7%stream vision tools
claude-haiku-4200k$0.80$4.00+7%stream vision tools
Google DeepMind
ModelInput ($/1M)Output ($/1M)MarkupCapabilities
gemini-2.5-pro1M$1.25$10.00+8%stream vision tools
gemini-2.5-flash1M$0.075$0.30+8%stream vision tools
Meta / DeepSeek / Mistral
ModelInput ($/1M)Output ($/1M)MarkupCapabilities
llama-3.3-70b128k$0.59$0.79+10%stream tools
deepseek-v364k$0.27$1.10+10%stream tools
mistral-large-2128k$2.00$6.00+10%stream tools

Volume Billing Tiers

The higher your monthly token spend, the lower your effective markup. Tiers reset on the 1st of each month.

TierMonthly Token SpendMarkup RateEffective SavingUnlocks
Developer$0 – $500Standardβ€”3 keys, 100 RPM
Pro$500 – $5,000βˆ’1%Up to $50/mo1,000 RPM, analytics
Business$5,000 – $50,000βˆ’2.5%Up to $1,250/moCustom routing, SLA
Enterprise$50,000+NegotiatedUp to 15%+Dedicated cluster, custom terms

* Billing tiers are independent from Loyalty tiers. Billing tiers reflect monthly spend volume; Loyalty tiers reflect cumulative top-up history.

Loyalty Tiers

Cumulative top-up milestones unlock permanent wallet bonuses. Tiers do not reset β€” they track your total lifetime top-up.

πŸ₯‰
Bronze
β‰₯ SGD 500 cumulative top-up
Bonus credits on every top-up
+3%
πŸ₯ˆ
Silver
β‰₯ SGD 2,000 cumulative top-up
Bonus credits + priority routing queue
+5%
πŸ₯‡
Gold
β‰₯ SGD 10,000 cumulative top-up
Bonus credits + dedicated routing + SLA
+8%
πŸ’Ž
Platinum
β‰₯ SGD 50,000 cumulative top-up
Maximum bonus + enterprise SLA + custom terms
+12%

Loyalty bonus credits are applied to your wallet at time of top-up. Example: Gold user tops up SGD 1,000 β†’ receives SGD 1,080 wallet balance (+8% bonus). Bonuses do not stack with promotional codes.

Add-Ons

Optional extras available on Pro and Business plans.

πŸ”’
Extended Audit Logs
$19 / month
Retain full request/response logs for 365 days. Export to S3 or GCS. Required for SOC 2 audits.
πŸ“Š
Advanced Analytics
$29 / month
Cost attribution by team, project, or custom labels. CSV export, Grafana-compatible metrics endpoint.
🚨
Spend Alerts & PagerDuty
$9 / month
Multi-channel alerts (Slack, email, SMS, PagerDuty) with configurable thresholds and escalation policies.
🌐
Dedicated IP Egress
$49 / month
Route all traffic through a static IP pool for firewall whitelisting. Required for some financial and healthcare orgs.
🀝
Shared Slack Support
$99 / month
Join a shared Slack Connect channel with the Nex engineering team. <4h response time, Mon–Fri 9–6 SGT.
⚑
Higher Rate Limits
From $49 / month
Burst capacity packs: 5k, 20k, or 50k additional RPM. Stacks on top of your base plan limit.

Full Feature Comparison

Detailed breakdown of what's included in each plan.

FeatureDeveloperPro BusinessEnterprise
Limits & Access
Monthly free credits$5 one-time$500 incl.Custom
Requests per minute (RPM)1001,000Custom
API keys320Unlimited
Sub-keys per keyβœ•5Unlimited
Context window supportUp to 128kUp to 200kUp to 1M+
Routing & Intelligence
Smart auto-routingβœ“βœ“βœ“
Provider fallbackβœ•βœ“βœ“
Custom routing rulesβœ•βœ•βœ“
Dedicated routing clusterβœ•βœ•βœ“
Streaming (SSE)βœ“βœ“βœ“
Budget & Controls
Per-key budget limitsβœ•βœ“βœ“
Auto top-upβœ•βœ“βœ“
Spend alertsEmail onlyβœ“βœ“
Multi-wallet (teams)βœ•βœ•βœ“
Observability
Request logs retention7 days30 days365 days
Usage analytics dashboardBasicStandardAdvanced + export
Cost attribution labelsβœ•βœ•βœ“
SLA & Support
Uptime SLAβœ•βœ•99.99%
Support channelCommunityEmailDedicated Slack
Response timeBest effort<48h<2h
Custom invoicing / POβœ•βœ•βœ“
πŸ‡ΈπŸ‡¬

Singapore GST Notice (9%)

Cete Ventures Pte Ltd (UEN: 202421160G) is GST-registered in Singapore. Platform subscription fees are subject to 9% GST for Singapore-based customers. Model usage credits for SG GST-registered businesses with a valid GST number may qualify for input tax claim β€” contact us to provide your registration number. Non-SG customers are invoiced without GST. A valid GST invoice is issued for every transaction.

Frequently Asked Questions

Do I pay provider API costs separately?
No. NexToken handles all provider API relationships on your behalf. You top up your NexToken wallet and we pay the providers. Your wallet is debited at our cost-plus-markup rates shown in the pricing table above. You never need to sign up with OpenAI, Anthropic, or Google directly.
What happens if my wallet balance hits zero?
API calls will return HTTP 402 (Payment Required) immediately. No debt is accumulated β€” NexToken operates on a strict prepaid model. You can enable auto top-up on Pro and Business plans to avoid interruptions. Existing in-flight streaming requests will complete (up to 120 seconds) before being terminated.
How does smart routing decide which provider to use?
NexToken's routing engine evaluates real-time provider health scores, current latency, your requested model, and your configured routing preferences. On Business plans, you can pin specific providers per API key or set custom fallback chains. The router runs sub-5ms decisions before proxying your request.
Can I get a refund on unused wallet credits?
Yes. Unused top-up credits (excluding free promotional credits) are refundable within 30 days of the top-up transaction. Refunds are processed back to your original payment method via Stripe within 5–10 business days. Loyalty bonus credits are non-refundable.
What's the difference between Billing Tiers and Loyalty Tiers?
Billing Tiers reflect your monthly token spend volume and reduce your per-token markup β€” they reset on the 1st of each month. Loyalty Tiers reflect your cumulative total top-up history and add bonus credits to your wallet when you top up β€” they never reset. The two systems are completely independent.
Is there a free trial for paid plans?
Every new account gets $5 in one-time free credits β€” no credit card required. Simply sign up and start making API calls immediately. Pro and Business plans offer a 14-day money-back guarantee on the subscription fee. Enterprise plans are negotiated individually.
Do prices include streaming responses?
Yes. Streaming (SSE) is supported at no additional cost. Token pricing is identical for streaming and non-streaming requests. Token counts are calculated using the provider's reported usage field where available; otherwise NexToken uses tiktoken-based estimation.

Start routing smarter today

Join hundreds of developers and teams using NexToken to reduce LLM costs and improve reliability.