Understand your costs

Understand how Chucky pricing works, what you’re charged for, and how to optimize costs.

Pricing Tiers

Plan	Monthly	Compute	Egress	Concurrent Sessions
Hobby	$5	~125 hours	10 GB	2
Starter	$29	~725 hours	50 GB	5
Pro	$99	~2,475 hours	200 GB	10
Enterprise	Custom	Custom	Custom	Custom

Overage Pricing

When you exceed plan limits:

Resource	Overage Rate
Compute	$0.04/hour
Egress	$0.03/GB

What You’re Charged For

1. AI Costs (Pass-Through)

AI costs are passed through at provider rates. You pay exactly what the provider charges.

Model	Input (per 1M tokens)	Output (per 1M tokens)
Claude Sonnet 4.5	$3.00	$15.00
Claude Opus 4.5	$15.00	$75.00
Claude 3.5 Haiku	$0.25	$1.25
GPT-4o	$2.50	$10.00
OpenRouter models	Varies	See pricing

Note: Use your own API keys for direct billing to providers. See Models & Providers for setup.

2. Compute Time

Time your sandbox is running, measured in seconds.

Starts when session begins
Ends when session completes
Includes tool execution, file operations, shell commands

3. Egress

Data transferred out of Chucky’s network:

Git bundle downloads
Large response payloads
File exports

Budget Enforcement

How It Works

Setting Budgets

const token = await createToken({
  userId: 'user-123',
  projectId: process.env.CHUCKY_PROJECT_ID,
  secret: process.env.CHUCKY_HMAC_SECRET,
  budget: createBudget({
    aiDollars: 5.00,        // Max AI spend
    computeHours: 1,        // Max compute time
    window: 'day',          // Reset period
  }),
});

Budget Windows

Window	Description
`hour`	Resets every hour
`day`	Resets every 24 hours
`week`	Resets every 7 days
`month`	Resets on billing anchor
`lifetime`	Never resets - one-time budget

Use lifetime for one-time tokens like trial credits or prepaid packages. Once consumed, the budget is exhausted permanently.

Budget Exceeded Response

When budget is exceeded:

import { BudgetExceededError } from '@chucky.cloud/sdk';

try {
  await session.send(message);
} catch (err) {
  if (err instanceof BudgetExceededError) {
    // Handle gracefully
    console.log('User budget exceeded');
    // Show upgrade prompt
  }
}

Cost Tracking

Per-Request Costs

Every response includes cost information:

for await (const msg of session.stream()) {
  if (msg.type === 'result') {
    console.log('Cost:', msg.total_cost_usd);
    console.log('Tokens:', msg.usage);
    // {
    //   input_tokens: 1500,
    //   output_tokens: 800,
    //   total_tokens: 2300
    // }
  }
}

Dashboard

View usage in the portal at app.chucky.cloud:

Total spend by period
Usage by project
Usage by user
Cost breakdown (AI vs compute)

Optimizing Costs

1. Choose the Right Model

Model	Speed	Cost	Best For
Haiku	Fastest	Lowest	Simple tasks, high volume
Sonnet	Fast	Medium	Most use cases
Opus	Slowest	Highest	Complex reasoning

// Use Haiku for simple tasks
const session = client.createSession({
  model: 'claude-3-5-haiku-20241022',
});

// Use Opus only when needed
const session = client.createSession({
  model: 'claude-opus-4-5-20251101',
});

2. Set Tight Budgets

// Start conservative
budget: createBudget({
  aiDollars: 0.50,  // 50 cents
  computeHours: 0.5, // 30 minutes
  window: 'hour',
})

3. Limit Turns

const session = client.createSession({
  maxTurns: 5,  // Prevent runaway conversations
});

4. Use Efficient Prompts

// Bad: Sends entire file every time
await session.send(`Here's my code:\n${entireFile}\n\nFix the bug.`);

// Good: Be specific
await session.send('Fix the null pointer exception on line 42 of api.ts');

5. Cache When Possible

For repeated queries, cache results:

const cacheKey = hash(prompt);
const cached = await cache.get(cacheKey);

if (cached) {
  return cached;
}

const result = await client.prompt(prompt);
await cache.set(cacheKey, result, { ttl: 3600 });
return result;

Billing for Your Users

Tiered Pricing

const plans = {
  free: { aiDollars: 0.50, computeHours: 0.5 },
  pro: { aiDollars: 10, computeHours: 5 },
  enterprise: { aiDollars: 100, computeHours: 50 },
};

const userPlan = plans[user.plan];

const token = await createToken({
  userId: user.id,
  budget: createBudget({
    aiDollars: userPlan.aiDollars,
    computeHours: userPlan.computeHours,
    window: 'month',
    windowStart: user.billingCycleStart,
  }),
});

Usage-Based Add-ons

// Track usage
const usage = await getMonthlyUsage(user.id);

// Calculate overage
const overage = Math.max(0, usage - plan.includedAmount);
const overageCharge = overage * 1.2; // 20% markup

// Add to invoice
await billing.addCharge(user.id, {
  description: 'AI Usage Overage',
  amount: overageCharge,
});

Alerts

Notify users before they hit limits:

const usage = await getUsage(user.id);
const percentUsed = (usage / budget) * 100;

if (percentUsed > 80) {
  await sendEmail(user, {
    subject: 'AI Budget Alert',
    body: `You've used ${percentUsed}% of your monthly AI budget.`,
  });
}

Custom Model Pricing

When using OpenRouter or other third-party models, configure custom pricing for accurate cost tracking.

Why Configure Pricing?

OpenRouter models have varying prices not in our built-in database
Free models should be billed at $0, not the default rate
Accurate user billing requires correct cost calculations

Setting Up Custom Pricing

Navigate to Billing → Custom Pricing in the portal:

Add Model Pattern

Enter the exact model name or use wildcards:

mistralai/mistral-large - Exact match
mistralai/* - All Mistral models

Set Prices

Enter price per 1 million tokens in USD:

Input: Cost for prompt tokens
Output: Cost for completion tokens

Save

Pricing is synced to all your projects automatically.

Pricing Lookup Order

Your custom pricing (exact match)
Your custom pricing (wildcard match)
Built-in pricing (Anthropic/OpenAI models)
Default fallback ( $3/$ 15 per 1M tokens - Sonnet-level)

Example Configurations

Model Pattern	Input	Output	Description
`mistralai/devstral-2512:free`	$0	$0	Free model
`mistralai/*`	$2	$6	All Mistral models
`deepseek/deepseek-chat`	$0.14	$0.28	DeepSeek Chat
`meta-llama/*`	$0.50	$0.75	All Llama models

Without custom pricing, unknown models are billed at Sonnet-level rates (

3/

15). This may overcharge for cheap models or undercharge for expensive ones.

For more details, see Models & Providers.

Billing FAQ

How am I billed?

Monthly subscription + any overage charges. Invoices generated on billing cycle date.

Can I set hard limits?

Yes, budget enforcement is hard. When exceeded, requests are rejected until the window resets or budget increases.

Do unused credits roll over?

No, budget windows reset on schedule. Plan for typical usage, not worst case.

Can I use my own Anthropic API key?

Yes, configure in project settings. AI costs go directly to your Anthropic account.

How is compute measured?

From session start to session end, in seconds. Idle time within a session counts.

What counts as egress?

Data leaving Chucky’s network: bundle downloads, large responses, file exports. Regular API responses don’t count.

Getting Help

Usage questions: [email protected]
Billing disputes: [email protected]
Enterprise pricing: [email protected]

Next Steps

Budget Management

Advanced budget strategies

Authentication

Token creation with budgets

White-Label

Billing your users

Dashboard

View your usage

Getting Started

Guides

CLI

Use Cases

Advanced

Concepts

​Pricing Tiers

​Overage Pricing

​What You’re Charged For

​1. AI Costs (Pass-Through)

​2. Compute Time

​3. Egress

​Budget Enforcement

​How It Works

​Setting Budgets

​Budget Windows

​Budget Exceeded Response

​Cost Tracking

​Per-Request Costs

​Dashboard

​Optimizing Costs

​1. Choose the Right Model

​2. Set Tight Budgets

​3. Limit Turns

​4. Use Efficient Prompts

​5. Cache When Possible

​Billing for Your Users

​Tiered Pricing

​Usage-Based Add-ons

​Alerts

​Custom Model Pricing

​Why Configure Pricing?

​Setting Up Custom Pricing

​Pricing Lookup Order

​Example Configurations

​Billing FAQ

​How am I billed?

​Can I set hard limits?

​Do unused credits roll over?

​Can I use my own Anthropic API key?

​How is compute measured?

​What counts as egress?

​Getting Help

​Next Steps

Budget Management

Authentication

White-Label

Dashboard

Pricing Tiers

Overage Pricing

What You’re Charged For

1. AI Costs (Pass-Through)

2. Compute Time

3. Egress

Budget Enforcement

How It Works

Setting Budgets

Budget Windows

Budget Exceeded Response

Cost Tracking

Per-Request Costs

Dashboard

Optimizing Costs

1. Choose the Right Model

2. Set Tight Budgets

3. Limit Turns

4. Use Efficient Prompts

5. Cache When Possible

Billing for Your Users

Tiered Pricing

Usage-Based Add-ons

Alerts

Custom Model Pricing

Why Configure Pricing?

Setting Up Custom Pricing

Pricing Lookup Order

Example Configurations

Billing FAQ

How am I billed?

Can I set hard limits?

Do unused credits roll over?

Can I use my own Anthropic API key?

How is compute measured?

What counts as egress?

Getting Help

Next Steps