# SuperPenguin

> AI cost management platform. Track spend per request, per customer, per feature, and per prompt version across supported LLM, speech, and gateway providers.

SuperPenguin is a cost management tool for teams building with large language models (LLMs) and voice AI. It provides per-request cost attribution, multi-provider dashboards, prompt version analytics, and billing reconciliation.

## What SuperPenguin Does

- **Per-request cost tracking**: Every LLM or voice API call is logged with cost, token usage, latency, and custom metadata (customer, feature, team, environment, prompt version).
- **Multi-provider dashboard**: Unified view of spend across supported providers including OpenAI, Anthropic, Google Gemini, Vercel AI Gateway, Deepgram, ElevenLabs, LiveKit voice agent workflows, and LiteLLM. Daily trends, model breakdown, Slack alerts, and AI spend forecast (Pro plan).
- **Cost attribution**: Slice spend by customer, feature, team, environment, prompt key, prompt version, model, and provider.
- **Prompt version analytics**: Compare cost and latency across prompt iterations to find the cheapest, fastest version.
- **Billing reconciliation**: Compare SDK-estimated costs against actual provider invoices. Catch discrepancies by provider, model, or day.
- **Python SDK**: `pip install superpenguin`. Wraps OpenAI, Anthropic, Google Gemini, Deepgram, ElevenLabs, and LiteLLM clients. No proxy required; logs cost metadata directly.
- **TypeScript SDK**: `npm install @superpenguin/js`. Wraps OpenAI, Anthropic, Google Gemini, Deepgram, and ElevenLabs clients, plus Vercel AI SDK helpers and OpenTelemetry span ingestion.

## How It Works

1. Install the SuperPenguin SDK (`pip install superpenguin` or `npm install @superpenguin/js`)
2. Wrap your LLM or voice client with `sp.wrap(client)` (Python), `wrap(client)` (TypeScript), or the Vercel AI SDK helpers.
3. Attach optional metadata (customer_id, feature, team, prompt_key, prompt_version)
4. Open the SuperPenguin dashboard to see per-request cost data

The SDKs are lightweight wrappers around native provider clients. Your API calls go directly to the provider; the SDK only captures cost-relevant metadata (tokens, model, latency, tags). No prompts or responses are stored.

## Key Facts

- **Privacy**: No prompts or responses are stored. Only cost metadata (tokens, model, latency, tags).
- **Performance**: Sub-10ms non-streaming overhead, less than 5ms time-to-first-token impact on streaming, async telemetry.
- **Providers and SDK paths**: OpenAI, Anthropic, Google Gemini, Deepgram, ElevenLabs, LiveKit, Vercel AI Gateway, Vercel AI SDK, LiteLLM, Python, and TypeScript.
- **Pricing**: Free up to $2K managed spend. Growth at $30/mo (up to $5K). Pro at $200/mo (up to $20K). Enterprise custom pricing.

## SDK Documentation

For detailed usage, API reference, and integration guides:

- **Python SDK docs**: https://superpenguin.ai/docs/sdk/python.md
- **TypeScript SDK docs**: https://superpenguin.ai/docs/sdk/typescript.md

## Links

- Website: https://superpenguin.ai
- Dashboard: https://app.superpenguin.ai
- Documentation: https://superpenguin.ai/docs
- PyPI: https://pypi.org/project/superpenguin/
- npm: https://www.npmjs.com/package/@superpenguin/js
- Twitter/X: https://x.com/superpenguin_ai