Aura - LLM Strategy & Pricing

LLM Strategy & Pricing

Model pricing, per-operation recommendations, and cost optimization strategies

Model ▲	Provider ▲	Input $/1M ▲	Output $/1M ▲	Context ▲	Speed	Best For

Tier 1 — Bulk/Fast

$0.005–$0.06

Domain analysis (batch)

Flash ~$0.008

Augment/tool suggestions

Flash-Lite ~$0.005

Niche re-rolls / CSM

Flash ~$0.01

Brand name generation

Haiku ~$0.06

Tier 2 — Quality Generation

$0.10–$1.00

Site copy (16 sections)

GPT-4o ~$0.50–1.00

Sales letters

GPT-4o ~$0.15

Marketing docs

GPT-4o ~$0.10–0.20

Tool/calculator generation

GPT-4o ~$0.15

Tier 3 — Precision/Legal

$0.20–$0.40

Legal docs (NDA, ToS, Privacy)

Sonnet ~$0.20–0.40

Business plans

GPT-4.1 ~$0.30

Contracts

Sonnet ~$0.20

Tier 4 — Research & Validation

$0.15–$0.25

Competitive analysis

Sonar Pro ~$0.20

Market validation

Sonar Pro ~$0.15

Reference material

Sonar Pro ~$0.20

Article generation with sources

Sonar Pro ~$0.25

Number of Domains

Depth Preset

Current (GPT-4o Only)

$125.00

@ $2.50/domain

Optimized (Multi-Model)

$60.00

@ $1.20/domain

Savings

$65.00

52% saved

Cost Breakdown by Tier

T1 Bulk/Fast

$0.04

T2 Quality

$0.75

T3 Legal

$0.25

T4 Research

$0.20

Prompt Caching

Gemini offers 90% discount on cached system prompts. GPT-4o also supports cached input tokens at reduced rates. Cache your system prompts and reuse across domains.

Gemini GPT-4o

Batch API

50% discount for async processing on both Gemini and OpenAI. Perfect for bulk domain analysis where results aren't needed instantly.

Gemini OpenAI

Model Fallback Chain

Try Flash first, if quality score is low, auto-escalate to Pro or GPT-4o. Most requests succeed on cheaper models — only pay premium when needed.

Flash → Pro → GPT-4o

Context Reuse

GPT-4.1's 1M context window: feed the entire package into one call for coherent, cross-referenced business plans instead of fragmented multi-call outputs.

GPT-4.1

Perplexity for Validation

Use Perplexity to validate AI-generated business docs against real market data. Sourced citations add credibility and catch hallucinated claims.

Sonar Pro

Current Pipeline

GPT-4o for everything

$2.50

per domain (all operations)

Analysis$0.30

Site copy$1.00

Sales + marketing$0.45

Legal + business$0.50

Research$0.25

Optimized Pipeline

Multi-model strategy

$1.20

per domain (52% savings)

T1 Bulk (Flash/Haiku)$0.08

T2 Quality (GPT-4o)$0.55

T3 Legal (Sonnet/4.1)$0.27

T4 Research (Sonar)$0.20

Image gen (DALL-E)$0.10

Visual Cost Comparison (per domain)

Current (GPT-4o only) $2.50

100%

Optimized (Multi-model) $1.20

48%

At 100 domains/month: Save $156/mo ($1,872/year)

LLM Strategy & Pricing

Model Pricing Comparison

Aura Operation Cost Matrix