Observability Cost Estimator Calculator

Estimator inputs

Currency

Used for display only.

Days per month

28–31 allowed.

Contingency buffer (%)

Adds headroom for bursts and incidents.

Discount (%)

Optional committed-use or contract discount.

Include logs

Logs ingestion (GB/day)

Average uncompressed daily logs.

Logs retention (days)

Hot + searchable retention window.

Compression ratio

Stored GB ≈ ingested GB / ratio.

Index/search factor

Multiplier for indexing and search overhead.

Logs ingest price (per GB)

Usage-based ingest charge.

Logs storage price (per GB-month)

Average stored volume billed monthly.

Query scan (GB/day)

Approximate scanned data across searches.

Query price (per GB scanned)

If your platform bills by query scan.

Include metrics

Active time series

Cardinality drives ingestion and retention.

Sample interval (seconds)

Lower intervals increase sample volume.

Metrics retention (days)

Total retention for metric samples.

Bytes per sample

Includes timestamp + value + overhead.

Ingest price (per million samples)

If billed by sample volume.

Storage price (per GB-month)

Based on retained samples size estimate.

Include traces

Spans per second

Raw span rate before sampling.

Sampling rate (%)

Applied to span rate for billing volume.

Average span size (bytes)

Bigger spans increase storage estimates.

Traces retention (days)

Shorter traces retention is common.

Ingest price (per million spans)

If billed by ingested spans.

Storage price (per GB-month)

Based on retained span payload size.

Platform extras

Optional fixed or seat-based fees often missed in usage-only estimates.

Hosts / agents (count)

Host price (per host-month)

Users (count)

User price (per user-month)

Synthetic checks (count)

Synthetic price (per check-month)

Alert rules (count)

Alerting price (per 100 rules-month)

Result appears above this form after submit.

Example data table

Sample scenarios to sanity-check your assumptions. Replace values with your telemetry reality.

Scenario	Logs (GB/day)	Metrics (series)	Traces (spans/sec, sampling)	Retention (logs/metrics/traces)	Typical use
Startup	2	80,000	120 @ 10%	7 / 30 / 3	Lean platform, focused debugging
Mid-size SaaS	15	350,000	600 @ 20%	14 / 60 / 7	Balanced observability with incident readiness
Enterprise	80	2,500,000	3,500 @ 30%	30 / 90 / 14	Large fleet, compliance retention, heavy analytics

Formula used

This estimator models common usage-based pricing patterns. Adjust unit prices to match your vendor or self-hosted costs.

Logs

IngestGB_month = IngestGB_day × Days
IngestCost = IngestGB_month × PriceGB × IndexFactor
AvgStoredGB = (IngestGB_day ÷ Compression) × RetentionDays
StorageCost = AvgStoredGB × StoragePriceGB_month
QueryCost = ScanGB_day × Days × QueryPriceGB

Metrics

Samples_month = Series × (SecondsMonth ÷ IntervalSec)
IngestCost = (Samples_month ÷ 1,000,000) × PricePerM
RetainedSamples = Series × (RetentionDays×86400 ÷ IntervalSec)
StoredGB = RetainedSamples × BytesPerSample ÷ 1024³
StorageCost = StoredGB × StoragePriceGB_month

Traces

Spans_month = SpansSec × SecondsMonth × (Sampling% ÷ 100)
IngestCost = (Spans_month ÷ 1,000,000) × PricePerM
StoredGB = SpansDay × RetentionDays × SpanBytes ÷ 1024³
StorageCost = StoredGB × StoragePriceGB_month
Total = Subtotal + Buffer − Discount

Tip: For self-hosted stacks, set ingest prices near zero and model storage + compute via “extras” (hosts/users) and your own per-GB storage rate.

How to use this calculator

Estimate daily telemetry: logs GB/day, active metric series, and spans/sec.
Choose retention per signal. Start with realistic “hot” retention.
Set unit prices from your provider quote or internal cost model.
Adjust sampling and query scans to reflect real investigation habits.
Add “extras” for hosts, seats, synthetics, and alerting add-ons.
Click Estimate Monthly Cost to see totals and breakdown.
Use Download CSV or Download PDF for sharing.

Signal Volume Benchmarks by Team Size

Small platforms often start near 1–3 GB/day of logs, 50k–150k active metric series, and 50–150 spans/sec. Mid-size SaaS commonly lands around 10–25 GB/day, 200k–600k series, and 300–900 spans/sec at 10–25% sampling. Enterprise fleets can exceed 50–150 GB/day, 1M–5M series, and 2k–6k spans/sec when tracing critical paths. Use these ranges to populate the example table and validate that your inputs are not off by an order of magnitude.

Retention Strategy: Hot vs Archive Days

Retention drives storage more than ingestion. A practical baseline is 7–14 hot log days for debugging, 30–60 metric days for trend review, and 3–7 trace days for incident replay. Compliance needs may push logs to 30–365 days; consider splitting tiers so only security/audit streams stay long. If you triple retention, storage roughly triples. Compression helps: improving log compression from 2× to 4× halves average stored GB for the same retention.

Cardinality Controls and Metric Sampling

Metric spend scales with series × samples. Dropping scrape frequency from 15s to 30s halves samples, reducing ingest costs directly. Cardinality is the bigger lever: eliminating high‑churn labels can cut series by 30–70%. Track top label sets and cap per‑tenant series budgets to prevent runaway costs during onboarding or misconfigured exporters. For planning, add 10–15% headroom for seasonal launches and new services.

Trace Sampling and Span Payload Hygiene

Tracing cost depends on spans/sec, sampling, and span size. Moving sampling from 20% to 10% halves ingest and stored volume. Keep average span payload under 600 bytes by trimming large attributes, limiting stack traces, and using exemplars selectively. For bursty traffic, add 10–20% buffer to cover incident-driven sampling increases. If you retain traces longer than a week, storage becomes the dominant line item.

Cost Governance: Budgets, Alerts, and Reviews

Treat observability like a product with quotas. Set budgets per environment and per team, then review unit prices quarterly. Alert when query scan volume is >10× ingestion for more than a week, since investigative queries can quietly dominate. Don’t forget fixed extras: agents at $8–$15 per host-month, seats at $5–$12 per user-month, and synthetics at $1–$3 per check-month are common. Export CSV/PDF monthly to support chargeback and vendor negotiations.

FAQs

How do I estimate daily log ingestion?

Start from average log size per request and requests per day, then add batch jobs. Many teams validate by sampling one hour of logs and scaling to 24 hours. Recheck after adding new services or verbose debug levels.

What should I enter for query scan volume?

Use recent platform usage if available. If not, a reasonable starting point is 2×–8× your log ingestion, higher during incidents. Dashboards with broad filters and ad‑hoc searches can push scans beyond 10×.

How do I approximate active metric series?

Count unique time series produced by exporters: metric name plus label combinations. Inventory your top dashboards and exporters, then estimate series per service. High-cardinality labels like userId or requestId can multiply series dramatically.

Does trace sampling affect both ingest and storage?

Yes. Sampling reduces the spans kept, so ingest cost and stored GB typically drop proportionally. If you use tail-based sampling, incidents can temporarily raise sampling, so keep a buffer for surge months.

Can this model self-hosted observability costs?

Yes. Set ingest unit prices close to zero, and use storage prices plus “extras” to represent compute, agents, and staffing. If you know your cloud storage rate, map it directly to the per‑GB‑month fields.

Why is my effective cost per GB ingested high?

Fixed extras and query scanning can dominate when telemetry volume is low. Check host/user fees, synthetics, and alerting add-ons, then compare query scans to ingestion. Reducing retention or sampling often improves the ratio quickly.