Character Error Rate Calculator

Analyze OCR and ASR output with CER metrics. Track substitutions, deletions, insertions, and accuracy trends. See graphs, exports, examples, and formulas for better evaluation.

Calculator Input

Use the responsive three, two, and one-column form layout below.

Normalization Options

Formula Used

Character error rate is based on edit operations over reference length.

CER = (S + D + I) / N

Character Error Rate measures how many character-level edits are needed to transform the hypothesis into the reference.

S is substitutions, D is deletions, I is insertions, and N is the number of characters in the normalized reference.

This file uses dynamic programming to compute the minimum edit distance, then backtracks through the matrix to count each operation precisely.

How to Use This Calculator

  1. Paste the ground truth text into the Reference Text field.
  2. Paste the predicted OCR, ASR, or model output into Hypothesis Text.
  3. Select any normalization rules that match your evaluation policy.
  4. Click the calculate button to generate CER, accuracy, and error counts.
  5. Review the chart, summary table, and alignment preview.
  6. Use the CSV or PDF buttons to export your result summary.

Example Data Table

Reference Text Hypothesis Text Expected Notes
character error rate charactr error rate One deletion likely produces a low CER.
machine learning model machien learning model One substitution and one insertion may appear.
speech recognition speech recogniton Missing character increases deletion count.
optical character recognition opticl charcter recognition Multiple deletions raise CER and lower accuracy.

Frequently Asked Questions

1. What does CER measure?

CER measures the proportion of character edits needed to match predicted text to the reference. Lower CER means better text recognition or transcription quality.

2. When is CER useful?

CER is useful for OCR, speech recognition, handwriting recognition, subtitle generation, and any system that outputs text one character at a time.

3. Why do normalization options matter?

Normalization changes the evaluation policy. Ignoring case, spaces, or punctuation can reduce penalties when those differences are irrelevant to your application.

4. What is a good CER score?

A good CER depends on the task. Near zero is excellent. Production systems often target very low CER for clean inputs and tolerate higher values for noisy data.

5. How is CER different from WER?

CER evaluates characters, while WER evaluates words. CER is more sensitive to spelling and small text changes, especially in short outputs.

6. Can CER be greater than 100 percent?

Yes. CER can exceed 100 percent when insertions are very high compared with the reference length, especially for short reference strings.

7. Why is CER undefined sometimes?

CER becomes undefined when the normalized reference is empty but the hypothesis contains characters. There is no valid denominator for the formula.

8. Does this calculator support multilingual text?

Yes. The calculator splits text with multibyte-safe character handling, so it works better with Unicode text than byte-based counting.

Related Calculators

real time factorspeech recognition accuracyframe length calculatorword error ratevoice activity detection

Important Note: All the Calculators listed in this site are for educational purpose only and we do not guarentee the accuracy of results. Please do consult with other sources as well.