Analyze predicted and reference phoneme sequences with confidence. Track edit operations, accuracy, and distance easily. Visualize errors and export polished results for model reviews.
| Case | Reference Sequence | Predicted Sequence | Expected Observation |
|---|---|---|---|
| 1 | HH AH0 L OW1 | HH AH0 L AO1 | One substitution; PER equals 25.00%. |
| 2 | K AE1 T | K AE1 T S | One insertion; PER equals 33.33%. |
| 3 | B R IH1 JH | B IH1 JH | One deletion; PER equals 25.00%. |
Phoneme Error Rate (PER) measures the proportion of edit operations needed to transform the predicted phoneme sequence into the reference sequence.
PER = (S + D + I) / N
The calculator uses dynamic programming alignment, similar to Levenshtein distance, to identify the minimum edit path and count each operation type.
PER measures how many phoneme edits are needed to match a prediction to the reference. Lower values indicate stronger speech or pronunciation model performance.
PER can exceed 100% when insertions are very high. Because insertions count as extra errors, the total edits may become larger than the reference length.
A substitution happens when one reference phoneme aligns with a different predicted phoneme. It indicates the model recognized the position incorrectly rather than skipping or adding one.
PER focuses on total edit cost relative to the reference. Accuracy highlights how many reference phonemes were matched correctly after the optimal alignment is found.
Use whichever delimiter matches your dataset. The important part is keeping the same tokenization style for both the reference and predicted sequences.
Yes. The calculator works with any tokenized phoneme set, including ARPAbet, IPA, or custom symbols, as long as each phoneme is separated consistently.
Alignment shows exactly where substitutions, deletions, and insertions occur. This makes debugging easier when you want to inspect pronunciation or recognition failures.
Review substitution patterns when diagnosing systematic confusions, such as vowels being swapped repeatedly. Those patterns often reveal token mapping or acoustic modeling weaknesses.
Important Note: All the Calculators listed in this site are for educational purpose only and we do not guarentee the accuracy of results. Please do consult with other sources as well.