Genome Assembly N50 Calculator

Measure genome assembly continuity with sortable contig statistics. Track N50, N90, and optional NG50 accurately. Visualize lengths clearly and export findings for better review.

Calculator Input

Use plain numeric values only. Example: 900000, 750000, 600000.
Add an estimated genome size to compute NG50, NG90, and custom NGx. Leave it empty when only assembly-based Nx values are needed.

Example Data Table

This example uses the same default sequence list preloaded in the calculator.

Rank Sequence Length (bp) Cumulative Length (bp) Notes
1900,000900,000Largest sequence
2750,0001,650,000Still below 50% threshold
3600,0002,250,000N50 reached here
4500,0002,750,000NG50 reached if target is 5,000,000 bp
5420,0003,170,000Useful continuity support
6380,0003,550,000Median falls near this range
7250,0003,800,000Approaching N90 threshold
8200,0004,000,000N90 reached here
9150,0004,150,000Lower tail
10100,0004,250,000Lower tail
1150,0004,300,000Smallest retained sequence
Example assembly span
4.300 Mbp
Example N50 / L50
600,000 bp
L50 = 3
Example N90 / L90
200,000 bp
L90 = 8
Example NG50
500,000 bp
Target genome size = 5,000,000 bp

Formula Used

1. Assembly span = sum of all retained sequence lengths.

2. N50 = sequence length at which cumulative sorted length reaches 50% of assembly span.

3. L50 = number of longest sequences needed to reach the N50 threshold.

4. N90 and L90 use the same method, but with a 90% threshold.

5. NG50 = sequence length where cumulative sorted length reaches 50% of the estimated genome size.

6. auN = Σ(length²) ÷ Σ(length). It weighs long sequences more heavily than simple averages.

The calculator first sorts all retained lengths from largest to smallest. It then walks through cumulative totals until each requested threshold is reached.

How to Use This Calculator

  1. Paste sequence lengths into the text area.
  2. Select the unit used by your values.
  3. Optionally set a minimum length filter.
  4. Enter a custom Nx threshold if needed.
  5. Add an estimated genome size for NG metrics.
  6. Press the calculate button to view the result section.
  7. Review the metrics, graph, and processed ranking table.
  8. Export the results as CSV or PDF when needed.

FAQs

1. What does N50 measure in genome assembly?

N50 measures continuity. It identifies the sequence length where the cumulative total of sorted sequences reaches half of the retained assembly span.

2. Why is L50 reported with N50?

L50 shows how many longest sequences were required to hit the 50% threshold. Lower L50 values usually indicate a more continuous assembly.

3. What is the difference between N50 and NG50?

N50 uses the observed assembly span. NG50 uses an expected genome size. NG50 is helpful when assemblies have incomplete total span.

4. Should I filter short sequences before calculating?

Filtering can be useful when very short fragments are noise or artifacts. However, always report your chosen threshold because filtering changes continuity metrics.

5. Is a higher N50 always better?

Not always. A higher N50 suggests larger assembled pieces, but assembly accuracy, completeness, duplication, and contamination still matter.

6. What does auN add beyond N50?

auN gives more weight to long sequences across the full distribution. It is often more informative when comparing assemblies with different length patterns.

7. Can I use scaffold lengths instead of contig lengths?

Yes. The calculator works with any positive sequence lengths. Just stay consistent when comparing assemblies or reporting results.

8. Why might NG90 show “Not reached”?

That means the retained assembly span never reached 90% of the estimated genome size. The data are insufficient for that threshold.

Related Calculators

hardy weinberg calculatoralignment score calculatorsequence length calculatorpromoter region finderpcr cycle calculatorpairwise sequence identity

Important Note: All the Calculators listed in this site are for educational purpose only and we do not guarentee the accuracy of results. Please do consult with other sources as well.