AI Stride Calculator

Calculator mode

Pick the task you need for layer design.

Dimensions

Use 1D for audio, text, or sequences.

Padding type

SAME computes before/after padding automatically.

Input height / length

Input width

Kernel height / length

Kernel width

Dilation height / length

Dilation width

Stride height / length

Stride width

Desired output height / length

Used to estimate integer stride.

Desired output width

Total padding height

Total = before + after.

Total padding width

Output padding height / length

Commonly 0..(stride-1).

Output padding width

Reset Form Download CSV Download PDF

Tip: SAME padding depends on stride and dilation, so it recomputes each run.

Example data table

Scenario	Input (H×W)	Kernel	Stride	Padding	Dilation	Output (H×W)
Downsample block	224×224	3×3	2×2	SAME	1×1	112×112
Feature refine	112×112	3×3	1×1	SAME	1×1	112×112
Valid convolution	32×32	5×5	1×1	VALID	1×1	28×28
Dilated context	64×64	3×3	1×1	SAME	2×2	64×64
Upsample step	56×56	4×4	2×2	Custom (PH=2, PW=2)	1×1	112×112

These examples match common CNN and decoder layer patterns.

Formula used

Forward convolution output (per dimension)

K_eff = D × (K − 1) + 1

Out = ⌊(In + P_total − K_eff) / S⌋ + 1

Where P_total = P_before + P_after.

SAME padding (target)

Out = ⌈In / S⌉

P_total = max((Out − 1)×S + K_eff − In, 0)

The tool splits padding into before/after, possibly uneven.

Transposed convolution output (per dimension)

Out = (In − 1)×S − P_total + D×(K − 1) + OutPad + 1

OutPad is often used to hit an exact target size.

All results are computed independently for height and width (or length for 1D).

How to use this calculator

Choose a mode: compute output, solve stride, or transposed output.
Select 1D or 2D, then enter input and kernel sizes.
Set dilation if you use dilated layers.
Pick padding type. For custom, enter total padding values.
Enter stride (or desired output if solving stride), then calculate.
Use the download buttons to export your history as CSV or PDF.

Why stride matters in feature extraction

Stride controls how densely a kernel samples the input. With stride 1, adjacent receptive fields overlap heavily, preserving detail. With stride 2, you roughly halve spatial resolution, reducing compute and activation memory by ~4× in 2D. In common backbones, early stride choices drive feature map sizes (e.g., 224→112→56) and determine where fine texture is lost. If accuracy drops, move the first downsample later. Stacking two stride‑2 layers yields an effective stride of 4, so a 16×16 patch in input maps to one output cell.

Output size math you can trust

For each dimension, the forward output is Out = floor((In + Ptotal − Keff)/S) + 1, where Keff = D×(K−1)+1. This calculator applies the formula independently to height and width (or length for 1D), so mixed strides like 2×1 are handled correctly for anisotropic inputs. The effective-kernel readout helps verify dilated blocks quickly.

Padding, dilation, and edge behavior

SAME padding targets Out = ceil(In/S) and computes the total padding needed, then splits it into before/after values that may be asymmetric. Odd kernels with stride 1 often yield symmetric padding, while larger strides can force uneven borders. VALID padding uses Ptotal=0, shrinking maps by Keff−1 when stride is 1. Dilation increases Keff without increasing parameters, expanding context while keeping stride unchanged; the trade‑off is more boundary dependence and possible gridding at high dilation.

Stride solving for target shapes

When you know the desired output size (for skip connections, concatenation, or patch grids), solving for stride can save trial and error. Because floor rounding is involved, not every target is achievable with a single integer stride. The solver reports the closest valid stride and shows the achieved output so you can adjust padding, kernel, or dilation to match exactly.

Transposed stride for upsampling

In decoders and segmentation heads, transposed layers expand resolution: Out = (In−1)×S − Ptotal + D×(K−1) + OutPad + 1. Output padding is a small correction term (often 0..S−1) used to land on exact sizes like 56→112. Use the history table to compare configurations, document decisions, and export reproducible notes.

FAQs

What does effective kernel mean with dilation?

Effective kernel equals D×(K−1)+1. Dilation spreads taps apart, so the receptive field grows without adding weights. The calculator shows Keff for each dimension to avoid manual expansion.

Why is SAME padding sometimes asymmetric?

When stride is greater than 1, the required total padding may be odd. The tool splits it as floor(total/2) before and the remainder after, which can shift one border by one pixel.

How does stride change compute and memory?

In 2D, doubling stride typically halves height and width, cutting activations by about 4× and reducing convolution FLOPs similarly. This speeds training, but it can remove small objects and fine edges.

Why can the stride solver miss my exact target?

The output formula contains a floor operation, so only certain outputs are reachable for a fixed input, kernel, dilation, and padding. If the closest stride is off, adjust padding or kernel, or redesign the downsampling schedule.

What is output padding in transposed convolution?

Output padding is a small extra term that increases the transposed output size by 0..(stride−1). It is useful for matching skip‑connection sizes without changing kernel or stride.

When should I use 1D mode?

Use 1D for sequences such as audio frames, token embeddings, or sensor signals. Enter the length as “input height/length”; width-related fields are ignored so you can focus on the single dimension.

Recent calculation history

Saved in your current browser session.

No calculations yet. Run the calculator to build a history.

Stride Calculator for Neural Layers