Conv2d benchmark

Benchmarking fp32 conv2d kernels on a 1×64×256×256 input with 128 filters of size 3×3.

Strategies

One untimed warmup run, followed by one measured run.

Results

Sorted fastest first.

No samples yet.

Run an individual strategy or start the full queue.