Conv2d benchmark
Benchmarking fp32 conv2d kernels on a 1×64×256×256 input with 128 filters of size 3×3.
One untimed warmup run, followed by one measured run.
No samples yet.
Run an individual strategy or start the full queue.
Benchmarking fp32 conv2d kernels on a 1×64×256×256 input with 128 filters of size 3×3.
One untimed warmup run, followed by one measured run.
No samples yet.
Run an individual strategy or start the full queue.