Matmul benchmark
Benchmarking fp32 matmul kernels on 4096×4096 matrices.
One untimed warmup run, followed by one measured run.
No samples yet.
Run an individual strategy or start the full queue.
Benchmarking fp32 matmul kernels on 4096×4096 matrices.
One untimed warmup run, followed by one measured run.
No samples yet.
Run an individual strategy or start the full queue.