Benchmark
Last updated
Was this helpful?
Last updated
Was this helpful?
Please refer to the for detailed performance benchmark results.
When measuring GPU times on your own on Linux, set the environment variable CUDA_MODULE_LOADING=EAGER
to avoid CUDA API overheads during the first kernel execution.