CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers

threadblock → gemm Relation

File in include/cutlass/epilogue/threadblockIncludes file in include/cutlass/gemm
default_epilogue_complex_tensor_op.hinclude/cutlass/gemm/gemm.h
default_epilogue_simt.hinclude/cutlass/gemm/gemm.h
default_epilogue_tensor_op.hinclude/cutlass/gemm/gemm.h
default_epilogue_volta_tensor_op.hinclude/cutlass/gemm/gemm.h
default_epilogue_wmma_tensor_op.hinclude/cutlass/gemm/gemm.h
default_thread_map_simt.hinclude/cutlass/gemm/gemm.h
default_thread_map_tensor_op.hinclude/cutlass/gemm/gemm.h
default_thread_map_volta_tensor_op.hinclude/cutlass/gemm/gemm.h
default_thread_map_wmma_tensor_op.hinclude/cutlass/gemm/gemm.h
direct_epilogue_tensor_op.hinclude/cutlass/gemm/gemm.h
epilogue.hinclude/cutlass/gemm/gemm.h
epilogue_base.hinclude/cutlass/gemm/gemm.h
interleaved_epilogue.hinclude/cutlass/gemm/gemm.h