Mojo module
conv
Structs
-
ConvDirectNHWC: Implement the outer loops for direct convolution. Collapse N, HO, WO into one dimension n_ho_wo. Tile n_ho_wo, C, and F. The tile factor for C and F are chosen by a heuristic prioritizing C. n_ho_wo is tiled by micro kernel's height. -
CuDNNConvMeta: -
Naive2dConvolution: Struct wrapper for naive 2d convolution implementation.
Functions
-
accumulate_wo_tile_1d: Update one row in the output for a given (c, f) tile. -
accumulate_wo_tile_2d: -
accumulate_wo_tile_3d: -
check_cudnn_error: -
conv1d_update_wo_tile: -
conv2d_gpu_naive_nhwc_rscf: -
conv2d_update_wo_tile: -
conv3d_gpu_naive_ndhwc_qrscf: -
conv3d_update_wo_tile: -
conv_cudnn: -
conv_gpu: -
conv_nhwc_direct: -
conv_shape: Compute the output shape of aconvoperation, and assert the inputs are compatible. -
get_cudnn_dtype: Map Mojo DType to cuDNN data type. -
pack_conv_filter_shape: Compute the output shape of convolution filter packing. -
pack_filter: This packs the filter form RSCF to FRSCf. Use the default micro kernel size for dynamic shapes. -
pack_filter_shape: Compute the shape of packed filter. The packed layout is FRSCf. shape_ref should be allocated with size 5 outside this kernel. -
pack_filter_shape_impl: Compute the shape of packed filter. The packed layout is FRSCf. shape_ref should be allocated with size 5 outside this kernel.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!