Skip to content

Latest commit

 

History

History

instruction

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

Block

Here we include instrutions on how to use the block averaging used in AdaHessian for different kernel sizes (e.g. matrix, 1D Conv, 2D Conv, etc). The instruction is written based on the adahessian.py file in this directory. In particular, in Lines 105 -- 156 here, we show how to use block averaging for various different kernels. For 1D, 3D, and 4D tensors, we give two choices to set the spatial averaging. Instructions are included in the comments.

The current format of AdaHessian is the same as general PyTorch optimizer except when you need to set

loss.backward(create_graph=True)

Please refer to here for the image classification example.