-
-
Notifications
You must be signed in to change notification settings - Fork 5.6k
Issues: labmlai/annotated_deep_learning_paper_implementations
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weโll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
How to use my own database for training and evaluating Retro for Question-Answering?
#260
opened Jun 27, 2024 by
Zahin112
updated Jun 27, 2024
RETRO: RuntimeError: stack expects each tensor to be equal size, but got [2, 32] at entry 0 and [1, 32] at entry 29
question
Further information is requested
#135
opened Jul 21, 2022 by
mocarsha
updated Jun 27, 2024
question about RotaryPEMultiHeadAttention: rotary_percentage
#246
opened Mar 13, 2024 by
YOONSEOKHEO
updated Jun 24, 2024
Training code or references for training the latent diffusion model on a custom dataset
paper implementation
New paper implementation
#195
opened Jul 8, 2023 by
risejl
updated Apr 19, 2024
"pip install labml-nn" generated errors. How to resolve it and complete the installation?
#247
opened Mar 14, 2024 by
jxwanguab
updated Mar 14, 2024
Request for Paper Implementation - Neural Operators
paper implementation
New paper implementation
#219
opened Oct 26, 2023 by
Robertboy18
updated Nov 7, 2023
Stride = 2 only for the first block of the entire ResNet
#205
opened Aug 16, 2023 by
PavelShtykov
updated Aug 16, 2023
Small issue in nucleus sampling explanation + implementation
#203
opened Aug 10, 2023 by
ascher8
updated Aug 10, 2023
q,k,v have different shape but torch.stack works?
#202
opened Aug 8, 2023 by
junsukha
updated Aug 8, 2023
Bug in Transformer-XL shift method
bug
Something isn't working
#185
opened May 16, 2023 by
Bearnardd
updated Jul 15, 2023
Dimension of subsequent layers in Hypernetwork
question
Further information is requested
#169
opened Feb 27, 2023 by
Simply-Adi
updated Jun 30, 2023
MultiHeadAttention parameter setting
improvement
#180
opened Apr 30, 2023 by
LXXiaogege
updated Jun 30, 2023
The classifier-free guidance of diffusion models is wrong.
bug
Something isn't working
#177
opened Apr 9, 2023 by
luowyang
updated Jun 30, 2023
Previous Next
ProTip!
Whatโs not been updated in a month: updated:<2024-09-07.