-
Notifications
You must be signed in to change notification settings - Fork 300
Issues: pytorch/torchtune
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
RuntimeError: CUDA error: an illegal memory access was encountered
#1202
opened Jul 20, 2024 by
epollwait
wikitext_dataset
fails in model forward with "Expected Long, Int; but got torch.cuda.FloatTensor"
bug
#1191
opened Jul 17, 2024 by
RdoubleA
Error resuming lora training using config if checkpoint_dir is different from output_dir
bug
Something isn't working
#1189
opened Jul 17, 2024 by
oliviernabuio
expandable_segments with PYTORCH_CUDA_ALLOC_CONF reduces VRAM
discussion
Start a discussion
#1185
opened Jul 16, 2024 by
winglian
[RFC] Adding RoPE scaling methods to support long context modeling
rfc
#1183
opened Jul 16, 2024 by
joecummings
load_dataset fails on distributed recipes for datasets with remote code
bug
Something isn't working
#1178
opened Jul 15, 2024 by
pbontrager
[feature request] Saving / Loading packed dataset
enhancement
New feature or request
help wanted
Extra attention is needed
#1149
opened Jul 8, 2024 by
ScottHoang
generate is correct but generate from quantization get error:
help wanted
Extra attention is needed
question
Further information is requested
#1148
opened Jul 8, 2024 by
artisanclouddev
Resize token embedding.
help wanted
Extra attention is needed
question
Further information is requested
#1145
opened Jul 7, 2024 by
hungphongtrn
Shape error when using torchtune.modules.RotaryPositionalEmbeddings
question
Further information is requested
#1157
opened Jul 6, 2024 by
Leo-Lifeblood
safe_torch_load failed when resume from checkpoint
bug
Something isn't working
question
Further information is requested
#1142
opened Jul 3, 2024 by
ScottHoang
text_completion_dataset removed?
question
Further information is requested
#1140
opened Jul 3, 2024 by
wiiiktor
Quantization for Llama-70b raises CUDA OOM
question
Further information is requested
#1128
opened Jun 27, 2024 by
lulmer
Support for Phi-3-mini-128k-instruct and larger context length models
#1120
opened Jun 25, 2024 by
dcsuka
Missing non-LoRA key tok_embeddings.weight from base model dict
#1110
opened Jun 22, 2024 by
vasicvuk
Previous Next
ProTip!
no:milestone will show everything without a milestone.