-
Notifications
You must be signed in to change notification settings - Fork 91
Insights: huggingface/nanotron
Overview
-
- 1 Merged pull request
- 0 Open pull requests
- 0 Closed issues
- 2 New issues
There hasn’t been any commit activity on huggingface/nanotron in the last week.
Want to help out?
1 Pull request merged by 1 person
-
readme
#145 merged
Jul 21, 2024
2 Issues opened by 2 people
-
Will audio input training be supported in the future?
#210 opened
Jul 26, 2024 -
multi-node pp hang when enable gradient accumulation
#209 opened
Jul 24, 2024
9 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Ring attention
#181 commented on
Jul 24, 2024 • 3 new comments -
Fix tp mem cache
#203 commented on
Jul 23, 2024 • 1 new comment -
Memory optimization in async tp-linear
#208 commented on
Jul 23, 2024 • 1 new comment -
Fineweb Configuration
#195 commented on
Jul 23, 2024 • 0 new comments -
Request for detailed FineWeb-ablation-models training strategy & hyperparams
#201 commented on
Jul 23, 2024 • 0 new comments -
"datatrove" is missing from the examples folder
#175 commented on
Jul 25, 2024 • 0 new comments -
Fix _RowLinearAsyncCommunication
#172 commented on
Jul 22, 2024 • 0 new comments -
Llama3 conversion scripts 🦙
#174 commented on
Jul 25, 2024 • 0 new comments -
Supporting datatrove tokenized documents with Nanosets
#189 commented on
Jul 25, 2024 • 0 new comments