swiss-ai
Popular repositories Loading
-
nanotron
nanotron PublicForked from huggingface/nanotron
Minimalistic large language model 3D-parallelism training
-
video2dataset
video2dataset PublicForked from iejMac/video2dataset
Easily create large video dataset from video urls
Python 1
-
Megatron-LLM
Megatron-LLM PublicForked from epfLLM/Megatron-LLM
distributed trainer for LLMs
Python
-
data-PDF-pipeline
data-PDF-pipeline PublicPDF pipeline for creating training corpora (mainly for llm, multimodal and alignment horizontals)
Python
-
data-tooling
data-tooling PublicForked from huggingface/datatrove
Tool set for data preparation and selection in the context of Swiss-AI (forked from DataTrove)
Python
Repositories
- nanotron-multilingual Public Forked from swiss-ai/nanotron
A copy of nanotron for multilingual training
swiss-ai/nanotron-multilingual’s past year of commit activity - ml-4m Public Forked from apple/ml-4m
4M: Massively Multimodal Masked Modeling (NeurIPS 2023 Spotlight)
swiss-ai/ml-4m’s past year of commit activity - video2dataset Public Forked from iejMac/video2dataset
Easily create large video dataset from video urls
swiss-ai/video2dataset’s past year of commit activity - data-tooling Public Forked from huggingface/datatrove
Tool set for data preparation and selection in the context of Swiss-AI (forked from DataTrove)
swiss-ai/data-tooling’s past year of commit activity - vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
swiss-ai/vllm’s past year of commit activity - data-PDF-pipeline Public
PDF pipeline for creating training corpora (mainly for llm, multimodal and alignment horizontals)
swiss-ai/data-PDF-pipeline’s past year of commit activity