Highlights
- Pro
-
verbatim-memorization Public
Forked from explanare/verbatim-memorizationDemystifying Verbatim Memorization in Large Language Models
Python MIT License UpdatedSep 12, 2024 -
acr-memorization Public
Forked from locuslab/acr-memorizationPython BSD 3-Clause "New" or "Revised" License UpdatedSep 10, 2024 -
pythia Public
Forked from EleutherAI/pythiaThe hub for EleutherAI's work on interpretability and learning dynamics
Jupyter Notebook Apache License 2.0 UpdatedSep 6, 2024 -
nanotron Public
Forked from huggingface/nanotronMinimalistic large language model 3D-parallelism training
Python Apache License 2.0 UpdatedAug 29, 2024 -
mmarius.github.io Public
Forked from alshedivat/al-folioA beautiful, simple, clean, and responsive Jekyll theme for academics
JavaScript MIT License UpdatedAug 14, 2024 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedMay 10, 2024 -
-
peft Public
Forked from huggingface/peft🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python Apache License 2.0 UpdatedMar 22, 2024 -
examples Public
Forked from mosaicml/examplesFast and flexible reference benchmarks
Shell Apache License 2.0 UpdatedJan 23, 2024 -
composer Public
Forked from mosaicml/composerSupercharge Your Model Training
Python Apache License 2.0 UpdatedJan 8, 2024 -
evaluate Public
Forked from huggingface/evaluate🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
Python Apache License 2.0 UpdatedNov 17, 2022 -
awesome-finetuning Public
A curated list of resources on fine-tuning language models.
-
Megatron-DeepSpeed Public
Forked from microsoft/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedJun 9, 2022 -
-
BayesianTransferLearning Public
Forked from hsouri/BayesianTransferLearningPython UpdatedMay 23, 2022 -
-
-
adapter-transformers Public
Forked from adapter-hub/adaptersHuggingface Transformers + Adapters = ❤️
Python Apache License 2.0 UpdatedMar 14, 2022 -
-
OpenPrompt Public
Forked from thunlp/OpenPromptAn Open-Source Toolkit for Prompt-Learning.
Python Apache License 2.0 UpdatedNov 4, 2021 -
HF-Megatron-DeepSpeed Public
Forked from bigscience-workshop/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedNov 2, 2021 -
-
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedMay 24, 2021 -
promptsource Public
Forked from bigscience-workshop/promptsourceToolkit for collecting and applying templates of prompting instances
Python Apache License 2.0 UpdatedMay 20, 2021 -
dynamic-lm-kb Public
Forked from zouharvi/dynamic-lm-kbLM with limited access to KB
Python UpdatedApr 27, 2021 -
pet Public
Forked from TevenLeScao/petThis repository contains the code for "How many data points is a prompt worth?"
Python Apache License 2.0 UpdatedApr 7, 2021 -
shortformer Public
Forked from ofirpress/shortformerCode for the Shortformer model, from the paper by Ofir Press, Noah A. Smith and Mike Lewis.
Python MIT License UpdatedMar 6, 2021 -
BIG-bench Public
Forked from google/BIG-benchBeyond the Imitation Game collaborative benchmark for enormous language models
Jupyter Notebook Apache License 2.0 UpdatedJan 27, 2021 -
AtypicalAnimacy Public
Forked from Living-with-machines/AtypicalAnimacyRepository for code underlying the paper 'Living Machines: A Study of Atypical Animacy' (COLING2020)
Jupyter Notebook Other UpdatedDec 17, 2020 -
mlxtend Public
Forked from rasbt/mlxtendA library of extension and helper modules for Python's data analysis and machine learning libraries.
Python Other UpdatedNov 24, 2020