Skip to content
View loretoparisi's full-sized avatar
🐍
NightShift
🐍
NightShift

Organizations

@Musixmatchdev @musixmatchresearch
Block or Report

Block or report loretoparisi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Language Modeling

245 repositories

Repo for external large-scale work

Python 6,427 722 Updated Apr 27, 2024

Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"

Python 44 4 Updated May 31, 2022

Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

Python 846 106 Updated Oct 30, 2023

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

Python 459 76 Updated Feb 24, 2024

Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch

Lua 11,526 2,580 Updated Oct 24, 2023

Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible for beginners and as seamlesscustomizable and as possible for…

Python 170 269 Updated Nov 30, 2023

BERT score for text generation

Jupyter Notebook 1,501 206 Updated Jun 14, 2024

UmBERTo: an Italian Language Model trained with Whole Word Masking.

Python 100 2 Updated Dec 15, 2022

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,140 880 Updated Jul 12, 2024

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,646 610 Updated Jul 25, 2023

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text

Python 227 40 Updated Oct 30, 2019

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 21,954 5,433 Updated Jun 11, 2024

Language Modeling with the H3 State Space Model

Assembly 504 54 Updated Sep 29, 2023
Jupyter Notebook 475 61 Updated Mar 14, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 11,988 825 Updated Jul 11, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,708 978 Updated Jul 12, 2024

DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature

Python 338 53 Updated Mar 4, 2023

Making large AI models cheaper, faster and more accessible

Python 38,337 4,308 Updated Jul 12, 2024

An unnecessarily tiny implementation of GPT-2 in NumPy.

Python 3,132 404 Updated Apr 24, 2023

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,712 313 Updated Jun 12, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,076 528 Updated Apr 19, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,107 750 Updated Jul 10, 2024

Utilities to use the Hugging Face Hub API

TypeScript 1,288 164 Updated Jul 12, 2024

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Python 4,331 182 Updated Jul 12, 2024

LLM inference in C/C++

C++ 61,566 8,801 Updated Jul 12, 2024

The simplest way to run LLaMA on your local machine

CSS 13,092 1,429 Updated Jun 18, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,143 4,015 Updated Mar 12, 2024

C++ implementation for BLOOM

C 809 65 Updated May 13, 2023

State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!

JavaScript 10,218 598 Updated Jul 12, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,402 2,196 Updated Feb 23, 2024