- Italy, Bologna
- @loretoparisi
Block or Report
Block or report loretoparisi
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (32)
Sort Name ascending (A-Z)
3D Scene Reconstruction
Audio
Audio Generation
Audio synthesisAudio Source Separation
Automatic Speech Recognition
Speech to TextBERT
Bidirectional Encoder Representations from TransformersCapsNet
Capsule Neural NetworksChinese NLP
CLIP
Contrastive Language-Image Pre-trainingComputer Vision
Elixir
Embedding
Erlang
FastText
Word2vec & FastText text embeddingsGraph Neural Networks
GNNImage Classification
Classification of imagesImage Generation
Neural generation of imagesJavaScript
Language Modeling
MIDI
NER
Named Entities RecognitionONNX
Onnx runtime and models weightsProtein Structure Prediction
Python
RL
Reinforcement Learning and AgentsRust
Search
Search EnginesSemantic Search
TensorRT
Text Classification
Wasm
Web AssemblyWord2vec
Stars
Language
Sort by: Recently starred
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf
An extension to the Amazon SQS client that enables sending and receiving messages up to 2GB via Amazon S3.
Opensource IDE For Exploring and Testing Api's (lightweight alternative to postman/insomnia)
Supplementary code for the paper "Hyperbolic Image Embeddings".
The official implementation of Self-Play Fine-Tuning (SPIN)
Comprehensive dynamic time warping module for python
3D ResNets for Action Recognition (CVPR 2018)
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
On-device Speech Recognition for Apple Silicon
Advanced Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
An Open Source text-to-speech system built by inverting Whisper.
SGLang is yet another fast serving framework for large language models and vision language models.
Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
The public release of LeftoverLocals code
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
OCR, layout analysis, reading order, line detection in 90+ languages
Peter-obi / voice-assistant
Forked from linyiLYi/voice-assistantVoice assistant using apple mlx_framework
Official implementation of Half-Quadratic Quantization (HQQ)
Run Mixtral-8x7B models in Colab or consumer desktops
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
A multi-voice TTS system trained with an emphasis on quality