- Italy, Bologna
- @loretoparisi
Block or Report
Block or report loretoparisi
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (32)
Sort Oldest
Audio Source Separation
Embedding
Automatic Speech Recognition
Speech to TextGraph Neural Networks
GNNAudio Generation
Audio synthesis3D Scene Reconstruction
Protein Structure Prediction
Semantic Search
Text Classification
Word2vec
Computer Vision
Elixir
Erlang
TensorRT
Audio
Chinese NLP
Wasm
Web AssemblyImage Classification
Classification of imagesPython
ONNX
Onnx runtime and models weightsCLIP
Contrastive Language-Image Pre-trainingSearch
Search EnginesNER
Named Entities RecognitionLanguage Modeling
RL
Reinforcement Learning and AgentsImage Generation
Neural generation of imagesFastText
Word2vec & FastText text embeddingsJavaScript
Rust
BERT
Bidirectional Encoder Representations from TransformersCapsNet
Capsule Neural NetworksMIDI
Stars
Language
Sort by: Recently starred
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
Run FFmpeg as an API with fluent-ffmpeg compatibility, queues and S3 storage.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Official repository of the DanteLLM family!
React Native Expo wrapper for the Swift WhisperKit library
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
WebAssembly binding for llama.cpp - Enabling in-browser LLM inference
Code repository for the paper - "Matryoshka Representation Learning"
Reaching LLaMA2 Performance with 0.1M Dollars
Zero-Shot Speech Editing and Text-to-Speech in the Wild
It's a player for play SSE streaming chunk from OpenAI audio speech API.
When do we not need larger vision models?
Efficiently read embedding in streaming from any filesystem
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
eltociear / LLM4Decompile
Forked from albertan017/LLM4DecompileReverse Engineering: Decompiling Binary Code with Large Language Models