- Italy, Bologna
- @loretoparisi
Block or Report
Block or report loretoparisi
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (32)
Sort Newest
MIDI
CapsNet
Capsule Neural NetworksBERT
Bidirectional Encoder Representations from TransformersRust
JavaScript
FastText
Word2vec & FastText text embeddingsImage Generation
Neural generation of imagesRL
Reinforcement Learning and AgentsLanguage Modeling
NER
Named Entities RecognitionSearch
Search EnginesCLIP
Contrastive Language-Image Pre-trainingONNX
Onnx runtime and models weightsPython
Image Classification
Classification of imagesWasm
Web AssemblyChinese NLP
Audio
TensorRT
Erlang
Elixir
Computer Vision
Word2vec
Text Classification
Semantic Search
Protein Structure Prediction
3D Scene Reconstruction
Audio Generation
Audio synthesisGraph Neural Networks
GNNAutomatic Speech Recognition
Speech to TextEmbedding
Audio Source Separation
Stars
Language
Sort by: Recently starred
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
Run FFmpeg as an API with fluent-ffmpeg compatibility, queues and S3 storage.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Official repository of the DanteLLM family!
React Native Expo wrapper for the Swift WhisperKit library
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
WebAssembly binding for llama.cpp - Enabling in-browser LLM inference
Code repository for the paper - "Matryoshka Representation Learning"
Reaching LLaMA2 Performance with 0.1M Dollars
Zero-Shot Speech Editing and Text-to-Speech in the Wild
It's a player for play SSE streaming chunk from OpenAI audio speech API.
When do we not need larger vision models?
Efficiently read embedding in streaming from any filesystem
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
eltociear / LLM4Decompile
Forked from albertan017/LLM4DecompileReverse Engineering: Decompiling Binary Code with Large Language Models