Skip to content
View matteosoo's full-sized avatar
:octocat:
:octocat:
  • National Tsing Hua University
  • Taipei, Taiwan
  • 18:16 (UTC +08:00)

Highlights

  • Pro

Block or report matteosoo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 5,853 632 Updated Sep 10, 2024

The Memory layer for your AI apps

Python 21,457 1,953 Updated Sep 10, 2024

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,366 60 Updated Sep 7, 2024

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

TypeScript 6,264 486 Updated Sep 10, 2024

Implementation for MatMul-free LM.

Python 2,854 175 Updated Aug 28, 2024

Search for words, documents, images, videos, news, maps and text translation using the DuckDuckGo.com search engine. Downloading files and images to a local hard drive.

Python 1,091 127 Updated Aug 23, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,530 2,211 Updated Jul 29, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 44,656 6,270 Updated Sep 10, 2024

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

TypeScript 2,657 308 Updated Aug 21, 2024

Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)

Python 259 21 Updated Mar 19, 2024

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

Python 518 69 Updated Nov 10, 2023

LlamaIndex is a data framework for your LLM applications

Python 35,307 4,960 Updated Sep 10, 2024

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 1,984 320 Updated Nov 14, 2023

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,540 753 Updated Feb 11, 2024

Inference code for CodeLlama models

Python 15,845 1,840 Updated Aug 12, 2024

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.

Python 19,787 2,201 Updated Aug 21, 2024

SoftVC VITS Singing Voice Conversion

Python 25,298 4,751 Updated Nov 11, 2023

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,605 920 Updated Apr 23, 2024

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Python 1,143 168 Updated Feb 5, 2024

PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor

Python 273 34 Updated Jul 16, 2023

Inference code for Llama models

Python 55,409 9,446 Updated Aug 18, 2024

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Python 294 35 Updated Jul 22, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,651 3,438 Updated May 18, 2024

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,686 706 Updated Jul 3, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,508 2,483 Updated Aug 28, 2024

Identification and conversion functions for Chinese text processing

Python 56 19 Updated May 25, 2024

List of speech synthesis papers.

991 120 Updated Jul 24, 2023

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Markdown 124,951 23,183 Updated Aug 31, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,407 305 Updated Jan 4, 2024

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

Python 653 82 Updated Oct 11, 2023
Next