blackbook-lab

Follow

Yancy Dan blackbook-lab

Follow

0 followers · 10 following

Lists (11)

Sort

AI learning

Audio Clone

BlockChain

CV

dataset

DataWhale

Engineering

hugging face

LLM

STT

16 repositories

TTS

Stars

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 32,887 3,957 Updated Nov 16, 2024

InternLM / InternLM

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,472 455 Updated Oct 10, 2024

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,122 4,250 Updated Aug 19, 2024

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 25,888 4,826 Updated Nov 11, 2023

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 8,003 1,132 Updated Nov 15, 2024

facebookresearch / hydra

Hydra is a framework for elegantly configuring complex applications

Python 8,812 635 Updated Nov 16, 2024

kahne / SpeechTransProgress

Tracking the progress in end-to-end speech translation

254 25 Updated Oct 25, 2023

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,258 2,237 Updated Aug 12, 2024

openaudiolab / LLaST

LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models

Python 18 1 Updated Aug 11, 2024

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,935 1,058 Updated Nov 14, 2024

formiel / fairseq

Forked from facebookresearch/fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 19 Updated Aug 1, 2024

SophonPlus / ChineseNlpCorpus

搜集、整理、发布中文自然语言处理语料/数据集，与有志之士共同促进中文自然语言处理的发展。

Jupyter Notebook 5,905 1,399 Updated Jan 29, 2019

mt-upc / ZeroSwot

Pushing the Limits of Zero-shot End-to-End Speech Translation

Python 21 3 Updated Aug 17, 2024

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,866 1,263 Updated Dec 6, 2023

FFmpeg / FFmpeg

Mirror of https://git.ffmpeg.org/ffmpeg.git

C 46,067 12,168 Updated Nov 17, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 35,716 4,072 Updated Nov 7, 2024

hcy71o / TransferTTS

TransferTTS (Zero-Shot learning of VITS)

Python 90 11 Updated Sep 23, 2022

TencentGameMate / chinese_speech_pretrain

chinese speech pretrained models

Shell 1,035 87 Updated Aug 23, 2024

zyds / transformers-code

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Jupyter Notebook 2,074 301 Updated Jul 15, 2024

ReneeYe / XSTNet

This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)

Python 19 3 Updated May 1, 2022

ictnlp / NAST-S2x

A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.

Python 60 4 Updated Oct 22, 2024

ictnlp / STEMM

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

Python 36 7 Updated Oct 25, 2023

ReneeYe / ConST

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

Python 62 6 Updated May 25, 2022

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 14,249 1,332 Updated Nov 17, 2024

facebookresearch / libri-light

dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.

Python 480 78 Updated Jul 11, 2023

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,485 1,449 Updated Oct 21, 2024

huggingface / notebooks

Notebooks using the Hugging Face libraries 🤗

Jupyter Notebook 3,669 1,538 Updated Nov 12, 2024

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 32,278 4,757 Updated Nov 14, 2024

google-research / vision_transformer

Jupyter Notebook 10,457 1,294 Updated May 21, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,947 3,321 Updated Jul 23, 2024