Skip to content
View blackbook-lab's full-sized avatar

Block or report blackbook-lab

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 32,887 3,957 Updated Nov 16, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,472 455 Updated Oct 10, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,122 4,250 Updated Aug 19, 2024

SoftVC VITS Singing Voice Conversion

Python 25,888 4,826 Updated Nov 11, 2023

vits2 backbone with multilingual-bert

Python 8,003 1,132 Updated Nov 15, 2024

Hydra is a framework for elegantly configuring complex applications

Python 8,812 635 Updated Nov 16, 2024

Tracking the progress in end-to-end speech translation

254 25 Updated Oct 25, 2023

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,258 2,237 Updated Aug 12, 2024

LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models

Python 18 1 Updated Aug 11, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,935 1,058 Updated Nov 14, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 19 Updated Aug 1, 2024

搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。

Jupyter Notebook 5,905 1,399 Updated Jan 29, 2019

Pushing the Limits of Zero-shot End-to-End Speech Translation

Python 21 3 Updated Aug 17, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,866 1,263 Updated Dec 6, 2023

Mirror of https://git.ffmpeg.org/ffmpeg.git

C 46,067 12,168 Updated Nov 17, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 35,716 4,072 Updated Nov 7, 2024

TransferTTS (Zero-Shot learning of VITS)

Python 90 11 Updated Sep 23, 2022

chinese speech pretrained models

Shell 1,035 87 Updated Aug 23, 2024

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Jupyter Notebook 2,074 301 Updated Jul 15, 2024

This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)

Python 19 3 Updated May 1, 2022

A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.

Python 60 4 Updated Oct 22, 2024

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

Python 36 7 Updated Oct 25, 2023

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

Python 62 6 Updated May 25, 2022

Fast and memory-efficient exact attention

Python 14,249 1,332 Updated Nov 17, 2024

dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.

Python 480 78 Updated Jul 11, 2023

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,485 1,449 Updated Oct 21, 2024

Notebooks using the Hugging Face libraries 🤗

Jupyter Notebook 3,669 1,538 Updated Nov 12, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 32,278 4,757 Updated Nov 14, 2024
Jupyter Notebook 10,457 1,294 Updated May 21, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,947 3,321 Updated Jul 23, 2024
Next