Skip to content
View babat00nday's full-sized avatar

Block or report babat00nday

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".

Python 720 46 Updated Oct 28, 2024

This repository collects an extensive list of awesome papers about Story Generation / Storytelling, primarily focusing on the era of Large Language Models (LLMs).

315 20 Updated Jun 25, 2024

Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.

Python 23 3 Updated Oct 18, 2024

Groq-powered chat assistant for generating contextual responses to user queries.

Jupyter Notebook 5 1 Updated Mar 22, 2024

Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models

Python 140 15 Updated Dec 18, 2023

The Programmable Cypher-based Neuro-Symbolic AGI that lets you program its behavior using Graph-based Prompt Programming: for people who want AI to behave as expected

Jupyter Notebook 687 62 Updated Sep 25, 2024

Infinite Alchemy is an AI-powered game where you mix and match elements to create basically anything

JavaScript 24 10 Updated May 18, 2024

Comparison of Language Model Inference Engines

189 6 Updated Sep 2, 2024

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Jupyter Notebook 230 15 Updated May 19, 2024

Awesome speech/audio LLMs, representation learning, and codec models

669 31 Updated Nov 1, 2024

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook 4,950 311 Updated Oct 18, 2023

whiteboard / infinite canvas SDK

TypeScript 35,684 2,187 Updated Nov 2, 2024

draw.io is a JavaScript, client-side editor for general diagramming.

JavaScript 41,212 7,648 Updated Oct 23, 2024

A python package to analyze and compare voices with deep learning

Python 2,770 427 Updated Oct 12, 2023

A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.

89 20 Updated Apr 26, 2024

Inference Llama 2 in one file of pure C

C 17,418 2,079 Updated Aug 6, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,150 1,816 Updated Aug 19, 2024

A generative language model which seeks to maximize rhyming syllables. Based on OpenAI's GPT-2.

Python 1 Updated Aug 22, 2023

VTuber application which only requires your voice and microphone, no need for a webcam or other tracking nonsense.

C++ 13 2 Updated Sep 29, 2024

OrgChart with a twist! Simpler, faster navigation and exporting capabilities for those power users who need to manage bulky organizations.

JavaScript 1 3 Updated Aug 26, 2021

A browser extension that removes YouTube suggestions, comments, shorts, and more

JavaScript 394 33 Updated Jul 6, 2024

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,255 485 Updated Nov 1, 2024

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021…

Python 204 18 Updated May 9, 2022

Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.

Jupyter Notebook 1 Updated May 9, 2022

PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)

Jupyter Notebook 120 13 Updated Feb 24, 2023

AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages

Python 66 20 Updated May 31, 2022

Genetic programming using the Elixir AST

Elixir 9 1 Updated Sep 15, 2022

Library for Textless Spoken Language Processing

Python 529 51 Updated Aug 29, 2023

THIS REPO IS NOT MAINTAINED ANYMORE. Please see https://codeberg.org/tenacityteam/tenacity for Tenacity, which is maintained.

C++ 6,757 255 Updated Jun 29, 2022
Next