Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

Python 512 53 Updated Oct 7, 2024

haoheliu / SemantiCodec-inference

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 132 8 Updated Aug 25, 2024

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 437 39 Updated Jun 9, 2024

X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Python 515 44 Updated Oct 5, 2024

KbsdJames / Omni-MATH

The official repository of the Omni-MATH benchmark.

Python 26 Updated Sep 16, 2024

matthewrenze / self-reflection

Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

20 2 Updated May 3, 2024

3DTopia / Phidias-Diffusion

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

151 2 Updated Sep 19, 2024

marlaman / show-me

A visual and transparent alternative to open-source ChatGPT O1

Python 571 56 Updated Sep 26, 2024

Time-MoE / Time-MoE

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Python 145 11 Updated Oct 7, 2024

UCSC-VLAA / o1_medical

Python 32 Updated Sep 25, 2024

zhaoxlpku / SubgoalXL

Python 17 1 Updated Aug 23, 2024

menyifang / MIMO

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

1,054 46 Updated Sep 27, 2024

OpenBMB / MiniCPM-CookBook

This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achieving exceptional performance on the edge.

Python 64 4 Updated Sep 29, 2024

Muennighoff / kto

Python 3 Updated May 4, 2024

baaivision / Emu3

Next-Token Prediction is All You Need

Python 844 25 Updated Sep 30, 2024

VectorSpaceLab / Video-XL

Effective and efficient hour-scale long video understanding model

20 1 Updated Sep 22, 2024

hyperknot / openfreemap

Free and open-source map hosting solution with custom styles for websites and apps, using OpenStreetMap data

Python 2,063 39 Updated Sep 29, 2024

mks0601 / ExAvatar_RELEASE

Official PyTorch implementation of "Expressive Whole-Body 3D Gaussian Avatar", ECCV 2024.

Python 377 28 Updated Oct 3, 2024

neo4j / NaLLM

Repository for the NaLLM project

TypeScript 1,245 244 Updated Jun 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lijun20

Block or report lijun20

Stars

ImagineAILab / ai-by-hand-excel

benfinkelshtein / CoGNN

memfreeme / memfree

john-hewitt / implicit-ins

MAGIC-AI4Med / MMedLM

adithya-s-k / VARAG

ivcylc / qa-mdt

jyrao / MatchTime

raphael-baena / DTLR

sugarandgugu / Text2Image-Retrieval

TAG-Research / lotus

DrewThomasson / ebook2audiobookXTTS