Skip to content
View xansrnitu's full-sized avatar

Block or report xansrnitu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PDF to Markdown with vision models

Python 6,257 343 Updated Nov 17, 2024

Document to Markdown OCR library with Llama 3.2 vision

TypeScript 1,089 73 Updated Nov 12, 2024

Convert any PDF into a podcast episode!

Python 1,491 163 Updated Nov 4, 2024

An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI

Python 1,074 114 Updated Nov 16, 2024

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,423 90 Updated Oct 31, 2024

RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.

C# 1,597 308 Updated Nov 15, 2024

Zep | The Memory Foundation For Your AI Stack

Go 2,703 384 Updated Oct 4, 2024

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,471 108 Updated Nov 13, 2024

Building AI agents, atomically

Python 941 80 Updated Nov 16, 2024

Get your documents ready for gen AI

Python 9,439 449 Updated Nov 17, 2024
Python 933 93 Updated Nov 6, 2024

tl/dw (Too Long, Didn't Watch): Your Personal Research Multi-Tool - a naive attempt at 'A Young Lady's Illustrated Primer'

Python 374 12 Updated Nov 16, 2024

Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation

Python 160 9 Updated Nov 12, 2024

[ECCV 2024] HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting.

Python 180 5 Updated Nov 2, 2024

Run your business smarter 🪄

TypeScript 5,801 525 Updated Nov 16, 2024

Official inference framework for 1-bit LLMs

C++ 11,156 757 Updated Nov 11, 2024

AI Browser

JavaScript 3,809 315 Updated Sep 17, 2024

A library to generate LaTeX expression from Python code.

Python 7,248 387 Updated May 13, 2024

Vision model based document ingestion

Python 1,238 57 Updated Nov 17, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,864 1,263 Updated Dec 6, 2023

The Multi-Agent Reasoning framework creates an interactive chatbot where AI agents collaborate via structured reasoning and Swarm Integration for optimal answers. Simulating a team that discusses, …

Python 130 22 Updated Oct 18, 2024

Entropy Based Sampling and Parallel CoT Decoding

Python 3,012 311 Updated Nov 13, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 16,959 1,226 Updated Nov 17, 2024

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…

Jupyter Notebook 4,104 454 Updated Nov 15, 2024

Effortlessly run LLM backends, APIs, frontends, and services with one command.

TypeScript 508 33 Updated Nov 10, 2024

Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.

JavaScript 344 105 Updated Nov 10, 2024

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 2,628 184 Updated Nov 1, 2024

This repository will host the code for the SIGGRAPH Asia 2024 Paper titled: "GaussianHeads: End-to-End Learning of Drivable Gaussian Head Avatars from Coarse-to-fine Representations"

19 Updated Sep 18, 2024

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 14,801 793 Updated Nov 16, 2024
Next