kash203

Follow

kash203

Follow

2 followers · 0 following

Stars

Plachtaa / seed-vc

State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning

Python 353 37 Updated Oct 10, 2024

AnswerDotAI / byaldi

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Python 517 49 Updated Oct 3, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,473 126 Updated Oct 16, 2024

flowvqa / flowvqa

The official dataset of the flowvqa project.

9 1 Updated Mar 26, 2024

japan-opendata / awesome-japan-opendata

Awesome Japan Open Data - 日本のオープンデータ情報一覧・まとめ

123 Updated Aug 25, 2024

aws-samples / aws-ml-jp

SageMakerで機械学習モデルを構築、学習、デプロイする方法が学べるNotebookと教材集

Jupyter Notebook 153 41 Updated Sep 11, 2024

ostris / ai-toolkit

Various AI scripts. Mostly Stable Diffusion stuff.

Python 3,080 305 Updated Oct 15, 2024

Cinnamon / kotaemon

An open-source RAG-based tool for chatting with your documents.

Python 14,240 1,093 Updated Oct 16, 2024

tegnike / aituber-kit

AITuber Kit

TypeScript 278 54 Updated Oct 15, 2024

Boese0601 / MagicDance

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Python 690 63 Updated Jul 3, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,805 1,074 Updated Sep 10, 2024

LLaVA-VL / LLaVA-NeXT

Python 2,634 208 Updated Oct 16, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,259 858 Updated Sep 13, 2024

mizuumi / JDocQA

21 2 Updated May 19, 2024

danny-avila / LibreChat

Enhanced ChatGPT Clone: Features Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, langchain, D…

TypeScript 18,176 3,037 Updated Oct 16, 2024

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 48,171 6,879 Updated Oct 16, 2024

taleinat / fuzzysearch

Find parts of long text or data, allowing for some changes/typos.

Python 305 26 Updated Aug 5, 2024

lxn96 / awesome-few-shot-object-detection

Collect some papers and datastes about few-shot object detection for computer vision.

150 15 Updated Sep 26, 2023

llm-proxy / llm-proxy

⚡Simplify and optimize the use of LLMs

Python 12 1 Updated May 24, 2024

traceloop / openllmetry

Open-source observability for your LLM application, based on OpenTelemetry

Python 2,180 229 Updated Oct 15, 2024

llm-jp / awesome-japanese-llm

日本語LLMまとめ - Overview of Japanese LLMs

TypeScript 986 30 Updated Oct 12, 2024

TMElyralab / MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Python 2,179 154 Updated Aug 7, 2024

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 141,231 26,697 Updated Oct 8, 2024

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 22,950 664 Updated Oct 16, 2024

Azure / azure-openai-benchmark

Azure OpenAI benchmarking tool

Python 125 52 Updated May 28, 2024

lllyasviel / stable-diffusion-webui-forge

Python 7,957 773 Updated Oct 15, 2024

namtuanly / MTL-TabNet

MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition

Python 85 12 Updated May 30, 2024

litagin02 / Style-Bert-VITS2

Forked from fishaudio/Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Python 718 87 Updated Sep 9, 2024

VikParuchuri / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 11,415 726 Updated Oct 14, 2024

NVlabs / SegFormer

Official PyTorch implementation of SegFormer

Python 2,522 351 Updated Aug 2, 2024