Skip to content
View kash203's full-sized avatar

Block or report kash203

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning

Python 353 37 Updated Oct 10, 2024

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Python 517 49 Updated Oct 3, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,473 126 Updated Oct 16, 2024

The official dataset of the flowvqa project.

9 1 Updated Mar 26, 2024

Awesome Japan Open Data - 日本のオープンデータ情報一覧・まとめ

123 Updated Aug 25, 2024

SageMakerで機械学習モデルを構築、学習、デプロイする方法が学べるNotebookと教材集

Jupyter Notebook 153 41 Updated Sep 11, 2024

Various AI scripts. Mostly Stable Diffusion stuff.

Python 3,080 305 Updated Oct 15, 2024

An open-source RAG-based tool for chatting with your documents.

Python 14,240 1,093 Updated Oct 16, 2024

AITuber Kit

TypeScript 278 54 Updated Oct 15, 2024

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Python 690 63 Updated Jul 3, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,805 1,074 Updated Sep 10, 2024
Python 2,634 208 Updated Oct 16, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,259 858 Updated Sep 13, 2024
21 2 Updated May 19, 2024

Enhanced ChatGPT Clone: Features Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, langchain, D…

TypeScript 18,176 3,037 Updated Oct 16, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 48,171 6,879 Updated Oct 16, 2024

Find parts of long text or data, allowing for some changes/typos.

Python 305 26 Updated Aug 5, 2024

Collect some papers and datastes about few-shot object detection for computer vision.

150 15 Updated Sep 26, 2023

⚡Simplify and optimize the use of LLMs

Python 12 1 Updated May 24, 2024

Open-source observability for your LLM application, based on OpenTelemetry

Python 2,180 229 Updated Oct 15, 2024

日本語LLMまとめ - Overview of Japanese LLMs

TypeScript 986 30 Updated Oct 12, 2024

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Python 2,179 154 Updated Aug 7, 2024

Stable Diffusion web UI

Python 141,231 26,697 Updated Oct 8, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 22,950 664 Updated Oct 16, 2024

Azure OpenAI benchmarking tool

Python 125 52 Updated May 28, 2024

MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition

Python 85 12 Updated May 30, 2024

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Python 718 87 Updated Sep 9, 2024

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 11,415 726 Updated Oct 14, 2024

Official PyTorch implementation of SegFormer

Python 2,522 351 Updated Aug 2, 2024
Next