Skip to content
View yunjiallm's full-sized avatar

Block or report yunjiallm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Python 7,963 1,966 Updated May 13, 2024

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C++ 30,958 7,843 Updated Aug 3, 2024

[ICCV 2023] PyTorch Implementation of "MotionBERT: A Unified Perspective on Learning Human Motion Representations"

Python 1,033 128 Updated Jun 18, 2024

StoryMaker: Towards consistent characters in text-to-image generation

Python 477 39 Updated Sep 26, 2024

📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.

Python 325 23 Updated Sep 30, 2024

Build mindmaps with plain text

TypeScript 8,858 610 Updated Oct 2, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,857 150 Updated Sep 25, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 7,884 737 Updated Oct 5, 2024

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 12,350 2,970 Updated Oct 7, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 4,949 405 Updated Oct 2, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,475 140 Updated Oct 4, 2024

🔥 🔥 🔥 Open Source JIRA, Linear, Monday, and Asana Alternative. Plane helps you track your issues, epics, and product roadmaps in the simplest way possible.

TypeScript 29,575 1,631 Updated Oct 7, 2024

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python 15,469 3,259 Updated Sep 19, 2024

One-click Face Swapper and Restoration powered by insightface 🔥

Python 501 76 Updated Apr 16, 2024

A high resolution face dataset for face editing purpose

Python 405 32 Updated Jul 19, 2024

face-to-sticker

Python 621 63 Updated Mar 1, 2024

face-to-sticker

Python 2 Updated Mar 1, 2024

中医药大模型

101 13 Updated Aug 8, 2023

Collection of awesome medical dataset resources.

290 21 Updated Sep 29, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 7,297 736 Updated Oct 7, 2024

ALL IN ONE Hacking Tool For Hackers

Python 50,047 5,374 Updated Jul 31, 2024

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Python 3,182 246 Updated Oct 7, 2024

The official code for paper "parallel speculative decoding with adaptive draft length."

Python 17 Updated Aug 23, 2024

Run macOS VM in a Docker! Run near native OSX-KVM in Docker! X11 Forwarding! CI/CD for OS X Security Research! Docker mac Containers.

Shell 46,524 2,509 Updated Sep 26, 2024

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 41,724 4,928 Updated Oct 7, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,839 406 Updated Oct 2, 2024
Python 34 5 Updated Jun 22, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,225 115 Updated Oct 7, 2024

截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator

TypeScript 3,912 303 Updated Oct 7, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,439 403 Updated Oct 7, 2024
Next