zqxuturbo

zqxuturbo

3 followers · 6 following

SAIC
shanghai

Stars

w3ng-git / qwen2-export-onnx

Export Qwen2 models to onnx.

Python 3 1 Updated Aug 12, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,172 850 Updated Sep 13, 2024

X-PLUG / MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 2,787 261 Updated Sep 26, 2024

quic / ai-hub-models

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Python 446 65 Updated Sep 27, 2024

jeinlee1991 / chinese-llm-benchmark

中文大模型能力评测榜单：目前已囊括115个大模型，覆盖chatgpt、gpt4o、百度文心一言、阿里通义千问、讯飞星火、商汤senseChat、minimax等商用模型，以及百川、qwen2、glm4、yi、书生internLM2、llama3等开源大模型，多维度能力评测。不仅提供能力评分排行榜，也提供所有模型的原始输出结果！

2,517 120 Updated Oct 7, 2024

deepcam-cn / yolov5-face

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931) ECCV Workshops 2022)

Python 2,056 497 Updated Jul 22, 2024

carla-simulator / carla

Open-source simulator for autonomous driving research.

C++ 11,193 3,620 Updated Oct 8, 2024

NVlabs / BEV-Planner

Python 166 7 Updated Jun 18, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 34,437 4,166 Updated Aug 16, 2024

ageitgey / face_recognition

The world's simplest facial recognition api for Python and the command line

Python 53,070 13,456 Updated Aug 21, 2024

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,737 246 Updated Jun 4, 2024

opendilab / LMDrive

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Jupyter Notebook 635 52 Updated Jul 7, 2024

mnotgod96 / AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 4,924 534 Updated Aug 8, 2024

nutonomy / nuscenes-devkit

The devkit of the nuScenes dataset.

Python 2,250 624 Updated Sep 30, 2024

fundamentalvision / BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Python 3,276 533 Updated Aug 15, 2024