Skip to content
View zqxuturbo's full-sized avatar

Block or report zqxuturbo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Export Qwen2 models to onnx.

Python 3 1 Updated Aug 12, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,172 850 Updated Sep 13, 2024

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 2,787 261 Updated Sep 26, 2024

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Python 446 65 Updated Sep 27, 2024

中文大模型能力评测榜单:目前已囊括115个大模型,覆盖chatgpt、gpt4o、百度文心一言、阿里通义千问、讯飞星火、商汤senseChat、minimax等商用模型, 以及百川、qwen2、glm4、yi、书生internLM2、llama3等开源大模型,多维度能力评测。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!

2,517 120 Updated Oct 7, 2024

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931) ECCV Workshops 2022)

Python 2,056 497 Updated Jul 22, 2024

Open-source simulator for autonomous driving research.

C++ 11,193 3,620 Updated Oct 8, 2024
Python 166 7 Updated Jun 18, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 34,437 4,166 Updated Aug 16, 2024

The world's simplest facial recognition api for Python and the command line

Python 53,070 13,456 Updated Aug 21, 2024

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,737 246 Updated Jun 4, 2024

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Jupyter Notebook 635 52 Updated Jul 7, 2024

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 4,924 534 Updated Aug 8, 2024

The devkit of the nuScenes dataset.

Python 2,250 624 Updated Sep 30, 2024

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Python 3,276 533 Updated Aug 15, 2024

awesome-autonomous-driving

648 73 Updated Aug 19, 2024

End-to-End Object Detection with Transformers

Python 13,433 2,426 Updated Mar 12, 2024

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

2,145 216 Updated Aug 15, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,383 143 Updated Sep 27, 2024

Mamba SSM architecture

Python 12,777 1,078 Updated Oct 7, 2024

Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-time

Python 775 124 Updated Dec 16, 2023

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,033 137 Updated Sep 3, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 93,179 14,983 Updated Oct 8, 2024

This is the official repository for Retrieval Augmented Visual Question Answering

Python 169 14 Updated Sep 3, 2024

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,123 210 Updated Oct 8, 2024

a lightweight LLM model inference framework

C++ 685 86 Updated Apr 7, 2024

State-of-the-art 2D and 3D Face Analysis Project

Python 23,029 5,370 Updated Sep 30, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

15,275 1,424 Updated Sep 19, 2024

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

2,949 94 Updated May 23, 2024
Next