Skip to content
View kq-chen's full-sized avatar

Highlights

  • Pro

Organizations

@shikras

Block or report kq-chen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,309 233 Updated Sep 14, 2024

SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.

3,975 359 Updated Sep 13, 2024

A lightweight library for PyTorch training tools and utilities

Python 1,656 266 Updated Sep 13, 2024

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,025 82 Updated Aug 8, 2024

Long Context Transfer from Language to Vision

Python 293 16 Updated Aug 26, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,332 91 Updated Sep 14, 2024

The Memory layer for your AI apps

Python 21,643 1,970 Updated Sep 14, 2024

Open-TeleVision: Teleoperation with Immersive Active Visual Feedback

Python 561 54 Updated Aug 24, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,683 857 Updated Aug 21, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

729 20 Updated Jul 31, 2024

RecordRTC is WebRTC JavaScript library for audio/video as well as screen activity recording. It supports Chrome, Firefox, Opera, Android, and Microsoft Edge. Platforms: Linux, Mac and Windows.

JavaScript 6,527 1,750 Updated May 13, 2024

Android ViewServer and ADB client

Python 1,607 344 Updated Apr 30, 2024
Python 36 6 Updated Jun 13, 2024

A Gradio web UI for Large Language Models.

Python 39,515 5,196 Updated Sep 9, 2024

Tensor library for machine learning

C++ 10,838 998 Updated Sep 8, 2024
Python 19 3 Updated Apr 13, 2024

A pytorch template for beginners based on pytorch_lightning

Python 33 5 Updated Feb 1, 2024

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 2,537 161 Updated Sep 14, 2024

[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"

Python 444 14 Updated Aug 9, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,168 904 Updated Sep 10, 2024

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Python 1,326 246 Updated Jul 29, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,997 533 Updated May 31, 2024

Implementation of a Transformer, but completely in Triton

Python 241 14 Updated Apr 5, 2022

🛁 Clean Code concepts adapted for Python

Python 4,386 770 Updated Jun 10, 2023

Implement minimal boilerplate CLIs derived from type hints and parse from command line, config files and environment variables

Python 316 44 Updated Sep 13, 2024

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 1,712 150 Updated Sep 8, 2024

LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.

TypeScript 120 7 Updated Jul 30, 2024

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,528 142 Updated Sep 9, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,538 2,485 Updated Aug 28, 2024
Python 154 6 Updated Jul 12, 2024
Next