Skip to content
View qijimrc's full-sized avatar

Organizations

@THU-KEG
Block or Report

Block or report qijimrc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 559 33 Updated Jul 26, 2024

P2P terminal game about spacepirates playing basketball across the galaxy

Rust 160 4 Updated Jul 13, 2024

A lightweight, terminal-based application to view and query delimiter separated value formatted documents, such as CSV or TSV files.

Rust 235 9 Updated Jul 27, 2024

Your journal app if you live in a terminal

Rust 314 10 Updated Jul 22, 2024

A feature-rich command-line audio/video downloader

Python 77,898 6,110 Updated Jul 28, 2024

SGLang is yet another fast serving framework for large language models and vision language models.

Python 3,275 201 Updated Jul 28, 2024

LVBench: An Extreme Long Video Understanding Benchmark

Python 37 1 Updated Jul 9, 2024

Python scraper based on AI

Python 13,520 1,051 Updated Jul 26, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 3,931 286 Updated Jul 27, 2024

Matryoshka Multimodal Models

Python 61 3 Updated Jun 3, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 1,661 90 Updated Jul 25, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 1,860 114 Updated May 15, 2024

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

Jupyter Notebook 205 29 Updated Jul 10, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 1,899 29 Updated Jun 6, 2024

BY Blog ->

HTML 3,093 8,089 Updated Jul 9, 2024

A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Python 263 17 Updated Jul 19, 2024

Multimodal Models in Real World

Jupyter Notebook 343 17 Updated Jul 12, 2024

Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"

Python 20 1 Updated Sep 8, 2023

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,065 78 Updated Jul 26, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,514 459 Updated Jul 26, 2024

Grok open release

Python 49,210 8,315 Updated May 29, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,116 738 Updated Jul 10, 2024

The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]

Python 248 19 Updated Apr 29, 2024

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

659 26 Updated Jul 24, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,023 983 Updated Jul 27, 2024

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Python 461 17 Updated Jun 26, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,104 195 Updated Jul 21, 2024

LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation

Python 162 11 Updated Apr 22, 2024

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,168 421 Updated Jul 25, 2024

Reading list for research topics in multimodal machine learning

5,721 831 Updated Jun 19, 2024
Next