ZQSIAT

Follow

🎯

Focusing

QingsongZhao ZQSIAT

🎯

Focusing

Follow

I used to be graduate student in Chinese Academy of Sciences, Now I am a Ph.D. candidate in Tongji University.

15 followers · 5 following

Tongji University
Shanghai, China

Starred repositories

WowCZ / LongMIT

LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets

Python 31 Updated Sep 30, 2024

zz-haooo / LLMs-Preference-Optimization

8 Updated May 31, 2024

princeton-nlp / SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 668 42 Updated Aug 22, 2024

uclaml / SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,002 89 Updated May 8, 2024

Oxen-AI / Self-Rewarding-Language-Models

This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.

Python 103 9 Updated Apr 25, 2024

WooooDyy / LLM-Reverse-Curriculum-RL

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

Python 66 4 Updated Feb 9, 2024

bklieger-groq / g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 3,563 330 Updated Oct 7, 2024

ezelikman / quiet-star

Code for Quiet-STaR

Python 572 81 Updated Aug 21, 2024

expz / quiet-star

Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)

Python 35 2 Updated Aug 8, 2024

jun0wanan / awesome-large-multimodal-agents

311 19 Updated Sep 25, 2024

liuzard / transformers_zh_docs

Huggingface transformers的中文文档

Python 155 19 Updated Nov 8, 2023

DLUT-LYZ / CODA-LM

Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)

Python 58 2 Updated Jul 12, 2024

OpenDriveLab / DriveLM

[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering

HTML 815 53 Updated Oct 8, 2024

MILVLG / activitynet-qa

An VideoQA dataset based on the videos from ActivityNet

Python 66 9 Updated Nov 22, 2020

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

12,079 772 Updated Oct 9, 2024

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,775 254 Updated Sep 25, 2024

ZQSIAT / AEDC

For the paper "Learning Discriminative Action Representations in Videos via Embedding Distance Correlation"

1 Updated Sep 13, 2024

InternLM / MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Python 4,813 483 Updated Sep 25, 2024

reshalfahsi / separableconv-torch

PyTorch implementation of Depthwise Separable Convolution

Python 11 Updated Aug 28, 2022

magicproduct / hash-hop

Long context evaluation for large language models

Python 177 15 Updated Oct 8, 2024

chatanywhere / GPT_API_free

Free ChatGPT API Key，免费ChatGPT API，支持GPT4 API（免费），ChatGPT国内可用免费转发API，直连无需代理。可以搭配ChatBox等软件/插件使用，极大降低接口使用成本。国内即可无限制畅快聊天。

Python 22,088 1,658 Updated Sep 26, 2024

YueFan1014 / VideoAgent

This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)

Python 111 5 Updated Sep 9, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,653 2,161 Updated Aug 12, 2024

seungjunlee96 / Depthwise-Separable-Convolution_Pytorch

Implementation of Depthwise Separable Convolution (pytorch)

Python 70 6 Updated Mar 11, 2020

NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,871 149 Updated Sep 25, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,368 71 Updated Oct 9, 2024

deepcs233 / Visual-CoT

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 101 6 Updated Oct 9, 2024

stoneMo / EZ-VSL

Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)

Python 29 8 Updated Oct 2, 2022

hche11 / Localizing-Visual-Sounds-the-Hard-Way

Localizing Visual Sounds the Hard Way

Python 76 15 Updated Jul 6, 2022

Ziyang412 / VideoTree

Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"

Python 72 3 Updated Aug 6, 2024

Starred topics

independent-component-analysis

bilevel-optimization

gait-recognition