Skip to content
View kugwzk's full-sized avatar
💬
I may be slow to respond.
💬
I may be slow to respond.
Block or Report

Block or report kugwzk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A family of compressed models obtained via pruning and knowledge distillation

66 5 Updated Jul 26, 2024
Python 204 21 Updated Jul 19, 2024

Open weights language model from Google DeepMind, based on Griffin.

Python 577 23 Updated Jul 9, 2024
Python 163 12 Updated Jul 26, 2024

GUICourse: From General Vision Langauge Models to Versatile GUI Agents

Python 44 5 Updated Jul 17, 2024

Sparse Backpropagation for Mixture-of-Expert Training

Python 15 Updated Jul 2, 2024
Python 72 4 Updated Jul 8, 2024

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".

Python 107 4 Updated Jul 26, 2024

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,350 313 Updated Jul 26, 2024

The official evaluation suite and dynamic data release for MixEval.

Python 182 25 Updated Jul 24, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 415 19 Updated Jul 27, 2024

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

118 5 Updated Jun 12, 2024

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 875 67 Updated Jul 25, 2024

The homepage of OneBit model quantization framework.

Python 120 2 Updated Jun 27, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,145 119 Updated Jun 26, 2024

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,260 115 Updated Jun 13, 2024
Python 205 11 Updated Apr 30, 2024

Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"

Python 38 1 Updated May 31, 2024

An LLM-based Web Navigating Agent (KDD'24)

Python 541 44 Updated May 5, 2024

Reaching LLaMA2 Performance with 0.1M Dollars

Python 947 75 Updated Jul 23, 2024

Web-grounded natural language instructions

HTML 11 4 Updated Mar 19, 2024

[ICML2024]Adaptive decoding balances the diversity and coherence of open-ended text generation.

Python 10 1 Updated Jun 2, 2024

VisualWebArena is a benchmark for multimodal agents.

Python 195 34 Updated Jul 24, 2024

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 651 94 Updated Jul 22, 2024

The model, data and code for the visual GUI Agent SeeClick

HTML 149 9 Updated Jul 15, 2024

Grok open release

Python 49,207 8,312 Updated May 29, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 20,915 1,981 Updated Jul 25, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,063 78 Updated Jul 26, 2024

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 550 68 Updated Jul 27, 2024
Next