Skip to content
View xfgao's full-sized avatar

Highlights

  • Pro

Block or report xfgao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Python 72 7 Updated Oct 20, 2024

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 421 26 Updated Oct 7, 2024

Fully automated end-to-end framework to extract data from bar plots and other figures in scientific research papers using modules such as OpenCV, AWS-Rekognition.

Jupyter Notebook 97 21 Updated Jul 15, 2021

Minecraft ReplayMod

Java 892 150 Updated Sep 9, 2024

[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning

Python 81 3 Updated Sep 23, 2024
Python 565 27 Updated Feb 15, 2024

CUDA Accelerated Robot Library

Python 762 118 Updated Aug 13, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,831 2,177 Updated Aug 12, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,947 374 Updated Aug 7, 2024

MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning

Python 133 5 Updated Jun 20, 2023

[CVPR23] DialMAT: Dialogue-Enabled Transformer with Moment-based Adversarial Training

Python 7 Updated Jul 1, 2023

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,801 962 Updated Oct 11, 2024

[ICCV 2023] Official code repository for ARNOLD benchmark

Jupyter Notebook 135 7 Updated Apr 1, 2024

NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants

Jupyter Notebook 9 1 Updated Mar 12, 2023

A benchmark environment for fully cooperative human-AI performance.

Jupyter Notebook 700 147 Updated Aug 27, 2024

Repository for DialFRED.

Python 41 3 Updated Sep 14, 2023

Learning for task and motion planning in a 2D kitchen.

Python 36 10 Updated Jul 1, 2020

2024中国翻墙软件VPN推荐以及科学上网避坑,稳定好用。对比SSR机场、蓝灯、V2ray、老王VPN、VPS搭建梯子等科学上网与翻墙软件,中国最新科学上网翻墙梯子VPN下载推荐,访问Chatgpt。

HTML 15,850 1,465 Updated Oct 15, 2024

深度学习经典、新论文逐段精读

26,762 2,419 Updated Aug 8, 2024

Code for EmBERT, a transformer model for embodied, language-guided visual task completion.

Python 57 12 Updated Apr 10, 2024

awesome grounding: A curated list of research papers in visual grounding

1,015 97 Updated Apr 9, 2023

VS Code in the browser

TypeScript 68,156 5,597 Updated Oct 18, 2024

Bayesian Inference Tools in Python

HTML 108 34 Updated Jun 5, 2023

Efficiently Scaling Up Video Annotation with Crowdsourced Marketplaces. IJCV 2012

HTML 607 255 Updated Jul 15, 2020

UCLA Thesis LaTeX style

TeX 129 84 Updated Jun 15, 2020

Code for the paper Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration

Python 85 14 Updated Jul 15, 2022

calibration tests for wearable eye-tracking glasses

Python 6 2 Updated Apr 12, 2018
Next