Skip to content
View waizei's full-sized avatar

Block or report waizei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICCV 2023] PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning

Python 227 14 Updated Aug 29, 2023

Repository of 3D Object Detection with Pointformer (CVPR2021)

Python 155 14 Updated Mar 30, 2023

[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds

Python 53 5 Updated Jan 29, 2023

[CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning

Python 33 3 Updated Aug 26, 2022

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Python 100 15 Updated Sep 6, 2022

Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.

Python 42 4 Updated Apr 30, 2024

💎 免费的编程资源大全,持续更新!🔥 覆盖各种语言和方向(Java \ Python \ C++ \ JavaScript \ Golang \ 前端 \ 后端等)的学习路线、贴心教程、项目实战、编程书籍、面试合集、实用资源等,对程序员非常有帮助!

HTML 3,005 540 Updated Dec 12, 2022

[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"

Python 65 18 Updated Oct 11, 2021

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,920 3,321 Updated Jul 23, 2024

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Markdown 125,868 23,227 Updated Sep 22, 2024

3D model viewer app (STL, OBJ, PLY) for Android.

Kotlin 186 34 Updated Nov 1, 2024

A library for show 3d model in a easy way that can analysis STL/OBJ/3DS file and support rotation and zooming operations. 一个基于OpenGL ES的简单易用的3D模型展示框架。自动分类解析STL、OBJ、3DS等模型文件,支持对模型进行旋转和缩放等操作。

Java 479 96 Updated May 22, 2018

VR全景图+Opengl3D模型展示

Java 290 65 Updated Mar 3, 2018

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Python 519 136 Updated Dec 21, 2022

[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events

JavaScript 51 2 Updated Aug 19, 2024

Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).

Jupyter Notebook 195 31 Updated Jun 8, 2022
Python 15 5 Updated Oct 27, 2020

PanDownload的个人维护版本

HTML 8,298 1,700 Updated Sep 25, 2020

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]

Python 56 5 Updated Apr 5, 2022

Tensorflow Implementation on Paper [CVPR2020]Image Search with Text Feedback by Visiolinguistic Attention Learning

Python 63 12 Updated Sep 12, 2020

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

1,139 101 Updated Aug 19, 2022

End-to-End Object Detection with Transformers

Python 13,611 2,454 Updated Mar 12, 2024

Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.

Python 63 13 Updated Sep 15, 2021

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 51,938 11,536 Updated Nov 13, 2024

「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!

Java 146,903 45,604 Updated Nov 12, 2024

《Java 程序员眼中的 Linux》

Shell 8,582 2,474 Updated Jun 11, 2022

一份超级详细的Java面试题【大厂面试真题+Java学习指南+工作总结】

4,232 1,126 Updated Jun 1, 2024

Deep Modular Co-Attention Networks for Visual Question Answering

Python 443 88 Updated Dec 16, 2020