- Kuala Lumpur, Malaysia
- dicksonneoh.com
- @dicksonneoh7
Block or Report
Block or report dnth
Contact GitHub support about this userβs behavior. Learn more about reporting abuse.
Report abuseLists (1)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
Run SOTA Vision-Language Model Florence-2 on your data!
Asynchronous Python PostgreSQL driver written in Rust
A repository for the FiftyOne Plugin Outlier Detection
A FiftyOne Plugin that allows you to search across any modality in your videos!
[CVPR2024] Efficient Dataset Distillation via Minimax Diffusion
Official Implementation of "CAT-Segπ±: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
GLM-4 series: Open Multilingual Multimodal Chat LMs | εΌζΊε€θ―θ¨ε€ζ¨‘ζε―Ήθ―樑ε
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. π₯ [Paper + Code + Demo]
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts (CVPR 2024)
MINT-1T: A one trillion token multimodal interleaved dataset.
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
π₯π₯π₯ δΈζ³¨δΊYOLOv5οΌYOLOv7γYOLOv8γYOLOv9ζΉθΏζ¨‘εοΌSupport to improve backbone, neck, head, loss, IoU, NMS and other modulesπ
VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".
Generate BM25 sparse vector inside PostgreSQL
Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)
Continual Learning tutorials and demo running on Google Colaboratory.
Orchestrate zero-shot computer vision models
Solve and install Python packages quickly with rip (pip in Rust)
Official repo of ππ£π©π§ππ£π¨ππ ππ€ππΌ: πΌ πππ£ππ§ππ‘ππ¨π© πΌπ₯π₯π§π€πππ ππ€π§ πΏππ¨ππ€π«ππ§ππ£π ππ£π€π¬π‘ππππ ππ£ πππ£ππ§ππ©ππ«π ππ€πππ‘π¨, which is previously titled (ππ¦π―π¦π³π’π΅πͺπ·π¦ ππ°π₯π¦ππ΄: ππ©π’π΅ π₯π° π΅π©π¦πΊ π¬π―π°πΈ? ππ° π΅π©π¦πΊ π¬π―π°πΈ π΅π©πͺπ―π¨π΄? ππ¦π΅'π΄ π§β¦
γCVPR 2024 HighlightγMonkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 main)