simonnxren

simon-nx-ren simonnxren

2 followers · 2 following

Achievements

Stars

IAAR-Shanghai / ICSFSurvey

A comprehensive survey on Internal Consistency and Self-Feedback in Large Language Models.

Jupyter Notebook 151 3 Updated Sep 19, 2024

hustvl / Dynamic-2DGS

Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects

Python 62 2 Updated Sep 25, 2024

ucbepic / docetl

A system for agentic LLM-powered data processing and ETL

Python 797 78 Updated Oct 8, 2024

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 5,056 407 Updated Oct 2, 2024

UCSB-NLP-Chang / CoPaint

Implementation of paper 'Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models'

Python 60 6 Updated Apr 3, 2024

Future-House / paper-qa

High accuracy RAG for answering questions from scientific documents with citations

Python 6,045 568 Updated Oct 7, 2024

cvg / GeoCalib

GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)

Python 384 13 Updated Oct 7, 2024

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,762 254 Updated Sep 25, 2024

NVIDIA-AI-IOT / nanoowl

A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.

Python 236 43 Updated Jul 30, 2024

showlab / Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 895 40 Updated Sep 30, 2024

awilliamson10 / clipora

Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank Adapters (LoRA).

Python 15 Updated Aug 15, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,170 850 Updated Sep 13, 2024

ZhangGe6 / onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

JavaScript 1,299 162 Updated Jun 29, 2024

flairNLP / fundus

A very simple news crawler with a funny name

Python 283 74 Updated Oct 8, 2024

dangeng / visual_anagrams

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Jupyter Notebook 857 78 Updated Jun 22, 2024

jiatiansun / Eventfulness

Code for *Eventfulness for Interactive Video Alignment*

Python 7 2 Updated Dec 13, 2023

KwaiVGI / LivePortrait

Bring portraits to life!

Python 12,186 1,279 Updated Oct 7, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 5,345 551 Updated Sep 29, 2024

HVision-NKU / StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 5,842 584 Updated Sep 26, 2024

fuenwang / Equirec2Perspec

A tool to project equirectangular panorama into perspective images

Python 272 55 Updated Oct 20, 2021

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 523 21 Updated Oct 6, 2024

konstantint / SKompiler

A tool for compiling trained SKLearn models into other representations (such as SQL, Sympy or Excel formulas)

Python 171 10 Updated Nov 17, 2022

TheAiSingularity / graphrag-local-ollama

Local models support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction

Python 662 95 Updated Sep 30, 2024

OpenGVLab / MM-NIAH

[NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of existing MLLMs to comprehend long multimodal documents.

Python 80 5 Updated Sep 26, 2024

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 3,449 283 Updated Aug 14, 2024

ENOT-AutoDL / onnx2torch

Convert ONNX models to PyTorch.

Python 595 69 Updated Aug 15, 2024

siyuanliii / masa

Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything

Python 970 63 Updated Sep 18, 2024

thunlp / LLaVA-UHD

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Python 306 15 Updated Oct 8, 2024

ZHU-Zhiyu / NVS_Solver

Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"

Python 248 4 Updated Aug 15, 2024

bcmi / Image-Composition-Assessment-Dataset-CADB

[BMVC2021] The first image composition assessment dataset. Used in the paper "Image Composition Assessment with Saliency-augmented Multi-pattern Pooling". Useful for image composition assessment, i…

Python 111 13 Updated May 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

simon-nx-ren simonnxren

Achievements

Achievements

Block or report simonnxren

Stars

IAAR-Shanghai / ICSFSurvey

hustvl / Dynamic-2DGS

ucbepic / docetl

Ucas-HaoranWei / GOT-OCR2.0

UCSB-NLP-Chang / CoPaint

Future-House / paper-qa

cvg / GeoCalib

gpt-omni / mini-omni

NVIDIA-AI-IOT / nanoowl

showlab / Show-o

awilliamson10 / clipora

OpenBMB / MiniCPM-V

ZhangGe6 / onnx-modifier

flairNLP / fundus

dangeng / visual_anagrams

jiatiansun / Eventfulness

KwaiVGI / LivePortrait

FunAudioLLM / CosyVoice

HVision-NKU / StoryDiffusion

fuenwang / Equirec2Perspec

buoyancy99 / diffusion-forcing

konstantint / SKompiler

TheAiSingularity / graphrag-local-ollama

OpenGVLab / MM-NIAH

DepthAnything / Depth-Anything-V2

ENOT-AutoDL / onnx2torch

siyuanliii / masa

thunlp / LLaVA-UHD

ZHU-Zhiyu / NVS_Solver

bcmi / Image-Composition-Assessment-Dataset-CADB