Skip to content
View hukenovs's full-sized avatar
๐Ÿข
hi ._.
๐Ÿข
hi ._.

Organizations

@ai-forever

Block or report hukenovs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

nanoGPT style version of Llama 3.1

Python 1,070 40 Updated Aug 8, 2024
Python 13 1 Updated Aug 10, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use thโ€ฆ

Jupyter Notebook 9,863 723 Updated Aug 21, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

714 18 Updated Jul 31, 2024

Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

Python 3,904 567 Updated Aug 9, 2024

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,120 1,216 Updated Aug 14, 2024

LLM101n: Let's build a Storyteller

27,177 1,485 Updated Aug 1, 2024

YOLOv10: Real-Time End-to-End Object Detection

Python 8,950 803 Updated Aug 8, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 1,805 104 Updated Jul 31, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 14,259 1,299 Updated Aug 23, 2024

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

C++ 1,638 252 Updated Aug 12, 2024

Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

Python 115 3 Updated Nov 13, 2023

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Python 289 15 Updated Aug 18, 2024

Patch-based harmonization network

Python 11 4 Updated May 20, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 1,962 186 Updated Apr 24, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 8,647 551 Updated Apr 16, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 8,745 1,351 Updated Aug 9, 2024

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention

Python 756 63 Updated Jun 2, 2024

MiVOLO age & gender transformer neural network

Python 295 52 Updated Aug 5, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,173 407 Updated Jul 30, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 1,884 118 Updated May 15, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,738 175 Updated Aug 2, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25,323 3,660 Updated Aug 23, 2024

OMG-LLaVA and OMG-Seg codebase

Python 1,194 47 Updated Aug 16, 2024

We write your reusable computer vision tools. ๐Ÿ’œ

Python 18,394 1,421 Updated Aug 23, 2024

Paper list of sign language, including sign language recognition(SLR), sign language translation(SLT) and other interesting work. Quick start your awesome work with us!! ๐ŸคŸ๐ŸคŸ๐ŸคŸ

67 1 Updated Aug 10, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,400 2,464 Updated Aug 22, 2024

The official repo of Qwen-Audio (้€šไน‰ๅƒ้—ฎ-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,366 97 Updated Jul 5, 2024

The official repo of Qwen-VL (้€šไน‰ๅƒ้—ฎ-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,601 345 Updated Aug 7, 2024
Next