hukenovs

🐢

hi ._.

Alexander Kapitanov hukenovs

🐢

hi ._.

Data Scientist, ex. FPGA Engineer

260 followers · 71 following

https://habr.com/ru/users/hukenovs/

Achievements

Organizations

Stars

karpathy / nano-llama31

nanoGPT style version of Llama 3.1

Python 1,070 40 Updated Aug 8, 2024

ai-forever / LIBRA

Python 13 1 Updated Aug 10, 2024

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 9,863 723 Updated Aug 21, 2024

mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

714 18 Updated Jul 31, 2024

obss / sahi

Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

Python 3,904 567 Updated Aug 9, 2024

open-mmlab / mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,120 1,216 Updated Aug 14, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

27,177 1,485 Updated Aug 1, 2024

THU-MIG / yolov10

YOLOv10: Real-Time End-to-End Object Detection

Python 8,950 803 Updated Aug 8, 2024

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 1,805 104 Updated Jul 31, 2024

KindXiaoming / pykan

Kolmogorov Arnold Networks

Jupyter Notebook 14,259 1,299 Updated Aug 23, 2024

kfrlib / kfr

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

C++ 1,638 252 Updated Aug 12, 2024

SCUT-DLVCLab / GPT-4V_OCR

Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

Python 115 3 Updated Nov 13, 2023

thunlp / LLaVA-UHD

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Python 289 15 Updated Aug 18, 2024

befozg / PHNet

Patch-based harmonization network

Python 11 4 Updated May 20, 2024

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 1,962 186 Updated Apr 24, 2024

facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 8,647 551 Updated Apr 16, 2024

WongKinYiu / yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 8,745 1,351 Updated Aug 9, 2024

NVlabs / FasterViT

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention

Python 756 63 Updated Jun 2, 2024

WildChlamydia / MiVOLO

MiVOLO age & gender transformer neural network

Python 295 52 Updated Aug 5, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,173 407 Updated Jul 30, 2024

PKU-YuanGroup / MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Python 1,884 118 Updated May 15, 2024

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,738 175 Updated Aug 2, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25,323 3,660 Updated Aug 23, 2024

lxtGH / OMG-Seg

OMG-LLaVA and OMG-Seg codebase

Python 1,194 47 Updated Aug 16, 2024

roboflow / supervision

We write your reusable computer vision tools. 💜

Python 18,394 1,421 Updated Aug 23, 2024

ZechengLi19 / Awesome-Sign-Language

Paper list of sign language, including sign language recognition(SLR), sign language translation(SLT) and other interesting work. Quick start your awesome work with us!! 🤟🤟🤟

67 1 Updated Aug 10, 2024

AIGCDesignGroup / ReplaceAnything

2,332 96 Updated May 17, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,400 2,464 Updated Aug 22, 2024

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,366 97 Updated Jul 5, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,601 345 Updated Aug 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alexander Kapitanov hukenovs

Achievements

Achievements

Organizations

Block or report hukenovs

Stars

karpathy / nano-llama31

ai-forever / LIBRA

facebookresearch / segment-anything-2

mlfoundations / MINT-1T

obss / sahi

open-mmlab / mmaction2

karpathy / LLM101n

THU-MIG / yolov10

THUDM / CogVLM2

KindXiaoming / pykan

kfrlib / kfr

SCUT-DLVCLab / GPT-4V_OCR

thunlp / LLaVA-UHD

befozg / PHNet

deepseek-ai / DeepSeek-VL

facebookresearch / nougat

WongKinYiu / yolov9

NVlabs / FasterViT

WildChlamydia / MiVOLO

AILab-CVC / YOLO-World

PKU-YuanGroup / MoE-LLaVA

hustvl / Vim

vllm-project / vllm

lxtGH / OMG-Seg

roboflow / supervision

ZechengLi19 / Awesome-Sign-Language

AIGCDesignGroup / ReplaceAnything

microsoft / unilm

QwenLM / Qwen-Audio

QwenLM / Qwen-VL