Skip to content
View ta012's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report ta012

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Stars

34 stars written in Python
Clear filter

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep lear…

Python 23,570 4,550 Updated Oct 15, 2023

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,409 2,402 Updated Apr 28, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,122 2,438 Updated Jul 7, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 14,949 1,424 Updated Jul 8, 2024

End-to-End Object Detection with Transformers

Python 13,097 2,374 Updated Mar 12, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,061 750 Updated Jul 2, 2024

Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.

Python 5,347 907 Updated Oct 19, 2023

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Python 4,184 735 Updated Jun 2, 2024

An open-source framework for training large multimodal models.

Python 3,562 270 Updated May 25, 2024

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,589 236 Updated Jun 4, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 1,466 79 Updated Jul 8, 2024

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 884 54 Updated Jun 27, 2024

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

Python 867 83 Updated Jun 12, 2024

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 667 31 Updated Jun 2, 2024

Audio Dataset for training CLAP and other models

Python 603 53 Updated Feb 5, 2024

Official PyTorch implementation of "A Comprehensive Overhaul of Feature Distillation" (ICCV 2019)

Python 410 78 Updated Jun 23, 2020

Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"

Python 399 39 Updated Mar 21, 2024

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Python 354 58 Updated Aug 14, 2022

ZazuML - easy AutoML for Object Detection

Python 336 44 Updated May 22, 2023

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Python 331 59 Updated Jul 27, 2022

Papers and resources related to the security and privacy of LLMs 🤖

Python 306 20 Updated Jun 30, 2024

A New Tamil Large Language Model (LLM) Based on Llama 2

Python 239 33 Updated Apr 5, 2024

MU-LLaMA: Music Understanding Large Language Model

Python 210 16 Updated Mar 25, 2024

The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"

Python 139 7 Updated Apr 10, 2024

Audio Captioning datasets for PyTorch.

Python 91 5 Updated Jun 14, 2024
Python 68 9 Updated Oct 29, 2019

Notes about LLaMA 2 model

Python 36 4 Updated Aug 30, 2023

cross modal background suppression for audio-visual event localization

Python 32 6 Updated Mar 18, 2022

A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)

Python 27 2 Updated Jul 6, 2023
Next