Skip to content
View prakamya-mishra's full-sized avatar
🍻
Github & Chillzz
🍻
Github & Chillzz

Organizations

@coala @Breeze18 @LASC-SNU @FOSS-SNU @Breeze19
Block or Report

Block or report prakamya-mishra

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Sparse autoencoders

Python 177 16 Updated Jul 9, 2024

This repository collects all relevant resources about interpretability in LLMs

138 7 Updated Jul 10, 2024

Neural Collapse in Multi-label Learning with Pick-all-label Loss

Jupyter Notebook 4 Updated Oct 27, 2023

A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...

228 8 Updated Jul 2, 2024

Codebase for arXiv:2405.17767, based on GPT-Neo and TinyStories.

Python 4 Updated Jul 8, 2024

[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features

Python 50 8 Updated Jul 19, 2022

Mutual Information in Pytorch

Python 96 10 Updated Aug 23, 2023

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 5,019 318 Updated Jun 27, 2024

Set of tools to assess and improve LLM security.

Python 2,147 355 Updated Jul 9, 2024

A Comprehensive Assessment of Trustworthiness in GPT Models

Python 229 51 Updated Jun 19, 2024

For optimization algorithm research and development.

Python 208 16 Updated Mar 22, 2024

Transformers with Arbitrarily Large Context

Python 571 43 Updated Jul 8, 2024

Tile primitives for speedy kernels

Cuda 1,359 45 Updated Jul 10, 2024

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 12,489 813 Updated Jul 10, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 14,180 1,082 Updated Jul 10, 2024

RCCL Performance Benchmark Tests

Cuda 38 35 Updated Jun 14, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,203 393 Updated Jul 10, 2024

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Python 236 19 Updated Jul 10, 2024

🔥Highlighting the top ML papers every week.

9,362 538 Updated Jul 8, 2024

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 782 65 Updated Jul 10, 2024

the AI-native open-source embedding database

Rust 13,544 1,148 Updated Jul 10, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,324 427 Updated May 3, 2024

PyTorch extensions for high performance and large scale training.

Python 2,988 267 Updated Jun 18, 2024

A playbook for systematically maximizing the performance of deep learning models.

25,778 2,157 Updated Jun 18, 2024

A framework for few-shot evaluation of language models.

Python 5,821 1,550 Updated Jul 10, 2024

A framework to optimize Parameter-Efficient Fine-Tuning for Fairness in Medical Image Analysis

Python 7 1 Updated Feb 29, 2024

Collection of papers on state-space models

477 16 Updated Jul 8, 2024

maximal update parametrization (µP)

Jupyter Notebook 1,234 88 Updated May 6, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 9,571 740 Updated May 19, 2024

A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks

Jupyter Notebook 222 10 Updated Jun 29, 2024
Next