prakamya-mishra

🍻

Github & Chillzz

Prakamya mishra prakamya-mishra

🍻

Github & Chillzz

Working on novel techniques to efficiently train LLMs & image/video generation AI models on large-scale clusters.

45 followers · 66 following

Applied Research Engineer @amd AI | MS CS UMass Amherst | ex-Applied Scientist Intern @amazon-science
Seattle
https://prakamya-mishra.github.io/
in/pkms
@PrakamyaMishra

Achievements

Highlights

Developer Program Member

Organizations

Block or Report

Block or report prakamya-mishra

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

EleutherAI / sae

Sparse autoencoders

Python 177 16 Updated Jul 9, 2024

ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models

This repository collects all relevant resources about interpretability in LLMs

138 7 Updated Jul 10, 2024

Heimine / NC_MLab

Neural Collapse in Multi-label Learning with Pick-all-label Loss

Jupyter Notebook 4 Updated Oct 27, 2023

MinghuiChen43 / awesome-deep-phenomena

A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...

228 8 Updated Jul 2, 2024

rhubarbwu / linguistic-collapse

Codebase for arXiv:2405.17767, based on GPT-Neo and TinyStories.

Python 4 Updated Jul 8, 2024

tding1 / Neural-Collapse

[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features

Python 50 8 Updated Jul 19, 2022

connorlee77 / pytorch-mutual-information

Mutual Information in Pytorch

Python 96 10 Updated Aug 23, 2023

google-research / arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 5,019 318 Updated Jun 27, 2024

meta-llama / PurpleLlama

Set of tools to assess and improve LLM security.

Python 2,147 355 Updated Jul 9, 2024

AI-secure / DecodingTrust

A Comprehensive Assessment of Trustworthiness in GPT Models

Python 229 51 Updated Jun 19, 2024

facebookresearch / optimizers

For optimization algorithm research and development.

Python 208 16 Updated Mar 22, 2024

lhao499 / ringattention

Transformers with Arbitrarily Large Context

Python 571 43 Updated Jul 8, 2024

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,359 45 Updated Jul 10, 2024

unslothai / unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 12,489 813 Updated Jul 10, 2024

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—foundation models

Python 14,180 1,082 Updated Jul 10, 2024

ROCm / rccl-tests

RCCL Performance Benchmark Tests

Cuda 38 35 Updated Jun 14, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,203 393 Updated Jul 10, 2024

SqueezeAILab / KVQuant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Python 236 19 Updated Jul 10, 2024

dair-ai / ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

9,362 538 Updated Jul 8, 2024

alxndrTL / mamba.py

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 782 65 Updated Jul 10, 2024

chroma-core / chroma

the AI-native open-source embedding database

Rust 13,544 1,148 Updated Jul 10, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,324 427 Updated May 3, 2024

facebookresearch / fairscale

PyTorch extensions for high performance and large scale training.

Python 2,988 267 Updated Jun 18, 2024

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

25,778 2,157 Updated Jun 18, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 5,821 1,550 Updated Jul 10, 2024

Raman1121 / FairTune

A framework to optimize Parameter-Efficient Fine-Tuning for Fairness in Medical Image Analysis

Python 7 1 Updated Feb 29, 2024

radarFudan / Awesome-state-space-models

Collection of papers on state-space models

477 16 Updated Jul 8, 2024

microsoft / mup

maximal update parametrization (µP)

Jupyter Notebook 1,234 88 Updated May 6, 2024

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 9,571 740 Updated May 19, 2024

Strivin0311 / long-llms-learning

A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks

Jupyter Notebook 222 10 Updated Jun 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prakamya mishra prakamya-mishra

Achievements

Achievements

Highlights

Organizations

Block or report prakamya-mishra

Stars

EleutherAI / sae

ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models

Heimine / NC_MLab

MinghuiChen43 / awesome-deep-phenomena

rhubarbwu / linguistic-collapse

tding1 / Neural-Collapse

connorlee77 / pytorch-mutual-information

google-research / arxiv-latex-cleaner

meta-llama / PurpleLlama

AI-secure / DecodingTrust

facebookresearch / optimizers

lhao499 / ringattention

HazyResearch / ThunderKittens

unslothai / unsloth

stanfordnlp / dspy

ROCm / rccl-tests

allenai / OLMo

SqueezeAILab / KVQuant

dair-ai / ML-Papers-of-the-Week

alxndrTL / mamba.py

chroma-core / chroma

jzhang38 / TinyLlama

facebookresearch / fairscale

google-research / tuning_playbook

EleutherAI / lm-evaluation-harness

Raman1121 / FairTune

radarFudan / Awesome-state-space-models

microsoft / mup

RUCAIBox / LLMSurvey

Strivin0311 / long-llms-learning