Matryoshka Representation Learning

MRL.ipynb is a replication of matryoshka embeddings from this paper which are a hierarchical representation scheme designed for representation learning that allowed one representation model to train embedding vectors of varying sizes simultaneously that also fit inside each other, like russian nesting dolls. see my youtube explanation below:

MatFormer+

in MatFormer+.ipynb I'm made the entire model exhibit the same splicing behavior as above within the inner-workings of the GPT, for example the kv multiplication using these smaller d lengths and corresponding smaller head sizes. this has already been done by MATFORMER except they only implmenented it on the feedforward network, not the MHA, whereas I've done it with literally every part of the model

MatryoshkaGPT

in MatryoskhaGPT.ipynb i'm incorporating the ideas from this paper to make MatFormer+ not only subsettable in all weight matrices but also across layers. Basically if you don't want to use all $L$ layers you can just cut the model off at your desired $l$ and multiply by the final output matrix. This is just one more element of matryoshka-ness. Not sure if Imma bother doing a video on this one

Misc

tangents.ipynb is some rant I was going on at one point that's somehow related
if you're looking for the models pertaining to imposed & emergent hierarchical embeddings, they have been moved to this repo
p.s. the code in this repo is based on andrej karpathy's minGPT

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
images		images
models		models
.gitignore		.gitignore
2D Matryoshka Sentence Embeddings.pdf		2D Matryoshka Sentence Embeddings.pdf
MATFORMER - NESTED TRANSFORMER FOR ELASTIC INFERENCE.pdf		MATFORMER - NESTED TRANSFORMER FOR ELASTIC INFERENCE.pdf
MRL.ipynb		MRL.ipynb
MatFormer+.ipynb		MatFormer+.ipynb
Matryoshka Representation Learning.pdf		Matryoshka Representation Learning.pdf
README.md		README.md
input.txt		input.txt
matryoshkaGPT.ipynb		matryoshkaGPT.ipynb
requirements.txt		requirements.txt
tangents.ipynb		tangents.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Matryoshka Representation Learning

MatFormer+

MatryoshkaGPT

Misc

About

Languages

evintunador/matryoshkaGPT

Folders and files

Latest commit

History

Repository files navigation

Matryoshka Representation Learning

MatFormer+

MatryoshkaGPT

Misc

About

Resources

Stars

Watchers

Forks

Languages