jaygala24

🎯

Focusing

Jay Gala jaygala24

🎯

Focusing

I'm a Research Associate at MBZUAI working on multimodal deep learning. Feel free to connect with me if you'd like to chat.

57 followers · 69 following

Abu Dhabi, United Arab Emirates
https://jaygala24.github.io
@jaygala24

Achievements

x2 x2

Achievements

x2 x2

Highlights

Organizations

Stars

BlackSamorez / tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

Python 612 38 Updated Jan 2, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

27,321 1,496 Updated Aug 1, 2024

VSehwag / minimal-diffusion

A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)

Python 237 38 Updated Sep 30, 2022

XueFuzhao / OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,332 64 Updated Mar 8, 2024

stitionai / devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…

Python 18,164 2,359 Updated Aug 8, 2024

BatsResearch / bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Python 645 42 Updated Aug 26, 2024

srush / annotated-mamba

Annotated version of the Mamba paper

Jupyter Notebook 441 16 Updated Feb 27, 2024

apple / ml-mgie

Python 3,827 250 Updated Mar 15, 2024

AI4Bharat / IndicInstruct

Forked from allenai/open-instruct

Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"

Python 52 6 Updated Aug 6, 2024

ml-explore / mlx-examples

Examples in the MLX framework

Python 5,749 820 Updated Aug 27, 2024

EleutherAI / cookbook

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 620 32 Updated Aug 19, 2024

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 10,559 635 Updated Aug 26, 2024

DLYuanGod / TinyGPT-V

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Python 1,229 75 Updated Apr 18, 2024

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 16,279 927 Updated Aug 28, 2024

adapter-hub / adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,508 336 Updated Aug 22, 2024

likenneth / honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Python 423 32 Updated Aug 25, 2024

AI4Bharat / aacl23-mnmt-tutorial

Additional resources from our AACL tutorial

10 1 Updated Nov 13, 2023

sangmichaelxie / doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

HTML 285 32 Updated Dec 26, 2023

huggingface / large_language_model_training_playbook

An open collection of implementation tips, tricks and resources for training large language models

Python 452 20 Updated Mar 8, 2023

huggingface / llm_training_handbook

An open collection of methodologies to help with successful training of large language models.

Python 437 31 Updated Feb 15, 2024

meta-llama / llama

Inference code for Llama models

Python 55,243 9,413 Updated Aug 18, 2024

neulab / prompt2model

prompt2model - Generate Deployable Models from Natural Language Instructions

Python 1,938 173 Updated May 20, 2024

norvig / pytudes

Python programs, usually short, of considerable difficulty, to perfect particular skills.

Jupyter Notebook 22,580 2,397 Updated Aug 20, 2024

wellecks / ntptutorial

Tutorial on neural theorem proving

Jupyter Notebook 149 14 Updated Jan 5, 2024

google-research-datasets / QAmeleon

QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning PaLM with only five examples per language. We use the synthet…

33 5 Updated Aug 15, 2023

nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,168 711 Updated Aug 5, 2024

embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 1,758 231 Updated Aug 27, 2024

mbzuai-nlp / bactrian-x

A Multilingual Replicable Instruction-Following Model

Python 93 3 Updated Jun 11, 2023

SinclairCoder / Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

747 24 Updated Jul 20, 2023

PiotrNawrot / nanoT5

Fast & Simple repository for pre-training and fine-tuning T5-style models

Python 956 70 Updated Aug 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jay Gala jaygala24

Achievements

Achievements

Highlights

Organizations

Block or report jaygala24

Stars

BlackSamorez / tensor_parallel

karpathy / LLM101n

VSehwag / minimal-diffusion

XueFuzhao / OpenMoE

stitionai / devika

BatsResearch / bonito

srush / annotated-mamba

apple / ml-mgie

AI4Bharat / IndicInstruct

ml-explore / mlx-examples

EleutherAI / cookbook

stas00 / ml-engineering

DLYuanGod / TinyGPT-V

ml-explore / mlx

adapter-hub / adapters

likenneth / honest_llama

AI4Bharat / aacl23-mnmt-tutorial

sangmichaelxie / doremi

huggingface / large_language_model_training_playbook

huggingface / llm_training_handbook

meta-llama / llama

neulab / prompt2model

norvig / pytudes

wellecks / ntptutorial

google-research-datasets / QAmeleon

nlpxucan / WizardLM

embeddings-benchmark / mteb

mbzuai-nlp / bactrian-x

SinclairCoder / Instruction-Tuning-Papers

PiotrNawrot / nanoT5