Skip to content
View jaygala24's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@djunicode @AI4Bharat

Block or report jaygala24

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Automatically split your PyTorch models on multiple GPUs for training & inference

Python 612 38 Updated Jan 2, 2024

LLM101n: Let's build a Storyteller

27,321 1,496 Updated Aug 1, 2024

A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)

Python 237 38 Updated Sep 30, 2022

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,332 64 Updated Mar 8, 2024

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…

Python 18,164 2,359 Updated Aug 8, 2024

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Python 645 42 Updated Aug 26, 2024

Annotated version of the Mamba paper

Jupyter Notebook 441 16 Updated Feb 27, 2024
Python 3,827 250 Updated Mar 15, 2024

Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"

Python 52 6 Updated Aug 6, 2024

Examples in the MLX framework

Python 5,749 820 Updated Aug 27, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 620 32 Updated Aug 19, 2024

Machine Learning Engineering Open Book

Python 10,559 635 Updated Aug 26, 2024

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Python 1,229 75 Updated Apr 18, 2024

MLX: An array framework for Apple silicon

C++ 16,279 927 Updated Aug 28, 2024

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,508 336 Updated Aug 22, 2024

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Python 423 32 Updated Aug 25, 2024

Additional resources from our AACL tutorial

10 1 Updated Nov 13, 2023

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

HTML 285 32 Updated Dec 26, 2023

An open collection of implementation tips, tricks and resources for training large language models

Python 452 20 Updated Mar 8, 2023

An open collection of methodologies to help with successful training of large language models.

Python 437 31 Updated Feb 15, 2024

Inference code for Llama models

Python 55,243 9,413 Updated Aug 18, 2024

prompt2model - Generate Deployable Models from Natural Language Instructions

Python 1,938 173 Updated May 20, 2024

Python programs, usually short, of considerable difficulty, to perfect particular skills.

Jupyter Notebook 22,580 2,397 Updated Aug 20, 2024

Tutorial on neural theorem proving

Jupyter Notebook 149 14 Updated Jan 5, 2024

QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning PaLM with only five examples per language. We use the synthet…

33 5 Updated Aug 15, 2023

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,168 711 Updated Aug 5, 2024

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 1,758 231 Updated Aug 27, 2024

A Multilingual Replicable Instruction-Following Model

Python 93 3 Updated Jun 11, 2023

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

747 24 Updated Jul 20, 2023

Fast & Simple repository for pre-training and fine-tuning T5-style models

Python 956 70 Updated Aug 21, 2024
Next