Skip to content
View singhranjodh's full-sized avatar
Block or Report

Block or report singhranjodh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 11,951 1,142 Updated Jul 12, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 21,812 2,239 Updated Jul 9, 2024

Fast and customizable framework for automatic ML model creation (AutoML)

Python 1,085 47 Updated May 30, 2024

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 56,217 28,780 Updated Jul 11, 2024

Source code for all Elastic connectors, developed by the Search team at Elastic, and home of our Python connector development framework

Python 61 120 Updated Jul 12, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 739 40 Updated Jul 8, 2024

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 1,909 126 Updated Jul 12, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,021 133 Updated Jun 25, 2024

Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".

Python 492 49 Updated Jul 6, 2024

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,010 140 Updated Jul 5, 2024

Sparsity-aware deep learning inference runtime for CPUs

Python 2,940 169 Updated Jul 5, 2024

LLM Workshop by Sourab Mangrulkar

Jupyter Notebook 297 109 Updated Jun 16, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 3,590 295 Updated Jul 13, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 20,559 1,948 Updated Jul 12, 2024

llama.cpp gguf file parser for javascript

JavaScript 23 1 Updated Jun 18, 2024
Jupyter Notebook 508 37 Updated May 1, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,558 244 Updated Jul 5, 2024

Go ahead and axolotl questions

Python 6,908 757 Updated Jul 13, 2024

Description Describes the IndicNLP corpus and associated datasets

Python 150 24 Updated Apr 16, 2023

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,365 488 Updated Jul 13, 2024

Custom data types and layouts for training and inference

Python 407 53 Updated Jul 13, 2024

Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops

Python 17 Updated Mar 16, 2024

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 46,259 4,459 Updated Jul 10, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,459 806 Updated Jul 12, 2024

Temporary anonymous version

Python 23 2 Updated Mar 20, 2024

Detect file content types with deep learning

Python 7,550 395 Updated Jul 10, 2024

Official repository for 'GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation'

Python 242 15 Updated Jul 8, 2024

Set-of-Mark Prompting for LMMs

Python 1,031 81 Updated Jun 5, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,727 794 Updated Jul 1, 2024
Next