Skip to content
View h2stein's full-sized avatar
Block or Report

Block or report h2stein

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 15 2 Updated Jun 14, 2024

A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

Python 855 82 Updated May 28, 2024

utilities for decoding deep representations (like sentence embeddings) back to text

Python 637 74 Updated May 26, 2024
Python 635 50 Updated May 24, 2024

A playbook for systematically maximizing the performance of deep learning models.

25,687 2,150 Updated Jun 18, 2024

LLM training in simple, raw C/CUDA

Cuda 21,196 2,294 Updated Jun 27, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 28,982 2,648 Updated Jun 27, 2024

Flax is a neural network library for JAX that is designed for flexibility.

Python 5,773 607 Updated Jun 27, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 9 3 Updated Mar 30, 2024

Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget

Python 97 4 Updated Mar 29, 2024

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,469 231 Updated May 1, 2024

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,423 133 Updated Jun 19, 2024

1.58-bit LLaMa model

Python 77 6 Updated Apr 3, 2024

Toolkit for attaching, training, saving and loading of new heads for transformer models

Jupyter Notebook 207 18 Updated Apr 28, 2024

Streamlit — A faster way to build and share data apps.

Python 33,039 2,884 Updated Jun 27, 2024

Build Conversational AI in minutes ⚡️

TypeScript 5,990 765 Updated Jun 25, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,063 2,432 Updated Jun 24, 2024

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 7,676 1,416 Updated Jun 27, 2024

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

Python 5,505 241 Updated Jun 27, 2024

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Python 40,319 4,288 Updated Jun 27, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 19,711 1,872 Updated Jun 27, 2024

MLX: An array framework for Apple silicon

C++ 15,576 883 Updated Jun 26, 2024

Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.

JavaScript 12,092 594 Updated Feb 26, 2024

Search photos on Unsplash using natural language

Jupyter Notebook 949 103 Updated Oct 13, 2022

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 12,000 777 Updated Jun 26, 2024

CLI to manage your datacontract.yaml files

Python 352 60 Updated Jun 27, 2024
Python 3,803 248 Updated Mar 15, 2024

Foundational model for human-like, expressive TTS

Python 3,425 610 Updated Jun 21, 2024

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 486 36 Updated Jun 24, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 23,339 3,093 Updated Jun 4, 2024
Next