Skip to content
View stereoplegic's full-sized avatar
🎰
🎰
  • 🧪Science🖌️Art🪄Magic
  • Remote (not Hybrid), The Internet
  • 21:29 (UTC -05:00)
  • X @SciArtMagic
Block or Report

Block or report stereoplegic

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

GoldFinch and other hybrid transformer components

Python 19 2 Updated Jul 18, 2024

[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

Python 12 1 Updated Jun 12, 2024

GraphRAG using Ollama with Gradio UI and Extra Features

Python 509 42 Updated Jul 18, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 11,835 963 Updated Jul 19, 2024
Python 3 Updated Jun 24, 2024

Hackathon winner at AI Engineer World Fair Hackathon: Transforming code, one function at a time, to reduce digital carbon footprints and create a more sustainable digital world.

Python 4 Updated Jul 16, 2024

Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""

Python 6 Updated Jul 2, 2024

Official code for "Block Transformer: Global-to-Local Language Modeling for Fast Inference"

Python 112 6 Updated Jul 2, 2024

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Python 542 17 Updated Jul 18, 2024

The modern replacement for Jupyter Notebooks

TypeScript 1,895 121 Updated Jul 16, 2024

convert files / GitHub repos into LLM-ready markdown.md files

Python 67 8 Updated Jun 30, 2024

Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023

Python 96 3 Updated Apr 20, 2024

Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model

Python 39 2 Updated Jan 14, 2024

Research project on the capabilities of RNNs.

Python 7 Updated Jul 16, 2024

Continual Resilient (CoRe) Optimizer for PyTorch

Python 7 Updated Jun 10, 2024

SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining

Python 13 1 Updated Jul 1, 2024
Shell 14 Updated Jun 11, 2024
Python 114 6 Updated Jun 15, 2024

MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024

Python 3 1 Updated Jun 11, 2024

Official implementation of Goldfish Loss: Mitigating Memorization in Generative LLMs

Python 64 4 Updated Jun 24, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

97 1 Updated Jun 23, 2024

ACL2024 Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation

Python 6 1 Updated Jun 17, 2024

Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".

Python 56 7 Updated Jun 29, 2024
Python 107 Updated Jun 13, 2024

This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"

Python 29 Updated Jul 16, 2024

LLM Analytics

TypeScript 562 20 Updated Jul 18, 2024

TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data

Python 4 Updated May 30, 2024

Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and knowledge-based reasoning tasks.

Python 248 25 Updated Jun 16, 2024

Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".

Jupyter Notebook 43 10 Updated Mar 11, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,040 38 Updated Jul 14, 2024
Next