Highlights
- Pro
Stars
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
Testing baseline LLMs performance across various models
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
Equinox implementation of llama3 and llama3.1
A simple, performant and scalable Jax LLM!
Open weights LLM from Google DeepMind.
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
A collection of AWESOME things about mixture-of-experts
A curated reading list of research in Mixture-of-Experts(MoE).
Neural Networks and the Chomsky Hierarchy
A VA-API implemention using NVIDIA's NVDEC
This is the code repository associated with the paper "Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers"
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…
Annotated version of the Mamba paper
Firefox user.js for speed, privacy, and security. Your favorite browser, but better.
Sioyek is a PDF viewer with a focus on textbooks and research papers
constrained nonlinear optimization for scientific machine learning, UQ, and AI
Local support for Tuya devices in Home Assistant
Use any linux distribution inside your terminal. Enable both backward and forward compatibility with software and freedom to use whatever distribution you’re more comfortable with. Mirror available…
GPU programming related news and material links