-
Carnegie Mellon University
- Pittsburgh
- https://www.cs.cmu.edu/~jlaurent/
Highlights
- Pro
Block or Report
Block or report jonathan-laurent
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A Julia implementation of choice sequence based PBT, inspired by Hypothesis
💯 Curated coding interview preparation materials for busy software engineers
Playing Pokemon Red with Reinforcement Learning
Programming language for literate programming law specification
Plotly Dash components based on Mantine React Components
Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
A high-throughput and memory-efficient inference and serving engine for LLMs
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
PRIMA is a package for solving general nonlinear optimization problems without using derivatives. It provides the reference implementation for Powell's derivative-free optimization methods, i.e., C…
Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)
A guidance language for controlling large language models.
Effortless Python bindings for OCaml modules
A language for constraint-guided and efficient LLM programming.
🦜🔗 Build context-aware reasoning applications
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…
A new markup-based typesetting system that is powerful and easy to learn.
Compositional Differentiable Programming Library
Adding guardrails to large language models.
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
Machine Learning Engineering Open Book
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python 3.8+ toolbox for submitting jobs to Slurm