Skip to content
View TuanNguyen27's full-sized avatar
☀️
☀️

Block or report TuanNguyen27

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Class notes for the course "Long Term Memory in AI - Vector Search and Databases" COS 597A @ Princeton Fall 2023

TeX 305 33 Updated Nov 18, 2023

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 36,928 3,224 Updated Aug 17, 2024

A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.

Python 1,359 83 Updated Aug 29, 2024

Fast SHAP value computation for interpreting tree-based models

Python 506 31 Updated Jun 26, 2023

The release of the Twitter algorithm, annotated for recsys

480 26 Updated Apr 15, 2023

Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask without any rewrites.

Jupyter Notebook 111 19 Updated Mar 29, 2024

Statistical Rethinking Course for Jan-Mar 2023

R 2,184 246 Updated Nov 28, 2023

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 9,355 723 Updated Aug 29, 2024

Flyte Documentation 📖

Python 74 107 Updated Aug 29, 2024

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

57,936 5,984 Updated Sep 1, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 11,439 1,413 Updated Aug 18, 2024

by ex-googlers, for ex-googlers - a lookup table of similar tech & services

14,503 1,039 Updated Jul 26, 2024

The easiest way to serve AI apps and models - Build reliable Inference APIs, LLM apps, Multi-model chains, RAG service, and much more!

Python 6,944 775 Updated Sep 3, 2024

Approximate Nearest Neighbor Search for Sparse Data in Python!

Python 915 145 Updated Oct 2, 2020

It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced research…

4,501 296 Updated Jan 21, 2022

Coarse-grained lineage and tracing for machine learning pipelines.

Python 464 29 Updated Nov 11, 2022

A collection of (mostly) technical things every software developer should know about

82,060 7,693 Updated Aug 6, 2024

A light-weight, flexible, and expressive statistical data testing library

Python 3,231 300 Updated Sep 3, 2024

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

Python 1,954 93 Updated Aug 16, 2024

Source code accompanying O'Reilly book: Machine Learning Design Patterns

Jupyter Notebook 1,865 527 Updated Apr 28, 2021

Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/

Python 2,018 135 Updated Sep 3, 2024

📌 Papers, guides, and mentor interviews on applying machine learning for ApplyingML.com—the ghost knowledge of machine learning.

MDX 190 31 Updated Jun 5, 2024

Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tuplex has similar Python APIs to Apache Spark or Dask, but rath…

C++ 811 46 Updated Mar 28, 2024

Automatically visualize your pandas dataframe via a single print! 📊 💡

Python 5,135 364 Updated Mar 20, 2024

Preparation links and resources for system design questions

8,755 2,453 Updated May 10, 2024

📝 Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)

535 90 Updated Mar 16, 2023

State of the Art Natural Language Processing

Scala 3,798 707 Updated Sep 3, 2024

MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvements in both training algorithms and models.

Python 321 62 Updated Aug 29, 2024

A C++ standalone library for machine learning

C++ 5,239 495 Updated Aug 6, 2024

PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf

Python 2,587 482 Updated Aug 30, 2024
Next