Skip to content
View km5ar's full-sized avatar
😇
😇
  • Yale School of Medicine - Yale University
Block or Report

Block or report km5ar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Plotting library for IPython/Jupyter notebooks

TypeScript 3,597 463 Updated May 21, 2024

Python toolkit for quantitative finance

Jupyter Notebook 6,350 802 Updated Jul 10, 2024

Shortest solutions for CS231n 2021-2024

Jupyter Notebook 228 52 Updated May 25, 2024

LLM101n: Let's build a Storyteller

15,346 733 Updated Jun 28, 2024
Python 7 1 Updated Jun 10, 2023

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,722 793 Updated Jul 1, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 14,192 1,082 Updated Jul 10, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 9,827 1,135 Updated Jul 10, 2024

PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf

Python 2,550 470 Updated Jul 10, 2024

Convert PDF to HTML without losing text or format.

HTML 3,545 354 Updated Jul 4, 2024

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Python 5,724 537 Updated Jul 10, 2024

Collection of data science projects in Python

Jupyter Notebook 1,463 393 Updated Nov 11, 2023

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 29,381 7,325 Updated Jul 8, 2024

Things that you should (and should not) do in your Materials Informatics research.

Jupyter Notebook 163 72 Updated Nov 17, 2023

A guideline for building practical production-level deep learning systems to be deployed in real world applications.

4,271 634 Updated Nov 17, 2023

Notes and links from the book club meetings

Jupyter Notebook 459 95 Updated Jun 16, 2024

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

TypeScript 282,334 37,612 Updated Jul 10, 2024

Efficient few-shot learning with Sentence Transformers

Jupyter Notebook 2,081 207 Updated Jul 3, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,033 193 Updated Jun 24, 2024

The Open Source Feature Store for Machine Learning

Python 5,373 954 Updated Jul 11, 2024

Open-source Library PyGDebias: Graph Datasets and Fairness-Aware Graph Mining Algorithms

Python 56 7 Updated May 7, 2024

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…

Python 26,731 7,765 Updated Mar 20, 2024

📖 A collection of pure bash alternatives to external processes.

Shell 36,286 3,255 Updated Nov 28, 2023

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 263,821 44,728 Updated Jun 29, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,094 751 Updated Jul 10, 2024

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,637 668 Updated Jan 14, 2024

Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the training on multiple AWS GPU instances

Python 49 6 Updated Jun 20, 2023

A lightweight AutoML library.

Python 148 11 Updated Apr 1, 2024
Next