Starred repositories
Notes and material for the "Machine Learning Engineer Nanodegree" (MLND) by Udacity.
Notebooks using the Hugging Face libraries 🤗
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Practice your pandas skills!
An Open Source Machine Learning Framework for Everyone
Tensors and Dynamic neural networks in Python with strong GPU acceleration
High-Performance Serverless event and data processing platform
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K…
Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
A high-throughput and memory-efficient inference and serving engine for LLMs
A tool for converting numerous python objects (numpy, sympy) into latex strings
⚡ A Fast, Extensible Progress Bar for Python and CLI
The official Python client for the Huggingface Hub.
This is a sample Scrapy project for educational purposes
Stable Diffusion with Core ML on Apple Silicon
Neo4j Movies Example with Spring Data Neo4j
The code and the dataset of experiments reported in papers "Revisiting the Tag Relevance Prediction Problem" and "The tag genome: Encoding community knowledge to support novel interaction."
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, …
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Read, play, and download millions of books; served by archive.org.
Unsupervised text tokenizer for Neural Network-based text generation.
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
A conda-smithy repository for tensorflow-datasets.