Stars
Source code of Paper: Scalable and Interpretable One-class SVMs with Deep Learning and Random Fourier features
A quick visualization tool for Jupyter and Neo4J
Faker is a Python package that generates fake data for you.
sidetable builds simple but useful summary tables of your data
pyforest - feel the bliss of automated imports
Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more
The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
pandas, scikit-learn, xgboost and seaborn integration
DuckDB is an analytical in-process SQL database management system
Streamlit — A faster way to build and share data apps.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Statsmodels: statistical modeling and econometrics in Python
A scikit-learn compatible neural network library that wraps PyTorch
Visualize and compare datasets, target values and associations, with one line of code.
A library of extension and helper modules for Python's data analysis and machine learning libraries.
An open source python library for automated feature engineering
A simple and efficient tool to parallelize Pandas operations on all available CPUs
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
Missing data visualization module for Python.
Modin: Scale your Pandas workflows by changing a single line of code
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
An open-source, low-code machine learning library in Python
Visual analysis and diagnostic tools to facilitate machine learning model selection.
A graph-based tool for visualizing effective access and resource relationships in AWS environments.