OpenRefine is a free, open source power tool for working with messy data and improving it
-
Updated
Nov 1, 2024 - Java
OpenRefine is a free, open source power tool for working with messy data and improving it
Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
Carefully curated resource links for data science in one place
Blazing-fast Data-Wrangling toolkit
A Python toolbox for gaining geometric insights into high-dimensional data
Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Prepping tables for machine learning
AI-data warehouse to enrich, transform and analyze data from cloud storages
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse
Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
Materials for following along with Hands-On Data Analysis with Pandas – Second Edition
Materials for following along with Hands-On Data Analysis with Pandas.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
An introductory workshop on pandas with notebooks and exercises for following along. Slides contain all solutions.
Data Analysis and Visualization in R for Ecologists
Pacote que trata e organiza os dados do Cadastro Nacional da Pessoa Jurídica (CNPJ)
Like awk but with SQL and table joins
Add a description, image, and links to the data-wrangling topic page so that developers can more easily learn about it.
To associate your repository with the data-wrangling topic, visit your repo's landing page and select "manage topics."