Welcome to DataScrub Pro, a powerful data cleaning tool designed for professionals dealing with large datasets. Whether you're a data scientist, researcher, or analyst, DataScrub Pro simplifies and accelerates the process of preparing your data for analysis.
Key Features
- Upload & View: Instantly upload CSV and Excel files and view data in a table format.
- Cleaning Options: Customize data cleaning with various options:
- Remove duplicates
- Handle missing values (remove, fill with mean, mode, median, custom values, forward fill, backward fill)
- Correct date formats
- Delete outliers
- Convert data types
- Standardize capitalization
- Ensure structural consistency
- Merge duplicate rows
- Real-Time Updates: Visualize original and cleaned data side-by-side.
- Export: Export cleaned data to CSV with one click.
- Log: Detailed log of cleaning actions for transparency.
Technologies Used
-
Frontend:
- HTML
- CSS
- JavaScript
-
Libraries:
- PapaParse for CSV parsing
- SheetJS (xlsx) for Excel file handling
- Lodash for data manipulation
How to Use
- Upload File: Click "Upload File" to select and load your data (supports CSV, XLS, XLSX formats).
- Select Cleaning Options: Choose from various cleaning options.
- Clean Data: Click "Clean Data" to process the selected cleaning options.
- View and Export: See the cleaned data in real-time. Click "Export Data" to download the cleaned data as CSV.
- Clear: Use "Clear" to reset and start fresh.
Getting Started
To get started with DataScrub Pro:
- Clone this repository:
git clone https://github.com/MohammedNainia/DataScrub-Pro.git
- Open
index.html
in your web browser.
Feel free to explore, customize, and enhance DataScrub Pro based on your specific needs and requirements.
Author
- Mohammed Nainia
- GitHub: @MohammedNainia
- LinkedIn: Mohammed Nainia