- Import dependencies
- pandas
- Used for all data related tasks (ex. data manipulation)
- Import and explore data
- Read in CSV
- View columns in data
- Gather basic statistics using the describe function
- View categories with the most and least entries
- Find the top 5 clients which had the most entries in the data
- Store client IDs in a list for the top 5
- Transform the data
- Create a column that calculates the subtotal for each line using the unit_price and the qty
- Create a column for shipping price, total price, cost, and profit
- Confirm work
- Check totals with the totals provided in the code
- Summarize and Analyze
- Figure out how much each of the top 5 clients spent by quantity
- Create a summary DataFrame
- Include : total units purchased, total shipping price, total revenue, and total profit
- Sort by total profit
- Format the data and rename the columns
- Sort the updated data by "Total Profit" from highest to lowest