Skip to content

josephinerobideau/pandas-challenge-1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

Challenge 4 - Pandas


Overview and agenda for Pandas challenge

  1. Import dependencies
  • pandas
    • Used for all data related tasks (ex. data manipulation)
  1. Import and explore data
  • Read in CSV
  • View columns in data
  • Gather basic statistics using the describe function
  • View categories with the most and least entries
  • Find the top 5 clients which had the most entries in the data
    • Store client IDs in a list for the top 5
  1. Transform the data
  • Create a column that calculates the subtotal for each line using the unit_price and the qty
  • Create a column for shipping price, total price, cost, and profit
  1. Confirm work
  • Check totals with the totals provided in the code
  1. Summarize and Analyze
  • Figure out how much each of the top 5 clients spent by quantity
  • Create a summary DataFrame
    • Include : total units purchased, total shipping price, total revenue, and total profit
    • Sort by total profit
  • Format the data and rename the columns
  • Sort the updated data by "Total Profit" from highest to lowest

Additional sources

Sources like ChatGPT, Xpert learning assistant, AskBCS, Google, YouTube, and Columbia provided tutoring were used for clarification, and debugging if/where needed.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published