Skip to content

zoelesv/airbnbnyc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Airbnb NYC Exploratory Data Analysis

New York City Python Pandas Plotly HTML5 CSS JavaScript

About

Airbnb, Inc. is an American company that operates an online marketplace for lodging, with a primary focus on homestays for vacation rentals and tourism activities. Headquartered in San Francisco, California, the platform is accessible through both a website and a mobile app. Our project delves into the large dataset provided by the company to extract important references that significantly impact business metrics.

Project Description

In this project we use Python to explore data and statistical measures to communicate data concepts.

Data Exploratory

  • Get the dataset from Kaggle.
  • Examine the data information.
  • There are total:
    • 102599 listings
    • 5 neighborhood groups: Manhattan, Brooklyn, Queens, Bronx, Staten Island.
    • 4 room types: Private room, Entire home/Apartments, Shared room, Hotel room.
  • Perform basic data cleaning, such as handling missing values and data type conversions.

Afterwards, we create the following plots using Plotly and Matplotlib.

Choropleth Map

choropleth Visualize the distribution of Airbnb listings across a geographic area using a choropleth map.

Scatter Plot

scatterplot Visualize the distribution of Airbnb price and rating in each neighborhood.

Word Cloud

word cloud Generate a word cloud from the names of listings.

Stacked Bar Graph and Pie Chart

stackedbargraph Create a bar graph to visualize the average price of room types (private room, entire home, shared room, hotel) in each neighborhood group. Implement Gestalt Principles: Law of Focal Point + Connectedness + Similarity to highlight insights.

  • Although Private rooms and Entire homes/Apartments dominate the number of listings on Airbnb (almost 98%), they generally have lower price and ratings compared to Shared rooms and Hotels. Given their prevalence, it can be inferred that a significant portion of Airbnb's revenue is generated from commissions on these signature room types.

Outcomes

This exploratory data analysis shows that there is not much of a correlation between price and customer satisfaction; it really depends on customers' budget and past experience. However, there are a lot of potential for private rooms and entire homes/apartments to be improved.

The visualization demonstrates a strong contrast and captures the interest of the audience. Please visit our website for an interactive and responsive experience!

Image 1 Image 2 Image 3 Image 4

Files and Directories

  • assets - a directory contains all images and visualization files.
  • notebooks - a directory contains all Jupiter Notebook files.
  • README.md - project description
  • index.html - main template for homepage when users go to the website.
  • script.js
  • style.css
  • white.html - white template for homepage when users click the lightbulb.

Installation

  • Fork this repository.
  • Update assets and notebooks.
  • Update index.html and white.html with your own assets.
  • Deploy with Github Pages or using your own domains.

Authors

  • Visualization: Arios Tong, John Bailey, Zoe Le
  • Website: Zoe Le
  • Git Repository

About

Data visualization with Python & Plotly.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published