Skip to content

Victor-Miron/How_high

Repository files navigation

How High Project

Project Description

The How High Project focuses on analyzing and processing datasets related to F1 drivers and constructor teams. The project aims to provide insights into the performance metrics of F1 entities and to visualize the data using tools like Tableau.

Future Enhancements

  • Set up a local database and read data in with SQLite.
  • Clean data and perform a SQL join with data sets using plain SQL.
  • Develop a user-friendly interface for data interaction through Tableau.
  • Utilize a virtual environment and include instructions in the README.md file on how to set one up.
  • Annotate .py files with well-written comments and a clear README.md.

Prerequisites

  • Python 3.10 or later
  • Virtual Environment (optional but recommended)
  • Visual Studio Code (VS Code) or any other preferred IDE with Jupyter Notebook support

Installation Instructions

To set up this project, follow these steps:

  1. Clone the Repository:

    Open your terminal or IDE terminal and run:

    git clone [email protected]:Victor-Miron/How_high.git
  2. Navigate to the Project Directory:

    cd How_high
  3. Create and Activate a Virtual Environment:

    python -m venv myvenv #python3 -m venv myvenv on linux
    source myvenv/bin/activate   # On Windows, use `myvenv\Scripts\activate`
  4. Install the Required Dependencies:

    pip install -r requirements.txt
  5. Install Jupyter Notebook (if not already installed):

    pip install notebook

Usage

Running the Project

  1. Open the Project in Your IDE:

    • Open Visual Studio Code (VS Code) or your preferred IDE.
    • Go to File > Open Folder and select the project directory.
  2. Run the Main Script:

    • Open the terminal within your IDE (View > Terminal in VS Code).

    • Run the main script:

      python Create_Greatest.py # python3 Create_Greatest.py on linux

    This will execute all necessary steps to process the data and create the database Greatest.db.

Visualizing the Clean Data

To visualize the clean data, use DB Browser for SQLite:

  1. Install DB Browser for SQLite:

  2. Open the Database:

    • Launch DB Browser for SQLite and open the generated SQLite database file located in the project directory.
  3. Explore the Data:

    • Use the built-in tools to explore tables, run queries, and visualize the clean data.
    • The clean data will be in two tables:
      • Constructor Data
      • Drivers Data

Tableau Visualization

Visualize the processed data in Tableau by visiting the following link (I advise using Chrome or Firefox for a better experience):

View on Tableau

Running the Data Handling Demo

  1. Launch Jupyter Notebook from Your IDE:

    • Open the terminal within your IDE (View > Terminal in VS Code).

    • Run Jupyter Notebook:

      jupyter notebook
  2. Open the Notebook:

    • In the Jupyter Notebook interface, navigate to the project directory and open Data_handling_demo.ipynb.
  3. Run the Notebook:

    • Execute the cells in the notebook to run the data handling demo.

Script Descriptions

For reviewers who want more information about the code, each Python script in this project serves a specific purpose. Below is the order and description of each script:

  1. Raw_Database.py: Initializes the database structure required for storing F1 data.

  2. Drivers_data.py: Creates and cleans the drivers data table using join.

  3. Constructor_Data.py: Creates and cleans the constructor data table using join.

  4. Convert_Drivers_data_to_sql.py: Transforms and loads driver data into the SQL database for further analysis.

  5. Convert_Constructor_data_to_sql.py: Transforms and loads constructor data into the SQL database for further analysis.

  6. Data_handling_demo.ipynb: Demonstrates data manipulation techniques in Python. Also, available as a Jupyter Notebook for interactive exploration.

Directory Structure

  • archive/: Contains archived datasets and backup files.
  • .gitignore/: Used to specify files and directories that should not be tracked by Git. This is important to keep the repository clean and avoid committing files that are not necessary for others to run the project.

Contributors

  • Victor Miron: Project Lead and Developer

This README.md file provides a comprehensive overview of the project, including setup instructions, usage guidelines, visualization tools, and a link to visualize the data in Tableau. Feel free to further customize any sections or provide additional details as needed!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published