Skip to content

catarinaacsilva/kdd

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Knowledge Discovery in Dataset

A set of tasks as an introduction to data mining.

  • Task 1: Exploratory Data Analysis
  • Task 2: Rank the numerical features using ANOVA; PCA and KPCA
  • Task 3: Linear Discriminant Functions and KNN
  • Task 4: Model Evaluation
  • Paper analysis: Replicate the steps followed on a published paper and added improvements on it

Note: Initial folder contains the first project just to initial exploration

Requirements

  • ubuntu 20.04
  • python 3

Installation

  1. Clone of the repository and cd kdd

  2. Create a virtual environment:

    1.1 python -m venv venv 1.2 source venv/bin/activate

  3. Install the requirementspip install -r requirements.txt

Use jupyter

  1. Run: jupyter notebook

  2. Open the following address on browser: http:https://localhost:8888/tree

  3. Run: jupyter notebook list

  4. Copy the token and paste on http:https://localhost:8888/tree box to password or token

Authors

License

This project is licensed under the MIT License - see the LICENSE file for details