SMU MBA ML/AI Workshop
- Install PyCharm
- Setup a Github account: https://github.com/
- Download the Github Desktop app for your OS
- Click on the top left tab that says "Current Repository"
- Click add and clone the repository: https://github.com/esperie/ml_basics.git
- Organize all your repositories into a folder
- Install PyCharm
- Create New Project in PyCharm and point to the local repository folder
- Under Python interpreter, select "New Virtual Environment"
- (Optional) Some settings to consider
- Settings > Tools > Actions on Save > Check "Reformat Code"
- Settings > Build, Execution, Deployment > Console > Uncheck "Show Console Variables by Default"
- Our data is shared
in: https://www.dropbox.com/sh/2fju8lxw7fk2hzc/AABulJvzPjnzzk7S2iOHczCra?dl=0
- Copy the files to your current project folder under store
- Open the terminal in PyCharm.
- Install the packages using the following
command:
pip3 install -U pip pandas colorama tqdm numpy matplotlib scikit-learn scipy plotly pycaret"[full]" ydata-profiling django fastapi"[all]" httpx openpyxl passlib psycopg2-binary pyarrow requests s3fs transformers
- Go to https://pytorch.org/get-started/locally/ and follow the instructions to install pytorch on your OS (and whether your machine comes with a GPU or not).
- Install XCode command line tools using the following command:
xcode-select --install
- Install homebrew with:
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
- Install miniforge with:
brew install miniforge
- Go to settings > Project > Project Interpreter > Click on Add Interpreter >
Click Add Local Interpreter > Choose Conda Environment and key in the
following in the box:
/opt/homebrew/bin/conda
- Create new environment.
- Install the packages using the following
command:
conda install pip pandas colorama tqdm numpy matplotlib scikit-learn scipy plotly ydata-profiling django httpx openpyxl passlib psycopg2-binary pyarrow requests s3fs transformers pycaret
- Run:
pip3 install fastapi"[all]"
- All functionalities are now consolidated under the applications folder.
- Each applications folder should have the following structure.
data: all data etl and wrangling codes
algo: logic, analytics, ML, DL, AI codes
models (if created): Django data models
other django files (if created.)
views: to serve the data and algo models
- logs: All log files are kept here.
- store: All data files are kept here. Model files too.
- tools: All tools that are not applications specific are kept here.
- config: All configuration files are kept here.
# All imports on the top
# Setup and global variables section
# Functions section
# Classes section
if __name__ == "__main__':
# execution code
# test code