Skip to content

Commit

Permalink
🎉 Missing Values in Data
Browse files Browse the repository at this point in the history
  • Loading branch information
mgobeameli committed May 9, 2024
0 parents commit 7142ab4
Show file tree
Hide file tree
Showing 14 changed files with 790 additions and 0 deletions.
Binary file added .DS_Store
Binary file not shown.
1 change: 1 addition & 0 deletions Init notebook.ipynb
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
{"cells":[{"cell_type":"markdown","source":"## Initialization Notebook\nThis is your initialization notebook.\n\n**What's this for?**\n\nYou can put custom code you want us to run every time we setup your environment in here. \n\n**Awesome! Anything I should not put in here?**\n\nPlease don't install `jupyter` or `jedi` packages, they would break your Deepnote environment. Also, no need to put `!pip install`s here, we already save those automatically!\n\n**I want to learn more!**\n\nGreat! Just [head over to our docs](https://deepnote.com/docs/custom-initialization).","metadata":{"tags":[],"cell_id":"e0fa9723dd234fc9ac112578e7a15235","deepnote_cell_type":"markdown"}},{"cell_type":"code","source":"%%bash\n# If your project has a 'requirements.txt' file, we'll install it here apart from blacklisted packages that interfere with Deepnote (see above).\nif test -f requirements.txt\n then\n sed -i '/jedi/d;/jupyter/d;' ./requirements.txt\n pip install -r ./requirements.txt\n else echo \"There's no requirements.txt, so nothing to install. This is the case with most projects.\"\nfi","metadata":{"tags":[],"cell_id":"26fd03a25a704e5db772178d12a11389","source_hash":"60b37886","execution_start":1666732261190,"execution_millis":21514,"deepnote_to_be_reexecuted":false,"deepnote_cell_type":"code"},"outputs":[{"name":"stdout","text":"Collecting pyjanitor\n Downloading pyjanitor-0.23.1-py3-none-any.whl (147 kB)\n ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 147.7/147.7 KB 14.1 MB/s eta 0:00:00\nCollecting missingno\n Downloading missingno-0.5.1-py3-none-any.whl (8.7 kB)\nRequirement already satisfied: numpy in /shared-libs/python3.9/py/lib/python3.9/site-packages (from -r ./requirements.txt (line 3)) (1.23.4)\nCollecting matplotlib==3.5.1\n Downloading matplotlib-3.5.1-cp39-cp39-manylinux_2_5_x86_64.manylinux1_x86_64.whl (11.2 MB)\n ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 11.2/11.2 MB 48.3 MB/s eta 0:00:00\nRequirement already satisfied: pandas in /shared-libs/python3.9/py/lib/python3.9/site-packages (from -r ./requirements.txt (line 5)) (1.2.5)\nCollecting pyreadr\n Downloading pyreadr-0.4.7-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (365 kB)\n ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 365.3/365.3 KB 32.1 MB/s eta 0:00:00\nRequirement already satisfied: seaborn in /shared-libs/python3.9/py/lib/python3.9/site-packages (from -r ./requirements.txt (line 7)) (0.12.1)\nCollecting session-info\n Downloading session_info-1.0.0.tar.gz (24 kB)\n Preparing metadata (setup.py): started\n Preparing metadata (setup.py): finished with status 'done'\nCollecting upsetplot==0.6.1\n Downloading UpSetPlot-0.6.1.tar.gz (18 kB)\n Preparing metadata (setup.py): started\n Preparing metadata (setup.py): finished with status 'done'\nRequirement already satisfied: python-dateutil>=2.7 in /shared-libs/python3.9/py-core/lib/python3.9/site-packages (from matplotlib==3.5.1->-r ./requirements.txt (line 4)) (2.8.2)\nRequirement already satisfied: cycler>=0.10 in /shared-libs/python3.9/py/lib/python3.9/site-packages (from matplotlib==3.5.1->-r ./requirements.txt (line 4)) (0.11.0)\nRequirement already satisfied: packaging>=20.0 in /shared-libs/python3.9/py-core/lib/python3.9/site-packages (from matplotlib==3.5.1->-r ./requirements.txt (line 4)) (21.3)\nRequirement already satisfied: fonttools>=4.22.0 in /shared-libs/python3.9/py/lib/python3.9/site-packages (from matplotlib==3.5.1->-r ./requirements.txt (line 4)) (4.37.4)\nRequirement already satisfied: pyparsing>=2.2.1 in /shared-libs/python3.9/py-core/lib/python3.9/site-packages (from matplotlib==3.5.1->-r ./requirements.txt (line 4)) (3.0.9)\nRequirement already satisfied: pillow>=6.2.0 in /shared-libs/python3.9/py/lib/python3.9/site-packages (from matplotlib==3.5.1->-r ./requirements.txt (line 4)) (9.2.0)\nRequirement already satisfied: kiwisolver>=1.0.1 in /shared-libs/python3.9/py/lib/python3.9/site-packages (from matplotlib==3.5.1->-r ./requirements.txt (line 4)) (1.4.4)\nRequirement already satisfied: scipy in /shared-libs/python3.9/py/lib/python3.9/site-packages (from pyjanitor->-r ./requirements.txt (line 1)) (1.9.3)\nCollecting pandas-flavor\n Downloading pandas_flavor-0.3.0-py3-none-any.whl (6.3 kB)\nCollecting natsort\n Downloading natsort-8.2.0-py3-none-any.whl (37 kB)\nCollecting multipledispatch\n Downloading multipledispatch-0.6.0-py3-none-any.whl (11 kB)\nRequirement already satisfied: pytz>=2017.3 in /shared-libs/python3.9/py/lib/python3.9/site-packages (from pandas->-r ./requirements.txt (line 5)) (2022.5)\nCollecting stdlib_list\n Downloading stdlib_list-0.8.0-py3-none-any.whl (63 kB)\n ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 63.5/63.5 KB 2.2 MB/s eta 0:00:00\nRequirement already satisfied: six>=1.5 in /shared-libs/python3.9/py-core/lib/python3.9/site-packages (from python-dateutil>=2.7->matplotlib==3.5.1->-r ./requirements.txt (line 4)) (1.16.0)\nCollecting lazy-loader==0.1rc2\n Downloading lazy_loader-0.1rc2-py3-none-any.whl (5.1 kB)\nCollecting xarray\n Downloading xarray-2022.10.0-py3-none-any.whl (947 kB)\n ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 947.6/947.6 KB 77.3 MB/s eta 0:00:00\nCollecting pandas\n Downloading pandas-1.5.1-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (12.2 MB)\n ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 12.2/12.2 MB 47.1 MB/s eta 0:00:00\nBuilding wheels for collected packages: upsetplot, session-info\n Building wheel for upsetplot (setup.py): started\n Building wheel for upsetplot (setup.py): finished with status 'done'\n Created wheel for upsetplot: filename=UpSetPlot-0.6.1-py3-none-any.whl size=19089 sha256=97af255b1c0f04b836d3fd2461e1bdd5570c4a616970e01b86466de2f385a092\n Stored in directory: /root/.cache/pip/wheels/f5/7e/35/f589284b59821650401f948f353cfd843889d2e359ec85bafd\n Building wheel for session-info (setup.py): started\n Building wheel for session-info (setup.py): finished with status 'done'\n Created wheel for session-info: filename=session_info-1.0.0-py3-none-any.whl size=8048 sha256=192189aa9de298c21a45feb1d6f1e467407bde607af261b5ff9ffce0de4faf72\n Stored in directory: /root/.cache/pip/wheels/d4/fc/2e/00ca60bac7954b84907efd41baa9b4853500eaeec4228410c6\nSuccessfully built upsetplot session-info\nInstalling collected packages: stdlib_list, session-info, natsort, multipledispatch, lazy-loader, pandas, matplotlib, xarray, upsetplot, pyreadr, pandas-flavor, missingno, pyjanitor\n Attempting uninstall: pandas\n Found existing installation: pandas 1.2.5\n Not uninstalling pandas at /shared-libs/python3.9/py/lib/python3.9/site-packages, outside environment /root/venv\n Can't uninstall 'pandas'. No files were found to uninstall.\n Attempting uninstall: matplotlib\n Found existing installation: matplotlib 3.6.0\n Not uninstalling matplotlib at /shared-libs/python3.9/py/lib/python3.9/site-packages, outside environment /root/venv\n Can't uninstall 'matplotlib'. No files were found to uninstall.\nSuccessfully installed lazy-loader-0.1rc2 matplotlib-3.5.1 missingno-0.5.1 multipledispatch-0.6.0 natsort-8.2.0 pandas-1.5.1 pandas-flavor-0.3.0 pyjanitor-0.23.1 pyreadr-0.4.7 session-info-1.0.0 stdlib_list-0.8.0 upsetplot-0.6.1 xarray-2022.10.0\nWARNING: You are using pip version 22.0.4; however, version 22.3 is available.\nYou should consider upgrading via the '/root/venv/bin/python -m pip install --upgrade pip' command.\n","output_type":"stream"}],"execution_count":1},{"cell_type":"markdown","source":"<a style='text-decoration:none;line-height:16px;display:flex;color:#5B5B62;padding:10px;justify-content:end;' href='https://deepnote.com?utm_source=created-in-deepnote-cell&projectId=694a3d08-7f18-421d-9e2f-c2820a79680e' target=\"_blank\">\n<img alt='Created in deepnote.com' style='display:inline;max-height:16px;margin:0px;margin-right:7.5px;' src='data:image/svg+xml;base64,PD94bWwgdmVyc2lvbj0iMS4wIiBlbmNvZGluZz0iVVRGLTgiPz4KPHN2ZyB3aWR0aD0iODBweCIgaGVpZ2h0PSI4MHB4IiB2aWV3Qm94PSIwIDAgODAgODAiIHZlcnNpb249IjEuMSIgeG1sbnM9Imh0dHA6Ly93d3cudzMub3JnLzIwMDAvc3ZnIiB4bWxuczp4bGluaz0iaHR0cDovL3d3dy53My5vcmcvMTk5OS94bGluayI+CiAgICA8IS0tIEdlbmVyYXRvcjogU2tldGNoIDU0LjEgKDc2NDkwKSAtIGh0dHBzOi8vc2tldGNoYXBwLmNvbSAtLT4KICAgIDx0aXRsZT5Hcm91cCAzPC90aXRsZT4KICAgIDxkZXNjPkNyZWF0ZWQgd2l0aCBTa2V0Y2guPC9kZXNjPgogICAgPGcgaWQ9IkxhbmRpbmciIHN0cm9rZT0ibm9uZSIgc3Ryb2tlLXdpZHRoPSIxIiBmaWxsPSJub25lIiBmaWxsLXJ1bGU9ImV2ZW5vZGQiPgogICAgICAgIDxnIGlkPSJBcnRib2FyZCIgdHJhbnNmb3JtPSJ0cmFuc2xhdGUoLTEyMzUuMDAwMDAwLCAtNzkuMDAwMDAwKSI+CiAgICAgICAgICAgIDxnIGlkPSJHcm91cC0zIiB0cmFuc2Zvcm09InRyYW5zbGF0ZSgxMjM1LjAwMDAwMCwgNzkuMDAwMDAwKSI+CiAgICAgICAgICAgICAgICA8cG9seWdvbiBpZD0iUGF0aC0yMCIgZmlsbD0iIzAyNjVCNCIgcG9pbnRzPSIyLjM3NjIzNzYyIDgwIDM4LjA0NzY2NjcgODAgNTcuODIxNzgyMiA3My44MDU3NTkyIDU3LjgyMTc4MjIgMzIuNzU5MjczOSAzOS4xNDAyMjc4IDMxLjY4MzE2ODMiPjwvcG9seWdvbj4KICAgICAgICAgICAgICAgIDxwYXRoIGQ9Ik0zNS4wMDc3MTgsODAgQzQyLjkwNjIwMDcsNzYuNDU0OTM1OCA0Ny41NjQ5MTY3LDcxLjU0MjI2NzEgNDguOTgzODY2LDY1LjI2MTk5MzkgQzUxLjExMjI4OTksNTUuODQxNTg0MiA0MS42NzcxNzk1LDQ5LjIxMjIyODQgMjUuNjIzOTg0Niw0OS4yMTIyMjg0IEMyNS40ODQ5Mjg5LDQ5LjEyNjg0NDggMjkuODI2MTI5Niw0My4yODM4MjQ4IDM4LjY0NzU4NjksMzEuNjgzMTY4MyBMNzIuODcxMjg3MSwzMi41NTQ0MjUgTDY1LjI4MDk3Myw2Ny42NzYzNDIxIEw1MS4xMTIyODk5LDc3LjM3NjE0NCBMMzUuMDA3NzE4LDgwIFoiIGlkPSJQYXRoLTIyIiBmaWxsPSIjMDAyODY4Ij48L3BhdGg+CiAgICAgICAgICAgICAgICA8cGF0aCBkPSJNMCwzNy43MzA0NDA1IEwyNy4xMTQ1MzcsMC4yNTcxMTE0MzYgQzYyLjM3MTUxMjMsLTEuOTkwNzE3MDEgODAsMTAuNTAwMzkyNyA4MCwzNy43MzA0NDA1IEM4MCw2NC45NjA0ODgyIDY0Ljc3NjUwMzgsNzkuMDUwMzQxNCAzNC4zMjk1MTEzLDgwIEM0Ny4wNTUzNDg5LDc3LjU2NzA4MDggNTMuNDE4MjY3Nyw3MC4zMTM2MTAzIDUzLjQxODI2NzcsNTguMjM5NTg4NSBDNTMuNDE4MjY3Nyw0MC4xMjg1NTU3IDM2LjMwMzk1NDQsMzcuNzMwNDQwNSAyNS4yMjc0MTcsMzcuNzMwNDQwNSBDMTcuODQzMDU4NiwzNy43MzA0NDA1IDkuNDMzOTE5NjYsMzcuNzMwNDQwNSAwLDM3LjczMDQ0MDUgWiIgaWQ9IlBhdGgtMTkiIGZpbGw9IiMzNzkzRUYiPjwvcGF0aD4KICAgICAgICAgICAgPC9nPgogICAgICAgIDwvZz4KICAgIDwvZz4KPC9zdmc+' > </img>\nCreated in <span style='font-weight:600;margin-left:4px;'>Deepnote</span></a>","metadata":{"tags":[],"created_in_deepnote_cell":true,"deepnote_cell_type":"markdown"}}],"nbformat":4,"nbformat_minor":0,"metadata":{"deepnote":{},"orig_nbformat":2,"deepnote_notebook_id":"f8df5845ef104aeca7c399dd598d9943","deepnote_execution_queue":[]}}
7 changes: 7 additions & 0 deletions data/missing_data_enconding_example.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
x,y,z
1,A,-100
3,N/A,-99
NA,NA,-98
-99,E,-101
-98,F,-1
-99,G,-1
Binary file added data/oceanbuoys.rda
Binary file not shown.
Binary file added data/pedestrian.rda
Binary file not shown.
Loading

0 comments on commit 7142ab4

Please sign in to comment.