Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

lesson3-planet download links using kaggle api #468

Open
freddyrayes opened this issue Feb 7, 2020 · 4 comments
Open

lesson3-planet download links using kaggle api #468

freddyrayes opened this issue Feb 7, 2020 · 4 comments

Comments

@freddyrayes
Copy link

In the notebook lesson3-planet (nbs/dl1/lesson3-planet.ipynb), the kaggle API command is not able to download the train files.

! kaggle competitions download -c planet-understanding-the-amazon-from-space -f train-jpg.tar.7z -p {path}  
! kaggle competitions download -c planet-understanding-the-amazon-from-space -f train_v2.csv -p {path}  

Returns 404 File not Found

List of files shows train-jpg is not present:

kaggle competitions files -c planet-understanding-the-amazon-from-space
name                                                size  creationDate         
-------------------------------------------------  -----  -------------------  
test_v2_file_mapping.csv/test_v2_file_mapping.csv  600KB  2019-12-15 22:14:13  
sample_submission_v2.csv/sample_submission_v2.csv    3MB  2019-12-15 22:14:13  
train_v2.csv/train_v2.csv                            1MB  2019-12-15 22:14:13  
Kaggle-planet-train-tif.torrent                      1MB  2019-12-15 22:14:13  
Kaggle-planet-test-tif.torrent                       2MB  2019-12-15 22:14:13  

Partial solution: fix download of train_v2.csv by using updated path:
kaggle competitions download -c planet-understanding-the-amazon-from-space -f train_v2.csv/train_v2.csv

Workaround:
Downloading train-jpg.tar manually from kaggle's page (https://www.kaggle.com/c/planet-understanding-the-amazon-from-space/data)

@lbergelson
Copy link

It seems like someone has uploaded the jpgs to a different project as well.

You can get what I believe are the same files from nikitarom/planets-dataset using:

!kaggle datasets download nikitarom/planets-dataset 

The training csv has a different name and the paths are slightly different but it seems to a way forward.

@lbergelson
Copy link

lbergelson commented Feb 19, 2020

This issue confused a whole group of us who are doing the class together, it would be good to update the notebook with some notes about a workaround.

@yiquncode
Copy link

Get your cookie and use wget to download the big file, More info here: https://forums.fast.ai/t/lesson-3-cant-download-planet-data-images-tar-archive/60309/21?u=yiqun

@brettmoreton
Copy link

You can get what I believe are the same files from nikitarom/planets-dataset using:

THANK YOU!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants