Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[KED-2131] Incomplete documentation about the Spaceflights tutorial #527

Closed
falcaopetri opened this issue Sep 27, 2020 · 6 comments
Closed
Labels
Issue: Bug Report 🐞 Bug that needs to be fixed

Comments

@falcaopetri
Copy link

Description

The docs regarding the Spaceflights tutorial are incomplete, which makes it harder to successfully finish it.

This has been partially discussed in kedro-org/kedro-community#58 (including the issue reproducibility). Therefore, I will discuss it in a complementary way.

Context

There is apparently an ongoing internal issue about improving the organization and sync between the repos kedro-examples, kedro-training, and kedro-starter-spaceflights (kedro-org/kedro-training#1).

As I understood, spaceflight full-repo is moving from kedro-examples/kedro-tutorial to kedro-training/kedro/exercises/spaceflight.

I'm not sure about what is being tracked internally, so I will list what I've found related to Spaceflights' requirements.txt:

Actionable

  1. kedro-examples/kedro-tutorial's requirements.txt should be updated to contain kedro[pandas.CSVDataSet,pandas.ExcelDataSet].
  2. Docs should be improved to show that kedro[pandas.CSVDataSet,pandas.ExcelDataSet] is required. Set up the spaceflights project#Install project dependencies is probably the right place.
  3. This also applies to kedro-training docs at Create a new project#kedro install.
  4. Latest and stable Kedro docs points to kedro-examples as the full source to the spaceflights project. I'm not sure, but I guess that this will/should be eventually changed to kedro-training/kedro-exercises/spaceflight.

I could work on 1., 2., and 3. if it makes sense (note that they are at 3 different repos).

PS: Sorry for the cross-repos references everywhere. I considered that here was the best place to report it.

@falcaopetri falcaopetri added the Issue: Bug Report 🐞 Bug that needs to be fixed label Sep 27, 2020
@921kiyo 921kiyo changed the title Incomplete documentation about the Spaceflights tutorial [KED-2131] Incomplete documentation about the Spaceflights tutorial Sep 28, 2020
@921kiyo
Copy link
Contributor

921kiyo commented Sep 28, 2020

Thank you for reporting it! We will address your feedback in the docs and spaceflight example code.

@falcaopetri
Copy link
Author

Thanks for addressing it, @921kiyo.

If there is still time, I'd like to add three more notes regarding the docs and kedro-training:

shuttles:
  type: kedro_tutorial.io.xls_local.ExcelLocalDataSet
  filepath: data/01_raw/shuttles.xlsx
  layer: raw

@laisbsc
Copy link
Contributor

laisbsc commented Sep 30, 2020

Hey there @falcaopetri.

Thanks for reporting this. It is not too late at all!

I will add your additional comments to our ticket. In the meantime, if you'd like to fix the issues, feel free to make a PR and work on it 😄 We truly appreciate it!

Chat soon.

@guludo
Copy link
Contributor

guludo commented Nov 27, 2020

From @falcaopetri's comment:

Doc's Data science pipeline#Update dependencies adds the scikit-learn dependency to src/requirements.txt and then runs kedro install. Shouldn't it be src/requirements.in + kedro build-deps && kedro install instead?

I have the same question. Adding it to src/requirements.in makes more sense to me too.

@lorenabalan
Copy link
Contributor

@guludo Apologies, just noticed this has been merged into develop, which is not visible on ReadTheDocs. Indeed it should be in src/requirements.in, the next version will have the correct docs.

@lorenabalan
Copy link
Contributor

Closing this as resolved through linked PRs/issues, as well as 589d6a7 and 0fd6b62 . Please feel free to open a new issue if there are other observations!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Issue: Bug Report 🐞 Bug that needs to be fixed
Projects
None yet
Development

No branches or pull requests

6 participants