#385 improve documentation on chsnks and chunk size parameter #423

earthgecko · 2018-08-30T06:39:30Z

Updated docs with with more info on chunksize
Updated chunksize docstrings with more info on chunksize

Modified:
docs/text/parallelization.rst
tsfresh/convenience/relevant_extraction.py
tsfresh/feature_extraction/extraction.py
tsfresh/feature_selection/relevance.py
tsfresh/feature_selection/selection.py
tsfresh/transformers/feature_augmenter.py
tsfresh/transformers/relevant_feature_augmenter.py

- Updated docs with with more info on chunksize - Updated chunksize docstrings with more info on chunksize Modified: docs/text/parallelization.rst tsfresh/convenience/relevant_extraction.py tsfresh/feature_extraction/extraction.py tsfresh/feature_selection/relevance.py tsfresh/feature_selection/selection.py tsfresh/transformers/feature_augmenter.py tsfresh/transformers/relevant_feature_augmenter.py

coveralls · 2018-08-30T06:47:35Z

Coverage remained the same at 97.444% when pulling 8ab8d12 on earthgecko:improve_chunk_size_docs into abb3237 on blue-yonder:master.

Added a removed blank line that was introduced from some other testing Modified: tsfresh/feature_extraction/extraction.py

MaxBenChrist · 2018-09-03T07:07:22Z

Good idea! Can you change that description a little bit? I would write

:class:`multiprocessing.Pool` is parallelisation parameter. One data chunk is defined as a singular time series for one id and one kind. The chunksize is the number of chunks that are submitted as one task to one worker process.  
If you set the chunksize to 10, then it means that one worker task corresponds to calculate all features for 10 id/kind time series combinations.  
If it is set it to None, depending on distributor, heuristics are used to find the optimal chunksize.
The chunksize can have an crucial influence on the optimal cluster performance and should be optimised in benchmarks for the problem at hand.

earthgecko · 2018-09-03T07:34:16Z

@MaxBenChrist modified as requested, all done,

MaxBenChrist · 2018-09-03T08:23:17Z

Thx!

blue-yonder#385 improve documentation on chsnks and chunk size parameter

c10556c

Added a removed blank line that was introduced from some other testing Modified: tsfresh/feature_extraction/extraction.py

Modifiy chunksize documentation

8ab8d12

MaxBenChrist merged commit 925dd64 into blue-yonder:master Sep 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#385 improve documentation on chsnks and chunk size parameter #423

#385 improve documentation on chsnks and chunk size parameter #423

earthgecko commented Aug 30, 2018

coveralls commented Aug 30, 2018 •

edited

Loading

MaxBenChrist commented Sep 3, 2018 •

edited

Loading

earthgecko commented Sep 3, 2018

MaxBenChrist commented Sep 3, 2018

#385 improve documentation on chsnks and chunk size parameter #423

#385 improve documentation on chsnks and chunk size parameter #423

Conversation

earthgecko commented Aug 30, 2018

coveralls commented Aug 30, 2018 • edited Loading

MaxBenChrist commented Sep 3, 2018 • edited Loading

earthgecko commented Sep 3, 2018

MaxBenChrist commented Sep 3, 2018

coveralls commented Aug 30, 2018 •

edited

Loading

MaxBenChrist commented Sep 3, 2018 •

edited

Loading