[MetaSchedule] Restore `num_threads` parameter in tuning API #13561

masahi · 2022-12-06T10:39:27Z

num_threads parameter in the Relay tuning API was (accidentally?) removed in #12895. This PR restores this parameter and also uses it consistently for XGB model training and builder / runner as well.

@zxybazh

tvm-bot · 2022-12-06T10:39:31Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @Hzfengsy, @elvin-n, @junrushao _{See #10317 for details}
Built docs for commit 71452b5 can be found here.

_{Generated by tvm-bot}

junrushao · 2022-12-06T13:54:14Z

I am not sure it should be a top-level parameter as it’s less frequently used by introductory-level users, but don’t have much strong opinions.

In the meantime, I’m not sure if using the same num_threads for xgb and for evo search is a good idea because I assume xgb prefers physical cores? I don’t have numbers handy so I’m not 100% sure.

Removed this because @spectrometerHBH @Hzfengsy @jinhongyii @MasterJH5574 suggested to, so I’d love to hear more from you guys.

spectrometerHBH · 2022-12-06T15:35:40Z

I would suggest using a better name because num_threads can be confusing, especially when the TIR programs to be tuned are using parallelization.

masahi · 2022-12-06T19:07:04Z

The main use case of this param is for tuning on a high-core system shared by many users. Currently if one user starts tuning, it occupies all CPU resources, which disrupts other users. So the goal is to limit the number of cores used by MS throughout the tuning process (evo search, post order apply, XGB training, builder / runner).

Also, tune_tir API has num_threads param as well.

tvm/python/tvm/meta_schedule/tir_integration.py

Line 57 in 6780c9f

num_threads: Union[Literal["physical", "logical"], int] = "physical",

I would suggest using a better name because num_threads can be confusing

Agreed, but that's what TuneContext calls... I can replace num_threads in the high-level API with max_workers or something, and initialize TuneContext by num_threads=max_workers. If people think this is better I can do that, otherwise I'd keep the existing convention.

zxybazh

I'm kinda in favor of this change because it allows more customization. We may want to find a good naming and default value for num_threads though.

python/tvm/meta_schedule/relay_integration.py

tests/python/contrib/test_hexagon/metaschedule_e2e/test_resnet50_int8.py

junrushao · 2022-12-07T07:16:34Z

I don't have a clear idea (very bad at naming). Perhaps @tqchen you could suggest?

masahi · 2022-12-07T19:46:36Z

I'm going to go with num_tuning_cores per discussion with @zxybazh, if there is no objection. For now I'll keep num_threads in TuneContext.

junrushao · 2022-12-07T20:52:18Z

The reason that I don’t like “cores” is that they are threads, which are not necessarily related to physical cpu cores. Perhaps num_threads makes more sense at the moment.

zxybazh · 2022-12-07T21:06:25Z

Just want to add that Runner & Builder are using new processes, that's why we may want to consider using other terminology. On the other hand, if we don't have very good idea on new naming, I don't mind just sticking to the current naming nun_threads before that.

masahi · 2022-12-07T21:08:51Z

I think "cores" is better because we are also using this parameter to set the number of workers used by builder / runner. Also, when a user wants to limit the amount of CPU resources used by MS, they would think in terms of the number of "cores", not "threads". So as an API, "core" sounds more intuitive to me.

junrushao · 2022-12-07T22:02:51Z

The argument that builder/runner workers are using processes rather than threads makes sense to me. Thanks for the explanation! Then I'm happy with the "core" terminology

masahi · 2022-12-08T07:01:34Z

Replaced num_threads with num_tuning_cores in the API.

zxybazh

LGTM

junrushao

LGTM

…13561) * [MetaSchedule] Restore num_threads argument in tune_relay * pass num_threads to XGBModel * fix default * pass num_threads as max_workers to Builder and Runner * add test * clean up * fix kwarg * num_threads -> num_tuning_cores * typo * num_threads -> num_tuning_cores in contrib/torch * typo in document

zxybazh reviewed Dec 6, 2022

View reviewed changes

python/tvm/meta_schedule/relay_integration.py Outdated Show resolved Hide resolved

tests/python/contrib/test_hexagon/metaschedule_e2e/test_resnet50_int8.py Outdated Show resolved Hide resolved

masahi added 7 commits December 7, 2022 07:25

[MetaSchedule] Restore num_threads argument in tune_relay

cb8d04f

pass num_threads to XGBModel

e56451c

fix default

bb7b06b

pass num_threads as max_workers to Builder and Runner

788f33a

add test

5b3a93b

clean up

d42a14a

fix kwarg

71452b5

masahi force-pushed the ms-thread-usage branch from b3599ae to 71452b5 Compare December 6, 2022 22:25

masahi added 3 commits December 8, 2022 11:12

num_threads -> num_tuning_cores

892e571

typo

bc78b00

num_threads -> num_tuning_cores in contrib/torch

eee7db8

zxybazh approved these changes Dec 8, 2022

View reviewed changes

typo in document

725944b

shingjan approved these changes Dec 8, 2022

View reviewed changes

junrushao approved these changes Dec 8, 2022

View reviewed changes

masahi merged commit 3b001ef into apache:main Dec 9, 2022

leandron mentioned this pull request Feb 1, 2023

TVM v0.11.0 Release Candidate Notes #13899

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MetaSchedule] Restore `num_threads` parameter in tuning API #13561

[MetaSchedule] Restore `num_threads` parameter in tuning API #13561

masahi commented Dec 6, 2022 •

edited

Loading

tvm-bot commented Dec 6, 2022 •

edited

Loading

junrushao commented Dec 6, 2022

spectrometerHBH commented Dec 6, 2022 •

edited

Loading

masahi commented Dec 6, 2022 •

edited

Loading

zxybazh left a comment

junrushao commented Dec 7, 2022

masahi commented Dec 7, 2022 •

edited

Loading

junrushao commented Dec 7, 2022

zxybazh commented Dec 7, 2022

masahi commented Dec 7, 2022

junrushao commented Dec 7, 2022

masahi commented Dec 8, 2022

zxybazh left a comment

junrushao left a comment

[MetaSchedule] Restore num_threads parameter in tuning API #13561

[MetaSchedule] Restore num_threads parameter in tuning API #13561

Conversation

masahi commented Dec 6, 2022 • edited Loading

tvm-bot commented Dec 6, 2022 • edited Loading

junrushao commented Dec 6, 2022

spectrometerHBH commented Dec 6, 2022 • edited Loading

masahi commented Dec 6, 2022 • edited Loading

zxybazh left a comment

Choose a reason for hiding this comment

junrushao commented Dec 7, 2022

masahi commented Dec 7, 2022 • edited Loading

junrushao commented Dec 7, 2022

zxybazh commented Dec 7, 2022

masahi commented Dec 7, 2022

junrushao commented Dec 7, 2022

masahi commented Dec 8, 2022

zxybazh left a comment

Choose a reason for hiding this comment

junrushao left a comment

Choose a reason for hiding this comment

[MetaSchedule] Restore `num_threads` parameter in tuning API #13561

[MetaSchedule] Restore `num_threads` parameter in tuning API #13561

masahi commented Dec 6, 2022 •

edited

Loading

tvm-bot commented Dec 6, 2022 •

edited

Loading

spectrometerHBH commented Dec 6, 2022 •

edited

Loading

masahi commented Dec 6, 2022 •

edited

Loading

masahi commented Dec 7, 2022 •

edited

Loading