Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add data limitation #255

Merged
merged 9 commits into from
Nov 2, 2023
Merged

Add data limitation #255

merged 9 commits into from
Nov 2, 2023

Conversation

gkumbhat
Copy link
Collaborator

@gkumbhat gkumbhat commented Nov 1, 2023

Description

  • Add training data limit which should be configurable by module and model

caikit_nlp/config/config.yml Show resolved Hide resolved
if max_num_examples > 0:
error.value_check(
"<NLP77627434E>",
len(train_stream) <= max_num_examples,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We'll probably need to make sure a release with caikit/caikit#553 is made and bumped here to make sure we don't run into the same len problem as #223?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yep, thats definitely required

max_num_examples = get_config().training_data_limit.get(cls.MODULE_ID, {}).get(base_model_name, -1)

if max_num_examples > 0:
error.value_check(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe good to have a tiny unit test case with temp config? :)

Copy link
Collaborator

@evaline-ju evaline-ju left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM but we should probably not merge until caikit bump

@gkumbhat gkumbhat mentioned this pull request Nov 1, 2023
@@ -14,7 +14,7 @@ classifiers=[
"License :: OSI Approved :: Apache Software License"
]
dependencies = [
"caikit[runtime-grpc,runtime-http]>=0.22.0,<0.23.0",
"caikit[runtime-grpc,runtime-http]>=0.23.2,<0.25.0",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suppose chances can be taken that 0.24 won't be breaking for this library

@gkumbhat gkumbhat merged commit e512728 into caikit:main Nov 2, 2023
4 checks passed
@gkumbhat gkumbhat deleted the add_data_limitation branch November 2, 2023 16:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants