Instantiate model from automodel #601

svenhendrikx · 2023-06-18T13:28:27Z

This pull request implements #521, adding functionality to allow users to run lm-eval tasks directly on transformers.PreTrainedModel instances using simple_evaluate. It contains three commits:

Add logic to the HFLM class, such that a transformers.PreTrainedModel instance can be passed as the pretrained argument, as @haileyschoelkopf suggested here: Add a way to instantiate from HF.AutoModel #521
Add an init.py file to the bigbench_resources directory, such that it is included in the build. If you don't do this, you'll get an error when trying to import the tasks, if you install the package using pip and the GitHub link.
Add logic to simple_evaluate, such that you can pass it a transformers.PreTrainedModel instance as well. I chose to directly instantiate the object, whereas when a string is passed, the models.get_model function is used. Directly instantiating it seemed like the simplest solution, but you could also add the functionality to the get_model function. Let me know what you think.

This is my first contribution to lm-eval, so feel free to share tips

…instance

…eTrainedModel instance

CLAassistant · 2023-06-18T13:28:34Z

All committers have signed the CLA.

svenhendrikx · 2023-06-18T13:37:47Z

lm_eval/models/gpt2.py

@@ -60,8 +60,8 @@ def __init__(
 trust_remote_code=trust_remote_code,
 )

-
- else:
+ elif isinstance(pretrained, str):


Something I realized last minute, bit cleaner than using an assertion. The else block raises a TypeError, which is more descriptive.

haileyschoelkopf · 2023-06-27T13:39:39Z

lm_eval/models/gpt2.py

+ 198,
+ 198,
+ 31373,
+ ], self.tokenizer.encode("hello\n\nhello")


This tokenizer assert is no longer required! it's a holdover from earlier commits where this model type was assumed to be gpt2 if using a GPT2Tokenizer type.

haileyschoelkopf · 2023-06-27T13:44:05Z

lm_eval/models/gpt2.py

+
+
+ # Initialize model
+ if isinstance(pretrained, transformers.PreTrainedModel):


minor nit: it'd be nice if we could confirm this is of type AutoModelForCausalLM or related subclasses, since this LM subclass only assumes a causal decoder-only model type.

haileyschoelkopf · 2023-06-27T13:45:22Z

lm_eval/evaluator.py

@@ -72,6 +76,11 @@ def simple_evaluate(
 lm = lm_eval.models.get_model(model).create_from_arg_string(
 model_args, {"batch_size": batch_size, "max_batch_size": max_batch_size, "device": device}
 )
+ elif isinstance(model, transformers.PreTrainedModel):
+ lm = HFLM(
+ pretrained=model,


We also want to pass batch_size=batch_size to this I believe. Agree that we should assume the user has already placed their model onto the correct device though!

lm = lm_eval.models.get_model("hf-causal") instead of instantiating HFLM directly here too preferably.

haileyschoelkopf · 2023-06-27T13:47:21Z

Thanks so much for this PR, and I apologize for the slow review! It looks great, but left a couple minor nitpicks. Happy to return to these later today to fix and merge. EDIT: Testing this now.

haileyschoelkopf · 2023-06-27T16:26:52Z

PRed changes to your branch here: svenhendrikx#1 once these are merged, LGTM!

Fixes for passing AutoModel

haileyschoelkopf · 2023-06-27T17:00:29Z

Thanks again for the contribution!!

…-from-Automodel Instantiate model from automodel

svenhendrikx added 4 commits June 16, 2023 21:55

Allow HFLM model to be initialized with transformers.PreTrainedModel …

a28c019

…instance

Turn bigbench_resources into module such that it's included in the build

42caa66

Add logic to simple_evaluate to instantiate HFLM from transformers.Pr…

ddc634f

…eTrainedModel instance

Merge branch 'master' into instantiate-model-from-Automodel

e6960b9

svenhendrikx requested review from haileyschoelkopf and lintangsutawika as code owners June 18, 2023 13:28

svenhendrikx commented Jun 18, 2023

View reviewed changes

svenhendrikx mentioned this pull request Jun 27, 2023

Add a way to instantiate from HF.AutoModel #521

Closed

haileyschoelkopf reviewed Jun 27, 2023

View reviewed changes

haileyschoelkopf requested changes Jun 27, 2023

View reviewed changes

haileyschoelkopf added 6 commits June 27, 2023 15:35

change self.gpt2 -> self.model

e0498dd

fix issue with dumping model_name to results

2a92cde

remove tokenizer assert

5f1d18d

pass batch size

d362dfe

switch to using get_model()

4221d30

update docstring

1e98c74

Merge pull request #1 from EleutherAI/pass-automodel

72ee34d

Fixes for passing AutoModel

haileyschoelkopf approved these changes Jun 27, 2023

View reviewed changes

haileyschoelkopf merged commit 72b7f0c into EleutherAI:master Jun 27, 2023
2 checks passed

haileyschoelkopf mentioned this pull request Jul 1, 2023

Run lm-eval for a large LLM that doesn't fit into GPU #641

Closed

qmdnls pushed a commit to qmdnls/lm-evaluation-harness that referenced this pull request Aug 17, 2023

Merge pull request EleutherAI#601 from svenhendrikx/instantiate-model…

123dd29

…-from-Automodel Instantiate model from automodel

dmitrii-palisaderesearch mentioned this pull request Jun 17, 2024

Add a way to instantiate from HF.AutoModel (again) #1978

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Instantiate model from automodel #601

Instantiate model from automodel #601

svenhendrikx commented Jun 18, 2023

CLAassistant commented Jun 18, 2023 •

edited

Loading

svenhendrikx Jun 18, 2023

haileyschoelkopf Jun 27, 2023

haileyschoelkopf Jun 27, 2023

haileyschoelkopf Jun 27, 2023

haileyschoelkopf Jun 27, 2023

haileyschoelkopf commented Jun 27, 2023 •

edited

Loading

haileyschoelkopf commented Jun 27, 2023

haileyschoelkopf commented Jun 27, 2023



		# Initialize model
		if isinstance(pretrained, transformers.PreTrainedModel):

Instantiate model from automodel #601

Instantiate model from automodel #601

Conversation

svenhendrikx commented Jun 18, 2023

CLAassistant commented Jun 18, 2023 • edited Loading

svenhendrikx Jun 18, 2023

Choose a reason for hiding this comment

haileyschoelkopf Jun 27, 2023

Choose a reason for hiding this comment

haileyschoelkopf Jun 27, 2023

Choose a reason for hiding this comment

haileyschoelkopf Jun 27, 2023

Choose a reason for hiding this comment

haileyschoelkopf Jun 27, 2023

Choose a reason for hiding this comment

haileyschoelkopf commented Jun 27, 2023 • edited Loading

haileyschoelkopf commented Jun 27, 2023

haileyschoelkopf commented Jun 27, 2023

CLAassistant commented Jun 18, 2023 •

edited

Loading

haileyschoelkopf commented Jun 27, 2023 •

edited

Loading