[Usage] The deterministic mode did not set in eval_model() function #1013

y-vectorfield · 2024-01-26T05:45:42Z

Describe the issue

Issue: The internal parameter for deterministic mode did not set in eval_model() func

According to the example code for eval_model(), temperature parameter set 0.

model_path = "liuhaotian/llava-v1.5-7b"
prompt = "What are the things I should be cautious about when I visit here?"
image_file = "https://llava-vl.github.io/static/images/view.jpg"

args = type('Args', (), {
    "model_path": model_path,
    "model_base": None,
    "model_name": get_model_name_from_path(model_path),
    "query": prompt,
    "conv_mode": None,
    "image_file": image_file,
    "sep": ",",
    "temperature": 0,
    "top_p": None,
    "num_beams": 1,
    "max_new_tokens": 512
})()

eval_model(args)

I think if we set this param 0, we should explicitly set these additional params.

torch.use_deterministic_algorithms = True
torch.backends.cudnn.deterministic = True
torch.backends.cudnn.benchmark = False

Hence, we should add the following conditional execution in eval_model() func.

if args.temperature == 0:
  torch.use_deterministic_algorithms = True
  torch.backends.cudnn.deterministic = True
  torch.backends.cudnn.benchmark = False
else:
  torch.use_deterministic_algorithms = False
  torch.backends.cudnn.deterministic = False
  torch.backends.cudnn.benchmark = True

The text was updated successfully, but these errors were encountered:

OliverXUZY · 2024-06-26T19:45:46Z

Hi,
Thank you for bringing up this issue. I encountered a similar problem even after explicitly setting torch.backends.cudnn.deterministic and related flags. I've noticed that the discrepancies occur specifically in the CLIP ViT encoders, where the vision embeddings produce different values across separate runs.

When comparing two identical inference processes, I observed that the image_forward_out varies despite using the same image input. This occurs in the following file:

LLaVA/llava/model/multimodal_encoder/clip_encoder.py

Line 50 in c121f04

    
           image_forward_out = self.vision_tower(image.to(device=self.device, dtype=self.dtype).unsqueeze(0), output_hidden_states=True)

This situation occurs starting from the second example until the last one.

I'm curious to know if you're still experiencing this issue and if you've found a solution. Any insights would be greatly appreciated. Thank you for your time!

OliverXUZY mentioned this issue Jun 26, 2024

[Usage] Non-deterministic output during inference, occur specifically in the CLIP ViT encoders #1579

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Usage] The deterministic mode did not set in eval_model() function #1013

[Usage] The deterministic mode did not set in eval_model() function #1013

y-vectorfield commented Jan 26, 2024

OliverXUZY commented Jun 26, 2024

[Usage] The deterministic mode did not set in eval_model() function #1013

[Usage] The deterministic mode did not set in eval_model() function #1013

Comments

y-vectorfield commented Jan 26, 2024

Describe the issue

OliverXUZY commented Jun 26, 2024