[Dynamo] TB hf_Reformer graph breaks #101154

yanboliang · 2023-05-11T04:18:54Z

🐛 Describe the bug

Repro:

import torch
import logging
import sys
import torch._dynamo

# torch._logging.set_logs(dynamo=logging.DEBUG, bytecode=True)
torch._dynamo.config.print_graph_breaks = True

import torch.nn as nn
import torch.nn.functional as F

class MyModel(nn.Module):

    def __init__(self):
        super(MyModel, self).__init__()
        self.linear = torch.nn.Linear(5, 5)
        self.dropout = torch.nn.Dropout()

    def _init_attention_seed(self):
        """
        This function sets a new seed for the attention layer to make dropout deterministic for both forward calls: 1
        normal forward call and 1 forward call in backward to recalculate activations.
        """

        # randomize seeds
        # use cuda generator if available
        if hasattr(torch.cuda, "default_generators") and len(torch.cuda.default_generators) > 0:
            # GPU
            device_idx = torch.cuda.current_device()
            self.attention_seed = torch.cuda.default_generators[device_idx].seed()
        else:
            # CPU
            self.attention_seed = int(torch.seed() % sys.maxsize)

        torch.manual_seed(self.attention_seed)

    def forward(self, x):
        self._init_attention_seed()
        return self.dropout(self.linear(x))

x = torch.randn(5, 5)

m = MyModel()
print(m(x))

opt_m = torch.compile(backend="eager")(m)
print(opt_m(x))

There are several graph breaks:

[2023-05-11 04:12:58,513] torch._dynamo.symbolic_convert: [WARNING] Graph break: hasattr: TorchVariable(<module 'torch.cuda' from '/scratch/ybliang/work/repos/pytorch/torch/cuda/__init__.py'>) from user code at   File "/scratch/ybliang/work/repos/debug/debug3.py", line 39, in forward
    self._init_attention_seed()
  File "/scratch/ybliang/work/repos/debug/debug3.py", line 28, in _init_attention_seed
    if hasattr(torch.cuda, "default_generators") and len(torch.cuda.default_generators) > 0:

[2023-05-11 04:12:58,748] torch._dynamo.symbolic_convert: [WARNING] Graph break: inlining disallowed: <function current_device at 0x7f2ec26d8430> from user code at   File "/scratch/ybliang/work/repos/debug/debug3.py", line 30, in <resume in _init_attention_seed>
    device_idx = torch.cuda.current_device()

[2023-05-11 04:12:58,754] torch._dynamo.symbolic_convert: [WARNING] Graph break: call_method UserDefinedObjectVariable(seed) __call__ [] {} from user code at   File "/scratch/ybliang/work/repos/debug/debug3.py", line 31, in <resume in _init_attention_seed>
    self.attention_seed = torch.cuda.default_generators[device_idx].seed()

Versions

N/A

cc @ezyang @anijain2305 @chauhang @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @soumith @msaroufim @wconstab @ngimel @bdhirsh @Xia-Weiwen @desertfire

The text was updated successfully, but these errors were encountered:

anijain2305 · 2023-06-02T20:09:25Z

Repro for the 1st error

import torch

@torch.compile(fullgraph=True)
def fn(x):
    if hasattr(torch.cuda, "default_generators"):
        return x
    return x + 1

fn(torch.randn(1))

Repro for the 3rd error

import torch

@torch.compile(fullgraph=True)
def fn(x):
    device_idx = 0
    n_seed = torch.cuda.default_generators[device_idx].seed()
    return x + 1

fn(torch.randn(1))

yanboliang · 2023-06-02T20:18:57Z

1/ Need to add call_hasattr to TorchVariable.
2/ Intended graph break, don't need to fix.
3/ AOT autograd can't handle Generator.seed well, need more discussion and defer to after hackday.

gmagogsfm · 2023-06-08T19:43:49Z

issue 1 is addressed, 3 is still there. And according to tests, there are still 44 graph breaks remaining in this model

anijain2305 · 2024-01-31T21:27:29Z

3/ It seems we should also graph break in this case. If a model has torch.cuda.default_generators[device_idx].seed() they probably want to reset random state in every invocation. torch.compile should respect that.

tugsbayasgalan · 2024-03-13T19:25:24Z

@yanboliang, @anijain2305 any update on this issue?

yanboliang added triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module oncall: pt2 module: dynamo labels May 11, 2023

yanboliang mentioned this issue May 11, 2023

[Placeholder] PyTorch 2.0 Dynamo/Inductor Hack{day/week} #101011

Closed

29 tasks

yanboliang assigned anijain2305 May 26, 2023

anijain2305 removed their assignment Jun 7, 2023

gmagogsfm self-assigned this Jun 7, 2023

gmagogsfm mentioned this issue Jun 7, 2023

[Dynamo Hackathon] Add support for hasattr on TorchVariable #103177

Closed

pytorchmergebot closed this as completed in b4f3a6f Jun 8, 2023

gmagogsfm reopened this Jun 8, 2023

anijain2305 unassigned gmagogsfm Jan 31, 2024

anijain2305 added module: graph breaks dynamo-triage-june2024 labels Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Dynamo] TB hf_Reformer graph breaks #101154

[Dynamo] TB hf_Reformer graph breaks #101154

yanboliang commented May 11, 2023 •

edited by pytorch-bot bot

Loading

anijain2305 commented Jun 2, 2023 •

edited

Loading

yanboliang commented Jun 2, 2023 •

edited

Loading

gmagogsfm commented Jun 8, 2023

anijain2305 commented Jan 31, 2024

tugsbayasgalan commented Mar 13, 2024

[Dynamo] TB hf_Reformer graph breaks #101154

[Dynamo] TB hf_Reformer graph breaks #101154

Comments

yanboliang commented May 11, 2023 • edited by pytorch-bot bot Loading

🐛 Describe the bug

Versions

anijain2305 commented Jun 2, 2023 • edited Loading

yanboliang commented Jun 2, 2023 • edited Loading

gmagogsfm commented Jun 8, 2023

anijain2305 commented Jan 31, 2024

tugsbayasgalan commented Mar 13, 2024

yanboliang commented May 11, 2023 •

edited by pytorch-bot bot

Loading

anijain2305 commented Jun 2, 2023 •

edited

Loading

yanboliang commented Jun 2, 2023 •

edited

Loading