-
Notifications
You must be signed in to change notification settings - Fork 333
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Does not work with TensorFlow versions >= 1.3 #1
Comments
Hi and thanks for the comment. Which TF version are you using? The newest version seems to hang up in the way you describe. TF 1.2.1 should work. I think the issue is in either weight norm or ema model code or maybe their combination. Unfortunately I haven’t had time to hunt it down. PRs welcome if you figure it out. |
I changed to TF 1.2.1. Now it works fine. |
Nice! Thanks for your comment. |
See the discussion in pull request #5. This is related to the data-dependent initialization of weight-normalized layers. Smart variable initialization appears to be broken in TensorFlow since 1.3, and may get fixed in the next release. Here's the currently active bug report in TensorFlow issues: TensorFlow variable initializers broken #13351. As a possible workaround, here's a smarter initialization scheme from OpenAI: smart_initialize.py. |
your python --version? @WeitaoVan |
I can not download the tf1.2.x with python3.x |
I think I used both Python 3.5 and 3.6 in the development of this, and TF 1.2.x. |
On Tensorflow 1.8, I found a workaround: Comment the line 851&852 around in pyfolder/Lib/site-packages/tensorflow/python/ops/variables.py:
|
tf 1.2.0 has a bug using dropout , see here . tf 1.2.1 is ok. |
Tried running svhn.py. Do you know what might be wrong |
Tensorflow 1.x is not available for python 3.8, use python 3.5 and 3.6 instead. See more at issue #39768 of tensorflow |
Thanks for your inspiring idea and the corresponding code.
I try to run the tensorflow code train_cifar10.py.
But it takes more than 2 hours to construct the computational graph and I'm still stuck here.
The screen does not print any thing.
If I replace the CNNs in the original code with a plain ResNet-32 (without Weight Normalization or other tricks), the whole code goes on well.
Do you know what might be wrong?
Thanks.
The text was updated successfully, but these errors were encountered: