Fix unexplained shader issue (glsl compiler bug??) #4757

Rogier-5 · 2016-11-09T10:17:24Z

For some reason, in the original code, the clamp() operations seem
to be ignored on my system (e.g. erroneously optimized away by the
glsl compiler ? driver bug ?)

The changed code, which should be 100% equivalent, does not suffer
from the problem.

It is not known if more than one person is affected, or if this issue
is already fixed in a more recent version of the software (driver,
glsl compiler ... ?)

See also the discussion in #4691

Screenshots without and with this patch:

Note: as so far I'm the only one experiencing this problem, I don't expect
this to be merged soon. However, this PR may be of use if and when other
people report this problem as well.

Fixer-007 · 2016-11-09T12:01:08Z

On my ATI HD6870 it works the same as before, performance also good. 👍

paramat · 2016-11-10T00:41:16Z

client/shaders/nodes_shader/opengl_fragment.glsl

- float d = clamp((fogDistance - length(eyeVec)) / (fogDistance * 0.6), 0.0, 1.0);
+ // On some systems, the order of operations matters here.
+ // See the commit message of the commit that changed this.
+ float d = clamp(1.0 / 0.6 - length(eyeVec) / (fogDistance * 0.6), 0.0, 1.0);


1.0 / 0.6 can be simplified to 1.66667, i think 5 decimal places is enough, that's still accurate for a view range of 1000.

@paramat: Changed to 1.7, which is probably just as good as 1.66667.

The code now reads: 1.7 - 1.7 * length(eyeVec) / fogDistance

Is that OK with you ?

IMO the exact value is not paramount. 1.0 / 0.6 makes the first fog start at 40% of the fog distance, 1.7 makes it start at 41%. Nobody would notice.

I would prefer to keep it fairly precise at 40% even if not visually noticeable.
How about 1.667?

paramat: Does it really matter?

paramat · 2016-11-10T00:42:01Z

It's no more complex so +1 once simplified.

nerzhul · 2016-11-10T13:38:30Z

client/shaders/nodes_shader/opengl_fragment.glsl

- float d = clamp((fogDistance - length(eyeVec)) / (fogDistance * 0.6), 0.0, 1.0);
+ // DO NOT change the way this value is computed. On some systems,
+ // it will cause problems !
+ // e.g.: clamp(1.7 * (1 - length(eyeVec) / fogDistance), 0.0, 1.0) did *not* work !


maybe the problem is just 1 instead of 1.0 i think this can be the cause. Can you verify ?

Rogier-5 · 2016-11-10T14:28:32Z

@Nerzul: I tried 1.0, but that does not make a difference.

Maybe that was caused by something else. Can't reproduce it now.

Rogier-5 · 2016-11-10T15:54:31Z

The following code also works:
float d = clamp(0.01 + 1.7 * (1 - length(eyeVec) / fogDistance), 0.0, 1.0);
while the following code doesn't:
float d = clamp(0.00 + 1.7 * (1 - length(eyeVec) / fogDistance), 0.0, 1.0);

So apparently the last operation must be an addition or substraction (of a non-zero value), or else the clamp() seems to be ignored.

lhofhansl · 2016-11-10T20:27:28Z

Generally, do we want to support old/buggy glsl compilers? The new formula looks again like magic, and what if we want to do exponential fog?

It seems at some place we'd have to draw a line and say it's not supported.

Rogier-5 · 2016-11-10T20:57:19Z

@lhofhansl: My intention was primarily to document this, and make a patch available in case other people also start to report the issue. As long as it is not known exactly which version(s) of which software suffer from the problem, it's hard to foretell whether that will happen.

If the new formula is too much like magic: the problem is also solved for example by adding 0.001 to your original formula:
float d = clamp(0.001 + (fogDistance - length(eyeVec)) / (fogDistance * 0.6), 0.0, 1.0);
Of course this addition of 0.001 needs a clarifying comment.
If this formula is preferred, I'll update the patch

lhofhansl · 2016-11-10T21:17:12Z

float d = clamp(0.001 + (fogDistance - length(eyeVec)) / (fogDistance * 0.6), 0.0, 1.0); with an explanation is super cool! Maybe 0.0001 is better (not sure about the actual scales)

What does 0.001 - 0.001 + ... do? :)

Rogier-5 · 2016-11-10T21:40:15Z

not sure about the actual scales

?? How do you mean ??

A difference of 0.01 for instance would make a difference of 1 node at a viewing range of about 167. I doubt anyone would notice, much less complain :-)

Rogier-5 · 2016-11-10T21:59:25Z

Pushed a new version with an updated formula.

@lhofhansl That doesn't work. I assume the compiler is smart enough to optimize that away - even though it (assuming it's the compiler's fault) is not smart enough to compute the formula correctly.

lhofhansl · 2016-11-10T22:30:38Z

👍

paramat · 2016-11-11T00:14:13Z

Would this be better since it's simpler, more precise and avoids a magic 0.001?
float d = clamp(1.667 - 1.667 * length(eyeVec) / fogDistance, 0.0, 1.0);

lhofhansl · 2016-11-11T02:24:57Z

@paramat I think it would just be magic numbers and a magic formula again. As long as the comment explain it doesn't matter, I guess.

paramat · 2016-11-11T21:40:40Z

I request float d = clamp(1.667 - 1.667 * length(eyeVec) / fogDistance, 0.0, 1.0); since it's equivalent, simpler and faster. +1 with that.

Rogier-5 · 2016-11-11T23:22:08Z

@lhofhansl @paramat how about the following formula, which replaces the magic constants with a symbolic constant.

const float fogStart = 0.4;    // fraction of fogDistance at which the fog starts to appear
[...]
float clarity = clamp(1 / (1 - fogStart) + 1 / (1 - fogStart) * (fogDistance - length(eyeVec)) / fogDistance, 0.0, 1.0);

paramat · 2016-11-11T23:50:56Z

Nope because that's more complex and therefore slower.
My request has no unnecessary magic number at all.

lhofhansl · 2016-11-12T06:43:17Z

Any formula is good as long as we put the "canonical" formula (fogMax - distance)/(fogMax - fogMin) in a comment somewhere. 1.667 is magic, unless there's a comment about how we got to that.

paramat · 2016-11-13T03:44:00Z

Yes that comment is needed.

Rogier-5 · 2016-11-13T18:31:21Z

@paramat, @lhofhansl Pushed a new version.

@paramat: I used your formula, and I tried to make it self-documenting, so I made the number a symbolic constant. OK ?

Zeno- · 2016-11-14T07:06:50Z

👍

sofar · 2016-11-15T00:00:40Z

is this relevant, perhaps?

https://www.opengl.org/discussion_boards/showthread.php/176167-ATI-GLSL-compiler-bug

Zeno- · 2016-11-15T09:11:12Z

@sofar it might be relevant yes. I was talking about these difficult to find "issues" the other day in a related PR

nerzhul · 2016-11-16T13:24:42Z

@paramat can you profile it to prove what you said ? Compiler will solve those calculs itself

paramat · 2016-11-16T14:49:46Z

Ok i was asking for too much simplification above.

+const float fogStart = 0.4;
+const float fogShadingParameter = 1 / ( 1 - fogStart);

+       float clarity = clamp(fogShadingParameter
+           + fogShadingParameter * (fogDistance - length(eyeVec)) / fogDistance, 0.0, 1.0);

I don't object to the const floats, but the form of the clarity equation can be simpler.
The original equation is
float d = clamp((fogDistance - length(eyeVec)) / (fogDistance * 0.6), 0.0, 1.0);

fogShadingParameter is 1.0 / 0.6.
So we can have this instead:

+const float fogStart = 0.4;
+const float fogShadingParameter = 1 / ( 1 - fogStart);

+       float clarity = clamp(fogShadingParameter - 
                    fogShadingParameter * length(eyeVec) / fogDistance, 0.0, 1.0);`

+1 for this. One less term and one less operator.
It's good practice to reduce to the simplest equation. Compared to this, the PR as it is currently is no closer to the standard form fog equation, and is no more readable.

paramat · 2016-11-16T15:06:18Z

Zeno this is not trivial and has only one approval, you almost impulsively merged against the rules again :) [EDIT i was wrong, see below].
As you can see your complaint was justified and i have changed my request, so waiting and discussing it instead of merging it worked out well.

Zeno- · 2016-11-16T15:14:35Z

Zeno this is not trivial and has only one approval, you almost impulsively merged against the rules again :)

I did not almost impulsively do anything at all! Do not attribute actions to me that never even existed, please.

<Zeno`> Rogier-5's change (using a constant) is the correct way
<Zeno`> and I disagree with paramat. Premature optimisation is the root of all evil
<Zeno`> Look, I'm the first person to push optimisations, I think you all know that
<Zeno`> but this is silly
<Zeno`> the bugfix is the more important of the two issues!
<Zeno`> by far
<Zeno`> there is no evidence that it even needs optimisation. I suggest it be merged
<nore> +1

You do NOT block a PR because you want someone to use a literal constant instead of a symbolic constant without a very good reason, and proof, to explain.

paramat · 2016-11-16T15:25:24Z

<Zeno> I'll merge in 15 minutes then if there are no objections
Oh but nore +1 just before, ok.
However i was in discussion and the author had not responded for a while, the discussion was ongoing, so it was a little irritating.

Zeno- · 2016-11-16T15:28:20Z

<Zeno> I'll merge in 15 minutes then if there are no objections`

Yes, because at that time there was two approvals. My approval, and nore's approval.

When sfan5 raised an objection I said I would not merge. How did I act inappropriately?

paramat · 2016-11-16T15:29:26Z

My mistake, see my edited comment above.

Zeno- · 2016-11-16T15:30:58Z

Please don't update previous comments in cases like this... it confuses things.

paramat · 2016-11-16T15:34:46Z

You do NOT block a PR because you want someone to use a literal constant instead of a symbolic constant without a very good reason, and proof, to explain.

Yes that's fair, i changed my request and gave reasons in my long comment above.

I didn't see nore's +1 in IRC chat at first.

paramat · 2016-11-16T15:39:46Z

Sorry i missed that.

paramat · 2016-11-16T15:48:18Z

Often i will edit a comment i have just posted, if someone then comments while i'm editing it can look like i edited after that someone's comment. That happened here once.

For some reason, in the original code, the clamp() operations seem to be ignored in at least one graphics stack (e.g. erroneously optimized away by the glsl compiler ? or a driver bug ?) It is not known if many people are affected, or if this issue is already fixed in a more recent version of the software (driver, glsl compiler ... ?) The problem seems not to happen if the last mathematical operation before the clamp() is an addition (instead of a multiplication). See also the discussion in minetest#4691 and minetest#4757 Note that the new code is probably more efficient as well.

Rogier-5 · 2016-11-16T16:46:33Z

@paramat: I implemented and tested your formula and it works the way it should.

I did some reading about efficient graphics processor programming, and I even think this patch is also a performance improvement, as additions are essentially zero-cost after performing a multiplication.

I pushed a new version.

paramat · 2016-11-16T16:54:22Z

Thanks 👍

Zeno- · 2016-11-16T18:24:13Z

Paramat, I do expect a proper apology by the way. I know it's petty, but I did nothing wrong. At all.

paramat · 2016-11-16T18:43:37Z

I did apologise above, but i do feel bad about not seeing that +1.

I think i felt a little annoyed because 2 devs (nore too, not just yourself) were pushing for a merge when the discussion was ongoing, when i had an objection and when the author had not responded for a while. I would have preferrred that someone commented and taken issue with my objection, as you can see in the end i realised i was being unreasonable and i changed my request.

Essentially if there's controversy and ongoing discussion it's often best (and advised in MT dev) to wait and not rush a merge.
It also seemed to be similar to yesterday, disagreement with my objection and a rush to merge without discussion. It became more annoying because of yesterday.
So i do think you (and nore) made a minor mistake too, but only very minor.

lhofhansl · 2016-11-16T23:40:50Z

As an outsider here, let me just note that everybody wants to do the right thing here and there was simply a communication problem. (I would have preferred the common range-fog formula be mentioned in the comment, but that's a nit). So let me thank all of you for keeping MT alive and kicking and being passionate about doing so.

sfan5 added @ Client / Audiovisuals Trivial The change is a trivial bug fix, documentation or maintenance change, as per the Git Guidelines Bugfix 🐛 PRs that fix a bug labels Nov 9, 2016

sfan5 added this to the 0.4.15 milestone Nov 9, 2016

paramat reviewed Nov 10, 2016

View reviewed changes

Rogier-5 force-pushed the shader-issue branch from 37eec47 to 7b0f186 Compare November 10, 2016 09:40

nerzhul reviewed Nov 10, 2016

View reviewed changes

Rogier-5 force-pushed the shader-issue branch from 7b0f186 to 44e6b20 Compare November 10, 2016 21:53

paramat removed the Trivial The change is a trivial bug fix, documentation or maintenance change, as per the Git Guidelines label Nov 11, 2016

Rogier-5 force-pushed the shader-issue branch from 44e6b20 to a5145b3 Compare November 13, 2016 18:23

Rogier-5 force-pushed the shader-issue branch from a5145b3 to bec9428 Compare November 13, 2016 18:32

Zeno- added the One approval ✅ ◻️ label Nov 14, 2016

Rogier-5 force-pushed the shader-issue branch from bec9428 to 8617084 Compare November 16, 2016 16:34

Rogier-5 force-pushed the shader-issue branch from 8617084 to d418069 Compare November 16, 2016 16:39

paramat added >= Two approvals ✅ ✅ and removed One approval ✅ ◻️ labels Nov 16, 2016

Zeno- merged commit 5f0dc8e into minetest:master Nov 16, 2016

Rogier-5 deleted the shader-issue branch November 16, 2016 16:57

Fix unexplained shader issue (glsl compiler bug??) #4757

Fix unexplained shader issue (glsl compiler bug??) #4757

Conversation

Rogier-5 commented Nov 9, 2016

Fixer-007 commented Nov 9, 2016 • edited Loading

paramat Nov 10, 2016 • edited Loading

Choose a reason for hiding this comment

Rogier-5 Nov 10, 2016 • edited Loading

Choose a reason for hiding this comment

paramat Nov 10, 2016

Choose a reason for hiding this comment

kwolekr Nov 16, 2016

Choose a reason for hiding this comment

paramat commented Nov 10, 2016

nerzhul Nov 10, 2016 • edited Loading

Choose a reason for hiding this comment

Rogier-5 commented Nov 10, 2016 • edited Loading

Rogier-5 commented Nov 10, 2016

lhofhansl commented Nov 10, 2016

Rogier-5 commented Nov 10, 2016

lhofhansl commented Nov 10, 2016 • edited Loading

Rogier-5 commented Nov 10, 2016

Rogier-5 commented Nov 10, 2016

lhofhansl commented Nov 10, 2016

paramat commented Nov 11, 2016

lhofhansl commented Nov 11, 2016

paramat commented Nov 11, 2016 • edited Loading

Rogier-5 commented Nov 11, 2016

paramat commented Nov 11, 2016

lhofhansl commented Nov 12, 2016

paramat commented Nov 13, 2016

Rogier-5 commented Nov 13, 2016

Zeno- commented Nov 14, 2016

sofar commented Nov 15, 2016

Zeno- commented Nov 15, 2016

nerzhul commented Nov 16, 2016

paramat commented Nov 16, 2016 • edited Loading

paramat commented Nov 16, 2016 • edited Loading

Zeno- commented Nov 16, 2016 • edited Loading

paramat commented Nov 16, 2016 • edited Loading

Zeno- commented Nov 16, 2016

paramat commented Nov 16, 2016 • edited Loading

Zeno- commented Nov 16, 2016

paramat commented Nov 16, 2016 • edited Loading

paramat commented Nov 16, 2016 • edited Loading

paramat commented Nov 16, 2016

Rogier-5 commented Nov 16, 2016

paramat commented Nov 16, 2016

Zeno- commented Nov 16, 2016

paramat commented Nov 16, 2016 • edited Loading

lhofhansl commented Nov 16, 2016

Fixer-007 commented Nov 9, 2016 •

edited

Loading

paramat Nov 10, 2016 •

edited

Loading

Rogier-5 Nov 10, 2016 •

edited

Loading

nerzhul Nov 10, 2016 •

edited

Loading

Rogier-5 commented Nov 10, 2016 •

edited

Loading

lhofhansl commented Nov 10, 2016 •

edited

Loading

paramat commented Nov 11, 2016 •

edited

Loading

paramat commented Nov 16, 2016 •

edited

Loading

paramat commented Nov 16, 2016 •

edited

Loading

Zeno- commented Nov 16, 2016 •

edited

Loading

paramat commented Nov 16, 2016 •

edited

Loading

paramat commented Nov 16, 2016 •

edited

Loading

paramat commented Nov 16, 2016 •

edited

Loading

paramat commented Nov 16, 2016 •

edited

Loading

paramat commented Nov 16, 2016 •

edited

Loading