Fix a bug in the LLVM memory model (with SMT array backed allocations) #603

travitch · 2020-12-23T22:33:15Z

Most of this commit is additional comments that try to clarify some of the
things that came up while diagnosing the bug. The actual fix is in
writeMemWithAllocationCheck.

The observed behavior was that a test case (in another repository) generated
formulas where reads and writes to SMT array-backed allocations did not refer to
the same byte offsets, when they should have. Specifically, the scenario was
that the test case has a single large allocation backed by an SMT array. The
function exhibiting the problem started with a pointer to the middle of that
allocation. Some of the reads and writes were properly in the middle of the
allocation (with a large offset appearing in SMT array select operations),
while others were using inappropriate low offsets (around zero).

The problem was a bug in the optimization in the memory model that coalesces
writes to SMT array backed allocations (which reduces the number of muxes
generated in many cases). The goal of this optimization (which is now
documented) is to turn sequential updates to an SMT array backed allocation into
a single entry in the write log by pretending that the small write actually
overwrites the entire SMT array with new values (which are the original contents
of the SMT array plus the new values just written, expressed as an SMT array
update).

To do this correctly, the updated array must overwrite the entire
allocation (i.e., starting at offset 0 within the block). The bug was that the
array was being written at the offset in the pointer to the start of the new
write. In the case outlined above, this meant that the new array was spliced
over the old one halfway through the allocation. This had a few consequences:

The offsets of reads no longer matched the offsets of their corresponding
writes (due to the intricacies of how read offsets are calculated in SMT array
allocations)
The expected optimized path for array reads was no longer taken, as writes
looked like they spanned two different array allocations
The ultimate SMT queries actually had no update operations in them at all
because the reads looked independent of all of the writes (due to the
misalignment)

Most of this commit is additional comments that try to clarify some of the things that came up while diagnosing the bug. The actual fix is in `writeMemWithAllocationCheck`. The observed behavior was that a test case (in another repository) generated formulas where reads and writes to SMT array-backed allocations did not refer to the same byte offsets, when they should have. Specifically, the scenario was that the test case has a single large allocation backed by an SMT array. The function exhibiting the problem started with a pointer to the middle of that allocation. Some of the reads and writes were properly in the middle of the allocation (with a large offset appearing in SMT array `select` operations), while others were using inappropriate low offsets (around zero). The problem was a bug in the optimization in the memory model that coalesces writes to SMT array backed allocations (which reduces the number of muxes generated in many cases). The goal of this optimization (which is now documented) is to turn sequential updates to an SMT array backed allocation into a single entry in the write log by pretending that the small write actually overwrites the entire SMT array with new values (which are the original contents of the SMT array plus the new values just written, expressed as an SMT array `update`). To do this correctly, the updated array must overwrite the entire allocation (i.e., starting at offset 0 within the block). *The bug was that the array was being written at the offset in the pointer to the start of the new write.* In the case outlined above, this meant that the new array was spliced over the old one halfway through the allocation. This had a few consequences: - The offsets of reads no longer matched the offsets of their corresponding writes (due to the intricacies of how read offsets are calculated in SMT array allocations) - The expected optimized path for array reads was no longer taken, as writes looked like they spanned two different array allocations - The ultimate SMT queries actually had no `update` operations in them at all because the reads looked independent of all of the writes (due to the misalignment)

kquick

The additional documentation really helps, thanks!

robdockins · 2020-12-23T22:48:13Z

Nice catch, that's a subtle bug! How hard would it be to add a test case for this situation?

travitch · 2020-12-23T22:58:14Z

I tried - the previous test I added attempted to trigger this (a simplified version of the real program), but some of the optimizations in the memory model ended up eliminating all of the actual memory options. Andrei said he thinks he can cook up a test case that will tickle it.

andreistefanescu · 2021-02-26T07:28:44Z

@travitch the problem was that asMemAllocationArrayStore did not check that the pointer offset is 0, thus incorrectly identifying an array store at an incorrect pointer as covering the entire allocation (yes, in this case, two wrongs make a right).

travitch requested review from andreistefanescu and kquick December 23, 2020 22:33

kquick approved these changes Dec 23, 2020

View reviewed changes

travitch assigned andreistefanescu Jan 16, 2021

travitch and others added 5 commits February 8, 2021 09:51

Merge remote-tracking branch 'origin' into tr/array-mem-model-fix

d475a05

Merge branch 'master' into tr/array-mem-model-fix

d1e34a4

Check pointer offset is 0 in asMemAllocationArrayStore.

364c875

Merge remote-tracking branch 'origin/master' into tr/array-mem-model-fix

c262a69

Update testMemArray.

a95e35c

andreistefanescu approved these changes Feb 26, 2021

View reviewed changes

andreistefanescu force-pushed the tr/array-mem-model-fix branch from 91f21d5 to a95e35c Compare February 26, 2021 08:17

andreistefanescu merged commit dc5a5ae into master Feb 26, 2021

andreistefanescu deleted the tr/array-mem-model-fix branch February 26, 2021 19:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a bug in the LLVM memory model (with SMT array backed allocations) #603

Fix a bug in the LLVM memory model (with SMT array backed allocations) #603

travitch commented Dec 23, 2020

kquick left a comment

robdockins commented Dec 23, 2020

travitch commented Dec 23, 2020

andreistefanescu commented Feb 26, 2021

Fix a bug in the LLVM memory model (with SMT array backed allocations) #603

Fix a bug in the LLVM memory model (with SMT array backed allocations) #603

Conversation

travitch commented Dec 23, 2020

kquick left a comment

Choose a reason for hiding this comment

robdockins commented Dec 23, 2020

travitch commented Dec 23, 2020

andreistefanescu commented Feb 26, 2021