Stack Traces: TNG #450

vmg · 2016-03-27T10:30:35Z

This PR is a followup on #448 and updates several tools to use the new stack trace APIs in 4.6+.

As always, details on the changes are on each individual commit message. The man pages have been updated to reflect the changes, and the old versions using the manual stack walker have been saved to tools/old.

TODO

memleak.py
offcputime.py
stackcount.py
stacksnoop.py

Instead of manually walking the (kernel) stack inside the eBPF code, create a `BPF_STACK_TRACE` table and store the stack IDs in the alloc info struct. This greatly simplifies the leak detection code: instead of storing the full stack trace as a string to de-duplicate the allocation point for the different allocations, we can store the `stack_id`. Since we're creating stack IDs with `BPF_F_REUSE_STACKID`, the kernel will take care of deduplication for us. Additionally, the `StackDecoder` class has been specialized into a `UStackDecoder` and `KStackDecoder` (for userland and kernel stacks, respectively). This lets us pass the decode class directly to `stack_traces.walk()` to automate symbol resolution. A new class, `Allocation` has been created to encapsulate what previously was a 2-element tuple of `(count, size)`. This

The manual walking for kernel stacks in the eBPF code has been replaced with a `BPF_STACK_TRACE` table; the stack ID is now stored as an integer in the main key. Thanks to the `StackTrace.walk` iterator, the printing code can also be significantly simplified for both folded and full stack modes.

4ast · 2016-03-27T16:05:47Z

lgtm. should the man page say that kversion<=4.5 can be found in tools/old ? probably not. I guess that's implicit assumption for all tools in old and we don't need to partition it further per kernel version.

The offset is now returned after a `+` symbol, instead of directly attached to the symbol name (which made reading the output very confusing) Before: tick_do_update_jiffies64a0 After: tick_do_update_jiffies64+0xa0

vmg · 2016-03-27T16:46:28Z

@4ast: Thank you! I've updated stacksnoop and stackcount too, which afaict are the two remaining tools that still had a manual stack walker.

@brendangregg: For stacksnoop, I've significantly simplified the output of the tool, given that we no longer log a line for each frame in the trace. The examples and manpage have been updated accordingly. Please let me know if you think the new output is acceptable.

Also, 4.6-rc1 is out! Perfect timing to start trying this stuff out. http:https://lkml.iu.edu/hypermail/linux/kernel/1603.3/01187.html 🎉

Instead of manually walking the stack and printing each frame to the trace log, we can use a BPF_STACK_TRACE table to store the stack traces, and print to the trace log their IDs every time they are traced. The output of the tool has been slightly modified: we no longer prefix each line of the stack trace with the timestamp and the other headers, since the whole stack trace is fetched in one go from the table and the information would be highly redundant. The man page and the examples have been updated to reflect the new output.

The changes for this script are minimal: the inline C probe has been _significantly_ simplified, and should now perform better since the stack walk happens in native code inside the kernel thanks to the BPF_STACK_TRACE table. The output of the tool should be virtually identical, so the man page and included examples have essentially no changes.

brendangregg · 2016-03-27T17:06:44Z

@vmg great, thanks! New output of stacksnoop looks good. (Old output was partially for debug anyway.)

@4ast I'm thinking we need a /docs/Ubuntu-Xenial.md for specific instructions, which points people to tools/old. And /docs can be for more misc stuff that isn't high level (like /INSTALL.md).

brendangregg · 2016-03-27T17:22:06Z

... wow, so BPF_MAP_TYPE_STACK_TRACE is in 4.6-rc1. I must have missed the final integration email.

evverx · 2016-05-26T03:05:45Z

Oh, I can simply run tools/old/offcputime.py instead of

$ git checkout e82fb1baa26b0~1
$ cmake .. -DCMAKE_INSTALL_PREFIX=/usr

:)

So, yeah,

@4ast I'm thinking we need a /docs/Ubuntu-Xenial.md for specific instructions, which points people to tools/old

makes sense.

vmg added 2 commits March 26, 2016 01:29

vmg force-pushed the vmg/stacks-next branch from cb303e8 to 0445210 Compare March 27, 2016 10:31

vmg mentioned this pull request Mar 27, 2016

Better stack walking APIs #448

Merged

BPF: better format for ksymaddr

3a137db

The offset is now returned after a `+` symbol, instead of directly attached to the symbol name (which made reading the output very confusing) Before: tick_do_update_jiffies64a0 After: tick_do_update_jiffies64+0xa0

vmg force-pushed the vmg/stacks-next branch from 0445210 to aed90a8 Compare March 27, 2016 16:42

vmg added 2 commits March 27, 2016 18:51

vmg force-pushed the vmg/stacks-next branch from aed90a8 to b4ebed0 Compare March 27, 2016 16:52

brendangregg merged commit c256e94 into iovisor:master Mar 27, 2016

evverx mentioned this pull request May 26, 2016

Install tools/old #552

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stack Traces: TNG #450

Stack Traces: TNG #450

vmg commented Mar 27, 2016

4ast commented Mar 27, 2016

vmg commented Mar 27, 2016

brendangregg commented Mar 27, 2016

brendangregg commented Mar 27, 2016

evverx commented May 26, 2016

Stack Traces: TNG #450

Stack Traces: TNG #450

Conversation

vmg commented Mar 27, 2016

TODO

4ast commented Mar 27, 2016

vmg commented Mar 27, 2016

brendangregg commented Mar 27, 2016

brendangregg commented Mar 27, 2016

evverx commented May 26, 2016