runtime/trace: emit events traceEv(String|Stack|Frequency) before dependent events #18744

cstockton · 2017-01-22T06:46:49Z

This is a follow up from a few weeks ago from the mailing list thread I wrote when I was taking a stab at implementing a Go trace Encoder/Decoder. While plenty of this feature still escapes me like the state machine that makes events consistent, it was a good learning exercise and I see the potential for a separate category of tooling. You could stream events to simple dashboards/gui's in real time for a high level view of whats going on in your application or use it during runtime within the same process and filter uninteresting outbound events.

Based on my LIMITED knowledge (disclaimer here) I believe we may be able to construct a complete (processor, goroutineID's, timing, stack, string id refs) stream of events from a shallow look-behind state with a couple changes. First is to emit traceEvString and traceEvStack when discovered so dependents are emitted with referential integrity. The runtime/trace is a bit over my head, but I was able to implement this change as a POC to ensure it worked and tests passed.

The next I am less sure about, that is timing via traceEvFrequency. If it was emitted after EvBatch then all future events could calculate time based on the previous events offset. If i understand correctly it's the cpu ticks, which should be consistent at any point in the trace. But I haven't implemented the timing yet in my library and am not entirely sure on this and my suggestion may not make sense if the timing is based on the elapsed trace duration in some way.

My apologies if these suggestions don't make sense, thanks for taking the time to review regardless.

ALTree · 2017-01-23T17:32:09Z

Hi,

is this a proposal (to be evaluated) or a feature-request-issue that you plan to fix by sending those patches you mention?

cstockton · 2017-01-23T18:50:00Z

If the changes are agreed upon by the Go team, I am comfortable cleaning up my patches and sending them for review.

ALTree · 2017-01-23T19:01:08Z

No proposal label then, just cc @dvyukov for opinions.

gopherbot · 2017-02-06T05:15:37Z

CL https://golang.org/cl/36325 mentions this issue.

hyangah · 2017-02-06T19:22:04Z

@heschik

aclements · 2017-02-13T17:16:38Z

Taking a step back here, is the high-level goal here to make the trace format (more) streamable?

If so, how do you plan to deal with time being out of order in the trace? In the current format, it may be necessary to buffer the entire trace before it can be sorted; for example, if some P is producing events so slowly that they all get buffered until the end of the trace.

cstockton · 2017-02-13T19:51:05Z

@aclements Yes, that would be one of the high level goals. Currently the entire trace HAS to be buffered for complete events regardless of batch ordering, my goal with this change is to lift that restriction to allow tackling the second problem you mention with slow P's. The built in Go tool which gives the very detailed report and holistic view of a trace would not benefit much, but it does enable a new tool category that focuses on aggregate information in sliding windows. For example a tool could keep 30 second buffer and run for many minutes, or hours if needed and you could create a simple threshold for specific metrics that you have trouble catching in production to send to a trace file for deeper analysis with the go trace tool later. Maybe this wouldn't be very useful to most but I think it might allow some cool stuff (that likely does't lose value in the event of a single slow P, as it wouldn't effect a large aggregate view in most cases).

The problem with slow P's I think can be managed now that I have a better understanding of tracing. The general idea being once a buffer has been active for a given duration to flush it. I think the problem of solving a single slow P that has many other active P's covers a lot of cases and is much less nuanced than a scenario of ALL slow P's. The latter would need the wake event expanded for the parking of the ReadTrace G, I likely see obstacles (or don't see) where you and Dmitry may not since you have much better understandings of these things.

cstockton · 2017-02-24T16:10:09Z

@aclements Is this something the Go team would accept a patch for if implemented sanely?

ALTree added NeedsDecision Suggested labels Jan 23, 2017

cstockton closed this as completed Feb 28, 2017

cstockton mentioned this issue Apr 2, 2017

Few notes on this repository cstockton/go-trace#1

Closed

golang locked and limited conversation to collaborators Feb 28, 2018

gopherbot added the FrozenDueToAge label Feb 28, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

runtime/trace: emit events traceEv(String|Stack|Frequency) before dependent events #18744

runtime/trace: emit events traceEv(String|Stack|Frequency) before dependent events #18744

cstockton commented Jan 22, 2017

ALTree commented Jan 23, 2017

cstockton commented Jan 23, 2017

ALTree commented Jan 23, 2017

gopherbot commented Feb 6, 2017

hyangah commented Feb 6, 2017

aclements commented Feb 13, 2017

cstockton commented Feb 13, 2017 •

edited

Loading

cstockton commented Feb 24, 2017

runtime/trace: emit events traceEv(String|Stack|Frequency) before dependent events #18744

runtime/trace: emit events traceEv(String|Stack|Frequency) before dependent events #18744

Comments

cstockton commented Jan 22, 2017

ALTree commented Jan 23, 2017

cstockton commented Jan 23, 2017

ALTree commented Jan 23, 2017

gopherbot commented Feb 6, 2017

hyangah commented Feb 6, 2017

aclements commented Feb 13, 2017

cstockton commented Feb 13, 2017 • edited Loading

cstockton commented Feb 24, 2017

cstockton commented Feb 13, 2017 •

edited

Loading