-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RFC: Add data-file output option #305
Comments
Please see the A datetime field would need to be added, PR welcome. We don't currently count the rows, I'm not sure what it adds / why it's necessary? As there's lots of threads making requests at the same time we'd have to "fake" it with the counter. Finally, the individual response times are not adjusted, so the last two rows aren't an option (and this is the same in HdrHistogram). If you review the mitigation section of the README you can see an example of what shows up in the Requests log. Specifically, if the |
BTW: It's also worth noting that I'm reworking the logger logic in this issue which should get merged soon: |
Yes the requests file is a reasonable workaround.
It adds two things:
Apologies for seeming obtuse, but I found it very confusing. I'll add comments in the relevant PR. |
I'd consider a PR that makes it configurable which fields from a given entity are logged, but for now I think it's enough to just be sure that all necessary fields are included. I don't want to get to bogged down in this just yet when tools already exist that can easily map the Due to the way that Instead, perhaps it would work better to filter by a given
Not at all: it's new documentation and a work in progress. PRs to improve readability are welcome. |
Agree the requests file is a existing workaround. This is aimed at making it convenient to generate data ready for analysis. I've amended the proposal to reflect this: --analysis-file NAME Create an CSV-formatted data file for analysis
--analysis-fields F1,F2,..FN Comma delimited field names to output to analysis-file. Field names: <TBD>
Not a problem. Generally people seem interested in the distributional properties, e.g. the tail, 99th %-tile etc. For distributional purposes the order requests are made is not relevant, hence the absence of a request time stamp - the While time-series properties are, IMO, of interest, e.g. are slow/fast responses clustered (tests for heteroskedasticity are well established), they are not at this stage the subject of this RFC. Though, as we mentioned elsewhere, it is not clear to us these systems are always ergodic - perhaps that is why we generally see no sophisticated analysis of this type?
Hopefully all that analysis is possible given the updated RFC.
OK we'll open a RFC to discover direction. |
We won't be able to pursue this beyond the documentation contribution which has landed. |
Prompted by issue #303 and PR #304
Add a data-file output option (under Metrics)
This produces a data file of response times suitable for analysis such as that described in PR #304.
Example (data comes from README mitigation section):
where all data files contain the fields:
datetime
: ISO-8601 date-time stamp of the start of the Goose run. Used to distinguish between Goose runs. Not used to calculate response times. Not used to debug.sample
: integer, row counteruser
: The Goose user ID (integer)code
: The HTTP response codeData files include additional fields passed as a comma delimited list in the option
--data-fields
raw
: integer, raw response time (nanoseconds)co_adjusted
: integer, CO-adjusted response time (nanoseconds)elapsed
co_elapsed
The text was updated successfully, but these errors were encountered: