Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance improvement: record-batching #779

Merged
merged 26 commits into from
Dec 13, 2021
Merged

Performance improvement: record-batching #779

merged 26 commits into from
Dec 13, 2021

Conversation

johnkerl
Copy link
Owner

@johnkerl johnkerl commented Dec 13, 2021

This is a third performance PR, following after #765 and #774.

This affects all Miller-supported file formats. Some performance numbers for CSV, on a Mac laptop:

Note that the last number is the runtime we'll have after the WIP JIT branch is PR'ed and merged. At that point, single-verb performance will be noticeably better than Miller 5. Furthermore, for multi-verb then-chains, the performance improvement will be even more noticeable, since Miller <= 5 (in C) is single-core whereas Miller 6 (in Go) is multicore.

See also new files in the scripts directory on this PR which help for doing measurements of this sort.

@johnkerl johnkerl merged commit f233923 into main Dec 13, 2021
@johnkerl johnkerl deleted the batched-reader branch December 13, 2021 05:57
johnkerl added a commit that referenced this pull request Dec 13, 2021
johnkerl added a commit that referenced this pull request Dec 13, 2021
johnkerl added a commit that referenced this pull request Dec 13, 2021
johnkerl added a commit that referenced this pull request Dec 13, 2021
@johnkerl johnkerl mentioned this pull request Dec 15, 2021
johnkerl added a commit that referenced this pull request Dec 15, 2021
johnkerl added a commit that referenced this pull request Dec 21, 2021
* JIT mlrval type-interfence: mlrval package

* mlrmap refactor

* complete merge from #779

* iterating

* mlrval/format.go

* mlrval/copy.go

* bifs/arithmetic_test.go

* iterate on bifs/collections_test.go

* mlrval_cmp.go

* mlrval JSON iterate

* iterate applying mlrval refactors to dependent packages

* first clean compile in a long while on this branch

* results of first post-compile profiling

* testing

* bugfix in ofmt formatting

* bugfix in octal-supporess

* go fmt

* neaten

* regression tests all passing
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant