Proposal: Significant performance improvements for Data.Map
Don Stewart
dons at galois.com
Sun Aug 29 09:15:45 EDT 2010
http://hackage.haskell.org/trac/ghc/ticket/4277
Proposal: Significant performance improvements for Data.Map
Description
Milan Straka's recent [52] Haskell Symposium paper (PDF) shed light on
the containers:Data.Map library, indicating there were both algorithmic
and stylistic performance improvements to be made.
52. http://research.microsoft.com/~simonpj/papers/containers/containers.pdf
This proposal provides a patch that [53] dramatically improves
performance across the API. Three standard techniques are applied to
the code:
* [54] Worker/Wrapper transformations of recursive functions with
constant arguments (aka. the [55] static argument transformation).
* Explicit inlining of wrapper functions.
* Explicit strictness of keys to functions.
53. http://is.gd/eJHIE
54. http://www.workerwrapper.com/
55. http://hackage.haskell.org/trac/ghc/ticket/888
Three complementary, but orthogonal patches are provided.
1. Add a testsuite, [56] with coverage data (currently 91% of top
level functions, and all core functions).
2. Add a micro-benchmark suite based on [57] Criterion, for empirical
evidence of improvements to each function.
3. The optimization patch itself.
All 3 patches should be applied, under this proposal.
56. http://code.haskell.org/~dons/tests/containers/hpc_index.html
57. http://hackage.haskell.org/package/criterion
The [58] benchmark results are very strong:
* An average speedup across the core api of 47% (on intel i7 / linux
64 bit), and;
* 36% on Mac / intel core 2 duo 32 bit).
http://i.imgur.com/05BWe.png
58. http://is.gd/eJHIE
No bugs were identified in the development of the comprehensive
testsuite, which includes a large set of unit tests from [60] Kazu
Yamamoto
60. http://twitter.com/kazu_yamamoto/status/22351727559
A fork of the containers package, with the 3 patches (and a version
bump to make it easier to test out) is available:
darcs get http://code.haskell.org/~dons/code/containers/
Separately, the patches are attached to this ticket.
The consideration period is 3 weeks (before ICFP).
__________________________________________________________________
Work carried out over 28/29th August, 2010 in Utrecht, NL, by Johan
Tibell and Don Stewart.
More information about the Libraries
mailing list