[Haskell-cafe] Structural sharing in haskell data structures?

Don Stewart dons at galois.com
Wed May 13 18:58:30 EDT 2009


wren:
> Jan-Willem Maessen wrote:
>> I wanted to clear up one misconception here...
>>
>> wren ng thornton wrote:
>> > In heavily GCed languages like Haskell allocation and collection is  
>> > cheap, so we don't mind too much; but in Java and the like, both > 
>> allocation and collection are expensive so the idea of cheap throwaway  
>> > objects is foreign.
>>
>> Not true!
>
> I was speaking of Java, not Clojure. I believe the costs in Java are  
> well documented, though I don't know enough about the JVM to know where  
> the blame belongs. (All I know of Clojure is that it's a Lisp-like on  
> the JVM :)
>
>
>> If you look at the internals of Clojure, you'll discover they're using 
>> trees with *very* wide fanout (eg fanout-64 leaf trees for lists).  
>> Why?  Because it's so cheap to allocate and GC these structures!  By 
>> using shallow-but-wide trees we reduce the cost of indexing and 
>> accessing list elements.  I suspect you'd still be hard-pressed to 
>> support this kind of allocation behavior in any of the present Haskell 
>> implementations, and Haskell implementations of the same kinds of 
>> structures have limited fanout to 2-4 elements or so.
>
> I was under the impression that the reason datastructures in Haskell  
> tend to be limited to 4-fanout had more to do with the cleanliness of  
> the implementations--- pattern matching on 64-wide cells is quite ugly,  
> as is dealing with the proliferation of corner cases for complex  
> structures like finger trees, patricia trees, etc. The use of view  
> patterns could clean this up significantly. On the other hand, we do  
> have things like lazy ByteStrings and UVector which do have wide fanouts.
>

Yes, agreed. This is the first I've heard of a penalty for fan out.
More details please!

-- Don


More information about the Haskell-Cafe mailing list