[Haskell-cafe] Re: Mining Twitter data in Haskell and Clojure

Don Stewart dons at galois.com
Tue Jun 15 17:26:10 EDT 2010


deliverable:
> Wren -- thanks for the clarification!  Someone said that Foldable on
> Trie may not be very efficient -- is that true?
> 
> I use ByteString as a node type for the graph; these are Twitter user
> names.  Surely it's useful to replace them with Int, which I'll try,
> but Clojure works with Java String fine and it simplifies all kinds of
> exploratory data mining and debugging to keep it as a String, so I'll
> try to get the most mileage from other things before interning.

bytestring seems appropriate.

> What's the exact relationship between Trie and Map and their
> respective performance?

Tries specialized to bytestring keys should outperform the generic Map.

-- Don


More information about the Haskell-Cafe mailing list