Proposal: Allow gunfold for Data.Map, Data.IntMap, etc.
ekmett at gmail.com
Thu Aug 30 01:54:57 CEST 2012
On Wed, Aug 29, 2012 at 3:24 PM, Milan Straka <fox at ucw.cz> wrote:
> Hi Edward,
> > I would like to propose improving the Data instances for a number of
> > currently completely opaque data types in the containers package, by
> > virtual constructors.
> > The instance for Data.Map already uses fromList for gfoldl, it just stops
> > there.
> > Extending it to be able to gunfold and mention the name of that
> > would enable generic traversal libraries like uniplate, etc. to work over
> > the contents of the Map, rather than bailing out in fear or crashing at
> > sight of a mkNoRepType.
> > An example of the changes for Data.Map are highlighted below.
> > instance (Data k, Data a, Ord k) => Data (Map k a) where
> > gfoldl f z m = z fromList `f` toList m
> > toConstr _ = fromListConstr
> > gunfold k z c = case constrIndex c of
> > 1 -> k (z fromList)
> > _ -> error "gunfold"
> > dataTypeOf _ = mapDataType
> > dataCast2 f = gcast2 f
> > fromListConstr :: Constr
> > fromListConstr = mkConstr mapDataType "fromList"  Prefix
> > mapDataType :: DataType
> > mapDataType = mkDataType "Data.Map.Map" [fromListConstr]
> > I've used this approach for years on my own libraries to great effect.
> +1 here.
> I am not very familiar with the Data instances -- is it true that the
> parameter of the `fromList` in the Data instance will often be sorted
> (i.e., result of `toList` or `filter . toList`)? If so, we could use
> fromMaybeAscList which would look like
> fromMaybeAscList list | isDistinctAsc list = fromDistinctAscList list
> | otherwise = fromList list
> There is a big gain in using a linear-time fromDistinctAscList over O(N
> log N) fromList, but there is a linear-time check and the list must be
> kept around until isDistinctAsc finishes.
The users of Data.Data could in theory do anything they want to the keys,
but I do confess for most scenarios they'll come back to you ordered.
Hrmm. A more nuanced fromList construction could definitely help, though I
suppose that could apply in the general case as well.
We should be able to fuse this "try to construct linearly, but fall back on
N-log-N" version of fromList in one pass even for normal uses of fromList.
e.g. assume that you are constructing a sorted tree until you find a key
out of order, then take the tree you've built so far and union it
appropriately with the slower constructed fromList of the remainder. That
way you don't have to retain the storage for both the list and the map, and
we only force the list once.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Libraries