From anthony_clayden at clear.net.nz  Wed Oct  3 11:45:08 2018
From: anthony_clayden at clear.net.nz (Anthony Clayden)
Date: Thu, 4 Oct 2018 00:45:08 +1300
Subject: [Hugs-users] TypeCastery
Message-ID: <CAM7nRYTYJo7feDLvUh8OHTPqSA-=xfX1Kzouwrj0roTN4XiA6A@mail.gmail.com>

A typical consequence of combining FunDeps + Overlapping instances is that
you have to make the result parameter more general than it needs be, then
use a TypeCast constraint to improve it.

There's a classic example in the HList paper [*], Section 9 'A generic type
equality predicate'.

> class TypeEq x y b  | x y -> b
>
> instance TypeEq x x HTrue                   -- HTrue, HFalse are just two
datatypes
>
> instance (TypeCast HFalse b)
>                  => TypeEq x y b                    -- can't put TypeEq x
y HFalse

(And really all of the tricky type improvement in that paper boils down to
that technique, whether or not there's an explicit TypeEq test.)

The paper says the TypeEq solution is GHC-specific. That's true but .... I
wonder if they discussed their results with the Hugs team at the time,
because it's not needed drastic surgery to make Hugs do that. (Perhaps I
broke something, but if so it's going to take a convoluted example to find
it. I've built a routine that merges TRex records by matching label names,
with horrendous amounts of overlap and typecasting.)

The technique relies on TypeCast, which does use FunDeps to mutually
improve/unify its two parameters. Nowadays in GHC you'd use the (~)
constraint. TypeCast does use a complicated chain of "indirection"
superconstraints on instances, but does not use overlaps -- indeed there's
only a single instance for each of the classes involved. I'd been happily
using the version of TypeCast given in Appendix D 'Generic type unification
cont'd'. Then I re-read the paper, which claimed it didn't work in Hugs
[Section 9 'Reification of type unification']:

"The most generic implementation of TypeCast, which works for both Hugs and
GHC, is ... For this implementation to work, we need ... Otherwise, type
simplification will ... and thereby inline the unification." IOW
unification will be too 'eager'/too smart and see that we're trying to
evade the FunDep consistency rule.
"Alas, this implementation [Appendix D alternative] is specific to GHC; it
does not work in Hugs because of the peculiarities of that system with
regard to MPTCs and functional dependencies, ..."

I'm experiencing that the Section 9 implementation claimed to work in Hugs,
doesn't; and the Appendix D implementation claimed to not work in Hugs,
does. That's using the legitimate distro version of Hugs Sep2006. Possibly
Hugs changed after 2004 when the HList work was reported. (In Section 6 of
the paper "We give up on persuading Hugs." so I guess they didn't explore
further.)

That is, the Section 9 implementation exhibits the 'eager' type
unification, whether or not I declare TypeCast in a separate module and
"import it at a higher level in the module hierarchy" [Section 9]. That is,
if I'm doing that right: what does "separate compilation" mean in context
of Hugs? It's an interpreter not a compiler; it doesn't produce
executables/object code. Indeed if you import a library -- even one of the
standard Report-defined libraries, it goes and gets the library source and
compiles that alongside the client program. There's no Haskell
interface/.hi files that I can see(?)

Oh, and there's another advantage to avoiding separate compilation: Hugs
can see all the instances and their constraints (and instances of the
constraint classes) across all the modules. So no 'orphan instances' -- the
problems with which were amongst the reasons for Section 6 'Overlapping
banned'. "we do not want to depend on the doubtful future of overlapping
instances in general ... GHC's instance selection is lazy, whereas Hugs' is
eager".

This leads the paper on to "what's known as the Data.Typeable approach at
the type level". But the cure is worse than the disease! It's just not
scalable and not workable. So the version of HList that has persisted to
this day does use the combo of Fundeps + Overlaps, and despite the paper's
misgivings, that has persistently been stable in GHC. No "doubtful future"
after a dozen years.

To pick up a misrepresentation: "GHC's instance selection is lazy": no,
GHC's validation of overlapping instances is lazy; but its selection is
eager, too eager: it commits to an instance inside each module, ignoring
the possibility this module is imported into another with an overlapping
module which is a better fit. Whereas the alleged "Hugs' is eager" is also
opposite to the truth: Hugs' selection of an instance is delayed as much as
possible until it's sure there's only one fit (it can see all instances in
all imports). Validation of overlap is eager: instances (heads) must be in
strict substitution sequence. That's often a nuisance, but a price I'm
prepared to pay to get better coherence.

'Orphan instances' (and overlap thereof) are certainly a danger in GHC.
Avoiding them was part of the motivation for Closed Type Families to be
grouped in a single syntactic unit/in a single module. And yet GHC knows
how to defer instance selection if there's no unique/suitable instance
visible in a module. Why can't it do that everywhere? Or at least warn if
imports contain instances that are overlapped in a different module.

Then it seems to me:
* not only does GHC have a "bogus" implementation of the FunDep consistency
rule; but also
* GHC's implementation of Overlaps is broken.

Hugs' implementation of both is a lot more restrictive, and more coherent.
I'm not finding it prevents any programs. It does require I structure them
in specific ways.


AntC


[*] Strongly Typed Heterogeneous Collections 2004, Oleg Kiselyov, Ralf
Lämmel, Keean Schupke
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.haskell.org/pipermail/hugs-users/attachments/20181004/7e1fbf3b/attachment.html>

From doug at cs.dartmouth.edu  Wed Oct  3 14:37:37 2018
From: doug at cs.dartmouth.edu (Doug McIlroy)
Date: Wed, 03 Oct 2018 10:37:37 -0400
Subject: [Hugs-users] Hugs-Users Digest, Vol 41, Issue 1
In-Reply-To: <mailman.7.1538568002.14692.hugs-users@haskell.org>
References: <mailman.7.1538568002.14692.hugs-users@haskell.org>
Message-ID: <201810031437.w93EbbgW028958@tahoe.cs.Dartmouth.EDU>

!e dead*

From doug at cs.dartmouth.edu  Wed Oct  3 14:40:10 2018
From: doug at cs.dartmouth.edu (Doug McIlroy)
Date: Wed, 03 Oct 2018 10:40:10 -0400
Subject: [Hugs-users] Hugs-Users Digest, Vol 41, Issue 1
In-Reply-To: <mailman.7.1538568002.14692.hugs-users@haskell.org>
References: <mailman.7.1538568002.14692.hugs-users@haskell.org>
Message-ID: <201810031440.w93EeAth029315@tahoe.cs.Dartmouth.EDU>

Sorry for whatever I just posted. It was a slip of the
fingers in responding to a completely unrelated email

Doug

From fa-ml at ariis.it  Wed Oct  3 14:56:16 2018
From: fa-ml at ariis.it (Francesco Ariis)
Date: Wed, 3 Oct 2018 16:56:16 +0200
Subject: [Hugs-users] Hugs-Users Digest, Vol 41, Issue 1
In-Reply-To: <201810031440.w93EeAth029315@tahoe.cs.Dartmouth.EDU>
References: <mailman.7.1538568002.14692.hugs-users@haskell.org>
 <201810031440.w93EeAth029315@tahoe.cs.Dartmouth.EDU>
Message-ID: <20181003145616.sk3darlfk4y3dqst@x60s.casa>

On Wed, Oct 03, 2018 at 10:40:10AM -0400, Doug McIlroy wrote:
> Sorry for whatever I just posted. It was a slip of the
> fingers in responding to a completely unrelated email

It suspiciously looks as vim shortcuts!

From anthony_clayden at clear.net.nz  Tue Oct  9 10:57:36 2018
From: anthony_clayden at clear.net.nz (Anthony Clayden)
Date: Tue, 9 Oct 2018 23:57:36 +1300
Subject: [Hugs-users] ~ Swansong [was: TypeCastery]
In-Reply-To: <CAM7nRYTYJo7feDLvUh8OHTPqSA-=xfX1Kzouwrj0roTN4XiA6A@mail.gmail.com>
References: <CAM7nRYTYJo7feDLvUh8OHTPqSA-=xfX1Kzouwrj0roTN4XiA6A@mail.gmail.com>
Message-ID: <CAM7nRYTPXwXyTowygzrMi8rejMh2UxKE6e+p=OGR_DjxWcveAA@mail.gmail.com>

(More from the annals of oh-so-easy in yacc.)

On Thu, 4 Oct 2018 at 12:45 AM, Anthony Clayden <
anthony_clayden at clear.net.nz> wrote:

> A typical consequence of combining FunDeps + Overlapping instances is that
> you have to make the result parameter more general than it needs be, then
> use a TypeCast constraint to improve it.
>
> ... TypeCast, which does use FunDeps to mutually improve/unify its two
> parameters. Nowadays in GHC you'd use the (~) constraint.
>

I got so fed up writing out TypeCast constraints; and I'm so used to
writing infix (~) in GHC, I implemented (~) in Hugs. This is hard-coded
syntax in constraints; I've not implemented type operators. (Mark you, it's
also hard-coded in GHC, because (~) is a reserved symbol.)

This needed only changes to the yacc syntax. From that I insisted that "~"
is a legitimate name for a class. There is a downside that equality
constraints are printed with the '~' prefix -- because all class names are
prefix.

To be clear: this is not as powerful or well-integrated as (~) in GHC. You
still need at the term level to explicitly cast.

Then I got so fed up writing out explicit typeCast calls ..., I picked up
the postfix operators idea
https://mail.haskell.org/pipermail/hugs-users/2018-September/000909.html
and invented a postfix operator (~::) to do the job.
Trailing double-colon says I'm doing something typeful; tilde connects it
to the equality constraint.

Then here's the classic, compiled in Hugs, also exhibiting the FunDeps +
Overlaps combo

class TypeEq t t' r  | t t' -> r  where
  typeEq :: t -> t' -> r

instance TypeEq t t TTrue  where
  typeEq _ _ = TTrue

instance (TFalse ~ f) => TypeEq t t' f  where
  typeEq _ _ = (TFalse ~::)      -- without explicit cast, complains type
is not general enough


AntC
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.haskell.org/pipermail/hugs-users/attachments/20181009/f1faf09a/attachment.html>

From anthony_clayden at clear.net.nz  Wed Oct 31 11:31:06 2018
From: anthony_clayden at clear.net.nz (Anthony Clayden)
Date: Thu, 1 Nov 2018 00:31:06 +1300
Subject: [Hugs-users] [haskell-beginners] Lifting over record syntax
Message-ID: <CAM7nRYSkXSuT3emStLFczqAAfefTVsvHxCmp_L9QhRUYeA+_qw@mail.gmail.com>

musing on this
https://mail.haskell.org/pipermail/beginners/2018-October/018363.html

to which my reply talked mainly about Trex.

Does it make sense to regard a Trex record (or any free-standing
anonymous/extensible record) as a container, with values 'lifted' into the
structure, and an interface given by the label names, _not_ by order of
fields? That is, "container" in the Monad sense?

Does it further make sense if all the values are already lifted into some
Functor (like Maybe or (Either e) ) to hoist the Functor out over the
record structure?

I'm not sure I'm up with the program for Functor/Applicative/Monad
everywhere. If a record structure is a Functor, what's the type
constructor? `Rec`? If record structure is an Applicative, what's the
constructor for `pure` that lifts into `Rec`? The `<*>` aka `ap` or bind
for a Monad presumably is to take two records (with disjoint labels) and
concatenate them to a single record.

Given

data Person = Person (Rec( name :: String, age :: Int))

we want, corresponding to the question:

Person <?$>  (age = 27) <*> (name = "Joe")

in which operator <?$> is map-like: it takes a function to its left that
expects a record of n fields, then maps over n singleton records with
corresponding labels appearing in any order.

I might be able to get there by generics/reflection over the field names
embedded in data constructor `Person`s type, then overloading <?$> to look
for each label. That is, if Hugs had any sort of generics/TypeRep.

Label names in Trex are literals; there's no such thing as a label
variable. (Which is why it's a tad annoying that they start lower case.)
Furthermore the same label name must appear in both terms and types -- in
fact labels occupy a namespace separate vs terms or types. So Trex is a
long way from from generic record handling like:

recAppend :: ( rho'\__x ) => Rec rho' -> Rec ( __x :: a) -> Rec ( __x :: a
| rho')
recAppend rho ( __x = x ) = ( __x = x | rho )

in which I've used double-underscore prefix to signify a label variable.
This is intended to extend a record `rho` with a singleton record. If we
try appending a record with more than one field, beware that field order is
arbitrary, so this

recAppend rho ( __x = x, __y = y) = ...

has no principal type (a familiar difficulty). The programmer doesn't care
which way round labels `__x, __y` bind, providing they're distinct, but the
typing does care.

Some sort of generic record extend/concatenate would be great. You might,
looking at Trex syntax, think that `|` is it. This is valid:

( x = "x" | (y = 'y' | (z = 3.14)))
( x = "x", y = 'y', z = 3.14 )           -- equivalent

I could put any record value in place of `(z = 3.14)`. But not in place of
the `x = ..` or `y = ...`: `|` is not an operator, not commutative, not
associative. Furthermore this is a place where parentheses make a
difference, unlike usually in Haskell. So the following are not equivalent,
indeed they're all invalid syntax

( (x = "x") | (y = 'y' | (z = 3.14)))       -- x = ... has parens
( x = "x"  | (y = 'y' | z = 3.14 ))          -- z = ... doesn't have parens
( x = "x" | y = 'y' | (z = 3.14) )           -- z = ... in parens OK, but y
= ... is not
( (x = "x") | (y = 'y') | (z = 3.14) )      -- no chance

I'd like to write term `( rho1 | rho2 )` to concatenate two records. That's
currently unrecognised syntax, so I think could be added as such. What
would be its type/is it principal?

( rho1 | rho2 ) :: (rho1' \\ rho2') => Rec( rho1' | rho2' )    -- inventing
more syntax

in which constraint `(rho1' \\ rho2')` requires the two rows' labels be
mutually disjoint -- read "lacks all". Ur/web has something like this. Note
I'm not envisaging `|` as a genuine operator: this is still hard-wired
syntax; pipe is a reserved symbol in H98 anyway.


AntC
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.haskell.org/pipermail/hugs-users/attachments/20181101/a2248477/attachment.html>