[GHC] #10681: Teach GHC to interpret all hs files as two levels of hs-boot files (abstract types only/full types + values)
GHC
ghc-devs at haskell.org
Fri Jul 31 00:59:01 UTC 2015
#10681: Teach GHC to interpret all hs files as two levels of hs-boot files
(abstract types only/full types + values)
-------------------------------------+-------------------------------------
Reporter: ezyang | Owner: ezyang
Type: feature request | Status: new
Priority: normal | Milestone:
Component: Compiler | Version: 7.11
Resolution: | Keywords:
Operating System: Unknown/Multiple | Architecture:
| Unknown/Multiple
Type of failure: None/Unknown | Test Case:
Blocked By: | Blocking:
Related Tickets: | Differential Revisions:
-------------------------------------+-------------------------------------
Old description:
> This is a new proposal for solving #1409. The big addition here is that
> we create **two** hs-boot files for each hs file: one that is a full hs-
> boot file to be imported by hs files to break loops, and a second which
> only includes abstract types for hs-boot files to import. C.f. #10679
>
> **Discussion.** Here is a slightly goofy `hs-boot` file I've excerpted
> from GHC:
>
> {{{
> module Packages where
>
> import {-# SOURCE #-} Module (PackageKey)
> import {-# SOURCE #-} DynFlags (DynFlags)
>
> packageKeyPackageIdString :: DynFlags -> PackageKey -> Maybe String
> }}}
>
> The `hs-boot` file must itself import `hs-boot` files, because this boot
> file is used by `Module` and `DynFlags`; without `{-# SOURCE #-}`, the
> boot file itself will participate in a cycle!
>
> But notice that there is something very interesting: a boot file is ONLY
> ever interested in importing other modules to get types. Never to import
> constructors or functions!
>
> We can use this observation to give us a mechanical transformation of an
> `hs` file to an `hs-boot` file, ASSUMING we can define a "second level"
> of `hs-boot` file to record our abstract types.
>
> **Example.** In this example, we have chosen to break the loop from `A`s
> import to `B`.
>
> {{{
> module A where
> import {-# SOURCE #-} B
> data A = A B
> f :: A -> Bool
> f (A (B (A b))) = g b
> f _ = True
>
> module B where
> import A
> data B = B A
> g :: B -> Bool
> g (B (A (B b))) = f b
> g _ = False
> }}}
>
> The first-level `hs-boot`s are:
>
> {{{
> module A where -- not actually used
> import {-# SOURCE 2 #-} B
> data A = A B
> f :: A -> Bool
>
> module B where
> import {-# SOURCE 2 #-} A
> data B = B A
> g :: B -> Bool
> }}}
>
> The second-level `hs-boot`s are:
>
> {{{
> module A where
> data A
>
> module B where -- not actually used
> data B
> }}}
>
> **Commentary.** Here are some remarks:
>
> 1. Because we have to lift the transitive dependencies of anything we
> `{-# SOURCE #-}` import, it doesn't make sense to have a pragma which
> explicitly says what to put in the `hs-boot` file; instead, we just put
> in everything that we *can* handle in an `hs-boot` file (so exclude
> anything with missing type signatures, type families, etc.) Ideally,
> these automatic hs-boot files are generated lazily, but they should be
> reused as necessary.
>
> 2. This facility actually makes `{-# SOURCE #-}` a lot more attractive
> for increasing separate compilation: you can mark an import `{-# SOURCE
> #-}` to ensure that if its implementation changes, you don't have to
> recompile this module / you can build the module in parallel with that
> module. The downside is that when the imported file is modified, we have
> to regenerate the `hs-boot` stub before we conclude that the types have
> not changed (as opposed to with separate `hs-boot` files, where a
> modification to `hs` would not bump the timestamp on `hs-boot`.
>
> 3. This seems to definitely suggest that you should never need more than
> two levels of hs-boot nesting, or perhaps three with kinding. (But maybe
> someone has a fancy type system feature for which this is not true!)
> Maybe this applies to signature files too.
>
> 4. We can't force the first level of `hs-boot` files to be abstract
> types, for two reasons: (1) a source file importing the hs-boot file may
> really need the selector/constructor, and (2) the `hs-boot` files will
> reflect any cycles from the source files, that's no good! Rolling out to
> the second level breaks the cycle because abstract types never need any
> imports.
>
> 5. What about type class instances? I propose that instances be lifted to
> the `hs-boot` level (so hs file usages of the instance continue to work),
> but not the `hs-boot2` level (so that we can still "bottom out"). This
> can result in some slightly unintuitive behavior, however:
> {{{
> module A where
> instance Eq (a -> b) where ...
> module B where
> import A
> module C where
> import {-# SOURCE #-} B
> }}}
> In this case, `C` would NOT see the `Eq` instance for functions
> defined in `A`.
New description:
This is a new proposal for solving #1409. The big addition here is that we
create **two** hs-boot files for each hs file: one that is a full hs-boot
file to be imported by hs files to break loops, and a second which only
includes abstract types for hs-boot files to import. C.f. #10679
**Discussion.** Here is a slightly goofy `hs-boot` file I've excerpted
from GHC:
{{{
module Packages where
import {-# SOURCE #-} Module (PackageKey)
import {-# SOURCE #-} DynFlags (DynFlags)
packageKeyPackageIdString :: DynFlags -> PackageKey -> Maybe String
}}}
The `hs-boot` file must itself import `hs-boot` files, because this boot
file is used by `Module` and `DynFlags`; without `{-# SOURCE #-}`, the
boot file itself will participate in a cycle!
But notice that there is something very interesting: a boot file is ONLY
ever interested in importing other modules to get types. Never to import
constructors or functions!
We can use this observation to give us a mechanical transformation of an
`hs` file to an `hs-boot` file, ASSUMING we can define a "second level" of
`hs-boot` file to record our abstract types.
**Example.** In this example, we have chosen to break the loop from `A`s
import to `B`.
{{{
module A where
import {-# SOURCE #-} B
data A = A B
f :: A -> Bool
f (A (B (A b))) = g b
f _ = True
module B where
import A
data B = B A
g :: B -> Bool
g (B (A (B b))) = f b
g _ = False
}}}
The first-level `hs-boot`s are:
{{{
module A where -- not actually used
import {-# SOURCE 2 #-} B
data A = A B
f :: A -> Bool
module B where
import {-# SOURCE 2 #-} A
data B = B A
g :: B -> Bool
}}}
The second-level `hs-boot`s are:
{{{
module A where
data A
module B where -- not actually used
data B
}}}
**Commentary.** Here are some remarks:
1. Because we have to lift the transitive dependencies of anything we `{-#
SOURCE #-}` import, it doesn't make sense to have a pragma which
explicitly says what to put in the `hs-boot` file; instead, we just put in
everything that we *can* handle in an `hs-boot` file (so exclude anything
with missing type signatures, type families, etc.) Ideally, these
automatic hs-boot files are generated lazily, but they should be reused as
necessary.
2. This facility actually makes `{-# SOURCE #-}` a lot more attractive for
increasing separate compilation: you can mark an import `{-# SOURCE #-}`
to ensure that if its implementation changes, you don't have to recompile
this module / you can build the module in parallel with that module. The
downside is that when the imported file is modified, we have to regenerate
the `hs-boot` stub before we conclude that the types have not changed (as
opposed to with separate `hs-boot` files, where a modification to `hs`
would not bump the timestamp on `hs-boot`.
3. With Haskell98, you should never need more than two levels of hs-boot
nesting. However, with data kind promotion, you may need arbitrarily many
levels of nesting. You could simply exclude promoted data kinds ala
**Handling unsupported boot features**; however an alternate thing to do
is generalize hs-boot to arbitrarily many levels. However, this might be
annoying to implement because dependency analysis needs to know how to
determine universe stratification so it can tell how many levels of hs-
boot are necessary.
4. We can't force the first level of `hs-boot` files to be abstract types,
for two reasons: (1) a source file importing the hs-boot file may really
need the selector/constructor, and (2) the `hs-boot` files will reflect
any cycles from the source files, that's no good! Rolling out to the
second level breaks the cycle because abstract types never need any
imports.
5. What about type class instances? I propose that instances be lifted to
the `hs-boot` level (so hs file usages of the instance continue to work),
but not the `hs-boot2` level (so that we can still "bottom out"). This can
result in some slightly unintuitive behavior, however:
{{{
module A where
instance Eq (a -> b) where ...
module B where
import A
module C where
import {-# SOURCE #-} B
}}}
In this case, `C` would NOT see the `Eq` instance for functions defined
in `A`.
**Handling unsupported boot features.** Some type-level features in
Haskell are not supported at the boot-level (type families, etc), so the
automatic generation of `hs-boot` needs a way of transitively(!) excluding
these definitions from `hs-boot` files. We can exclude things from the
boot file in the following way:
1. If a declaration is not liftable to the `hs-boot` file, we replace it
with a "not bootable" declaration, which specifies that there is something
with this `Name`, but we don't have any information about it. (This is a
sort of generalized version of an abstract type).
2. If we are type-checking a declaration and make reference to a not
bootable declaration, the full declaration itself is considered not
bootable.
Alternately, we can just make sure all language features are supported in
boot files.
--
Comment (by ezyang):
Updated description with some remarks about handling type system features
which are not supported by boot files.
@goldfire: Intuitively, it seems like if you can figure out your universe
hierarchy, you can just write as many levels of `hs-boot` files as you
need. Unfortunately, because types and kinds are syntactically merged in
your nokinds branch, it's not immediately obvious prior to typechecking
what the universes are (the pain of de-stratifying!) which makes it much
more difficult to plan compilation. So it seems like it would be much
simpler to just not include these types of declarations in hs-boot files.
--
Ticket URL: <http://ghc.haskell.org/trac/ghc/ticket/10681#comment:7>
GHC <http://www.haskell.org/ghc/>
The Glasgow Haskell Compiler
More information about the ghc-tickets
mailing list