Build system idea
Simon Marlow
marlowsd at gmail.com
Thu Aug 28 09:59:16 EDT 2008
John Meacham wrote:
> unfortunately the cabal approach doesn't work. note, I am not saying a
> declarative configuration manager won't work. in fact, I have sketched a
> design for one on occasion. but cabal's particular choices are broken.
> It is treading the same waters that made 'imake' fail.
>
> the ideas of forwards and backwards compatability are _the_ defining
> features of a configuration manager. Think about this, I can take my old
> sunsite CD, burned _ten years_ ago and take the unchanged tarballs off
> that CD and ./configure && make and in general most will work. many were
> written before linux even existed, many were written with non gcc
> compilers, yet they work today. The cabal way wasn't able to handle a
> single release of ghc and keep forwards or backwards compatability.
>
> That any project ever had to be changed to use the flag 'split-base' is
> a travesty. What about all the projects on burnt cds or that don't have
> someone to update them? 20 years from now when we are all using 'fhc'
> (Fred's Haskell Compiler) will we still have this reference to
> 'split-base' in our cabal files? how many more flags will have
> accumulated by then? Sure it's declarative, but in a language that
> doesn't make sense without the rule-book. autoconf tests things like
> 'does a library named foo exist and export bar'. 'is char signed or
> unsigned on the target system'. those are declarative statement and
> have a defined meaning through all time. (though, implemented in a
> pretty ugly imperative way) That is what allows autoconfed packages to
> be compiled by compilers on systems that were never dreamed of when the
> packages were written.
The important thing about Cabal's way of specifying dependencies is that
they can be made sound with not much difficulty. If I say that my package
depends on base==3.0 and network==1.0, then I can guarantee that as long as
those dependencies are present then my package will build. ("but but
but..." I hear you say - don't touch that keyboard yet!)
Suppose you used autoconf tests instead. You might happen to know that
Network.Socket.blah was added at some point and write a test for that, but
alas if you didn't also write a test for Network.Socket.foo (which your
code uses but ends up getting removed in network-1.1) then your code
breaks. Autoconf doesn't help you make your configuration sound, and you
get no prior guarantee that your code will build.
Now, Cabal's dependencies have the well-known problem that they're
exceptionally brittle, because they either overspecify or underspecify, and
it's not possible to get it "just right". On the other hand, autoconf
configurations tend to underspecify dependencies, because you typically
only write an autoconf test for something that you know has changed in the
past - you don't know what's going to change in the future, so you usually
just hope for the best. For Cabal I can ask the question "if I modify the
API of package P, which other packages might be broken as a result?", but I
can't do that with autoconf.
Both systems are flawed, but neither fundamentally. For Cabal I think it
would be interesting to look into using more precise dependencies
(module.identifier::type, rather than package-version) and have them
auto-generated. But this has difficult implications: implementing
cabal-install's installation plans becomes much harder, for example.
>> So I accept that we do not yet cover the range of configuration choices
>> that are needed by the more complex packages (cf darcs), but I think
>> that we can and that the approach is basically sound. The fact that we
>> can automatically generate distro packages for hundreds of packages is
>> not insignificant. This is just not possible with the autoconf approach.
>
> This is just utterly untrue. autoconfed packages that generate rpms,
> debs, etc are quite common. The only reason cabal can autogenerate
> distro packages for so many is that many interesting or hard ones just
> _arn't possible with cabal at all_.
Exactly! Cabal is designed so that a distro packager can write a program
that takes a Cabal package and generates a distro package for their distro.
It has to do distro-specific stuff, but it doesn't typically need to do
package-specific stuff.
To generate a distro package from an autoconf package either the package
author has to include support for that distro, or a distro packager has to
write specific support for that package. There's no way to do generic
autoconf->distro package generation, like there is with Cabal.
Yes this means that Cabal is less general than autoconf. It was quite a
revelation when we discovered this during the design of Cabal - originally
we were going to have everything done programmatically in the Setup.hs
file, but then we realised that having the package configuration available
*as data* gave us a lot more scope for automation, albeit at the expense of
some generality.
That's the tradeoff - but there's still nothing stopping you from using
autoconf and your own build system instead if you need to!
> As for programs written in haskell, I don't want people's first
> impression of haskell being "oh crap, I gotta learn a new way to build
> things just because this program is written in some odd language called
> 'haskell'" I don't care how awesome a language is, I am going to be
> annoyed by having to deal with it when I just want to compile/install a
> program. It will leave a bad taste in my mouth. I would much rather
> peoples first impression be "oh wow, this program is pretty sweet. I
> wonder what it is written in?" hence they all use ./configure && make by
> design rather than necessity.
Python packages don't have ./configure or make...
> I sometimes hear that I just shouldn't use cabal for some projects but,
> when it comes down to it. If cabal is a limited build/configuration
> system in any way, why would I ever choose it when starting a project
> when I know it is either putting a limit on my projects ability to
> innovate or knowing that at some point in the future I am going to have
> to switch build systems?
Because if you *can* use Cabal, you get a lot of value-adds for free
(distro packages, cabal-install, Haddock, source distributions, Hackage).
What's more, it's really cheap to use Cabal: a .cabal file is typically
less than a screenful, so it's no big deal to switch to something else
later if you need to.
Cheers,
Simon
More information about the Glasgow-haskell-users
mailing list