Version control systems

Simon Marlow marlowsd at gmail.com
Mon Aug 11 11:17:59 EDT 2008


Manuel M T Chakravarty wrote:

> I think all *core* libraries must switch.  Seriously, requiring GHC 
> developer to use a mix of two vcs during development is a Very Bad 
> Idea.  Don was excited about getting more people to look at the source 
> when it is in git (see the comments he posted from reddit).  By 
> requiring two vcs you will get *less* people to look at the source.
> 
> This is not only to get the sources to hack them, but you effectively 
> require developers to learn the commands for two vcs (when they are 
> already reluctant to learn one).  For example, often enough somebody who 
> changes something in GHC will modify the base package, too.  Then, to 
> commit the overall work, you need to commit using both vcs.  If you need 
> to branch for your work, you need to create branches in two vcs (no idea 
> whether the semantics of a branch in git and darcs is anywhere 
> similar).  When you merge your branch, you need to merge in both vcs.  
> You can't seriously propose such a set up!

I completely agree this is a problem.  The main obstacle with just 
switching the core libraries is that they are shared by other 
implementations and other maintainers.  So I see no alternative but to 
create forks of those repositories for use by GHC, unless/until the other 
projects/maintainers want to migrate to git.  Some of the repositories are 
not shared - for example ghc-prim, integer-gmp, template-haskell, and these 
don't need to be forked.

One way we could create the forks would be to create a git repo for each 
package with two branches: the master branch that GHC builds, and a 
separate branch that tracks the main darcs repository, and is synced 
automatically whenever patches are pushed to the main darcs repo.  We'd 
have to explicitly merge the tracking branch into the master branch from 
time to time.  When we want to make changes locally, we can just commit 
them to the GHC branch and push the changes upstream in a batch later (and 
then we'd end up having to merge them back in to the GHC branch... but 
hopefully git's merge is clever enough to avoid manual intervention here). 
  This is complicated and ugly of course; better suggestions welcome.

> I *strongly* object to moving to git before this isn't sorted out.  As 
> Roman said before, GHC is heading into a dangerous direction.  It gets 
> progressively harder to contribute to the project at the moment.  First, 
> changing the build system to Cabal.  Now, proposing to use two vcs.  
> Somebody who is new to the project not only has to learn the internals 
> of GHC, but they also have to learn two new vcs, and if they need to 
> change the build system, they need to learn a new build tool.  Raising 
> the bar for developers to contribute to a project has been proven to be 
> a very bad idea many times.  Let's not take GHC down that path.

I'm not completely convinced we need to have this all worked out before GHC 
switches, although it would be nice of course.  We currently have 
infastructure in place for the build to work with a mixture of darcs and 
git repositories, and existing developers already have to learn git anyway. 
  They just need to remember to use darcs for libraries and git for the 
main GHC repo, and this is only a temporary situation.


As for Cabal - we had a thread on cvs-ghc last week, and as I said there 
we'd love to hear suggestions for how to improve things, including wild and 
crazy ideas for throwing it all away and starting again.  However, as I 
explained, there are good reasons for the way things are done now, the main 
one being that the build system for packages is not written twice.

Cheers,
	Simon


More information about the Glasgow-haskell-users mailing list