[GHC DevOps Group] CircleCI job accounting question

Manuel M T Chakravarty manuel.chakravarty at tweag.io
Tue Dec 12 08:05:07 UTC 2017


> Simon Peyton Jones <simonpj at microsoft.com>:
> 
> The problem is that many contributors, including Simon PJ, Richard, and
> me, tend to push batches of work
>  
> I have not been following this thread (“job accounting” seemed above my pay grade) but I saw this mention of my name 😊.   Without having read myself into the context there seem to be two issues
>  
> Every commit to master should be validate-clean, and this should be tested by the CI framework not by the contributor. This is essential.   I would be delighted if every commit I made went through that gate. I’m careful, but occasionally not careful enough.
> 
> Most – perhaps all – commits should go through a code-review process.  Here I freely admit that I tend to use (or abuse?) my status to make most of my commits without review, except perhaps informally with individuals.  I’d be absolutely willing to review this if (a) in fact people think that the extra step would really improve quality (perhaps looking at past commits) or (b) the very fact that I do so makes people feel cross.   

I think, the important point, especially in the context we are discussing here is the first one. We should ensure that every single commit goes through CI before hitting master.

> In both cases, the big thing from my point of view is that, once I’m ready to press “go”, I’d like to take it off my to-do list by pushing into a queue that will result in either
> a commit to master, or
> an email to me saying “more work to do here”
>  
> I’m sure that many contributors would value this property.  The tricky bit is, I suppose, deciding when the reviewers are happy.  Maybe that need central human intervention; I’m not sure.

The idea with pull requests on GitHub is basically what you are describing. CI automatically validates the pull request and a reviewer can just push the merge button if CI is green and they are happy. If they are unhappy, the comment, which results in a notification to you.

I always thought this is what Differentials in Phabricator are for. 

Ben, is that not so?

> Also important: I often accumulate a sequence of related patches:
>  
> I’d like to be able to queue them as a group, or at least with very low additional overhead per patch
>  
> Sometimes I have attempted to separate two things (e.g. a bug fix and a refactoring).  In the past I have not felt it important to guarantee a validate-clean state between the two; e.g. I might have erred in separating the two.  The cost benefit ratio of teasing them apart has not seemed worth the pain.  But separating them at least means we have two commit messages. 
>  
> On this point I can see that I might have to change my behaviour, because the CI would be obliged to stop as soon as it found one that didn’t validate.  I might need help to figure out how best to fix up a patch sequence by moving little bits from one to another, but I’m sure it’s possible.

This is where things get a bit tricky.

A standard solution is to ”squash” commits; i.e., these separate patches are combined into one when they are finally applied to master. This means that the separate patches are never tested separately and that ”broken” intermediate state is never observed by CI or bisection. (The patches are still separate during code review.)

Manuel

> I’m very keen to support a better CI process in any way I can.
>  
> Simon
>  
> From: Ghc-devops-group [mailto:ghc-devops-group-bounces at haskell.org] On Behalf Of Manuel M T Chakravarty
> Sent: 11 December 2017 06:17
> To: Ben Gamari <ben at well-typed.com>
> Cc: ghc-devops-group at haskell.org; Mateusz Kowalczyk <fuuzetsu at fuuzetsu.co.uk>
> Subject: Re: [GHC DevOps Group] CircleCI job accounting question
>  
> 23.11.2017 06:15 schrieb Ben Gamari <ben at well-typed.com <mailto:ben at well-typed.com>>:
> Simon Marlow <marlowsd at gmail.com <mailto:marlowsd at gmail.com>> writes:
> 
> 
> On 22 November 2017 at 15:31, Ben Gamari <ben at well-typed.com <mailto:ben at well-typed.com>> wrote:
> 
> 
> Manuel M T Chakravarty <manuel.chakravarty at tweag.io <mailto:manuel.chakravarty at tweag.io>> writes:
> 
> Why do we need the intermediate builds exactly? Wouldn’t they usually
> fail? (When I do PRs with multiple commits, the state of the tree
> between this commits will usually not be well-defined.)
> 
> No, every commit should build. This is in part a difference between
> Phabricator's patch-based model and GitHub's feature branch model.
> However, many projects using the latter also demand that all
> intermediate commits must be atomic, buildable changes. Sacrificing this
> property greatly complicates bisection.
> 
> Building all intermediates is desireable as ultimately we would like to
> preserve per-commit build artifacts for the last few months of commits
> to enable easy bisection.
> 
> 
> I don't quite understand this. Yes building all commits is desirable, but
> in the case of Phabricator each revision is going to be a single commit,
> no? So why is this an issue? Or is it an issue only for github PRs?
> 
> The problem is that many contributors, including Simon PJ, Richard, and
> me, tend to push batches of work. For instance, when I land
> contributors' differentials I first apply a batch, then validate
> locally, and then push as a chunk. We can change this if necessary, but
> it will need to be via social convention which hasn't worked very well
> historically.
>  
> I would like to understand this issue (which prompts #14505) a bit better. Do we agree that, in an ideal world, nobody should ever push to master, and instead, all contributions start as PRs or differentials, which are merged after being reviewed, which crucial includes building them in CI? If that were the case, then we wouldn’t have the issue with needing to force CircleCI to build non-head commits, right?
>  
> If we agree that this is the ideal scenario, then what is preventing us from getting to that scenario? In particular, I don’t think you should need to validate contributor code manually on your boxes. The purpose of an automatic CI pipeline is exactly to avoid such manual steps.
>  
> I am sorry if any of these questions appear naive.
>  
> Cheers,
> Manuel

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.haskell.org/pipermail/ghc-devops-group/attachments/20171212/20924db8/attachment-0001.html>


More information about the Ghc-devops-group mailing list