[openstack-dev] [git-review] Supporting development in local branches

Yuriy Taraday yorik.sar at gmail.com
Wed Aug 6 19:14:09 UTC 2014


On Wed, Aug 6, 2014 at 7:23 PM, Ben Nemec <openstack at nemebean.com> wrote:

> On 08/06/2014 12:41 AM, Yuriy Taraday wrote:
> > On Wed, Aug 6, 2014 at 1:17 AM, Ben Nemec <openstack at nemebean.com>
> wrote:
> >
> >> On 08/05/2014 03:14 PM, Yuriy Taraday wrote:
> >>> On Tue, Aug 5, 2014 at 10:48 PM, Ben Nemec <openstack at nemebean.com>
> >> wrote:
> >>>
> >>>> On 08/05/2014 10:51 AM, ZZelle wrote:
> >>>>> Hi,
> >>>>>
> >>>>>
> >>>>> I like the idea  ... with complex change, it could useful for the
> >>>>> understanding to split it into smaller changes during development.
> >>>>
> >>>> I don't understand this.  If it's a complex change that you need
> >>>> multiple commits to keep track of locally, why wouldn't reviewers want
> >>>> the same thing?  Squashing a bunch of commits together solely so you
> >>>> have one review for Gerrit isn't a good thing.  Is it just the warning
> >>>> message that git-review prints when you try to push multiple commits
> >>>> that is the problem here?
> >>>
> >>>
> >>> When you're developing some big change you'll end up with trying dozens
> >> of
> >>> different approaches and make thousands of mistakes. For reviewers this
> >> is
> >>> just unnecessary noise (commit title "Scratch my last CR, that was
> >>> bullshit") while for you it's a precious history that can provide basis
> >> for
> >>> future research or bug-hunting.
> >>
> >> So basically keeping a record of how not to do it?
> >
> >
> > Well, yes, you can call version control system "a history of failures".
> > Because if there were no failures there would've been one omnipotent
> commit
> > that does everything you want it to.
>
> Ideally, no.  In a perfect world every commit would work, so the version
> history would be a number of small changes that add up to this great
> application.  In reality it's a combination of new features, oopses, and
> fixes for those oopses.  I certainly wouldn't describe it as a history
> of failures though.  I would hope the majority of commits to our
> projects are _not_ failures. :-)
>

Well, new features are merged just to be later fixed and refactored - how
that's not a failure? And we basically do "keep a record of how not to do
it" in our repositories. Why prevent developers do the same on the smaller
scale?

>>  I get that, but I
> >> think I'm more onboard with the suggestion of sticking those dead end
> >> changes into a separate branch.  There's no particular reason to keep
> >> them on your working branch anyway since they'll never merge to master.
> >>
> >
> > The commits themselves are never going to merge to master but that's not
> > the only meaning of their life. With current tooling "working branch"
> ends
> > up a patch series that is constantly rewritten with no proper history of
> > when did that happen and why. As I said, you can't find roots of bugs in
> > your code, you can't dig into old versions of your code (what if you
> need a
> > method that you've already created but removed because of some wrong
> > suggestion?).
>
> You're not going to find the root of a bug in your code by looking at an
> old commit that was replaced by some other implementation.  If anything,
> I see that as more confusing.  And if you want to keep old versions of
> your code, either push it to Gerrit or create a new branch before
> changing it further.
>

So you propose two options:
- store history of your work within Gerrit's patchsets for each change
request, which don't fit "commit often" approach (who'd want to see how I
struggle with fixing some bug or write working test?);
- store history of your work in new branches instead of commits in the same
branch, which... is not how Git is supposed to be used.
And both this options don't provide any proper way of searching through
this history.

Have you ever used bisect? Sometimes I find myself wanting to use it
instead of manually digging through patchsets in Gerrit to find out which
change I made broke some usecase I didn't put in unittests yet.

>  They're basically unnecessary conflicts waiting to happen.
> >>
> >
> > No. They are your local history. They don't need to be rebased on top of
> > master - you can just merge master into your branch and resolve conflicts
> > once. After that your "autosquashed" commit will merge clearly back to
> > master.
>
> Then don't rebase them.  git checkout -b dead-end and move on. :-)
>

I never proposed to rebase anything. I want to use merge instead of rebase.

 >>> Merges are one of the strong sides of Git itself (and keeping them very
> >>> easy is one of the founding principles behind it). With current
> workflow
> >> we
> >>> don't use them at all. master went too far forward? You have to do
> rebase
> >>> and screw all your local history and most likely squash everything
> anyway
> >>> because you don't want to fix commits with known bugs in them. With
> >>> proposed feature you can just do merge once and let 'git review' add
> some
> >>> magic without ever hurting your code.
> >>
> >> How do rebases screw up your local history?  All your commits are still
> >> there after a rebase, they just have a different parent.  I also don't
> >> see how rebases are all that much worse than merges.  If there are no
> >> conflicts, rebases are trivial.  If there are conflicts, you'd have to
> >> resolve them either way.
> >>
> >
> > Merge is a new commit, new recorded point in history. Rebase is rewriting
> > your commit, replacing it with a new one, without any record in history
> (of
> > course there will be a record in reflog but there's not much tooling to
> > work with it). Yes, you just apply your patch to a different version of
> > master branch. And then fix some conflicts. And then fix some tests. And
> > then you end up with totally different commit.
>
> And with merge commits you end up with a tree that is meaningless except
> at the very tail end of the commit series, which I think is the root of
> your problems with rebasing.


With proposed workflow you don't need to rebase. That's the whole point -
let developer keep track of one's own progress, not rewrite history every 5
minutes.

I imagine it would be very painful to work
> in a way where the only commit that you can test against is the last one.
>

I don't understand how that's painful. You always test only against the
last commit in your branch. Or you checkout older commit and test that one.

> I totally agree that life's very easy when there's no conflicts and you've
> > written all your feature in one go. But that's almost never the true.
> >
> >
> >> I also reiterate my point about not keeping broken commits on your
> >> working branch.  You know at some point they're going to get
> >> accidentally submitted. :-)
> >>
> >
> > Well... As long as you use 'git review' to upload CRs, you're safe. If
> you
> > do 'git push gerrit HEAD:refs/for/master' you're screwed. But why would
> you
> > do that?
>
> Or if you forget this new proposed option to git-review. :-)
>

I don't propose it as an option. A concrete config parameter will turn on
new behavior for all your repos or for some of them.


> >> As far as letting git review do magic, how is that better than "git
> >> rebase once and no magic required"?  You deal with the conflicts and
> >> you're good to go.
> >
> >
> > In a number of manual tasks it's the same. If your patch cannot be merged
> > into master, you merge master to your local branch and you're good to go.
> > But as I said, merge will be remembered, rebase won't. And after that
> > rebase/merge you might end up having your tests failing, and you'll have
> to
> > rewrite your commit again with --amend, with no record in history.
>
> Again, this is why the tests should pass against all of your commits.
> If that's the case, you can verify your changes as you rebase before you
> update the commit.
>

Ok, one more time. You don't need to do rebase. You merge master with one
local commit resolving dependencies in the process and then fix tests and
everything with the second one. It's really simple.


> >> And if someone asks you to split a commit, you can
> >> do it.  With this proposal you can't, because anything but squashing
> >> into one commit is going to be a nightmare (which might be my biggest
> >> argument against this).
> >>
> >
> > You can do it with the new approach as well. See at the end of the
> > proposal. You split your current branch into a number of branches and let
> > git-review detect who depends on who between them.
>
> !
>
> So instead of a chain of commits, you want to have a separate branch for
> every commit?


For every change request, yes. That's what we basically have in Gerrit
(although those commits are not structured as well).

How would you test that locally?


Easy - you merge locally changes from branch 'cr-1' to branch 'cr-2' and
voila - 'cr-2' contains all changes you've applied to 'cr-1' and want to
test.

Even if git-review had
> the smarts to resolve dependencies that doesn't help me while I'm
> actively working on a change.
>

It does help because you can independently work on each change request
until they are ready to be combined once again. And you don't need to
manually keep track of which CR's require the one you're working on.


> This sounds very much like a reimplementation of git rebase -i.  Instead
> of picking the commits you want to edit with that, you'd switch branches
> to edit them.  Dependency issues aside, how is that an improvement?
>

You keep track of all changes that happen to every change request locally.
That's the point of this proposal.


> >
> >> And speaking about breaking down of change requests don't forget support
> >>> for change requests chains that this feature would lead to. How to you
> >> deal
> >>> with 5 consecutive change request that are up on review for half a
> year?
> >>> The only way I could suggest to my colleague at a time was "Erm...
> Learn
> >>> Git and dance with rebases, detached heads and reflogs!" My proposal
> >> might
> >>> take care of that too.
> >>>
> >>
> >> How does this relate to commit series?  Squashing all the commits into
> >> one isn't a solution to any of the problems with those (if it were, we
> >> could do that today :-).
> >>
> >
> > At the end of the proposal I mention how. You keep a separate branch for
> > every commit in the series. You merge them into each other as needed, you
> > can fix them separately, you can change the dependency order. And every
> > single step will be recorded in your local repo.
> >
> >
> >> FWIW, I have had long-lived patch series, and I don't really see what is
> >> so difficult about running git rebase master.  Other than conflicts, of
> >> course, which are going to be an issue with any long-running change no
> >> matter how it's submitted.  There isn't a ton of git magic involved.
> >>
> >
> > Yeah. As long as all you need is to keep them aligned with master, you
> can
> > just rebase. But what if you have 5 commits in the series and you need to
> > fix commit #2 that's heavily used in commit #4? You can either constantly
> > do interactive rebase (for each small change, actually) or work on top of
> > commit #4, then stash your changes and amend commit #2 with them. And you
> > can't "commit early, commit often" during this process because a "commit"
> > is a long task. You can only hope you won't lose one part of it while
> > fixing the other one.
>
> This doesn't fix that though.  If commit 4 depends on commit 2 and you
> have to make changes to commit 2, you still have to switch back to
> commit 4 to verify that your changes in 2 didn't break anything.  This
> is why having commits be self-contained is important - if commit 2 has
> good unit tests to verify the changes, you can hopefully be confident
> that your updates to it won't break commit 4.
>

But when unittests are not good (enough) you'll break your commit 4 and
you'll have to go back and fix that again. And some time later you'll find
out that some other issue appeared during this process and you'll want to
find out which change to commit #2 did that, and you run bisect... Only if
you didn't use rebase.

All this would change is the command you run to switch between commits.
>  Since git already provides this functionality, I don't think we should
> reinvent it.
>
> >
> >
> >> So as you may have guessed by now, I'm opposed to adding this to
> >> git-review.  I think it's going to encourage bad committer behavior
> >> (monolithic commits) and doesn't address a use case I find compelling
> >> enough to offset that concern.
> >>
> >
> > And I hope to show you that even if you wouldn't use this feature, others
> > might find it extremely useful.
> > It won't encourage having single monolithic change request. It just
> allows
> > you to keep track of its development locally. Nothing changes from
> > reviewer's point of view.
> >
>
> As I noted in the other e-mail, my concern isn't even so much whether
> this feature makes sense (although I think it's a sign of a flawed git
> workflow if you feel the need to use it, but I have enough flawed
> workflows that I can't really throw stones on that ;-), it's whether
> this feature is going to be abused by people who don't understand the
> intent.  I think it will, and because of that I think the fix here is to
> better explain the intended workflow for Git/git-review/Gerrit.
> Reimplementing stuff that Git already does in a slightly different way
> is only going to cause more confusion.
>

I'm not suggesting to reinvent Git features. I suggest to use them. I
suggest to embrace ability to keep track of everything, not only lousy
history of your uploads.

-- 

Kind regards, Yuriy.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-dev/attachments/20140806/5ed78f68/attachment.html>


More information about the OpenStack-dev mailing list