On pre-commit, black, ruff, ruff-format and everything else $linter related

12 Sep 2024

      o/

Seeing as we've been doing quite a bit of experimentation on this topic in the
SDK team recently, and it's come up in random discussions a few times over
recent days, I thought it might be helpful to share a few tips and tricks on how
our linting stored has evolved across SDK deliverables. Hopefully it's helpful
to someone.

Historically, teams have provided a `pep8` tox target to run their linters.
Originally this would have run flake8 or the hacking wrapper, but over the years
these targets have expanded to cover other linters like bandit, doc8 and mypy,
as well as various flake8 plugins.

In recent times, we have been slowly introducing use of the pre-commit tool [1].
Initially this was purely additive, as it's a nice help for users working in a
terminal (like me!) and mostly duplicated what was defined in the `pep8` target.
This duplication led to issues where pre-commit was doing slightly different
things from the `pep8` target, so we have evolved this so that the `pep8` target
now simply calls pre-commit. This is pretty easy to do now since most if not all
of the linters in common use across OpenStack - flake8, hacking, doc8, bandit,
etc. - provides pre-commit hooks. I recommend anyone using pre-commit does the
same thing otherwise these thing can drift. For example, your pep8 target can
become:

   [testenv:pep8] 
   description = 
   Run style checks. 
   skip_install = true 
   deps = 
   pre-commit 
   commands = 
   pre-commit run --all-files --show-diff-on-failure

We've since evolved this further to start changing the linters we've used in
general. We first introduced black [2], the automatic code formatting tool. The
rationale behind this was to remove the need to even think about code style and
focus on the actual semantic change. We're already overburdened, and going back
and forth on style issues is a waste of everyone's time. Contributors
(particularly new ones) can simply install pre-commit or use the `pep8` tox
target and their code will be automatically restyled in a generally non-
offensive manner. Is it perfect? No, but it's close enough that we've decided we
don't care. When we have introduced black, we have typically kept the changes
introduced by black separate from the changes to e.g. the `.pre-commit-
config.yaml` or `tox.ini` file. This allows us to introduce a `.git-blame-
ignore-revs` file afterwards which some software forges (like GitHub, though
sadly not Gittea (yet!)) can use to ignore these bulk update changes when using
`git-blame`. You can also configure git locally to ignore this, and I've made an
effort to place comments in the files whenever I've added them.

In addition to black, we also added other linters like bandit [3] and pyupgrade
[4], including these in pre-commit so that they would run automatically on
commit and in CI. However, as anyone reading the orange site or surveying the
broader Python landscape might be aware, the ruff [5] linter and formatter has
been gaining steam and is pretty mature at this point. In recent weeks, we have
been migrating pretty much all our linters and formatters - black, pyupgrade,
bandit and (most of) flake8 - to ruff and ruff-format. We have kept hacking but
only enable the H (hacking) ruleset plus any custom rules for now, but at some
point we will explore getting these few hacking rules into ruff or writing them
off. In any case, this migration has resulted in a significantly trimmer pre-
commit-config.yaml file (and much faster runtime to boot). You can see the end
result on a repo like e.g. osc-lib [6].

All in all, I think I can recommend this approach for anyone interested in
pursuing it. Linters are far from the most important aspect of development, but
I've found these tools to yield a nice improvement in developer experience.

Top tips:

 * Have your 'pep8' target (and any other linter-related target run 'pre-commit'
   rather than the linters themselves), and don't forget to remove linters from
   test-requirements.txt once you do (with the exception of hacking if you have
   custom hacking rules)
 * Consider using ruff and ruff-format in place of all non-hacking
   linters/formatters: it's seriously fast and provides rules to cover most of
   these.
 * When applying ruff-format or the UP (pyupgrade) ruff rules, do it separately
   so you can add the commit ID to a .git-blame-ignore-revs file. You should be
   able to do this pre-merge since Gerrit/Zuul merge patches rather than
   rebasing them so the commit ID won't change.

Other points:

 * When we first introduced pre-commit, some people noted that installing from
   Git rather than PyPI introduced a possible attack vector since tags can be
   changed. Given we only use a small, curated set of linters from trusted
   sources, I don't know how much of an issue this is but I'll raise it here.
 * Difficulty backporting code has been highlighted as an blocker for running a
   formatter like black or ruff-format. We don't do many backports in SDK, OSC
   etc. so this hasn't been a huge issue. If it were though, we would simply
   apply the formatter to the stable branch first before backporting.

Hope this helps,
Stephen

[1] https://pre-commit.com/
[2] https://github.com/psf/black
[3] https://github.com/PyCQA/bandit
[4] https://github.com/asottile/pyupgrade
[5] https://github.com/astral-sh/ruff
[6] https://opendev.org/openstack/osc-lib/

Stephen Finucane

Clark Boylan

smooney＠redhat.com

smooney＠redhat.com

tags

participants (3)