Split autoformatters and linters into different workflows and CI jobs #5167

pmeier · 2022-01-06T09:22:18Z

Status quo

Currently we mix formatters and linters with pre-commit:

formatters

Lines 9 to 11 in 578c154

    
           - id: mixed-line-ending 
        
             args: [--fix=lf] 
        
           - id: end-of-file-fixer

vision/.pre-commit-config.yaml

Lines 23 to 26 in 578c154

    
           - id: ufmt 
        
             additional_dependencies: 
        
               - black == 21.9b0 
        
               - usort == 0.6.4

linters

vision/.pre-commit-config.yaml

Lines 5 to 8 in 578c154

    
           - id: check-docstring-first 
        
           - id: check-toml 
        
           - id: check-yaml 
        
             exclude: packaging/.*

vision/.pre-commit-config.yaml

Lines 31 to 32 in 578c154

    
           - id: flake8 
        
             args: [--config=setup.cfg]

vision/.pre-commit-config.yaml

Line 37 in 578c154

- id: pydocstyle

In addition we have a separate CI job that lints with mypy

vision/.circleci/config.yml

Line 298 in 578c154

type_check_python:

Proposal

I propose we change the rationale from the two CI jobs from pre-commit and mypy to code format and lint. That means, we would only keep autoformatters in our pre-commit configuration and move all linters to what is currently the mypy job.

Pros

We could use pre-commit as "single source of truth" for formatting code. Currently they mentioned as "purely optional" in our contribution guide. Since pre-commit supports running the hooks manually, we can simply treat it as our way to bundle all autoformatters while the user does not need to know or care what exactly is run.
We could simply add new autoformatters if they are available through pre-commit, e.g. add prettier as non-code auto formatter #5158. If anyone applies code formatting through pre-commit anyway, that won't break any workflow.

Cons

We would loose the ability to run linters in a bundled manner. I don't think this is a strong con, since linters such as flake8 or pydocstyle trigger very seldom anyway due to the auto-formatting. Plus, we already need to run mypy separately.

cc @seemethere

The text was updated successfully, but these errors were encountered:

datumbox · 2022-01-07T09:09:44Z

Does it mean that the linters will run only via pre-commit instead of manually?

What I liked in the original proposal when you introduced formatting was the fact that you didn't force upon users a specific workflow aka using pre-commit. I personally prefer submitting these commands manually, after I'm done and committed my code changes and then review independently any automatic changes done by flake8, black etc.

pmeier · 2022-01-07T09:22:22Z

Does it mean that the linters will run only via pre-commit instead of manually?

Partially yes, but this is already true today. The following hooks cannot be run without pre-commit:

vision/.pre-commit-config.yaml

Lines 2 to 11 in 8c546f6

    
           - repo: https://github.com/pre-commit/pre-commit-hooks 
        
             rev: v4.0.1 
        
             hooks: 
        
               - id: check-docstring-first 
        
               - id: check-toml 
        
               - id: check-yaml 
        
                 exclude: packaging/.* 
        
               - id: mixed-line-ending 
        
                 args: [--fix=lf] 
        
               - id: end-of-file-fixer

For everything else, you can of course also run manually. It gets a little trickier if you have formatters that cannot be installed by pip. For example prettier in #5158 needs a JS environment. So you would need to set it up manually. pre-commit handles this automatically.

I personally prefer submitting these commands manually, after I'm done and committed my code changes and then review independently any automatic changes done by flake8, black etc.

flake8 is a linter and as proposed above it will be removed from our pre-commit config.

Regarding black I'm not sure what there is to review. It guarantees AST equality, so there should never be a situation where a manual review should be needed.

datumbox · 2022-01-07T09:27:24Z

flake8 often picks up import problems that black doesnt. I'm pretty use there are other things that are not covered by black, but don't remember specifics.

I'm skeptical about forcing upon developers pre-commit. I personally don't use it and I don't like the idea of having changes automatically done on my code silently without the ability to review them. We had issues in the past with tools that rewrote the python code that their outcome conflicted with JIT. If those rewrites are done on-commit, it's going to be a hell to figure out what happened and where.

pmeier · 2022-01-07T09:42:16Z

flake8 often picks up import problems that black doesnt.

~~That is not true~~ (EDIT: I missed the word "import" in @datumbox's statement. I agree, unused imports is one of the few things that ufmt will not handle). Especially in combination with usort, there are very few things that are left for it to pick up. Quoting from our contribution guidelines:

Similarly, you can check for flake8 errors with flake8 torchvision, although they should be fairly rare considering that most of the errors are automatically taken care of by ufmt already.

It basically comes down to stuff that cannot be fixed automatically, e.g. from foo import *.

I personally don't use it and I don't like the idea of having changes automatically done on my code silently without the ability to review them.

That can't happen. If you have the hooks installed, git commit something, and hook fails, it is as if the git commit has never happened. You will see all changes made by the hooks as unstaged changes that you need to manually stage again. Only if no hook fails, git commit is actually executed.

Besides, just using the pre-commit framework does not mean you have to use the hooks. You can simply do pre-commit run, which will execute all formatters on the staged files. If you don't pre-commit install, you will have no changes whatsoever to the git behavior. Thus, without installing the hooks, pre-commit basically acts as grouper for all formatters while also handling setting them up in their own environments.

NicolasHug · 2022-01-07T10:21:29Z

flake8 often picks up import problems that black doesnt.

That is not true.

From personal experience the one thing that black doesn't pick up is xyz imported but not used. This happens often when refactoring (or just writing) code.

I see value in allowing pre-commits, but I wouldn't make this mandatory either. It's just sugar on top of the more low-level tools, and I think it's important to keep them. To answer a question from @pmeier from another thread #5168 (comment)

Does a contributor need to know for example how to run all linters manually?

IMHO yes, if only for educational purpose. It's important that they know what's happening under the hood, for example to resolve errors more efficiently.

pmeier added needs discussion module: ci code quality labels Jan 6, 2022

pmeier mentioned this issue Jan 6, 2022

Streamline contributing experience #5168

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Split autoformatters and linters into different workflows and CI jobs #5167

Split autoformatters and linters into different workflows and CI jobs #5167

pmeier commented Jan 6, 2022 •

edited by pytorch-probot bot

Loading

datumbox commented Jan 7, 2022

Uh oh!

pmeier commented Jan 7, 2022

Uh oh!

datumbox commented Jan 7, 2022

Uh oh!

pmeier commented Jan 7, 2022 •

edited

Loading

Uh oh!

NicolasHug commented Jan 7, 2022

Uh oh!

Split autoformatters and linters into different workflows and CI jobs #5167

Split autoformatters and linters into different workflows and CI jobs #5167

Comments

pmeier commented Jan 6, 2022 • edited by pytorch-probot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Status quo

formatters

linters

Proposal

Pros

Cons

datumbox commented Jan 7, 2022

Uh oh!

pmeier commented Jan 7, 2022

Uh oh!

datumbox commented Jan 7, 2022

Uh oh!

pmeier commented Jan 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NicolasHug commented Jan 7, 2022

Uh oh!

pmeier commented Jan 6, 2022 •

edited by pytorch-probot bot

Loading

pmeier commented Jan 7, 2022 •

edited

Loading