Add a method to `Checker` for cached parsing of stringified type annotations #13158

AlexWaygood · 2024-08-30T10:20:49Z

Summary

This PR adds a method to ruff_linter::checkers::ast::Checker for cached parsing of stringified type annotations. It was decided in review of #12951 that this would be desirable, since ruff_python_parser::typing::parse_type_annotation can be quite expensive. However, since we already have a number of linter rules that call ruff_python_parser::typing::parse_type_annotation, it seemed to me like this would make sense as a standalone PR. (Adding caching was also more complicated than I expected, so separating this into its own PR should make life easier for reviewers.) If this is accepted, I'll rebase #12951 on top of it.

The PR should be easiest to review commit-by-commit. Each commit on its own passes the entire test suite. The first two commits do some refactoring to lay the groundwork for adding the new method. The third commit adds the new method and makes use of it in crates/ruff_linter/src/checkers/ast/mod.rs; the final commit makes use of the new method in various linter rules that currently use ruff_python_parser::typing::parse_type_annotation.

Test Plan

cargo test

AlexWaygood · 2024-08-30T10:31:48Z

This doesn't seem to have any impact on the Codspeed benchmarks one way or another as a standalone PR (but I don't know how many stringified annotations we have in those benchmarks?)

github-actions · 2024-08-30T10:39:41Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

crates/ruff_linter/src/checkers/ast/mod.rs

MichaReiser · 2024-09-02T10:23:40Z

This doesn't seem to have any impact on the Codspeed benchmarks one way or another as a standalone PR (but I don't know how many stringified annotations we have in those benchmarks?)

I don't think we have any benchmark containing stringified annotations. You could try to run some hyperfine benchmarks over a project that does use stringified annotations.

MichaReiser

Nice. Looks good to me.

The only part that's unclear to me is why we need to reset the cache. It would be nice if it could be avoided. If not, then it would help to extend the comment so that it explains what data gets invalidated.

crates/ruff_linter/src/checkers/ast/mod.rs

Daverball · 2024-09-02T11:05:11Z

This is more of a general comment, since I thought about adding something similar for my ongoing work on flake8-type-checking.

I am not sure to what degree this would be viable, but why not just eagerly parse string annotations, store them all on the checker as a struct that contains both the string and the parsed expression add that struct to the deferred string annotations vector instead of just the string and then visit the expression where we currently parse the annotation in the checker. That should preserve the same semantics for bindings and references.

I realize there are some corner cases with nested strings like 'list["str"]' where parsing either would still need to be deferred or the nested case would need to be handled eagerly while parsing the annotation, adding an arbitrary number of string / expression pairs, rather than only a single one. But it still seems viable to me, unless I'm forgetting an important detail.

MichaReiser · 2024-09-02T11:15:09Z

I thought about that too because we always end up parsing all type annotations.

My conclusion was that doing it inside of Checker would require one additional AST pass, which is unfortunate. It would be nice if we could do this directly in the parser but that's probably more involved.

…t from `visit_deferred_string_type_definitions`

AlexWaygood added the performance Potential performance improvement label Aug 30, 2024

AlexWaygood requested a review from charliermarsh August 30, 2024 10:20

AlexWaygood requested review from MichaReiser and dhruvmanila as code owners August 30, 2024 10:20

AlexWaygood mentioned this pull request Aug 30, 2024

[flake8-pyi] Teach various rules that annotations might be stringized #12951

Merged

charliermarsh reviewed Aug 30, 2024

View reviewed changes

crates/ruff_linter/src/checkers/ast/mod.rs Outdated Show resolved Hide resolved

AlexWaygood force-pushed the alex/cached-annotation-parsing branch from 8ff61ad to 3a3b810 Compare September 1, 2024 10:18

AlexWaygood requested a review from charliermarsh September 1, 2024 10:21

MichaReiser approved these changes Sep 2, 2024

View reviewed changes

AlexWaygood added 6 commits September 2, 2024 13:39

Refactor parse_type_annotation to improve encapsulation

2574c6d

Store a reference to the allocator on Checker instances

79d1148

Add a method for cached parsing of string type annotations, and use i…

49c375a

…t from `visit_deferred_string_type_definitions`

Use the caching method to parse type annotations in various linter rules

0f5d3c0

Trivial review comments

53eca4b

Only store successfully parsed annotations in the arena

6fa7a5b

AlexWaygood force-pushed the alex/cached-annotation-parsing branch from c4d222e to 6fa7a5b Compare September 2, 2024 12:39

AlexWaygood enabled auto-merge (squash) September 2, 2024 12:40

AlexWaygood merged commit b7c7b4b into main Sep 2, 2024
16 checks passed

AlexWaygood deleted the alex/cached-annotation-parsing branch September 2, 2024 12:44

dhruvmanila added the internal An internal refactor or improvement label Sep 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a method to `Checker` for cached parsing of stringified type annotations #13158

Add a method to `Checker` for cached parsing of stringified type annotations #13158

AlexWaygood commented Aug 30, 2024

AlexWaygood commented Aug 30, 2024 •

edited

Loading

github-actions bot commented Aug 30, 2024 •

edited

Loading

MichaReiser commented Sep 2, 2024

MichaReiser left a comment

Daverball commented Sep 2, 2024

MichaReiser commented Sep 2, 2024

Add a method to Checker for cached parsing of stringified type annotations #13158

Add a method to Checker for cached parsing of stringified type annotations #13158

Conversation

AlexWaygood commented Aug 30, 2024

Summary

Test Plan

AlexWaygood commented Aug 30, 2024 • edited Loading

github-actions bot commented Aug 30, 2024 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

Formatter (stable)

Formatter (preview)

MichaReiser commented Sep 2, 2024

MichaReiser left a comment

Choose a reason for hiding this comment

Daverball commented Sep 2, 2024

MichaReiser commented Sep 2, 2024

Add a method to `Checker` for cached parsing of stringified type annotations #13158

Add a method to `Checker` for cached parsing of stringified type annotations #13158

AlexWaygood commented Aug 30, 2024 •

edited

Loading

github-actions bot commented Aug 30, 2024 •

edited

Loading

`ruff-ecosystem` results