Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix surrounding pair performance #2700

Merged
merged 34 commits into from
Jan 7, 2025
Merged

Fix surrounding pair performance #2700

merged 34 commits into from
Jan 7, 2025

Conversation

AndreasArvidsson
Copy link
Member

@AndreasArvidsson AndreasArvidsson commented Dec 22, 2024

With a 10k line json. "take pair" went from about 12 seconds to 200 ms on my computer. Implements a cache and also eliminates nested loops and replaces them with a lookup tree.

Checklist

  • I have added tests
  • [/] I have updated the docs and cheatsheet
  • [/] I have not broken the cheatsheet

@AndreasArvidsson AndreasArvidsson requested a review from a team as a code owner December 22, 2024 08:39
@AndreasArvidsson AndreasArvidsson mentioned this pull request Dec 23, 2024
1 task
Copy link
Member

@pokey pokey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good stuff; see comments

@@ -32,6 +32,11 @@ export interface LanguageDefinitions {
*/
get(languageId: string): LanguageDefinition | undefined;

/**
* Clear the cache of all language definitions. This is run at the start of a command.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

worth saying why this is necessary. I'm also concerned that if you need to do this, then the cache is breaking things that don't happen as a result of a command, eg does hot reloading still work with the scope visualizer?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment added. The scope visualizes the works fine.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess we blow away the language def every time it hot reloads so that's why it works?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Probably true

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few unit tests would be nice here. It's complex enough, and would function as a form of documentation as well

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added a few tests

@pokey
Copy link
Member

pokey commented Jan 7, 2025

Feel free to merge when you're happy

@AndreasArvidsson AndreasArvidsson added this pull request to the merge queue Jan 7, 2025
Merged via the queue into main with commit c5824a1 Jan 7, 2025
15 checks passed
@AndreasArvidsson AndreasArvidsson deleted the srPerformance branch January 7, 2025 15:51
github-merge-queue bot pushed a commit that referenced this pull request Jan 7, 2025
The tests should pass once
#2700 is merged

This is the result from running it locally on my desktop.
```
  Performance: 10202 lines JSON
    Remove token
      24 ms
    √ Remove token (108ms)
    Select character
      14 ms
    √ Select character (122ms)
    Select word
      10 ms
    √ Select word (113ms)
    Select token
      8 ms
    √ Select token (110ms)
    Select identifier
      7 ms
    √ Select identifier (115ms)
    Select line
      40 ms
    √ Select line (141ms)
    Select sentence
      25 ms
    √ Select sentence (126ms)
    Select paragraph
      5 ms
    √ Select paragraph (104ms)
    Select document
      8 ms
    √ Select document (115ms)
    Select nonWhitespaceSequence
      8 ms
    √ Select nonWhitespaceSequence (96ms)
    Select string
      247 ms
    √ Select string (353ms)
    Select map
      167 ms
    √ Select map (248ms)
    Select collectionKey
      135 ms
    √ Select collectionKey (223ms)
    Select value
      142 ms
    √ Select value (222ms)
    Select boundedParagraph
      11822 ms
    √ Select boundedParagraph (11906ms)
    Select boundedNonWhitespaceSequence
      13749 ms
    √ Select boundedNonWhitespaceSequence (13837ms)
    Select surroundingPair.any
      20893 ms
    √ Select surroundingPair.any (21039ms)
    Select surroundingPair.curlyBrackets
      365 ms
    √ Select surroundingPair.curlyBrackets (483ms)
  18 passing (50s)
```

## Checklist

- [x] I have added
[tests](https://www.cursorless.org/docs/contributing/test-case-recorder/)
- [-] I have updated the
[docs](https://github.com/cursorless-dev/cursorless/tree/main/docs) and
[cheatsheet](https://github.com/cursorless-dev/cursorless/tree/main/cursorless-talon/src/cheatsheet)
- [-] I have not broken the cheatsheet
github-merge-queue bot pushed a commit that referenced this pull request Jan 9, 2025
Uses the cache added in #2700 to also optimize all tree sitter queries,
not just surrounding pairs.

## Checklist

- [/] I have added
[tests](https://www.cursorless.org/docs/contributing/test-case-recorder/)
- [/] I have updated the
[docs](https://github.com/cursorless-dev/cursorless/tree/main/docs) and
[cheatsheet](https://github.com/cursorless-dev/cursorless/tree/main/cursorless-talon/src/cheatsheet)
- [/] I have not broken the cheatsheet

---------

Co-authored-by: pre-commit-ci-lite[bot] <117423508+pre-commit-ci-lite[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants