impl extract function, part of #746 #1786

asukaminato0721 · 2025-12-07T13:06:03Z

Summary

part of #746

Introduced extract_function_code_actions plus LocalRefactorCodeAction plumbing so the state layer can produce workspace edits for a selected block; it now dedents the selection, synthesizes helper definitions/calls, infers parameters/returns (including augmented assignments), and rejects unsupported selections (returns/breaks/etc.).

Added a visitor-based identifier collector that walks statements/expressions to classify loads, stores, post-selection reads, and synthetic aug-assign loads, which drives the parameter/return heuristics.

Updated the non-wasm LSP server to advertise REFACTOR_EXTRACT, convert LocalRefactorCodeAction edits into URIs, and merge them alongside existing quick fixes.

Test Plan

Added an integration test that selects a block via markers, requests refactor actions, applies the returned edits, and asserts the helper/function call match the expected structure.

Copilot

Pull request overview

This PR implements the "extract function" refactoring feature as part of issue #746. It adds the ability to select a block of Python code and extract it into a helper function, automatically inferring parameters and return values. The implementation includes a new LocalRefactorCodeAction type, visitor-based identifier collection, and integration with the LSP server to advertise the REFACTOR_EXTRACT capability.

Key Changes:

Introduced extract_function_code_actions that analyzes selected code blocks, identifies parameters (from loaded variables) and returns (from stored variables used later), and generates workspace edits to create a helper function and replace the selection with a function call
Added visitor-based identifier collection that distinguishes between loads, stores, and synthetic loads from augmented assignments
Integrated extract function actions into the LSP server alongside existing quickfix actions

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.

File	Description
`pyrefly/lib/test/lsp/lsp_interaction/basic.rs`	Updated server capabilities test to include "refactor.extract" in advertised code action kinds
`pyrefly/lib/test/lsp/code_actions.rs`	Added test infrastructure (helper functions) and basic integration test for extract function with augmented assignment
`pyrefly/lib/state/lsp.rs`	Core implementation including `LocalRefactorCodeAction` struct, `extract_function_code_actions` method, and helper functions for identifier collection, dedenting, indenting, and name generation
`pyrefly/lib/lsp/non_wasm/server.rs`	Integrated extract function actions into the code action handler and registered `REFACTOR_EXTRACT` capability

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

pyrefly/lib/state/lsp.rs

Copilot · 2025-12-07T14:03:28Z

pyrefly/lib/test/lsp/code_actions.rs

+#[test]
+fn extract_function_basic_refactor() {
+    let code = r#"
+def process_data(data_list):
+    total_sum = 0
+    for item in data_list:
+        # EXTRACT-START
+        squared_value = item * item
+        if squared_value > 100:
+            print(f"Large value detected: {squared_value}")
+        total_sum += squared_value
+        # EXTRACT-END
+    return total_sum
+
+
+if __name__ == "__main__":
+    data = [1, 5, 12, 8, 15]
+    result = process_data(data)
+    print(f"The final sum is: {result}")
+"#;
+    let (handles, state) =
+        mk_multi_file_state_assert_no_errors(&[("main", code)], Require::Everything);
+    let handle = handles.get("main").unwrap();
+    let transaction = state.transaction();
+    let module_info = transaction.get_module_info(handle).unwrap();
+    let selection = find_marked_range(module_info.contents());
+    let actions = transaction
+        .extract_function_code_actions(handle, selection)
+        .unwrap_or_default();
+    assert!(!actions.is_empty(), "expected extract refactor action");
+    let updated = apply_refactor_edits_for_module(&module_info, &actions[0].edits);
+    let expected = r#"
+def extracted_function(item, total_sum):
+    squared_value = item * item
+    if squared_value > 100:
+        print(f"Large value detected: {squared_value}")
+    total_sum += squared_value
+    return total_sum
+
+def process_data(data_list):
+    total_sum = 0
+    for item in data_list:
+        # EXTRACT-START
+        total_sum = extracted_function(item, total_sum)
+        # EXTRACT-END
+    return total_sum
+
+
+if __name__ == "__main__":
+    data = [1, 5, 12, 8, 15]
+    result = process_data(data)
+    print(f"The final sum is: {result}")
+"#;
+    assert_eq!(expected.trim(), updated.trim());
+}


Test coverage for the extract function feature is limited. Only one basic scenario is tested. Consider adding tests for:

Selections with no parameters or return values

Multiple return values (tuple unpacking)

Edge cases: empty selections, whitespace-only selections

Rejection cases: selections containing return/break/continue/raise/function/class definitions

Various indentation scenarios

Synthetic loads from augmented assignments without regular loads

Name collision scenarios for the generated function name

agreed. please add more tests, even if they are disabled

some other ones I can think of:

function is within a class (but no classes in range), does the indent work?

pyrefly/lib/state/lsp.rs

asukaminato0721 · 2025-12-07T14:14:48Z

this aim to be a basic impl, as copilot said, too many edge cases...

kinto0

awesome work! this is exciting. a few comments:

lsp.rs is much too big. can you put all the new additions in it's own file? maybe in a lsp/quick_fixes folder? (no need to move the other stuff, we can do that later)
a few comments about reuse and tests. I don't mind if the feature isn't implemented, I just don't want it crashing because of a crazy situation

pyrefly/lib/state/lsp.rs

pyrefly/lib/test/lsp/code_actions.rs

kinto0 · 2025-12-09T00:39:34Z

pyrefly/lib/test/lsp/code_actions.rs

+#[test]
+fn extract_function_basic_refactor() {
+    let code = r#"
+def process_data(data_list):
+    total_sum = 0
+    for item in data_list:
+        # EXTRACT-START
+        squared_value = item * item
+        if squared_value > 100:
+            print(f"Large value detected: {squared_value}")
+        total_sum += squared_value
+        # EXTRACT-END
+    return total_sum
+
+
+if __name__ == "__main__":
+    data = [1, 5, 12, 8, 15]
+    result = process_data(data)
+    print(f"The final sum is: {result}")
+"#;
+    let (handles, state) =
+        mk_multi_file_state_assert_no_errors(&[("main", code)], Require::Everything);
+    let handle = handles.get("main").unwrap();
+    let transaction = state.transaction();
+    let module_info = transaction.get_module_info(handle).unwrap();
+    let selection = find_marked_range(module_info.contents());
+    let actions = transaction
+        .extract_function_code_actions(handle, selection)
+        .unwrap_or_default();
+    assert!(!actions.is_empty(), "expected extract refactor action");
+    let updated = apply_refactor_edits_for_module(&module_info, &actions[0].edits);
+    let expected = r#"
+def extracted_function(item, total_sum):
+    squared_value = item * item
+    if squared_value > 100:
+        print(f"Large value detected: {squared_value}")
+    total_sum += squared_value
+    return total_sum
+
+def process_data(data_list):
+    total_sum = 0
+    for item in data_list:
+        # EXTRACT-START
+        total_sum = extracted_function(item, total_sum)
+        # EXTRACT-END
+    return total_sum
+
+
+if __name__ == "__main__":
+    data = [1, 5, 12, 8, 15]
+    result = process_data(data)
+    print(f"The final sum is: {result}")
+"#;
+    assert_eq!(expected.trim(), updated.trim());
+}


agreed. please add more tests, even if they are disabled

kinto0 · 2025-12-09T00:40:35Z

pyrefly/lib/state/lsp.rs

+    found
+}
+
+fn find_enclosing_module_statement_range(


@jvansch1 i think you needed something similar for call hierarchy. any way to reuse it?

pyrefly/lib/state/lsp.rs

kinto0 · 2025-12-09T00:47:56Z

pyrefly/lib/test/lsp/code_actions.rs

+#[test]
+fn extract_function_basic_refactor() {
+    let code = r#"
+def process_data(data_list):
+    total_sum = 0
+    for item in data_list:
+        # EXTRACT-START
+        squared_value = item * item
+        if squared_value > 100:
+            print(f"Large value detected: {squared_value}")
+        total_sum += squared_value
+        # EXTRACT-END
+    return total_sum
+
+
+if __name__ == "__main__":
+    data = [1, 5, 12, 8, 15]
+    result = process_data(data)
+    print(f"The final sum is: {result}")
+"#;
+    let (handles, state) =
+        mk_multi_file_state_assert_no_errors(&[("main", code)], Require::Everything);
+    let handle = handles.get("main").unwrap();
+    let transaction = state.transaction();
+    let module_info = transaction.get_module_info(handle).unwrap();
+    let selection = find_marked_range(module_info.contents());
+    let actions = transaction
+        .extract_function_code_actions(handle, selection)
+        .unwrap_or_default();
+    assert!(!actions.is_empty(), "expected extract refactor action");
+    let updated = apply_refactor_edits_for_module(&module_info, &actions[0].edits);
+    let expected = r#"
+def extracted_function(item, total_sum):
+    squared_value = item * item
+    if squared_value > 100:
+        print(f"Large value detected: {squared_value}")
+    total_sum += squared_value
+    return total_sum
+
+def process_data(data_list):
+    total_sum = 0
+    for item in data_list:
+        # EXTRACT-START
+        total_sum = extracted_function(item, total_sum)
+        # EXTRACT-END
+    return total_sum
+
+
+if __name__ == "__main__":
+    data = [1, 5, 12, 8, 15]
+    result = process_data(data)
+    print(f"The final sum is: {result}")
+"#;
+    assert_eq!(expected.trim(), updated.trim());
+}


some other ones I can think of:

function is within a class (but no classes in range), does the indent work?

test

meta-codesync · 2025-12-10T00:50:43Z

@kinto0 has imported this pull request. If you are a Meta employee, you can view this in D88799566.

meta-cla bot added the cla signed label Dec 7, 2025

asukaminato0721 force-pushed the 746 branch 2 times, most recently from 5e5ecd1 to 41161a7 Compare December 7, 2025 13:55

asukaminato0721 marked this pull request as ready for review December 7, 2025 13:58

Copilot AI review requested due to automatic review settings December 7, 2025 13:58

Copilot started reviewing on behalf of asukaminato0721 December 7, 2025 13:58 View session

Copilot AI reviewed Dec 7, 2025

View reviewed changes

kinto0 self-assigned this Dec 9, 2025

kinto0 requested changes Dec 9, 2025

View reviewed changes

impl

1d5f4ab

test

asukaminato0721 force-pushed the 746 branch from 41161a7 to 1d5f4ab Compare December 9, 2025 07:52

asukaminato0721 added 2 commits December 9, 2025 17:11

update by comment

44f108a

clippy

99c9a6f

impl extract function, part of #746 #1786

Are you sure you want to change the base?

impl extract function, part of #746 #1786

Uh oh!

Conversation

asukaminato0721 commented Dec 7, 2025

Summary

Test Plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

kinto0 Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

kinto0 Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

asukaminato0721 commented Dec 7, 2025

Uh oh!

kinto0 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kinto0 Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

kinto0 Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kinto0 Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

meta-codesync bot commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants