parsing FUNCTION block by dcoutinho1328 · Pull Request #15 · Autonomy-Logic/xml2st

dcoutinho1328 · 2025-09-22T21:00:09Z

Summary by CodeRabbit

New Features
- Added support for parsing FUNCTION blocks, exposing name, type, and return type; treated like other top-level blocks.
Improvements
- Redesigned type classification to better distinguish simple vs. complex types, improving handling and richer retention of complex/struct blocks during parsing and rewrite.
- Declaration spreading and block filtering logic improved for clearer output.
Style
- Formatting and whitespace cleanups in glue-generation and XML-to-ST tooling with no behavioral impact.

coderabbitai · 2025-09-22T21:00:18Z

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Walkthrough

Reworks ComplexParser’s internal type classification (introduces simple_types/simple_types_names and complex_types) and changes how blocks are analyzed and rewritten (iterative multi-phase classification, embedding of complex block lines before function rewrite). Adds a new FUNCTION block type in STParser. GlueGenerator and xml2st receive formatting-only edits.

Changes

Cohort / File(s)	Summary
ComplexParser refactor `ComplexParser.py`	Replaced array_dependant tracking with `simple_types`, `simple_types_names`, and `complex_types`; reworked initialization/clearing; changed gating in `__getBlockLines` to use `complex_types`; rewrote `__analyseTypes` to iterative multi-phase classification; updated `__getSTLines`, `__getCustomType`, and `__spreadDeclarations` to use new type collections and to optionally embed block lines for complex StructInstances before rewriting as function-blocks. Internal state shape changed but public APIs unchanged.
New FUNCTION block in STParser `STParser.py`	Added `_Function(_NamedBlock)` to parse `FUNCTION <name> : <return_type>`, exposed as `FUNCTION`, and registered it in `ALL_BLOCKS` and `CLOSABLE_BLOCKS`; minor formatting tweak to `BASE_TYPES`.
Formatting-only updates `GlueGenerator.py`, `xml2st.py`	Cosmetic reformatting (multi-line loader init, quoting/whitespace, dict literal layout, argparse wrapping); no behavioral or API changes.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant CP as ComplexParser
  participant AL as __analyseTypes
  participant GL as __getSTLines
  participant BL as __getBlockLines
  participant B as StructInstance Block

  Note over CP: Type classification + ST line extraction
  CP->>AL: Run multi-phase analysis -> classify simple/complex types
  AL-->>CP: Update simple_types, simple_types_names, complex_types
  CP->>GL: Request ST lines for a block
  GL->>B: Inspect block
  alt block type ∈ complex_types AND ignoreComplexStructs True
    GL->>B: Count non-empty, non-_InsertLine lines
    alt count > threshold
      GL->>BL: __getBlockLines(B)
      BL-->>GL: full block lines
      GL->>CP: Emit embedded block lines then rewritten function-block
    else
      GL->>CP: Emit rewritten function-block only
    end
  else
    GL->>CP: Existing handling (use simple_types_names for filtering)
  end

sequenceDiagram
  autonumber
  participant STP as STParser
  participant SRC as Source Line
  participant REG as Block Registry

  Note over STP: New FUNCTION block recognition
  STP->>SRC: Read top-level line
  alt Line matches "FUNCTION <name> : <return_type>"
    STP->>STP: Instantiate `_Function` -> parse name,type,return_type
    STP->>REG: Ensure `FUNCTION` present in ALL_BLOCKS & CLOSABLE_BLOCKS
    STP-->>STP: Block exposes GetInfo(name,type,return_type)
  else
    STP->>STP: Defer to other block parsers
  end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

Rtop 35 add support for complex variables on debug.c #8 — Prior changes touching ComplexParser’s handling of STRUCT/complex variables; likely related to the type-classification and embedding work here.

Suggested reviewers

thiagoralves

Poem

I hop through types and blocks at dawn,
I nibble lines till tangles are gone,
A FUNCTION named, a struct shown bright,
I stitch their paths by dim lamp-light,
Code carrots crunch — hooray, that’s right! 🥕🐇

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The title "parsing FUNCTION block" is concise and directly reflects the main, visible functional change in the diff — STParser.py adds a _Function block class and a public FUNCTION object to enable parsing function blocks; this is the primary new feature a reviewer would expect from the PR. Other edits (ComplexParser refactor and cosmetic changes) are secondary and do not make the title misleading or off-topic. The title is short, specific, and understandable to teammates scanning history.
Docstring Coverage	✅ Passed	Docstring coverage is 87.50% which is sufficient. The required threshold is 80.00%.

Warning

Review ran into problems

🔥 Problems

Errors were encountered while retrieving linked issues.

Errors (1)

RTOP-71: Entity not found: Issue - Could not find referenced Issue.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

GlueGenerator.py (1)

26-35: Indexing bug parsing varName; use a regex instead

parts[2] will IndexError for names like "__QX0_1"/"__QW0". Extract kind/sub/indices via regex.

-        try:
-            parts = varName.split("_")
-            pos1 = int(parts[2][2:])  # number after QX0 or QW0
-            pos2 = int(parts[3]) if len(parts) > 3 else 0
-        except Exception as e:
-            raise Exception(f"Error parsing variable name '{varName}': {e}")
-
-        kind = varName[2]  # I, Q, M
-        sub = varName[3]  # X, B, W, D, L
+        m = re.match(r"^__([IQM])([XBWDL])(\d+)(?:_(\d+))?$", varName)
+        if not m:
+            raise Exception(
+                f"Error parsing variable name '{varName}': "
+                "expected __[IQM][XBWDL]<pos1>[_<pos2>]"
+            )
+        kind, sub = m.group(1), m.group(2)  # I/Q/M and X/B/W/D/L
+        pos1 = int(m.group(3))
+        pos2 = int(m.group(4) or 0)

🧹 Nitpick comments (2)

xml2st.py (1)
149-151: Don’t rely on filename “plc.xml” when computing output path

String replacement can mis-save output for non-”plc.xml” inputs. Use directory join.
-            st_file = os.path.abspath(args.generate_st).replace("plc.xml", "program.st")
+            base_dir = os.path.dirname(os.path.abspath(args.generate_st))
+            st_file = os.path.join(base_dir, "program.st")
GlueGenerator.py (1)
25-25: Reduce noisy stdout in library code

Consider logging at INFO/DEBUG instead of print().
-        print(f"Linking variable {varName}")
+        # logger.debug(f"Linking variable {varName}")

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

Disabled knowledge base sources:

Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 8cd6e6b and 9fb75ca.

📒 Files selected for processing (4)

ComplexParser.py (1 hunks)
GlueGenerator.py (3 hunks)
STParser.py (4 hunks)
xml2st.py (3 hunks)

🧰 Additional context used

🧠 Learnings (2)

📓 Common learnings

Learnt from: dcoutinho1328
PR: Autonomy-Logic/xml2st#11
File: ComplexParser.py:214-233
Timestamp: 2025-09-02T21:59:52.612Z
Learning: In the ComplexParser.py __getBlockLines method, the ignoreComplexStructs parameter doesn't need to be propagated in recursive calls because complex structs only appear at the first function call level, not nested within other structures in ST files.

📚 Learning: 2025-09-02T21:59:52.612Z

Learnt from: dcoutinho1328
PR: Autonomy-Logic/xml2st#11
File: ComplexParser.py:214-233
Timestamp: 2025-09-02T21:59:52.612Z
Learning: In the ComplexParser.py __getBlockLines method, the ignoreComplexStructs parameter doesn't need to be propagated in recursive calls because complex structs only appear at the first function call level, not nested within other structures in ST files.

Applied to files:

ComplexParser.py

🧬 Code graph analysis (1)

GlueGenerator.py (1)

util/paths.py (1)

AbsDir (36-37)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: build-windows-arm

🔇 Additional comments (2)

xml2st.py (1)

86-88: Formatting-only change: OK

The multi-line endswith() check reads cleaner. No functional impact.

ComplexParser.py (1)

321-334: Verify duplicate/omitted content when embedding TYPE lines before rewrites

The >2 non-empty line gate appends TYPE block lines (excluding struct bodies) before function-block rewrites; otherwise TYPE wrapper is omitted. Please confirm this doesn’t (a) duplicate content across TYPE blocks or (b) drop needed prologue/metadata for short blocks.

Would you like a quick script to diff pre/post output on representative ST files?

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (3)

ComplexParser.py (3)

325-339: Clarify the complex filtering logic with better comments.

This conditional logic that filters non-empty, non-insert lines before extending with block content is hard to follow. The nested filter and lambda operations make the intent unclear.

-                    if (
-                        len(
-                            list(
-                                filter(
-                                    lambda x: not isinstance(x, _InsertLine)
-                                    and not EMPTY_LINE.match(x),
-                                    block.lines,
-                                )
-                            )
-                        )
-                        > 2
-                    ):
+                    # Check if block has substantial content (more than just open/close lines)
+                    non_trivial_lines = [
+                        line for line in block.lines 
+                        if not isinstance(line, _InsertLine) and not EMPTY_LINE.match(line)
+                    ]
+                    if len(non_trivial_lines) > 2:

347-355: Consider caching the Jinja environment for performance.

Creating a new Jinja Environment instance each time __rewriteStructsAsFunctionBlocks is called introduces unnecessary overhead. The core component of Jinja is the Environment() class. In this example, you create a Jinja environment without any arguments. Later you'll change the parameters of Environment to customize your environment.

+    def __init__(self):
         self.blocks = []
         self.arrays = []
         self.structs = []
         self.programs = []
         self.csv_vars = []
         self.simple_types = []
         self.simple_types_names = []
         self.complex_types = []
         self.complex_structs = []
         self.function_blocks = []
-        self.__loader = FileSystemLoader(
+        self.__jinja_env = Environment(loader=FileSystemLoader(
             os.path.join(paths.AbsDir(__file__), "templates")
-        )
+        ))

     def __rewriteStructsAsFunctionBlocks(self):
-        template = Environment(loader=self.__loader).get_template(
+        template = self.__jinja_env.get_template(
             FUNCTION_BLOCK_ST_TEMPLATE
         )

497-497: Consider caching the Jinja environment instance here as well.

Similar to the previous suggestion, creating a new Environment instance in __appendVarsToCSV introduces unnecessary overhead.

         template = Environment(loader=self.__loader).get_template(CSV_VARS_TEMPLATE)
+        template = self.__jinja_env.get_template(CSV_VARS_TEMPLATE)

This change would be dependent on implementing the previous suggestion to cache the Jinja environment in __init__.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

Disabled knowledge base sources:

Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 8ef90e8 and a24c222.

📒 Files selected for processing (1)

ComplexParser.py (7 hunks)

🔇 Additional comments (6)

ComplexParser.py (6)

139-141: LGTM! Improved type classification system.

The new type classification system with explicit simple_types, simple_types_names, and complex_types collections provides better clarity compared to the previous array_dependant tracking approach.

157-159: LGTM! Consistent clearing of new state variables.

The __clear method correctly resets all the new type classification collections to maintain clean state between parser runs.

225-225: LGTM! Improved block complexity determination.

Using complex_types to determine block complexity in __getBlockLines is more semantically clear than the previous implementation.

321-321: LGTM! Consistent usage of new naming convention.

The change from checking against a previous collection to using simple_types_names is consistent with the new type classification system.

371-371: LGTM! Consistent search scope update.

The change to search in simple_types instead of the previous array_dependant is consistent with the new type classification system.

413-413: LGTM! Consistent usage of new type names collection.

Using simple_types_names for type checking in __spreadDeclarations maintains consistency with the new classification system.

parsing FUNCTION block

9fb75ca

coderabbitai Bot reviewed Sep 22, 2025

View reviewed changes

Comment thread STParser.py

dcoutinho1328 added 2 commits September 23, 2025 16:58

hotfix

8ef90e8

fixed struct transformation only for complex types

a24c222

coderabbitai Bot reviewed Sep 24, 2025

View reviewed changes

Comment thread ComplexParser.py

code rabbit suggestions

1d8b56c

thiagoralves approved these changes Sep 24, 2025

View reviewed changes

dcoutinho1328 merged commit 168a096 into development Sep 24, 2025
7 checks passed

thiagoralves deleted the RTOP-71-Xm2st-is-break-on-large-files branch September 25, 2025 12:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

parsing FUNCTION block#15

parsing FUNCTION block#15
dcoutinho1328 merged 4 commits into
developmentfrom
RTOP-71-Xm2st-is-break-on-large-files

dcoutinho1328 commented Sep 22, 2025 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Sep 22, 2025 •

edited

Loading

Review skipped

Review ran into problems

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dcoutinho1328 commented Sep 22, 2025 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Review ran into problems

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dcoutinho1328 commented Sep 22, 2025 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Sep 22, 2025 •

edited

Loading