-
Notifications
You must be signed in to change notification settings - Fork 391
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No 'Variable Referenced Before Assignment' Rule? #1660
Comments
Thanks @Vacant0mens that is a feature of strict mode in PowerShell which can be a (runtime) workaround-- because of dynamic scoping in PowerShell this is impossible to always get right at the script analysis level-- this issue is further described in #1641 |
If it can't be caught as a "referenced before assignment" situation, is there a way to have the linter point out variables that are potentially null within the scope of the script itself? or is that covered under the same case that you mentioned? |
So just to expand on this a bit, since we discussed it. PowerShell uses dynamic scope. That means that a variable's value depends on the runtime stack. If you show me a scriptblock or a script that references a variable that isn't assigned anywhere in the script and ask whether it's defined, the answer is "depends how you call it at runtime". And because you can do anything at runtime, like make calls or variable definitions dependent on the output of a web request, there's really no way to say statically. As an explicit note, this is a departure from Python (for example), which uses lexical scope. If you don't assign an Does that mean we're out of luck? Perhaps not if we change the parameters of the problem. Perhaps we can just look for variables that can only be defined by virtue of dynamic scope — ones that haven't been assigned in the immediate scope. However, naively applying that rule is going to turn up a lot of results that users will see as false positives even though they're not. Consider: $extension = '*.txt'
Get-ChildItem | Where-Object { $_.Extension -like $extension }
So we could build in some "obvious false positive" logic. For that PSScriptAnalyzer must understand that So we could build a simple heuristic that says "for this command name, any scriptblock passed to it should be considered to be executed in scope". And in fact for the converse problem (the unassigned variable problem), we also want to know when scriptblocks are dot-sourced (since that propagates assignments upward), so we also need logic to understand that. Starting to get a bit more complicated. But then there's things like And we haven't yet discussed things like Basically, to embark on this a little bit is going to annoy people and create the expectation of being correct, when really it's just a heuristic. To annoy people less (but bear in mind it's still a heuristic), we'd need to do a lot of yak shaving. Of course, we'd accept a contribution that implemented it. But it's not a workitem that we're likely to tackle internally any time soon. |
Okay. I'm getting the problem a bit more now. Thanks for that. So I guess what I'm really asking for is something like "Warning: in the current script context/file, this variable is used/referenced, but in the lines above it there are no Unless I'm misunderstanding how all this works? I have seen "you're overriding a system variable" warnings, so it seems possible with my limited, new knowledge. |
Yeah it's possible. It should be done on a per-scriptblock basis (the AST visitor should build a stack of dictionaries of variables as it visits each scriptblock down a lexical scope). But the scriptblocks for scriptblock cmdlets will create false positives that people will complain about, so we need to be smarter than that. |
Oh okay. And that's what the other issue was getting into, right? I think I'm seeing it now. So PSScriptAnalyzer doesn't technically do syntax parsing on the text itself as text? It actually attempts to load the script file in as a script block first? Or did I miss something? It kind of sounds like cmdlet functions, modules, and other related items (things that don't execute within current context) will need to be handled differently than normal scripts/scriptblocks, but basic scripts/scriptblocks would be a good place to start. (Differently, meaning in a more advanced way, but based in similar parameters) At least for VSCode, I was just hoping there'd be a parser or something to do it within the text itself, rather than checking through the whole call stack or context. |
PSScriptAnalyzer takes the script text and uses the Parser API to get the AST and tokens corresponding to that text. Those outputs are simply structured representations of the text as the PowerShell parser sees them -- they don't contain any extra or special information. So at that point, we have an AST and we'd need to sift through each scriptblock in it (scriptblocks essentially correspond to scopes) and build a table of variables. You can see an example of that here: PSScriptAnalyzer/Rules/UseDeclaredVarsMoreThanAssignments.cs Lines 117 to 158 in 58c4423
But I stress that this is just a syntactic representation, we have to do the hard work of understanding the semantics. So for example, that linked rule makes a bunch of assumptions that aren't true in a number of cases, which is what #1641 is all about. |
You might find this blog post to be a helpful introduction on how PSSA works |
Okay, now that I mostly understand what's going on (I think), is there a rule in the works that's basically the inverse of the rule you referenced? (i.e. check variable references against the table created with the AST's) By the way, thank you for taking the time to help me understand this. |
There's no such rule today, for the reasons I describe above. |
From what I can tell, there's no linting rule to check if a variable is null or unassigned when it's being referenced. This would be super helpful to avoid nullref errors when running a script or module. I know there's a "variable assigned but not referenced/used" rule, but that's kind of the opposite of what I'm looking for.
I know this functionality exists in pylint for python, but I sorely miss this when working with PowerShell.
Does this exist? If so, how does one enable it?
Apologies if there is already an issue for this somewhere. I couldn't find one.
The text was updated successfully, but these errors were encountered: