Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix code snippet in 'Collection lookups with spans (What's new in .NET libraries for .NET 9)' documentation #43702

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

wipiano
Copy link

@wipiano wipiano commented Nov 22, 2024

Summary

Fixes #43701

@wipiano wipiano requested review from gewarren and a team as code owners November 22, 2024 08:07
@dotnetrepoman dotnetrepoman bot added this to the November 2024 milestone Nov 22, 2024
@dotnet-policy-service dotnet-policy-service bot added the community-contribution Indicates PR is created by someone from the .NET community. label Nov 22, 2024
@wipiano wipiano changed the title fix code snippet in 'What's new in .NET libraries for .NET 9' fix code snippet in 'Collection lookups with spans (What's new in .NET libraries for .NET 9)' documentation Nov 25, 2024
@gewarren
Copy link
Contributor

gewarren commented Apr 1, 2025

Thanks @wipiano, and sorry for the delay. You're right that the original code doesn't do what it's supposed to.

I wonder why ValueMatchEnumerator.Current isn't a Range like ValueSplitEnumerator.Current is. That would make your suggested code even simpler because you wouldn't have to instantiate the Range instance.

Since EnumerateSplits was introduced in .NET 9, I wonder if this code used it intentionally over EnumerateMatches. If so, we could still use EnumerateSplits with code like the following, where the "match" is actually the characters between the words. But it does require a check for empty ranges.

static Dictionary<string, int> CountWords(ReadOnlySpan<char> input)
{
    Dictionary<string, int> wordCounts = new(StringComparer.OrdinalIgnoreCase);
    Dictionary<string, int>.AlternateLookup<ReadOnlySpan<char>> spanLookup =
        wordCounts.GetAlternateLookup<ReadOnlySpan<char>>();

    foreach (Range wordRange in Regex.EnumerateSplits(input, @"\b\W+"))
    {
        if (wordRange.Start.Value == wordRange.End.Value)
        {
            continue; // Skip empty ranges.
        }
        ReadOnlySpan<char> word = input[wordRange];
        spanLookup[word] = spanLookup.TryGetValue(word, out int count) ? count + 1 : 1;
    }

    return wordCounts;
}

@stephentoub Do you have any preference as to whether the code that demonstrates GetAlternateLookup uses EnumerateMatches or EnumerateSplits?

@stephentoub
Copy link
Member

I wonder why ValueMatchEnumerator.Current isn't a Range like ValueSplitEnumerator.Current is.

Leaving the door open to add additional state as part of enabling EnumerateMatches to also gather capture group information.

Do you have any preference as to whether the code that demonstrates GetAlternateLookup uses EnumerateMatches or EnumerateSplits?

No preference.

@gewarren gewarren enabled auto-merge (squash) April 2, 2025 15:05
@wipiano
Copy link
Author

wipiano commented Apr 3, 2025

@gewarren Thanks for your review. I think your suggested code is simple and I prefer it. I'll check it and incorporate it into this pull request.

@wipiano wipiano requested a review from CamSoper as a code owner April 3, 2025 01:44
@gewarren gewarren disabled auto-merge April 3, 2025 01:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
community-contribution Indicates PR is created by someone from the .NET community. dotnet-fundamentals/svc
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Incorrect code sample in Collection lookups with spans documentation
3 participants