Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[JitDiff X64] [EgorBo] Remove some bounds checks around Vector.Create(Span) #1023

Open
MihuBot opened this issue Mar 1, 2025 · 3 comments
Open

Comments

@MihuBot
Copy link
Owner

MihuBot commented Mar 1, 2025

Job completed in 15 minutes 15 seconds (remote runner delay: 1 minute 12 seconds).
dotnet/runtime#113035

Diffs

Diffs
Found 85 files with textual diffs.

Summary of Code Size diffs:
(Lower is better)

Total bytes of base: 43801115
Total bytes of diff: 43803293
Total bytes of delta: 2178 (0.00 % of base)
Total relative delta: 15.01
    diff is a regression.
    relative diff is a regression.


Top file regressions (bytes):
        1277 : System.Net.Sockets.dasm (0.48 % of base)
         985 : System.Net.Primitives.dasm (0.96 % of base)

Top file improvements (bytes):
         -84 : System.Private.CoreLib.dasm (-0.00 % of base)

3 total files with Code Size differences (1 improved, 2 regressed), 260 unchanged.

Top method regressions (bytes):
         191 (32.21 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:Contains(System.Net.IPAddress):ubyte:this (FullOpts)
         116 (232.00 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:Equals(System.Net.IPNetwork):ubyte:this (FullOpts)
         106 (179.66 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:op_Equality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte (FullOpts)
          99 (30.65 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:EndReceive(System.IAsyncResult,byref):ubyte[]:this (FullOpts)
          97 (20.00 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:.ctor(int,int):this (FullOpts)
          94 (97.92 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:Equals(System.Object):ubyte:this (FullOpts)
          94 (37.90 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:ReceiveAsync(System.Threading.CancellationToken):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.UdpReceiveResult]:this (FullOpts)
          92 (27.71 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:Receive(byref):ubyte[]:this (FullOpts)
          92 (23.59 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:ReceiveAsync():System.Threading.Tasks.Task`1[System.Net.Sockets.UdpReceiveResult]:this (FullOpts)
          91 (108.33 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:op_Inequality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte (FullOpts)
          90 (48.91 % of base) : System.Net.Primitives.dasm - System.Net.IPAddress:IsLoopback(System.Net.IPAddress):ubyte (FullOpts)
          89 (61.38 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:BeginReceive(System.AsyncCallback,System.Object):System.IAsyncResult:this (FullOpts)
          83 (48.54 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:IsWildcardEndPoint(System.Net.EndPoint):ubyte (FullOpts)
          59 (147.50 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:IsBroadcast(System.Net.IPAddress):ubyte (FullOpts)
          57 (123.91 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:GetHashCode():int:this (FullOpts)
          54 (16.62 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:SetReceivingPacketInformation():this (FullOpts)
          51 (20.73 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:TryFormat(System.Span`1[ubyte],byref):ubyte:this (FullOpts)
          51 (21.52 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:TryFormat(System.Span`1[ushort],byref):ubyte:this (FullOpts)
          51 (60.71 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.MulticastOption:.ctor(System.Net.IPAddress):this (FullOpts)
          51 (20.56 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.TcpListener:.ctor(int):this (FullOpts)
          48 (30.77 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:.cctor() (FullOpts)
          47 (16.79 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:ToString():System.String:this (FullOpts)
          47 (2.67 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:ReceiveFrom(ubyte[],int,int,int,byref):int:this (FullOpts)
          47 (4.93 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:ReceiveFromAsync(System.Net.Sockets.SocketAsyncEventArgs,System.Threading.CancellationToken):ubyte:this (FullOpts)
          47 (20.89 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.SocketPal:SetMulticastOption(System.Net.Sockets.SafeSocketHandle,int,System.Net.Sockets.MulticastOption):int (FullOpts)
          46 (8.75 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:ReceiveFromAsync(System.Memory`1[ubyte],int,System.Net.EndPoint,System.Threading.CancellationToken):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.SocketReceiveFromResult]:this (FullOpts)
          46 (25.27 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.TcpListener:Create(int):System.Net.Sockets.TcpListener (FullOpts)
          46 (10.90 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:DropMulticastGroup(System.Net.IPAddress):this (FullOpts)
          46 (10.90 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:JoinMulticastGroup(System.Net.IPAddress):this (FullOpts)
          45 (214.29 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:get_BaseAddress():System.Net.IPAddress:this (FullOpts)
          43 (2.99 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:ReceiveFrom(System.Span`1[ubyte],int,byref):int:this (FullOpts)
          20 (3.88 % of base) : System.Net.Primitives.dasm - System.Net.IPAddress:.cctor() (FullOpts)
          20 (7.78 % of base) : System.Net.Primitives.dasm - System.Net.IPAddress:.ctor(System.ReadOnlySpan`1[ubyte],long):this (FullOpts)
          20 (10.75 % of base) : System.Net.Primitives.dasm - System.Net.IPAddress:.ctor(System.ReadOnlySpan`1[ubyte]):this (FullOpts)

Top method improvements (bytes):
         -14 (-16.09 % of base) : System.Net.Primitives.dasm - System.Net.IPAddress:ReadUInt16NumbersFromBytes(System.ReadOnlySpan`1[ubyte]):ushort[] (FullOpts)
         -12 (-32.43 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:.ctor(System.ReadOnlySpan`1[float]):this (FullOpts)
         -12 (-32.43 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:Create(System.ReadOnlySpan`1[float]):System.Numerics.Vector4 (FullOpts)
         -12 (-30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[double]):System.Runtime.Intrinsics.Vector128`1[double] (FullOpts)
         -12 (-30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[int]):System.Runtime.Intrinsics.Vector128`1[int] (FullOpts)
         -12 (-30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[long]):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts)
         -12 (-30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[short]):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts)
         -12 (-30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[ubyte]):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)

Top method regressions (percentages):
         116 (232.00 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:Equals(System.Net.IPNetwork):ubyte:this (FullOpts)
          45 (214.29 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:get_BaseAddress():System.Net.IPAddress:this (FullOpts)
         106 (179.66 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:op_Equality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte (FullOpts)
          59 (147.50 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:IsBroadcast(System.Net.IPAddress):ubyte (FullOpts)
          57 (123.91 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:GetHashCode():int:this (FullOpts)
          91 (108.33 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:op_Inequality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte (FullOpts)
          94 (97.92 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:Equals(System.Object):ubyte:this (FullOpts)
          89 (61.38 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:BeginReceive(System.AsyncCallback,System.Object):System.IAsyncResult:this (FullOpts)
          51 (60.71 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.MulticastOption:.ctor(System.Net.IPAddress):this (FullOpts)
          90 (48.91 % of base) : System.Net.Primitives.dasm - System.Net.IPAddress:IsLoopback(System.Net.IPAddress):ubyte (FullOpts)
          83 (48.54 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:IsWildcardEndPoint(System.Net.EndPoint):ubyte (FullOpts)
          94 (37.90 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:ReceiveAsync(System.Threading.CancellationToken):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.UdpReceiveResult]:this (FullOpts)
         191 (32.21 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:Contains(System.Net.IPAddress):ubyte:this (FullOpts)
          48 (30.77 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:.cctor() (FullOpts)
          99 (30.65 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:EndReceive(System.IAsyncResult,byref):ubyte[]:this (FullOpts)
          92 (27.71 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:Receive(byref):ubyte[]:this (FullOpts)
          46 (25.27 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.TcpListener:Create(int):System.Net.Sockets.TcpListener (FullOpts)
          92 (23.59 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:ReceiveAsync():System.Threading.Tasks.Task`1[System.Net.Sockets.UdpReceiveResult]:this (FullOpts)
          51 (21.52 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:TryFormat(System.Span`1[ushort],byref):ubyte:this (FullOpts)
          47 (20.89 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.SocketPal:SetMulticastOption(System.Net.Sockets.SafeSocketHandle,int,System.Net.Sockets.MulticastOption):int (FullOpts)
          51 (20.73 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:TryFormat(System.Span`1[ubyte],byref):ubyte:this (FullOpts)
          51 (20.56 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.TcpListener:.ctor(int):this (FullOpts)
          97 (20.00 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:.ctor(int,int):this (FullOpts)
          47 (16.79 % of base) : System.Net.Primitives.dasm - System.Net.IPNetwork:ToString():System.String:this (FullOpts)
          54 (16.62 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:SetReceivingPacketInformation():this (FullOpts)
          46 (10.90 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:DropMulticastGroup(System.Net.IPAddress):this (FullOpts)
          46 (10.90 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.UdpClient:JoinMulticastGroup(System.Net.IPAddress):this (FullOpts)
          20 (10.75 % of base) : System.Net.Primitives.dasm - System.Net.IPAddress:.ctor(System.ReadOnlySpan`1[ubyte]):this (FullOpts)
          46 (8.75 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:ReceiveFromAsync(System.Memory`1[ubyte],int,System.Net.EndPoint,System.Threading.CancellationToken):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.SocketReceiveFromResult]:this (FullOpts)
          20 (7.78 % of base) : System.Net.Primitives.dasm - System.Net.IPAddress:.ctor(System.ReadOnlySpan`1[ubyte],long):this (FullOpts)
          47 (4.93 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:ReceiveFromAsync(System.Net.Sockets.SocketAsyncEventArgs,System.Threading.CancellationToken):ubyte:this (FullOpts)
          20 (3.88 % of base) : System.Net.Primitives.dasm - System.Net.IPAddress:.cctor() (FullOpts)
          43 (2.99 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:ReceiveFrom(System.Span`1[ubyte],int,byref):int:this (FullOpts)
          47 (2.67 % of base) : System.Net.Sockets.dasm - System.Net.Sockets.Socket:ReceiveFrom(ubyte[],int,int,int,byref):int:this (FullOpts)

Top method improvements (percentages):
         -12 (-32.43 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:.ctor(System.ReadOnlySpan`1[float]):this (FullOpts)
         -12 (-32.43 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:Create(System.ReadOnlySpan`1[float]):System.Numerics.Vector4 (FullOpts)
         -12 (-30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[double]):System.Runtime.Intrinsics.Vector128`1[double] (FullOpts)
         -12 (-30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[int]):System.Runtime.Intrinsics.Vector128`1[int] (FullOpts)
         -12 (-30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[long]):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts)
         -12 (-30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[short]):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts)
         -12 (-30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[ubyte]):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
         -14 (-16.09 % of base) : System.Net.Primitives.dasm - System.Net.IPAddress:ReadUInt16NumbersFromBytes(System.ReadOnlySpan`1[ubyte]):ushort[] (FullOpts)

42 total methods with Code Size differences (8 improved, 34 regressed), 248147 unchanged.

--------------------------------------------------------------------------------

Artifacts:

@MihuBot
Copy link
Owner Author

MihuBot commented Mar 1, 2025

Top method regressions

116 (232.00 % of base) - System.Net.IPNetwork:Equals(System.Net.IPNetwork):ubyte:this
 ; Assembly listing for method System.Net.IPNetwork:Equals(System.Net.IPNetwork):ubyte:this (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
-; rsp based frame
+; rbp based frame
 ; fully interruptible
 ; No PGO data
 ; 0 inlinees with PGO data; 2 single block inlinees; 2 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;  V00 this         [V00,T00] (  4,  3.50)   byref  ->  rdi         this single-def
 ;* V01 arg1         [V01    ] (  0,  0   )  struct (16) zero-ref    multireg-arg ld-addr-op single-def <System.Net.IPNetwork>
 ;# V02 OutArgs      [V02    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
 ;* V03 tmp1         [V03    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
 ;* V04 tmp2         [V04    ] (  0,  0   )     ref  ->  zero-ref    class-hnd single-def "impAppendStmt" <System.Net.IPAddress>
-;  V05 tmp3         [V05,T05] (  2,  2   )     ref  ->  rdi         class-hnd single-def "dup spill" <System.Net.IPAddress>
-;  V06 tmp4         [V06,T03] (  5,  2.50)     ref  ->  rdi        
+;  V05 tmp3         [V05,T05] (  2,  2   )     ref  ->  r15         class-hnd single-def "dup spill" <System.Net.IPAddress>
+;  V06 tmp4         [V06,T03] (  4,  2   )     ref  ->  r15        
 ;* V07 tmp5         [V07    ] (  0,  0   )     ref  ->  zero-ref    class-hnd single-def "dup spill" <System.Net.IPAddress>
-;  V08 tmp6         [V08,T04] (  5,  2.50)     ref  ->  rsi        
-;  V09 tmp7         [V09,T02] (  2,  1.50)     ref  ->  rsi         single-def "field V01._baseAddress (fldOffset=0x0)" P-INDEP
+;  V08 tmp6         [V08,T04] (  4,  2   )     ref  ->  rsi        
+;  V09 tmp7         [V09,T02] (  2,  1.50)     ref  ->  rbx         single-def "field V01._baseAddress (fldOffset=0x0)" P-INDEP
 ;  V10 tmp8         [V10,T01] (  2,  2   )     int  ->  rdx         single-def "field V01.<PrefixLength>k__BackingField (fldOffset=0x8)" P-INDEP
 ;
 ; Lcl frame size = 0
 
 G_M60615_IG01:
-						;; size=0 bbWeight=1 PerfScore 0.00
+       push     rbp
+       push     r15
+       push     rbx
+       lea      rbp, [rsp+0x10]
+       mov      rbx, rsi
+						;; size=12 bbWeight=1 PerfScore 3.75
 G_M60615_IG02:
        cmp      dword ptr [rdi+0x08], edx
        je       SHORT G_M60615_IG05
 						;; size=5 bbWeight=1 PerfScore 4.00
 G_M60615_IG03:
        xor      eax, eax
 						;; size=2 bbWeight=0.50 PerfScore 0.12
 G_M60615_IG04:
+       pop      rbx
+       pop      r15
+       pop      rbp
        ret      
-						;; size=1 bbWeight=0.50 PerfScore 0.50
+						;; size=5 bbWeight=0.50 PerfScore 1.25
 G_M60615_IG05:
-       mov      rdi, gword ptr [rdi]
-       mov      rax, 0xD1FFAB1E      ; const ptr
-       test     rdi, rdi
-       cmove    rdi, gword ptr [rax]
+       mov      r15, gword ptr [rdi]
+       test     r15, r15
+       jne      SHORT G_M60615_IG07
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       SHORT G_M60615_IG11
+						;; size=23 bbWeight=0.50 PerfScore 3.75
+G_M60615_IG06:
+       mov      rdi, 0xD1FFAB1E      ; data for System.Net.IPAddress:Any
+       mov      r15, gword ptr [rdi]
+						;; size=13 bbWeight=0.50 PerfScore 1.12
+G_M60615_IG07:
+       mov      rsi, rbx
        test     rsi, rsi
-       cmove    rsi, gword ptr [rax]
+       jne      SHORT G_M60615_IG09
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       SHORT G_M60615_IG12
+						;; size=23 bbWeight=0.50 PerfScore 2.88
+G_M60615_IG08:
+       mov      rdi, 0xD1FFAB1E      ; data for System.Net.IPAddress:Any
+       mov      rsi, gword ptr [rdi]
+						;; size=13 bbWeight=0.50 PerfScore 1.12
+G_M60615_IG09:
+       mov      rdi, r15
        mov      rax, 0xD1FFAB1E      ; code for System.Net.IPAddress:Equals(System.Net.IPAddress):ubyte:this
        cmp      dword ptr [rdi], edi
-						;; size=39 bbWeight=0.50 PerfScore 6.00
-G_M60615_IG06:
+						;; size=15 bbWeight=0.50 PerfScore 1.75
+G_M60615_IG10:
+       pop      rbx
+       pop      r15
+       pop      rbp
        tail.jmp [rax]System.Net.IPAddress:Equals(System.Net.IPAddress):ubyte:this
-						;; size=3 bbWeight=0.50 PerfScore 1.00
+						;; size=7 bbWeight=0.50 PerfScore 1.75
+G_M60615_IG11:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPAddress
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      SHORT G_M60615_IG06
+						;; size=24 bbWeight=0 PerfScore 0.00
+G_M60615_IG12:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPAddress
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      SHORT G_M60615_IG08
+						;; size=24 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 50, prolog size 0, PerfScore 11.62, instruction count 13, allocated bytes for code 50 (MethodHash=da2c1338) for method System.Net.IPNetwork:Equals(System.Net.IPNetwork):ubyte:this (FullOpts)
+; Total bytes of code 166, prolog size 12, PerfScore 21.50, instruction count 43, allocated bytes for code 166 (MethodHash=da2c1338) for method System.Net.IPNetwork:Equals(System.Net.IPNetwork):ubyte:this (FullOpts)
 ; ============================================================
106 (179.66 % of base) - System.Net.IPNetwork:op_Equality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte
 ; Assembly listing for method System.Net.IPNetwork:op_Equality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
-; rsp based frame
+; rbp based frame
 ; fully interruptible
 ; No PGO data
 ; 0 inlinees with PGO data; 2 single block inlinees; 3 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;* V00 arg0         [V00    ] (  0,  0   )  struct (16) zero-ref    multireg-arg ld-addr-op single-def <System.Net.IPNetwork>
 ;* V01 arg1         [V01    ] (  0,  0   )  struct (16) zero-ref    multireg-arg single-def <System.Net.IPNetwork>
 ;# V02 OutArgs      [V02    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
 ;* V03 tmp1         [V03    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
 ;* V04 tmp2         [V04    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.Net.IPNetwork>
 ;* V05 tmp3         [V05    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
 ;* V06 tmp4         [V06    ] (  0,  0   )     ref  ->  zero-ref    class-hnd single-def "impAppendStmt" <System.Net.IPAddress>
 ;* V07 tmp5         [V07    ] (  0,  0   )     ref  ->  zero-ref    class-hnd single-def "dup spill" <System.Net.IPAddress>
-;  V08 tmp6         [V08,T04] (  5,  2.50)     ref  ->  rdi        
+;  V08 tmp6         [V08,T04] (  4,  2   )     ref  ->  r15        
 ;* V09 tmp7         [V09    ] (  0,  0   )     ref  ->  zero-ref    class-hnd single-def "dup spill" <System.Net.IPAddress>
-;  V10 tmp8         [V10,T05] (  5,  2.50)     ref  ->  rsi        
+;  V10 tmp8         [V10,T05] (  4,  2   )     ref  ->  rsi        
 ;  V11 tmp9         [V11,T02] (  2,  1.50)     ref  ->  rdi         single-def "field V00._baseAddress (fldOffset=0x0)" P-INDEP
 ;  V12 tmp10        [V12,T00] (  2,  2   )     int  ->  rsi         single-def "field V00.<PrefixLength>k__BackingField (fldOffset=0x8)" P-INDEP
-;  V13 tmp11        [V13,T03] (  2,  1.50)     ref  ->  rdx         single-def "field V01._baseAddress (fldOffset=0x0)" P-INDEP
+;  V13 tmp11        [V13,T03] (  2,  1.50)     ref  ->  rbx         single-def "field V01._baseAddress (fldOffset=0x0)" P-INDEP
 ;  V14 tmp12        [V14,T01] (  2,  2   )     int  ->  rcx         single-def "field V01.<PrefixLength>k__BackingField (fldOffset=0x8)" P-INDEP
 ;* V15 tmp13        [V15    ] (  0,  0   )     ref  ->  zero-ref    single-def "field V04._baseAddress (fldOffset=0x0)" P-INDEP
 ;* V16 tmp14        [V16    ] (  0,  0   )     int  ->  zero-ref    "field V04.<PrefixLength>k__BackingField (fldOffset=0x8)" P-INDEP
 ;
 ; Lcl frame size = 0
 
 G_M11613_IG01:
-						;; size=0 bbWeight=1 PerfScore 0.00
+       push     rbp
+       push     r15
+       push     rbx
+       lea      rbp, [rsp+0x10]
+       mov      rbx, rdx
+						;; size=12 bbWeight=1 PerfScore 3.75
 G_M11613_IG02:
        cmp      esi, ecx
        je       SHORT G_M11613_IG05
 						;; size=4 bbWeight=1 PerfScore 1.25
 G_M11613_IG03:
        xor      eax, eax
 						;; size=2 bbWeight=0.50 PerfScore 0.12
 G_M11613_IG04:
+       pop      rbx
+       pop      r15
+       pop      rbp
        ret      
-						;; size=1 bbWeight=0.50 PerfScore 0.50
+						;; size=5 bbWeight=0.50 PerfScore 1.25
 G_M11613_IG05:
-       mov      rsi, 0xD1FFAB1E      ; const ptr
-       test     rdi, rdi
-       cmove    rdi, gword ptr [rsi]
-       mov      rsi, rdx
-       mov      rax, 0xD1FFAB1E      ; const ptr
+       mov      r15, rdi
+       test     r15, r15
+       jne      SHORT G_M11613_IG07
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       SHORT G_M11613_IG11
+						;; size=23 bbWeight=0.50 PerfScore 2.88
+G_M11613_IG06:
+       mov      rdi, 0xD1FFAB1E      ; data for System.Net.IPAddress:Any
+       mov      r15, gword ptr [rdi]
+						;; size=13 bbWeight=0.50 PerfScore 1.12
+G_M11613_IG07:
+       mov      rsi, rbx
        test     rsi, rsi
-       cmove    rsi, gword ptr [rax]
+       jne      SHORT G_M11613_IG09
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       SHORT G_M11613_IG12
+						;; size=23 bbWeight=0.50 PerfScore 2.88
+G_M11613_IG08:
+       mov      rdi, 0xD1FFAB1E      ; data for System.Net.IPAddress:Any
+       mov      rsi, gword ptr [rdi]
+						;; size=13 bbWeight=0.50 PerfScore 1.12
+G_M11613_IG09:
+       mov      rdi, r15
        mov      rax, 0xD1FFAB1E      ; code for System.Net.IPAddress:Equals(System.Net.IPAddress):ubyte:this
        cmp      dword ptr [rdi], edi
-						;; size=49 bbWeight=0.50 PerfScore 5.25
-G_M11613_IG06:
+						;; size=15 bbWeight=0.50 PerfScore 1.75
+G_M11613_IG10:
+       pop      rbx
+       pop      r15
+       pop      rbp
        tail.jmp [rax]System.Net.IPAddress:Equals(System.Net.IPAddress):ubyte:this
-						;; size=3 bbWeight=0.50 PerfScore 1.00
+						;; size=7 bbWeight=0.50 PerfScore 1.75
+G_M11613_IG11:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPAddress
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      SHORT G_M11613_IG06
+						;; size=24 bbWeight=0 PerfScore 0.00
+G_M11613_IG12:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPAddress
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      SHORT G_M11613_IG08
+						;; size=24 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 59, prolog size 0, PerfScore 8.12, instruction count 14, allocated bytes for code 59 (MethodHash=fc58d2a2) for method System.Net.IPNetwork:op_Equality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte (FullOpts)
+; Total bytes of code 165, prolog size 12, PerfScore 17.88, instruction count 43, allocated bytes for code 165 (MethodHash=fc58d2a2) for method System.Net.IPNetwork:op_Equality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte (FullOpts)
 ; ============================================================
99 (30.65 % of base) - System.Net.Sockets.UdpClient:EndReceive(System.IAsyncResult,byref):ubyte[]:this
 ; Assembly listing for method System.Net.Sockets.UdpClient:EndReceive(System.IAsyncResult,byref):ubyte[]:this (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rbp based frame
 ; partially interruptible
 ; No PGO data
 ; 0 inlinees with PGO data; 5 single block inlinees; 3 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;  V00 this         [V00,T00] (  7,  7   )     ref  ->  rbx         this class-hnd single-def <System.Net.Sockets.UdpClient>
 ;  V01 arg1         [V01,T03] (  3,  3   )     ref  ->  r15         class-hnd single-def <System.IAsyncResult>
 ;  V02 arg2         [V02,T04] (  3,  3   )   byref  ->  r14         single-def
-;  V03 loc0         [V03    ] (  3,  3   )     ref  ->  [rbp-0x20]  do-not-enreg[X] must-init addr-exposed ld-addr-op class-hnd <System.Net.EndPoint>
+;  V03 loc0         [V03    ] (  3,  3   )     ref  ->  [rbp-0x28]  do-not-enreg[X] must-init addr-exposed ld-addr-op class-hnd <System.Net.EndPoint>
 ;  V04 loc1         [V04,T08] (  4,  2.50)     int  ->  r15        
 ;* V05 loc2         [V05    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op <System.Span`1[ubyte]>
 ;# V06 OutArgs      [V06    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
-;  V07 tmp1         [V07,T10] (  3,  2   )     ref  ->  rdi        
+;  V07 tmp1         [V07,T10] (  3,  2   )     ref  ->  r13        
 ;* V08 tmp2         [V08    ] (  0,  0   )     ref  ->  zero-ref    class-hnd exact "NewObj constructor temp" <System.PlatformNotSupportedException>
 ;  V09 tmp3         [V09,T02] (  4,  6   )     ref  ->  rbx         class-hnd single-def "Inlining Arg" <ubyte[]>
 ;* V10 tmp4         [V10    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "NewObj constructor temp" <System.Span`1[ubyte]>
 ;  V11 tmp5         [V11,T11] (  3,  2   )     ref  ->  r14         class-hnd "Inline return value spill temp" <ubyte[]>
 ;  V12 tmp6         [V12,T07] (  3,  3   )     ref  ->  r14         class-hnd exact single-def "NewArr temp" <ubyte[]>
 ;* V13 tmp7         [V13    ] (  0,  0   )   byref  ->  zero-ref    "Inlining Arg"
 ;* V14 tmp8         [V14    ] (  0,  0   )   byref  ->  zero-ref    "Inlining Arg"
 ;* V15 tmp9         [V15    ] (  0,  0   )    long  ->  zero-ref    "Inlining Arg"
 ;* V16 tmp10        [V16    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V05._reference (fldOffset=0x0)" P-INDEP
 ;* V17 tmp11        [V17    ] (  0,  0   )     int  ->  zero-ref    "field V05._length (fldOffset=0x8)" P-INDEP
 ;  V18 tmp12        [V18,T12] (  3,  1.50)   byref  ->  rbx         "field V10._reference (fldOffset=0x0)" P-INDEP
 ;  V19 tmp13        [V19,T06] (  5,  3   )     int  ->  r15         "field V10._length (fldOffset=0x8)" P-INDEP
 ;  V20 rat0         [V20,T05] (  3,  4.50)     ref  ->  rsi         "Spilling to split statement for tree"
 ;  V21 rat1         [V21,T01] (  5,  7.50)     ref  ->  rax         "replacement local"
 ;  V22 rat2         [V22,T09] (  3,  2.50)    long  ->  rdi         "CSE for expectedClsNode"
 ;
-; Lcl frame size = 8
+; Lcl frame size = 16
 
 G_M16011_IG01:
        push     rbp
        push     r15
        push     r14
+       push     r13
        push     rbx
-       push     rax
-       lea      rbp, [rsp+0x20]
+       sub      rsp, 16
+       lea      rbp, [rsp+0x30]
        xor      eax, eax
-       mov      qword ptr [rbp-0x20], rax
+       mov      qword ptr [rbp-0x28], rax
        mov      rbx, rdi
        mov      r15, rsi
        mov      r14, rdx
-						;; size=27 bbWeight=1 PerfScore 7.50
+						;; size=32 bbWeight=1 PerfScore 7.75
 G_M16011_IG02:
        movzx    rdi, byte  ptr [rbx+0x1D]
        mov      rsi, rbx
        mov      rax, 0xD1FFAB1E      ; code for System.ObjectDisposedException:ThrowIf(ubyte,System.Object)
        call     [rax]System.ObjectDisposedException:ThrowIf(ubyte,System.Object)
        cmp      dword ptr [rbx+0x18], 2
-       jne      SHORT G_M16011_IG04
+       jne      SHORT G_M16011_IG05
 						;; size=26 bbWeight=1 PerfScore 9.50
 G_M16011_IG03:
-       mov      rdi, 0xD1FFAB1E      ; const ptr
-       mov      rdi, gword ptr [rdi]
-       jmp      SHORT G_M16011_IG05
-						;; size=15 bbWeight=0.50 PerfScore 2.12
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       G_M16011_IG21
+						;; size=19 bbWeight=0.50 PerfScore 2.12
 G_M16011_IG04:
-       mov      rdi, 0xD1FFAB1E      ; const ptr
-       mov      rdi, gword ptr [rdi]
-						;; size=13 bbWeight=0.50 PerfScore 1.12
+       mov      rdi, 0xD1FFAB1E      ; data for System.Net.IPEndPointStatics:Any
+       mov      r13, gword ptr [rdi]
+       jmp      SHORT G_M16011_IG07
+						;; size=15 bbWeight=0.50 PerfScore 2.12
 G_M16011_IG05:
-       mov      gword ptr [rbp-0x20], rdi
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       G_M16011_IG20
+						;; size=19 bbWeight=0.50 PerfScore 2.12
+G_M16011_IG06:
+       mov      rdi, 0xD1FFAB1E      ; data for System.Net.IPEndPointStatics:IPv6Any
+       mov      r13, gword ptr [rdi]
+						;; size=13 bbWeight=0.50 PerfScore 1.12
+G_M16011_IG07:
+       mov      gword ptr [rbp-0x28], r13
        mov      rdi, gword ptr [rbx+0x08]
-       lea      rdx, [rbp-0x20]
+       lea      rdx, [rbp-0x28]
        mov      rsi, r15
        mov      rax, 0xD1FFAB1E      ; code for System.Net.Sockets.Socket:EndReceiveFrom(System.IAsyncResult,byref):int:this
        cmp      dword ptr [rdi], edi
        call     [rax]System.Net.Sockets.Socket:EndReceiveFrom(System.IAsyncResult,byref):int:this
        mov      r15d, eax
-       mov      rsi, gword ptr [rbp-0x20]
+       mov      rsi, gword ptr [rbp-0x28]
        mov      rax, rsi
        test     rax, rax
-       je       SHORT G_M16011_IG08
+       je       SHORT G_M16011_IG10
 						;; size=44 bbWeight=1 PerfScore 12.75
-G_M16011_IG06:
+G_M16011_IG08:
        mov      rdi, 0xD1FFAB1E      ; System.Net.IPEndPoint
        cmp      qword ptr [rax], rdi
-       je       SHORT G_M16011_IG08
+       je       SHORT G_M16011_IG10
 						;; size=15 bbWeight=0.50 PerfScore 2.12
-G_M16011_IG07:
+G_M16011_IG09:
        mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_CHKCASTCLASS_SPECIAL
        call     [rax]CORINFO_HELP_CHKCASTCLASS_SPECIAL
 						;; size=12 bbWeight=0.25 PerfScore 0.81
-G_M16011_IG08:
+G_M16011_IG10:
        mov      rdi, r14
        mov      rsi, rax
        call     CORINFO_HELP_CHECKED_ASSIGN_REF
        mov      rbx, gword ptr [rbx+0x10]
        test     rbx, rbx
-       je       SHORT G_M16011_IG10
+       je       SHORT G_M16011_IG12
 						;; size=20 bbWeight=1 PerfScore 4.75
-G_M16011_IG09:
+G_M16011_IG11:
        cmp      dword ptr [rbx+0x08], r15d
-       jb       SHORT G_M16011_IG17
+       jb       SHORT G_M16011_IG19
        add      rbx, 16
-       jmp      SHORT G_M16011_IG11
+       jmp      SHORT G_M16011_IG13
 						;; size=12 bbWeight=0.50 PerfScore 3.12
-G_M16011_IG10:
+G_M16011_IG12:
        test     r15d, r15d
-       jne      SHORT G_M16011_IG17
+       jne      SHORT G_M16011_IG19
        xor      rbx, rbx
        xor      r15d, r15d
 						;; size=10 bbWeight=0.50 PerfScore 0.88
-G_M16011_IG11:
+G_M16011_IG13:
        test     r15d, r15d
-       je       SHORT G_M16011_IG13
+       je       SHORT G_M16011_IG15
 						;; size=5 bbWeight=1 PerfScore 1.25
-G_M16011_IG12:
+G_M16011_IG14:
        mov      esi, r15d
        mov      rdi, 0xD1FFAB1E      ; ubyte[]
        call     CORINFO_HELP_NEWARR_1_VC
        mov      r14, rax
        mov      edx, r15d
        lea      rdi, bword ptr [r14+0x10]
        mov      rsi, rbx
        mov      rax, 0xD1FFAB1E      ; code for System.SpanHelpers:Memmove(byref,byref,ulong)
        call     [rax]System.SpanHelpers:Memmove(byref,byref,ulong)
-       jmp      SHORT G_M16011_IG15
+       jmp      SHORT G_M16011_IG17
 						;; size=45 bbWeight=0.50 PerfScore 4.00
-G_M16011_IG13:
+G_M16011_IG15:
        mov      rdi, 0xD1FFAB1E      ; global ptr
        test     byte  ptr [rdi], 1
-       je       SHORT G_M16011_IG18
+       je       SHORT G_M16011_IG22
 						;; size=15 bbWeight=0.50 PerfScore 2.12
-G_M16011_IG14:
+G_M16011_IG16:
        mov      rax, 0xD1FFAB1E      ; data for System.Array+EmptyArray`1[ubyte]:Value
        mov      r14, gword ptr [rax]
 						;; size=13 bbWeight=0.50 PerfScore 1.12
-G_M16011_IG15:
+G_M16011_IG17:
        mov      rax, r14
 						;; size=3 bbWeight=1 PerfScore 0.25
-G_M16011_IG16:
-       add      rsp, 8
+G_M16011_IG18:
+       add      rsp, 16
        pop      rbx
+       pop      r13
        pop      r14
        pop      r15
        pop      rbp
        ret      
-						;; size=11 bbWeight=1 PerfScore 3.25
-G_M16011_IG17:
+						;; size=13 bbWeight=1 PerfScore 3.75
+G_M16011_IG19:
        mov      rax, 0xD1FFAB1E      ; code for System.ThrowHelper:ThrowArgumentOutOfRangeException()
        call     [rax]System.ThrowHelper:ThrowArgumentOutOfRangeException()
        int3     
 						;; size=13 bbWeight=0.50 PerfScore 1.75
-G_M16011_IG18:
+G_M16011_IG20:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPEndPointStatics
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      G_M16011_IG06
+						;; size=27 bbWeight=0 PerfScore 0.00
+G_M16011_IG21:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPEndPointStatics
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      G_M16011_IG04
+						;; size=27 bbWeight=0 PerfScore 0.00
+G_M16011_IG22:
        mov      rdi, 0xD1FFAB1E      ; System.Array+EmptyArray`1[ubyte]
        mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
        call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
-       jmp      SHORT G_M16011_IG14
+       jmp      SHORT G_M16011_IG16
 						;; size=24 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 323, prolog size 18, PerfScore 58.44, instruction count 84, allocated bytes for code 323 (MethodHash=77c3c174) for method System.Net.Sockets.UdpClient:EndReceive(System.IAsyncResult,byref):ubyte[]:this (FullOpts)
+; Total bytes of code 422, prolog size 23, PerfScore 63.44, instruction count 100, allocated bytes for code 422 (MethodHash=77c3c174) for method System.Net.Sockets.UdpClient:EndReceive(System.IAsyncResult,byref):ubyte[]:this (FullOpts)
 ; ============================================================
97 (20.00 % of base) - System.Net.Sockets.UdpClient:.ctor(int,int):this
 ; Assembly listing for method System.Net.Sockets.UdpClient:.ctor(int,int):this (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rbp based frame
 ; partially interruptible
 ; No PGO data
-; 0 inlinees with PGO data; 4 single block inlinees; 4 inlinees without PGO data
+; 0 inlinees with PGO data; 2 single block inlinees; 1 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;  V00 this         [V00,T00] (  9,  9   )     ref  ->  rbx         this class-hnd single-def <System.Net.Sockets.UdpClient>
 ;  V01 arg1         [V01,T01] (  6,  4.50)     int  ->  r15         single-def
 ;  V02 arg2         [V02,T02] (  5,  4.50)     int  ->  r14         single-def
 ;  V03 loc0         [V03,T08] (  3,  2   )     ref  ->  r14         class-hnd <System.Net.IPEndPoint>
 ;# V04 OutArgs      [V04    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
-;  V05 tmp1         [V05,T05] (  2,  4   )     ref  ->  rax         class-hnd exact single-def "NewArr temp" <ubyte[]>
-;  V06 tmp2         [V06,T07] (  3,  3   )     ref  ->  r14         class-hnd exact single-def "NewObj constructor temp" <System.Net.IPEndPoint>
-;  V07 tmp3         [V07,T04] (  4,  4   )     ref  ->  r14         class-hnd exact single-def "NewObj constructor temp" <System.Net.IPEndPoint>
-;  V08 tmp4         [V08,T10] (  3,  0   )     ref  ->  rbx         class-hnd exact single-def "NewObj constructor temp" <System.ArgumentException>
-;  V09 tmp5         [V09,T11] (  3,  0   )     ref  ->  rbx         class-hnd exact single-def "NewObj constructor temp" <System.ArgumentOutOfRangeException>
-;* V10 tmp6         [V10,T17] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;  V11 tmp7         [V11,T09] (  2,  2   )     ref  ->  rsi         class-hnd exact single-def "Inlining Arg" <System.Net.IPAddress+ReadOnlyIPAddress>
-;* V12 tmp8         [V12    ] (  0,  0   )     ref  ->  zero-ref    class-hnd exact single-def "NewObj constructor temp" <System.ArgumentOutOfRangeException>
-;* V13 tmp9         [V13    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;  V14 tmp10        [V14,T03] (  3,  6   )     ref  ->  r13         class-hnd exact single-def "NewObj constructor temp" <System.Net.Sockets.Socket>
-;  V15 tmp11        [V15,T06] (  2,  4   )     int  ->  r15         "finalizable newobj spill"
-;* V16 tmp12        [V16    ] (  0,  0   )     ref  ->  zero-ref    single-def "argument with side effect"
-;  V17 tmp13        [V17,T12] (  2,  0   )     ref  ->  r15         single-def "argument with side effect"
-;  V18 tmp14        [V18,T13] (  2,  0   )     ref  ->  rsi         single-def "argument with side effect"
-;  V19 tmp15        [V19,T14] (  2,  0   )     ref  ->  r15         single-def "argument with side effect"
-;  V20 tmp16        [V20,T15] (  2,  0   )     ref  ->  rdx         single-def "argument with side effect"
-;  V21 tmp17        [V21,T16] (  2,  0   )     ref  ->  rsi         single-def "argument with side effect"
+;  V05 tmp1         [V05,T04] (  2,  4   )     ref  ->  rax         class-hnd exact single-def "NewArr temp" <ubyte[]>
+;  V06 tmp2         [V06,T06] (  3,  3   )     ref  ->  r14         class-hnd exact single-def "NewObj constructor temp" <System.Net.IPEndPoint>
+;  V07 tmp3         [V07,T07] (  3,  3   )     ref  ->  r14         class-hnd exact single-def "NewObj constructor temp" <System.Net.IPEndPoint>
+;  V08 tmp4         [V08,T11] (  3,  0   )     ref  ->  rbx         class-hnd exact single-def "NewObj constructor temp" <System.ArgumentException>
+;  V09 tmp5         [V09,T12] (  3,  0   )     ref  ->  rbx         class-hnd exact single-def "NewObj constructor temp" <System.ArgumentOutOfRangeException>
+;* V10 tmp6         [V10,T18] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
+;  V11 tmp7         [V11,T03] (  3,  6   )     ref  ->  r13         class-hnd exact single-def "NewObj constructor temp" <System.Net.Sockets.Socket>
+;  V12 tmp8         [V12,T05] (  2,  4   )     int  ->  r15         "finalizable newobj spill"
+;  V13 tmp9         [V13,T09] (  2,  2   )     ref  ->  rsi         single-def "argument with side effect"
+;  V14 tmp10        [V14,T10] (  2,  2   )     ref  ->  rsi         single-def "argument with side effect"
+;  V15 tmp11        [V15,T13] (  2,  0   )     ref  ->  r15         single-def "argument with side effect"
+;  V16 tmp12        [V16,T14] (  2,  0   )     ref  ->  rsi         single-def "argument with side effect"
+;  V17 tmp13        [V17,T15] (  2,  0   )     ref  ->  r15         single-def "argument with side effect"
+;  V18 tmp14        [V18,T16] (  2,  0   )     ref  ->  rdx         single-def "argument with side effect"
+;  V19 tmp15        [V19,T17] (  2,  0   )     ref  ->  rsi         single-def "argument with side effect"
 ;
 ; Lcl frame size = 0
 
 G_M57935_IG01:
        push     rbp
        push     r15
        push     r14
        push     r13
        push     rbx
        lea      rbp, [rsp+0x20]
        mov      rbx, rdi
        mov      r15d, esi
        mov      r14d, edx
 						;; size=22 bbWeight=1 PerfScore 6.25
 G_M57935_IG02:
        mov      rdi, 0xD1FFAB1E      ; ubyte[]
        mov      esi, 0x10000
        call     CORINFO_HELP_NEWARR_1_VC
        lea      rdi, bword ptr [rbx+0x10]
        mov      rsi, rax
        call     CORINFO_HELP_ASSIGN_REF
        mov      dword ptr [rbx+0x18], 2
        test     r15d, r15d
-       jl       G_M57935_IG11
+       jl       G_M57935_IG13
 						;; size=48 bbWeight=1 PerfScore 5.50
 G_M57935_IG03:
        cmp      r15d, 0xFFFF
-       jg       G_M57935_IG11
+       jg       G_M57935_IG13
 						;; size=13 bbWeight=0.50 PerfScore 0.62
 G_M57935_IG04:
        cmp      r14d, 2
        je       SHORT G_M57935_IG06
 						;; size=6 bbWeight=1 PerfScore 1.25
 G_M57935_IG05:
        cmp      r14d, 23
-       jne      G_M57935_IG12
+       jne      G_M57935_IG14
 						;; size=10 bbWeight=0.50 PerfScore 0.62
 G_M57935_IG06:
        mov      dword ptr [rbx+0x18], r14d
        cmp      dword ptr [rbx+0x18], 2
-       je       SHORT G_M57935_IG08
+       je       SHORT G_M57935_IG09
 						;; size=10 bbWeight=1 PerfScore 5.00
 G_M57935_IG07:
        mov      rdi, 0xD1FFAB1E      ; System.Net.IPEndPoint
        call     CORINFO_HELP_NEWSFAST
        mov      r14, rax
-       mov      rsi, 0xD1FFAB1E      ; const ptr
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       G_M57935_IG16
+						;; size=37 bbWeight=0.50 PerfScore 2.88
+G_M57935_IG08:
+       mov      rsi, 0xD1FFAB1E      ; data for System.Net.IPAddress:IPv6Any
        mov      rsi, gword ptr [rsi]
        mov      rdi, r14
        mov      edx, r15d
        mov      rax, 0xD1FFAB1E      ; code for System.Net.IPEndPoint:.ctor(System.Net.IPAddress,int):this
        call     [rax]System.Net.IPEndPoint:.ctor(System.Net.IPAddress,int):this
-       jmp      SHORT G_M57935_IG09
-						;; size=51 bbWeight=0.50 PerfScore 4.75
-G_M57935_IG08:
+       jmp      SHORT G_M57935_IG11
+						;; size=33 bbWeight=0.50 PerfScore 4.00
+G_M57935_IG09:
        mov      rdi, 0xD1FFAB1E      ; System.Net.IPEndPoint
        call     CORINFO_HELP_NEWSFAST
        mov      r14, rax
-       mov      rdi, 0xD1FFAB1E      ; const ptr
-       mov      rsi, gword ptr [rdi]
-       mov      dword ptr [r14+0x10], r15d
-       lea      rdi, bword ptr [r14+0x08]
-       call     CORINFO_HELP_ASSIGN_REF
-						;; size=44 bbWeight=0.50 PerfScore 3.12
-G_M57935_IG09:
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       G_M57935_IG15
+						;; size=37 bbWeight=0.50 PerfScore 2.88
+G_M57935_IG10:
+       mov      rsi, 0xD1FFAB1E      ; data for System.Net.IPAddress:Any
+       mov      rsi, gword ptr [rsi]
+       mov      rdi, r14
+       mov      edx, r15d
+       mov      rax, 0xD1FFAB1E      ; code for System.Net.IPEndPoint:.ctor(System.Net.IPAddress,int):this
+       call     [rax]System.Net.IPEndPoint:.ctor(System.Net.IPAddress,int):this
+						;; size=31 bbWeight=0.50 PerfScore 3.00
+G_M57935_IG11:
        mov      r15d, dword ptr [rbx+0x18]
        mov      rdi, 0xD1FFAB1E      ; System.Net.Sockets.Socket
        call     CORINFO_HELP_NEWFAST
        mov      r13, rax
        mov      rdi, r13
        mov      esi, r15d
        mov      edx, 2
        mov      ecx, 17
        mov      rax, 0xD1FFAB1E      ; code for System.Net.Sockets.Socket:.ctor(int,int,int):this
        call     [rax]System.Net.Sockets.Socket:.ctor(int,int,int):this
        lea      rdi, bword ptr [rbx+0x08]
        mov      rsi, r13
        call     CORINFO_HELP_ASSIGN_REF
        mov      rdi, gword ptr [rbx+0x08]
        mov      rsi, r14
        mov      rax, 0xD1FFAB1E      ; code for System.Net.Sockets.Socket:Bind(System.Net.EndPoint):this
        cmp      dword ptr [rdi], edi
 						;; size=81 bbWeight=1 PerfScore 15.00
-G_M57935_IG10:
+G_M57935_IG12:
        pop      rbx
        pop      r13
        pop      r14
        pop      r15
        pop      rbp
        tail.jmp [rax]System.Net.Sockets.Socket:Bind(System.Net.EndPoint):this
 						;; size=11 bbWeight=1 PerfScore 4.50
-G_M57935_IG11:
+G_M57935_IG13:
        mov      rdi, 0xD1FFAB1E      ; System.ArgumentOutOfRangeException
        call     CORINFO_HELP_NEWSFAST
        mov      rbx, rax
        mov      edi, 0x196D
        mov      rsi, 0xD1FFAB1E
        call     CORINFO_HELP_STRCNS
        mov      rsi, rax
        mov      rdi, rbx
        mov      rax, 0xD1FFAB1E      ; code for System.ArgumentOutOfRangeException:.ctor(System.String):this
        call     [rax]System.ArgumentOutOfRangeException:.ctor(System.String):this
        mov      rdi, rbx
        call     CORINFO_HELP_THROW
        int3     
 						;; size=65 bbWeight=0 PerfScore 0.00
-G_M57935_IG12:
+G_M57935_IG14:
        mov      rdi, 0xD1FFAB1E      ; System.ArgumentException
        call     CORINFO_HELP_NEWSFAST
        mov      rbx, rax
        mov      rax, 0xD1FFAB1E      ; code for System.SR:get_net_protocol_invalid_family():System.String
        call     [rax]System.SR:get_net_protocol_invalid_family():System.String
        mov      r15, rax
        mov      edi, 0x2B06
        mov      rsi, 0xD1FFAB1E
        call     CORINFO_HELP_STRCNS
        mov      rsi, rax
        mov      rdi, r15
        mov      rax, 0xD1FFAB1E      ; code for System.SR:Format(System.String,System.Object):System.String
        call     [rax]System.SR:Format(System.String,System.Object):System.String
        mov      r15, rax
        mov      edi, 0x2A6A
        mov      rsi, 0xD1FFAB1E
        call     CORINFO_HELP_STRCNS
        mov      rdx, rax
        mov      rsi, r15
        mov      rdi, rbx
        mov      rax, 0xD1FFAB1E      ; code for System.ArgumentException:.ctor(System.String,System.String):this
        call     [rax]System.ArgumentException:.ctor(System.String,System.String):this
        mov      rdi, rbx
        call     CORINFO_HELP_THROW
        int3     
 						;; size=124 bbWeight=0 PerfScore 0.00
+G_M57935_IG15:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPAddress
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      G_M57935_IG10
+						;; size=27 bbWeight=0 PerfScore 0.00
+G_M57935_IG16:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPAddress
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      G_M57935_IG08
+						;; size=27 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 485, prolog size 13, PerfScore 46.62, instruction count 106, allocated bytes for code 485 (MethodHash=a04c1db0) for method System.Net.Sockets.UdpClient:.ctor(int,int):this (FullOpts)
+; Total bytes of code 582, prolog size 13, PerfScore 51.50, instruction count 121, allocated bytes for code 582 (MethodHash=a04c1db0) for method System.Net.Sockets.UdpClient:.ctor(int,int):this (FullOpts)
 ; ============================================================
94 (97.92 % of base) - System.Net.IPNetwork:Equals(System.Object):ubyte:this
 ; Assembly listing for method System.Net.IPNetwork:Equals(System.Object):ubyte:this (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rbp based frame
 ; fully interruptible
 ; No PGO data
 ; 0 inlinees with PGO data; 2 single block inlinees; 3 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;  V00 this         [V00,T01] (  4,  3   )   byref  ->  rdi         this single-def
 ;  V01 arg1         [V01,T00] (  6,  4.25)     ref  ->  rsi         class-hnd single-def <System.Object>
 ;* V02 loc0         [V02    ] (  0,  0   )  struct (16) zero-ref    <System.Net.IPNetwork>
 ;# V03 OutArgs      [V03    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
 ;* V04 tmp1         [V04    ] (  0,  0   )     int  ->  zero-ref    "spilling qmarkNull"
 ;* V05 tmp2         [V05    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
 ;* V06 tmp3         [V06    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.Net.IPNetwork>
 ;* V07 tmp4         [V07    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
 ;* V08 tmp5         [V08    ] (  0,  0   )     ref  ->  zero-ref    class-hnd single-def "impAppendStmt" <System.Net.IPAddress>
-;  V09 tmp6         [V09,T04] (  2,  2   )     ref  ->  rdi         class-hnd single-def "dup spill" <System.Net.IPAddress>
-;  V10 tmp7         [V10,T02] (  5,  2.50)     ref  ->  rdi        
+;  V09 tmp6         [V09,T04] (  2,  2   )     ref  ->  r15         class-hnd single-def "dup spill" <System.Net.IPAddress>
+;  V10 tmp7         [V10,T02] (  4,  2   )     ref  ->  r15        
 ;* V11 tmp8         [V11    ] (  0,  0   )     ref  ->  zero-ref    class-hnd single-def "dup spill" <System.Net.IPAddress>
-;  V12 tmp9         [V12,T03] (  5,  2.50)     ref  ->  rsi        
+;  V12 tmp9         [V12,T03] (  4,  2   )     ref  ->  rsi        
 ;* V13 tmp10        [V13    ] (  0,  0   )     ref  ->  zero-ref    "field V02._baseAddress (fldOffset=0x0)" P-INDEP
 ;* V14 tmp11        [V14    ] (  0,  0   )     int  ->  zero-ref    "field V02.<PrefixLength>k__BackingField (fldOffset=0x8)" P-INDEP
-;  V15 tmp12        [V15,T05] (  2,  1   )     ref  ->  rax         single-def "field V06._baseAddress (fldOffset=0x0)" P-INDEP
-;  V16 tmp13        [V16,T06] (  2,  1   )     int  ->  rsi         "field V06.<PrefixLength>k__BackingField (fldOffset=0x8)" P-INDEP
+;  V15 tmp12        [V15,T05] (  2,  1   )     ref  ->  rbx         single-def "field V06._baseAddress (fldOffset=0x0)" P-INDEP
+;  V16 tmp13        [V16,T06] (  2,  1   )     int  ->  rax         "field V06.<PrefixLength>k__BackingField (fldOffset=0x8)" P-INDEP
 ;
 ; Lcl frame size = 0
 
 G_M37668_IG01:
        push     rbp
-       mov      rbp, rsp
-						;; size=4 bbWeight=1 PerfScore 1.25
+       push     r15
+       push     rbx
+       lea      rbp, [rsp+0x10]
+						;; size=9 bbWeight=1 PerfScore 3.50
 G_M37668_IG02:
        test     rsi, rsi
-       je       SHORT G_M37668_IG06
+       je       SHORT G_M37668_IG10
 						;; size=5 bbWeight=1 PerfScore 1.25
 G_M37668_IG03:
        mov      rax, 0xD1FFAB1E      ; System.Net.IPNetwork
        cmp      qword ptr [rsi], rax
-       jne      SHORT G_M37668_IG06
+       jne      SHORT G_M37668_IG10
 						;; size=15 bbWeight=0.25 PerfScore 1.06
 G_M37668_IG04:
-       mov      rax, gword ptr [rsi+0x08]
-       mov      esi, dword ptr [rsi+0x10]
-       cmp      dword ptr [rdi+0x08], esi
+       mov      rbx, gword ptr [rsi+0x08]
+       mov      eax, dword ptr [rsi+0x10]
+       cmp      dword ptr [rdi+0x08], eax
+       jne      SHORT G_M37668_IG10
+       mov      r15, gword ptr [rdi]
+       test     r15, r15
        jne      SHORT G_M37668_IG06
-       mov      rdi, gword ptr [rdi]
-       mov      rsi, 0xD1FFAB1E      ; const ptr
-       test     rdi, rdi
-       cmove    rdi, gword ptr [rsi]
-       mov      rsi, rax
-       mov      rax, 0xD1FFAB1E      ; const ptr
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       SHORT G_M37668_IG12
+						;; size=35 bbWeight=0.50 PerfScore 7.75
+G_M37668_IG05:
+       mov      rdi, 0xD1FFAB1E      ; data for System.Net.IPAddress:Any
+       mov      r15, gword ptr [rdi]
+						;; size=13 bbWeight=0.50 PerfScore 1.12
+G_M37668_IG06:
+       mov      rsi, rbx
        test     rsi, rsi
-       cmove    rsi, gword ptr [rax]
+       jne      SHORT G_M37668_IG08
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       SHORT G_M37668_IG13
+						;; size=23 bbWeight=0.50 PerfScore 2.88
+G_M37668_IG07:
+       mov      rdi, 0xD1FFAB1E      ; data for System.Net.IPAddress:Any
+       mov      rsi, gword ptr [rdi]
+						;; size=13 bbWeight=0.50 PerfScore 1.12
+G_M37668_IG08:
+       mov      rdi, r15
        mov      rax, 0xD1FFAB1E      ; code for System.Net.IPAddress:Equals(System.Net.IPAddress):ubyte:this
        cmp      dword ptr [rdi], edi
-						;; size=64 bbWeight=0.50 PerfScore 10.25
-G_M37668_IG05:
+						;; size=15 bbWeight=0.50 PerfScore 1.75
+G_M37668_IG09:
+       pop      rbx
+       pop      r15
        pop      rbp
        tail.jmp [rax]System.Net.IPAddress:Equals(System.Net.IPAddress):ubyte:this
-						;; size=4 bbWeight=0.50 PerfScore 1.25
-G_M37668_IG06:
+						;; size=7 bbWeight=0.50 PerfScore 1.75
+G_M37668_IG10:
        xor      eax, eax
 						;; size=2 bbWeight=0.50 PerfScore 0.12
-G_M37668_IG07:
+G_M37668_IG11:
+       pop      rbx
+       pop      r15
        pop      rbp
        ret      
-						;; size=2 bbWeight=0.50 PerfScore 0.75
+						;; size=5 bbWeight=0.50 PerfScore 1.25
+G_M37668_IG12:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPAddress
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      SHORT G_M37668_IG05
+						;; size=24 bbWeight=0 PerfScore 0.00
+G_M37668_IG13:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPAddress
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      SHORT G_M37668_IG07
+						;; size=24 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 96, prolog size 4, PerfScore 15.94, instruction count 26, allocated bytes for code 96 (MethodHash=bd736cdb) for method System.Net.IPNetwork:Equals(System.Object):ubyte:this (FullOpts)
+; Total bytes of code 190, prolog size 9, PerfScore 23.56, instruction count 49, allocated bytes for code 190 (MethodHash=bd736cdb) for method System.Net.IPNetwork:Equals(System.Object):ubyte:this (FullOpts)
 ; ============================================================
94 (37.90 % of base) - System.Net.Sockets.UdpClient:ReceiveAsync(System.Threading.CancellationToken):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.UdpReceiveResult]:this
 ; Assembly listing for method System.Net.Sockets.UdpClient:ReceiveAsync(System.Threading.CancellationToken):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.UdpReceiveResult]:this (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rbp based frame
 ; partially interruptible
 ; No PGO data
 ; 0 inlinees with PGO data; 2 single block inlinees; 1 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;  V00 this         [V00,T00] (  8,  8   )     ref  ->  rbx         this class-hnd single-def <System.Net.Sockets.UdpClient>
 ;  V01 RetBuf       [V01,T02] (  4,  4   )    long  ->  r15         single-def
 ;* V02 arg1         [V02    ] (  0,  0   )  struct ( 8) zero-ref    single-def <System.Threading.CancellationToken>
 ;  V03 OutArgs      [V03    ] (  1,  1   )  struct (32) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <UNNAMED>
-;  V04 tmp1         [V04,T06] (  2,  4   )     ref  ->  rdi         class-hnd single-def "impAppendStmt" <System.Net.Sockets.Socket>
+;  V04 tmp1         [V04,T06] (  2,  4   )     ref  ->  r13         class-hnd single-def "impAppendStmt" <System.Net.Sockets.Socket>
 ;* V05 tmp2         [V05    ] (  0,  0   )  struct (16) zero-ref    "impAppendStmt" <System.Memory`1[ubyte]>
 ;* V06 tmp3         [V06    ] (  0,  0   )     ref  ->  zero-ref    single-def
 ;* V07 tmp4         [V07    ] (  0,  0   )     ref  ->  zero-ref    single-def
 ;* V08 tmp5         [V08    ] (  0,  0   )  struct (16) zero-ref    <System.Memory`1[ubyte]>
 ;* V09 tmp6         [V09    ] (  0,  0   )     int  ->  zero-ref   
 ;* V10 tmp7         [V10    ] (  0,  0   )     ref  ->  zero-ref    single-def
 ;* V11 tmp8         [V11    ] (  0,  0   )     ref  ->  zero-ref    single-def
 ;* V12 tmp9         [V12    ] (  0,  0   )  struct (16) zero-ref    <System.Memory`1[ubyte]>
 ;* V13 tmp10        [V13    ] (  0,  0   )     int  ->  zero-ref   
 ;  V14 tmp11        [V14,T09] (  3,  2   )     ref  ->   r9        
-;  V15 tmp12        [V15,T08] (  2,  4   )  struct (32) [rbp-0x38]  do-not-enreg[HS] must-init hidden-struct-arg "spilled call-like call argument" <System.Threading.Tasks.ValueTask`1[System.Net.Sockets.SocketReceiveFromResult]>
-;  V16 tmp13        [V16,T01] (  4,  6   )     ref  ->  rdx         class-hnd single-def "Inlining Arg" <ubyte[]>
-;  V17 tmp14        [V17    ] (  8, 10   )  struct (16) [rbp-0x48]  do-not-enreg[SFA] multireg-arg must-init ld-addr-op "NewObj constructor temp" <System.Memory`1[ubyte]>
+;  V15 tmp12        [V15,T08] (  2,  4   )  struct (32) [rbp-0x40]  do-not-enreg[HS] must-init hidden-struct-arg "spilled call-like call argument" <System.Threading.Tasks.ValueTask`1[System.Net.Sockets.SocketReceiveFromResult]>
+;  V16 tmp13        [V16,T01] (  4,  6   )     ref  ->  rdi         class-hnd single-def "Inlining Arg" <ubyte[]>
+;  V17 tmp14        [V17    ] (  8, 10   )  struct (16) [rbp-0x50]  do-not-enreg[SFA] multireg-arg must-init ld-addr-op "NewObj constructor temp" <System.Memory`1[ubyte]>
 ;  V18 tmp15        [V18,T07] (  2,  2   )     ref  ->  r14         single-def "field V02._source (fldOffset=0x0)" P-INDEP
 ;* V19 tmp16        [V19    ] (  0,  0   )     ref  ->  zero-ref    single-def "field V05._object (fldOffset=0x0)" P-INDEP
 ;* V20 tmp17        [V20    ] (  0,  0   )     int  ->  zero-ref    "field V05._index (fldOffset=0x8)" P-INDEP
 ;* V21 tmp18        [V21    ] (  0,  0   )     int  ->  zero-ref    "field V05._length (fldOffset=0xc)" P-INDEP
 ;* V22 tmp19        [V22    ] (  0,  0   )     ref  ->  zero-ref    single-def "field V08._object (fldOffset=0x0)" P-INDEP
 ;* V23 tmp20        [V23    ] (  0,  0   )     int  ->  zero-ref    "field V08._index (fldOffset=0x8)" P-INDEP
 ;* V24 tmp21        [V24    ] (  0,  0   )     int  ->  zero-ref    "field V08._length (fldOffset=0xc)" P-INDEP
 ;* V25 tmp22        [V25    ] (  0,  0   )     ref  ->  zero-ref    single-def "field V12._object (fldOffset=0x0)" P-INDEP
 ;* V26 tmp23        [V26    ] (  0,  0   )     int  ->  zero-ref    "field V12._index (fldOffset=0x8)" P-INDEP
 ;* V27 tmp24        [V27    ] (  0,  0   )     int  ->  zero-ref    "field V12._length (fldOffset=0xc)" P-INDEP
-;  V28 tmp25        [V28,T03] (  4,  5   )     ref  ->  [rbp-0x48]  do-not-enreg[] "field V17._object (fldOffset=0x0)" P-DEP
-;  V29 tmp26        [V29,T04] (  4,  5   )     int  ->  [rbp-0x40]  do-not-enreg[] "field V17._index (fldOffset=0x8)" P-DEP
-;  V30 tmp27        [V30,T05] (  4,  5   )     int  ->  [rbp-0x3C]  do-not-enreg[] "field V17._length (fldOffset=0xc)" P-DEP
+;  V28 tmp25        [V28,T03] (  4,  5   )     ref  ->  [rbp-0x50]  do-not-enreg[] "field V17._object (fldOffset=0x0)" P-DEP
+;  V29 tmp26        [V29,T04] (  4,  5   )     int  ->  [rbp-0x48]  do-not-enreg[] "field V17._index (fldOffset=0x8)" P-DEP
+;  V30 tmp27        [V30,T05] (  4,  5   )     int  ->  [rbp-0x44]  do-not-enreg[] "field V17._length (fldOffset=0xc)" P-DEP
 ;
-; Lcl frame size = 88
+; Lcl frame size = 80
 
 G_M25660_IG01:
        push     rbp
        push     r15
        push     r14
+       push     r13
        push     rbx
-       sub      rsp, 88
+       sub      rsp, 80
        lea      rbp, [rsp+0x70]
-       xor      eax, eax
-       mov      qword ptr [rbp-0x48], rax
        vxorps   xmm8, xmm8, xmm8
-       vmovdqu  ymmword ptr [rbp-0x40], ymm8
-       mov      qword ptr [rbp-0x20], rax
+       vmovdqu  ymmword ptr [rbp-0x50], ymm8
+       vmovdqa  xmmword ptr [rbp-0x30], xmm8
        mov      rbx, rdi
        mov      r15, rsi
        mov      r14, rdx
-						;; size=44 bbWeight=1 PerfScore 10.08
+						;; size=41 bbWeight=1 PerfScore 10.83
 G_M25660_IG02:
        movzx    rdi, byte  ptr [rbx+0x1D]
        mov      rsi, rbx
        mov      rax, 0xD1FFAB1E      ; code for System.ObjectDisposedException:ThrowIf(ubyte,System.Object)
        call     [rax]System.ObjectDisposedException:ThrowIf(ubyte,System.Object)
-       mov      rdi, gword ptr [rbx+0x08]
-       mov      rdx, gword ptr [rbx+0x10]
-       test     rdx, rdx
+       mov      r13, gword ptr [rbx+0x08]
+       mov      rdi, gword ptr [rbx+0x10]
+       test     rdi, rdi
        je       SHORT G_M25660_IG04
 						;; size=33 bbWeight=1 PerfScore 10.75
 G_M25660_IG03:
-       mov      gword ptr [rbp-0x48], rdx
-       xor      ecx, ecx
-       mov      dword ptr [rbp-0x40], ecx
-       mov      edx, dword ptr [rdx+0x08]
-       mov      dword ptr [rbp-0x3C], edx
+       mov      gword ptr [rbp-0x50], rdi
+       xor      eax, eax
+       mov      dword ptr [rbp-0x48], eax
+       mov      edi, dword ptr [rdi+0x08]
+       mov      dword ptr [rbp-0x44], edi
        jmp      SHORT G_M25660_IG07
 						;; size=17 bbWeight=0.50 PerfScore 3.62
 G_M25660_IG04:
-       xor      rdx, rdx
-       mov      gword ptr [rbp-0x48], rdx
+       xor      rdi, rdi
+       mov      gword ptr [rbp-0x50], rdi
 						;; size=6 bbWeight=0.50 PerfScore 0.62
 G_M25660_IG05:
-       mov      dword ptr [rbp-0x40], edx
+       mov      dword ptr [rbp-0x48], edi
 						;; size=3 bbWeight=0.50 PerfScore 0.50
 G_M25660_IG06:
-       mov      dword ptr [rbp-0x3C], edx
+       mov      dword ptr [rbp-0x44], edi
 						;; size=3 bbWeight=0.50 PerfScore 0.50
 G_M25660_IG07:
        cmp      dword ptr [rbx+0x18], 2
-       jne      SHORT G_M25660_IG09
+       jne      SHORT G_M25660_IG10
 						;; size=6 bbWeight=1 PerfScore 4.00
 G_M25660_IG08:
-       mov      rdx, 0xD1FFAB1E      ; const ptr
-       mov      r9, gword ptr [rdx]
-       jmp      SHORT G_M25660_IG10
-						;; size=15 bbWeight=0.50 PerfScore 2.12
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       G_M25660_IG15
+						;; size=19 bbWeight=0.50 PerfScore 2.12
 G_M25660_IG09:
-       mov      r9, 0xD1FFAB1E      ; const ptr
-       mov      r9, gword ptr [r9]
-						;; size=13 bbWeight=0.50 PerfScore 1.12
+       mov      rdi, 0xD1FFAB1E      ; data for System.Net.IPEndPointStatics:Any
+       mov      r9, gword ptr [rdi]
+       jmp      SHORT G_M25660_IG12
+						;; size=15 bbWeight=0.50 PerfScore 2.12
 G_M25660_IG10:
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       G_M25660_IG14
+						;; size=19 bbWeight=0.50 PerfScore 2.12
+G_M25660_IG11:
+       mov      rdx, 0xD1FFAB1E      ; data for System.Net.IPEndPointStatics:IPv6Any
+       mov      r9, gword ptr [rdx]
+						;; size=13 bbWeight=0.50 PerfScore 1.12
+G_M25660_IG12:
        mov      gword ptr [rsp], r14
-       mov      rdx, gword ptr [rbp-0x48]
-       mov      rcx, qword ptr [rbp-0x40]
-       lea      rsi, [rbp-0x38]
+       mov      rdx, gword ptr [rbp-0x50]
+       mov      rcx, qword ptr [rbp-0x48]
+       lea      rsi, [rbp-0x40]
+       mov      rdi, r13
        xor      r8d, r8d
        mov      rax, 0xD1FFAB1E      ; code for System.Net.Sockets.Socket:ReceiveFromAsync(System.Memory`1[ubyte],int,System.Net.EndPoint,System.Threading.CancellationToken):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.SocketReceiveFromResult]:this
        cmp      dword ptr [rdi], edi
        call     [rax]System.Net.Sockets.Socket:ReceiveFromAsync(System.Memory`1[ubyte],int,System.Net.EndPoint,System.Threading.CancellationToken):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.SocketReceiveFromResult]:this
        lea      rdi, [rsp]
-       lea      rsi, [rbp-0x38]
+       lea      rsi, [rbp-0x40]
        mov      rcx, gword ptr [rsi]
        mov      gword ptr [rsp], rcx
        add      rsi, 8
        add      rdi, 8
        movsq    
        mov      rcx, gword ptr [rsi]
        mov      gword ptr [rsp+0x10], rcx
        add      rsi, 8
        add      rdi, 8
        movsq    
        mov      rdi, rbx
        mov      rsi, r15
        mov      rax, 0xD1FFAB1E      ; code for System.Net.Sockets.UdpClient:<ReceiveAsync>g__WaitAndWrap|62_0(System.Threading.Tasks.ValueTask`1[System.Net.Sockets.SocketReceiveFromResult]):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.UdpReceiveResult]:this
        call     [rax]System.Net.Sockets.UdpClient:<ReceiveAsync>g__WaitAndWrap|62_0(System.Threading.Tasks.ValueTask`1[System.Net.Sockets.SocketReceiveFromResult]):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.UdpReceiveResult]:this
        mov      rax, r15
-						;; size=97 bbWeight=1 PerfScore 24.00
-G_M25660_IG11:
-       add      rsp, 88
+						;; size=100 bbWeight=1 PerfScore 24.25
+G_M25660_IG13:
+       add      rsp, 80
        pop      rbx
+       pop      r13
        pop      r14
        pop      r15
        pop      rbp
        ret      
-						;; size=11 bbWeight=1 PerfScore 3.25
+						;; size=13 bbWeight=1 PerfScore 3.75
+G_M25660_IG14:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPEndPointStatics
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      G_M25660_IG11
+						;; size=27 bbWeight=0 PerfScore 0.00
+G_M25660_IG15:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPEndPointStatics
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      G_M25660_IG09
+						;; size=27 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 248, prolog size 35, PerfScore 60.58, instruction count 70, allocated bytes for code 248 (MethodHash=0f3d9bc3) for method System.Net.Sockets.UdpClient:ReceiveAsync(System.Threading.CancellationToken):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.UdpReceiveResult]:this (FullOpts)
+; Total bytes of code 342, prolog size 32, PerfScore 66.33, instruction count 85, allocated bytes for code 342 (MethodHash=0f3d9bc3) for method System.Net.Sockets.UdpClient:ReceiveAsync(System.Threading.CancellationToken):System.Threading.Tasks.ValueTask`1[System.Net.Sockets.UdpReceiveResult]:this (FullOpts)
 ; ============================================================
91 (108.33 % of base) - System.Net.IPNetwork:op_Inequality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte
 ; Assembly listing for method System.Net.IPNetwork:op_Inequality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rbp based frame
 ; partially interruptible
 ; No PGO data
 ; 0 inlinees with PGO data; 3 single block inlinees; 3 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;* V00 arg0         [V00    ] (  0,  0   )  struct (16) zero-ref    multireg-arg single-def <System.Net.IPNetwork>
 ;* V01 arg1         [V01    ] (  0,  0   )  struct (16) zero-ref    multireg-arg single-def <System.Net.IPNetwork>
 ;# V02 OutArgs      [V02    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
 ;* V03 tmp1         [V03    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.Net.IPNetwork>
-;  V04 tmp2         [V04,T06] (  3,  2   )   ubyte  ->  rbx         "Inline return value spill temp"
+;  V04 tmp2         [V04,T06] (  3,  2   )   ubyte  ->  r15         "Inline return value spill temp"
 ;* V05 tmp3         [V05    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.Net.IPNetwork>
 ;* V06 tmp4         [V06    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
 ;* V07 tmp5         [V07    ] (  0,  0   )     ref  ->  zero-ref    class-hnd single-def "impAppendStmt" <System.Net.IPAddress>
 ;* V08 tmp6         [V08    ] (  0,  0   )     ref  ->  zero-ref    class-hnd single-def "dup spill" <System.Net.IPAddress>
-;  V09 tmp7         [V09,T04] (  5,  2.50)     ref  ->  rdi        
+;  V09 tmp7         [V09,T04] (  4,  2   )     ref  ->  r15        
 ;* V10 tmp8         [V10    ] (  0,  0   )     ref  ->  zero-ref    class-hnd single-def "dup spill" <System.Net.IPAddress>
-;  V11 tmp9         [V11,T05] (  5,  2.50)     ref  ->  rsi        
+;  V11 tmp9         [V11,T05] (  4,  2   )     ref  ->  rsi        
 ;  V12 tmp10        [V12,T02] (  2,  1.50)     ref  ->  rdi         single-def "field V00._baseAddress (fldOffset=0x0)" P-INDEP
 ;  V13 tmp11        [V13,T00] (  2,  2   )     int  ->  rsi         single-def "field V00.<PrefixLength>k__BackingField (fldOffset=0x8)" P-INDEP
-;  V14 tmp12        [V14,T03] (  2,  1.50)     ref  ->  rdx         single-def "field V01._baseAddress (fldOffset=0x0)" P-INDEP
+;  V14 tmp12        [V14,T03] (  2,  1.50)     ref  ->  rbx         single-def "field V01._baseAddress (fldOffset=0x0)" P-INDEP
 ;  V15 tmp13        [V15,T01] (  2,  2   )     int  ->  rcx         single-def "field V01.<PrefixLength>k__BackingField (fldOffset=0x8)" P-INDEP
 ;* V16 tmp14        [V16    ] (  0,  0   )     ref  ->  zero-ref    single-def "field V03._baseAddress (fldOffset=0x0)" P-INDEP
 ;* V17 tmp15        [V17    ] (  0,  0   )     int  ->  zero-ref    "field V03.<PrefixLength>k__BackingField (fldOffset=0x8)" P-INDEP
 ;* V18 tmp16        [V18    ] (  0,  0   )     ref  ->  zero-ref    single-def "field V05._baseAddress (fldOffset=0x0)" P-INDEP
 ;* V19 tmp17        [V19    ] (  0,  0   )     int  ->  zero-ref    "field V05.<PrefixLength>k__BackingField (fldOffset=0x8)" P-INDEP
 ;
-; Lcl frame size = 8
+; Lcl frame size = 0
 
 G_M22682_IG01:
        push     rbp
+       push     r15
        push     rbx
-       push     rax
        lea      rbp, [rsp+0x10]
-						;; size=8 bbWeight=1 PerfScore 3.50
+       mov      rbx, rdx
+						;; size=12 bbWeight=1 PerfScore 3.75
 G_M22682_IG02:
        cmp      esi, ecx
        je       SHORT G_M22682_IG04
 						;; size=4 bbWeight=1 PerfScore 1.25
 G_M22682_IG03:
-       xor      ebx, ebx
-       jmp      SHORT G_M22682_IG05
-						;; size=4 bbWeight=0.50 PerfScore 1.12
+       xor      r15d, r15d
+       jmp      SHORT G_M22682_IG09
+						;; size=5 bbWeight=0.50 PerfScore 1.12
 G_M22682_IG04:
-       mov      rsi, 0xD1FFAB1E      ; const ptr
-       test     rdi, rdi
-       cmove    rdi, gword ptr [rsi]
-       mov      rsi, rdx
-       mov      rax, 0xD1FFAB1E      ; const ptr
+       mov      r15, rdi
+       test     r15, r15
+       jne      SHORT G_M22682_IG06
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       SHORT G_M22682_IG11
+						;; size=23 bbWeight=0.50 PerfScore 2.88
+G_M22682_IG05:
+       mov      rdi, 0xD1FFAB1E      ; data for System.Net.IPAddress:Any
+       mov      r15, gword ptr [rdi]
+						;; size=13 bbWeight=0.50 PerfScore 1.12
+G_M22682_IG06:
+       mov      rsi, rbx
        test     rsi, rsi
-       cmove    rsi, gword ptr [rax]
+       jne      SHORT G_M22682_IG08
+       mov      rdi, 0xD1FFAB1E      ; global ptr
+       test     byte  ptr [rdi], 1
+       je       SHORT G_M22682_IG12
+						;; size=23 bbWeight=0.50 PerfScore 2.88
+G_M22682_IG07:
+       mov      rdi, 0xD1FFAB1E      ; data for System.Net.IPAddress:Any
+       mov      rsi, gword ptr [rdi]
+						;; size=13 bbWeight=0.50 PerfScore 1.12
+G_M22682_IG08:
+       mov      rdi, r15
        mov      rax, 0xD1FFAB1E      ; code for System.Net.IPAddress:Equals(System.Net.IPAddress):ubyte:this
        cmp      dword ptr [rdi], edi
        call     [rax]System.Net.IPAddress:Equals(System.Net.IPAddress):ubyte:this
-       mov      ebx, eax
-						;; size=53 bbWeight=0.50 PerfScore 6.88
-G_M22682_IG05:
-       test     ebx, ebx
+       mov      r15d, eax
+						;; size=20 bbWeight=0.50 PerfScore 3.38
+G_M22682_IG09:
+       test     r15d, r15d
        sete     al
        movzx    rax, al
-						;; size=8 bbWeight=1 PerfScore 1.50
-G_M22682_IG06:
-       add      rsp, 8
+						;; size=9 bbWeight=1 PerfScore 1.50
+G_M22682_IG10:
        pop      rbx
+       pop      r15
        pop      rbp
        ret      
-						;; size=7 bbWeight=1 PerfScore 2.25
+						;; size=5 bbWeight=1 PerfScore 2.50
+G_M22682_IG11:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPAddress
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      SHORT G_M22682_IG05
+						;; size=24 bbWeight=0 PerfScore 0.00
+G_M22682_IG12:
+       mov      rdi, 0xD1FFAB1E      ; System.Net.IPAddress
+       mov      rax, 0xD1FFAB1E      ; code for CORINFO_HELP_GET_GCSTATIC_BASE
+       call     [rax]CORINFO_HELP_GET_GCSTATIC_BASE
+       jmp      SHORT G_M22682_IG07
+						;; size=24 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 84, prolog size 8, PerfScore 16.50, instruction count 26, allocated bytes for code 84 (MethodHash=500ea765) for method System.Net.IPNetwork:op_Inequality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte (FullOpts)
+; Total bytes of code 175, prolog size 9, PerfScore 21.50, instruction count 45, allocated bytes for code 175 (MethodHash=500ea765) for method System.Net.IPNetwork:op_Inequality(System.Net.IPNetwork,System.Net.IPNetwork):ubyte (FullOpts)
 ; ============================================================

Note: some changes were skipped as they were too large to fit into a comment.

Larger list of diffs: https://gist.github.com/MihuBot/28f98a625005d40b4163da4c4c1415e5

@MihuBot
Copy link
Owner Author

MihuBot commented Mar 1, 2025

Top method improvements

-14 (-16.09 % of base) - System.Net.IPAddress:ReadUInt16NumbersFromBytes(System.ReadOnlySpan`1[ubyte]):ushort[]
 ; Assembly listing for method System.Net.IPAddress:ReadUInt16NumbersFromBytes(System.ReadOnlySpan`1[ubyte]):ushort[] (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
-; rbp based frame
+; rsp based frame
 ; partially interruptible
 ; No PGO data
-; 0 inlinees with PGO data; 1 single block inlinees; 2 inlinees without PGO data
+; 0 inlinees with PGO data; 0 single block inlinees; 1 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;* V00 arg0         [V00    ] (  0,  0   )  struct (16) zero-ref    multireg-arg ld-addr-op single-def <System.ReadOnlySpan`1[ubyte]>
 ;  V01 loc0         [V01,T02] (  3,  3   )     ref  ->  rax         class-hnd exact single-def <ushort[]>
 ;  V02 loc1         [V02,T03] (  5,  5   )  simd16  ->  mm0         <System.Runtime.Intrinsics.Vector128`1[ushort]>
 ;* V03 loc2         [V03    ] (  0,  0   )     int  ->  zero-ref   
 ;# V04 OutArgs      [V04    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
 ;* V05 tmp1         [V05    ] (  0,  0   )     ref  ->  zero-ref    class-hnd exact "NewArr temp" <ushort[]>
-;* V06 tmp2         [V06    ] (  0,  0   )  simd16  ->  zero-ref    "spilled call-like call argument"
-;* V07 tmp3         [V07    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[ubyte]>
-;* V08 tmp4         [V08    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[ubyte]>
-;  V09 tmp5         [V09,T00] (  2,  2   )   byref  ->  rbx         single-def "field V00._reference (fldOffset=0x0)" P-INDEP
-;  V10 tmp6         [V10,T01] (  2,  2   )     int  ->  r15         single-def "field V00._length (fldOffset=0x8)" P-INDEP
-;* V11 tmp7         [V11    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V07._reference (fldOffset=0x0)" P-INDEP
-;* V12 tmp8         [V12    ] (  0,  0   )     int  ->  zero-ref    "field V07._length (fldOffset=0x8)" P-INDEP
-;* V13 tmp9         [V13    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V08._reference (fldOffset=0x0)" P-INDEP
-;* V14 tmp10        [V14    ] (  0,  0   )     int  ->  zero-ref    "field V08._length (fldOffset=0x8)" P-INDEP
+;  V06 tmp2         [V06,T00] (  2,  2   )   byref  ->  rbx         single-def "field V00._reference (fldOffset=0x0)" P-INDEP
+;  V07 tmp3         [V07,T01] (  2,  2   )     int  ->  r15         single-def "field V00._length (fldOffset=0x8)" P-INDEP
 ;
-; Lcl frame size = 0
+; Lcl frame size = 8
 
 G_M25290_IG01:
-       push     rbp
        push     r15
        push     rbx
-       lea      rbp, [rsp+0x10]
+       push     rax
        mov      rbx, rdi
        mov      r15d, esi
-						;; size=15 bbWeight=1 PerfScore 4.00
+						;; size=10 bbWeight=1 PerfScore 3.50
 G_M25290_IG02:
        mov      rdi, 0xD1FFAB1E      ; ushort[]
        mov      esi, 8
        call     CORINFO_HELP_NEWARR_1_VC
        cmp      r15d, 16
-       jl       SHORT G_M25290_IG04
+       jbe      SHORT G_M25290_IG04
        vmovups  xmm0, xmmword ptr [rbx]
        vpsllw   xmm1, xmm0, 8
        vpsrlw   xmm0, xmm0, 8
        vpor     xmm0, xmm0, xmm1
        vmovups  xmmword ptr [rax+0x10], xmm0
 						;; size=49 bbWeight=1 PerfScore 11.08
 G_M25290_IG03:
+       add      rsp, 8
        pop      rbx
        pop      r15
-       pop      rbp
        ret      
-						;; size=5 bbWeight=1 PerfScore 2.50
+						;; size=8 bbWeight=1 PerfScore 2.25
 G_M25290_IG04:
-       mov      edi, 6
-       mov      rax, 0xD1FFAB1E      ; code for System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
-       call     [rax]System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
+       call     CORINFO_HELP_RNGCHKFAIL
        int3     
-						;; size=18 bbWeight=0.50 PerfScore 1.88
+						;; size=6 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 87, prolog size 9, PerfScore 19.46, instruction count 24, allocated bytes for code 87 (MethodHash=f3099d35) for method System.Net.IPAddress:ReadUInt16NumbersFromBytes(System.ReadOnlySpan`1[ubyte]):ushort[] (FullOpts)
+; Total bytes of code 73, prolog size 4, PerfScore 16.83, instruction count 21, allocated bytes for code 73 (MethodHash=f3099d35) for method System.Net.IPAddress:ReadUInt16NumbersFromBytes(System.ReadOnlySpan`1[ubyte]):ushort[] (FullOpts)
 ; ============================================================
-12 (-32.43 % of base) - System.Numerics.Vector4:.ctor(System.ReadOnlySpan`1[float]):this
 ; Assembly listing for method System.Numerics.Vector4:.ctor(System.ReadOnlySpan`1[float]):this (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rsp based frame
 ; partially interruptible
 ; No PGO data
-; 0 inlinees with PGO data; 2 single block inlinees; 1 inlinees without PGO data
+; 0 inlinees with PGO data; 1 single block inlinees; 0 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;  V00 this         [V00,T00] (  3,  3   )   byref  ->  rdi         this single-def
 ;* V01 arg1         [V01    ] (  0,  0   )  struct (16) zero-ref    multireg-arg single-def <System.ReadOnlySpan`1[float]>
 ;# V02 OutArgs      [V02    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
-;* V03 tmp1         [V03    ] (  0,  0   )  simd16  ->  zero-ref    "spilled call-like call argument"
-;* V04 tmp2         [V04    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[float]>
-;* V05 tmp3         [V05    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[float]>
-;  V06 tmp4         [V06,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
-;  V07 tmp5         [V07,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
-;* V08 tmp6         [V08    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V04._reference (fldOffset=0x0)" P-INDEP
-;* V09 tmp7         [V09    ] (  0,  0   )     int  ->  zero-ref    "field V04._length (fldOffset=0x8)" P-INDEP
-;* V10 tmp8         [V10    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V05._reference (fldOffset=0x0)" P-INDEP
-;* V11 tmp9         [V11    ] (  0,  0   )     int  ->  zero-ref    "field V05._length (fldOffset=0x8)" P-INDEP
+;  V03 tmp1         [V03,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
+;  V04 tmp2         [V04,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
 ;
 ; Lcl frame size = 8
 
 G_M26767_IG01:
        push     rax
 						;; size=1 bbWeight=1 PerfScore 1.00
 G_M26767_IG02:
        cmp      edx, 4
-       jl       SHORT G_M26767_IG04
+       jbe      SHORT G_M26767_IG04
        vmovups  xmm0, xmmword ptr [rsi]
        vmovups  xmmword ptr [rdi], xmm0
 						;; size=13 bbWeight=1 PerfScore 7.25
 G_M26767_IG03:
        add      rsp, 8
        ret      
 						;; size=5 bbWeight=1 PerfScore 1.25
 G_M26767_IG04:
-       mov      edi, 6
-       mov      rax, 0xD1FFAB1E      ; code for System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
-       call     [rax]System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
+       call     CORINFO_HELP_RNGCHKFAIL
        int3     
-						;; size=18 bbWeight=0.50 PerfScore 1.88
+						;; size=6 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 37, prolog size 1, PerfScore 11.38, instruction count 11, allocated bytes for code 37 (MethodHash=27c19770) for method System.Numerics.Vector4:.ctor(System.ReadOnlySpan`1[float]):this (FullOpts)
+; Total bytes of code 25, prolog size 1, PerfScore 9.50, instruction count 9, allocated bytes for code 25 (MethodHash=27c19770) for method System.Numerics.Vector4:.ctor(System.ReadOnlySpan`1[float]):this (FullOpts)
 ; ============================================================
-12 (-32.43 % of base) - System.Numerics.Vector4:Create(System.ReadOnlySpan`1[float]):System.Numerics.Vector4
 ; Assembly listing for method System.Numerics.Vector4:Create(System.ReadOnlySpan`1[float]):System.Numerics.Vector4 (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rsp based frame
 ; partially interruptible
 ; No PGO data
-; 0 inlinees with PGO data; 1 single block inlinees; 1 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;* V00 arg0         [V00    ] (  0,  0   )  struct (16) zero-ref    multireg-arg single-def <System.ReadOnlySpan`1[float]>
 ;# V01 OutArgs      [V01    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
-;  V02 tmp1         [V02,T02] (  2,  4   )  simd16  ->  mm0         multireg-ret "spilled call-like call argument"
-;* V03 tmp2         [V03    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[float]>
-;* V04 tmp3         [V04    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[float]>
-;  V05 tmp4         [V05,T00] (  2,  2   )   byref  ->  rdi         single-def "field V00._reference (fldOffset=0x0)" P-INDEP
-;  V06 tmp5         [V06,T01] (  2,  2   )     int  ->  rsi         single-def "field V00._length (fldOffset=0x8)" P-INDEP
-;* V07 tmp6         [V07    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V03._reference (fldOffset=0x0)" P-INDEP
-;* V08 tmp7         [V08    ] (  0,  0   )     int  ->  zero-ref    "field V03._length (fldOffset=0x8)" P-INDEP
-;* V09 tmp8         [V09    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V04._reference (fldOffset=0x0)" P-INDEP
-;* V10 tmp9         [V10    ] (  0,  0   )     int  ->  zero-ref    "field V04._length (fldOffset=0x8)" P-INDEP
+;  V02 tmp1         [V02,T02] (  2,  4   )  simd16  ->  mm0         multireg-ret "Return value temp for multireg return" <System.Numerics.Vector4>
+;  V03 tmp2         [V03,T00] (  2,  2   )   byref  ->  rdi         single-def "field V00._reference (fldOffset=0x0)" P-INDEP
+;  V04 tmp3         [V04,T01] (  2,  2   )     int  ->  rsi         single-def "field V00._length (fldOffset=0x8)" P-INDEP
 ;
 ; Lcl frame size = 8
 
 G_M17785_IG01:
        push     rax
 						;; size=1 bbWeight=1 PerfScore 1.00
 G_M17785_IG02:
        cmp      esi, 4
-       jl       SHORT G_M17785_IG04
+       jbe      SHORT G_M17785_IG04
        vmovups  xmm0, xmmword ptr [rdi]
        vmovhlps xmm1, xmm1, xmm0
 						;; size=13 bbWeight=1 PerfScore 6.25
 G_M17785_IG03:
        add      rsp, 8
        ret      
 						;; size=5 bbWeight=1 PerfScore 1.25
 G_M17785_IG04:
-       mov      edi, 6
-       mov      rax, 0xD1FFAB1E      ; code for System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
-       call     [rax]System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
+       call     CORINFO_HELP_RNGCHKFAIL
        int3     
-						;; size=18 bbWeight=0.50 PerfScore 1.88
+						;; size=6 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 37, prolog size 1, PerfScore 10.38, instruction count 11, allocated bytes for code 37 (MethodHash=c7afba86) for method System.Numerics.Vector4:Create(System.ReadOnlySpan`1[float]):System.Numerics.Vector4 (FullOpts)
+; Total bytes of code 25, prolog size 1, PerfScore 8.50, instruction count 9, allocated bytes for code 25 (MethodHash=c7afba86) for method System.Numerics.Vector4:Create(System.ReadOnlySpan`1[float]):System.Numerics.Vector4 (FullOpts)
 ; ============================================================
-12 (-30.00 % of base) - System.Runtime.Intrinsics.Vector128`1[double]:System.Runtime.Intrinsics.ISimdVector,T>.Create(System.ReadOnlySpan`1[double]):System.Runtime.Intrinsics.Vector128`1[double]
 ; Assembly listing for method System.Runtime.Intrinsics.Vector128`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[double]):System.Runtime.Intrinsics.Vector128`1[double] (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rsp based frame
 ; partially interruptible
 ; No PGO data
-; 0 inlinees with PGO data; 1 single block inlinees; 1 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;  V00 RetBuf       [V00,T00] (  4,  4   )    long  ->  rdi         single-def
 ;* V01 arg0         [V01    ] (  0,  0   )  struct (16) zero-ref    multireg-arg single-def <System.ReadOnlySpan`1[double]>
 ;# V02 OutArgs      [V02    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
-;* V03 tmp1         [V03    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[double]>
-;* V04 tmp2         [V04    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[double]>
-;  V05 tmp3         [V05,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
-;  V06 tmp4         [V06,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
-;* V07 tmp5         [V07    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V03._reference (fldOffset=0x0)" P-INDEP
-;* V08 tmp6         [V08    ] (  0,  0   )     int  ->  zero-ref    "field V03._length (fldOffset=0x8)" P-INDEP
-;* V09 tmp7         [V09    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V04._reference (fldOffset=0x0)" P-INDEP
-;* V10 tmp8         [V10    ] (  0,  0   )     int  ->  zero-ref    "field V04._length (fldOffset=0x8)" P-INDEP
+;  V03 tmp1         [V03,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
+;  V04 tmp2         [V04,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
 ;
 ; Lcl frame size = 8
 
 G_M21567_IG01:
        push     rax
 						;; size=1 bbWeight=1 PerfScore 1.00
 G_M21567_IG02:
        cmp      edx, 2
-       jl       SHORT G_M21567_IG04
+       jbe      SHORT G_M21567_IG04
        vmovups  xmm0, xmmword ptr [rsi]
        vmovups  xmmword ptr [rdi], xmm0
        mov      rax, rdi
 						;; size=16 bbWeight=1 PerfScore 7.50
 G_M21567_IG03:
        add      rsp, 8
        ret      
 						;; size=5 bbWeight=1 PerfScore 1.25
 G_M21567_IG04:
-       mov      edi, 6
-       mov      rax, 0xD1FFAB1E      ; code for System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
-       call     [rax]System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
+       call     CORINFO_HELP_RNGCHKFAIL
        int3     
-						;; size=18 bbWeight=0.50 PerfScore 1.88
+						;; size=6 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 40, prolog size 1, PerfScore 11.62, instruction count 12, allocated bytes for code 40 (MethodHash=6168abc0) for method System.Runtime.Intrinsics.Vector128`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[double]):System.Runtime.Intrinsics.Vector128`1[double] (FullOpts)
+; Total bytes of code 28, prolog size 1, PerfScore 9.75, instruction count 10, allocated bytes for code 28 (MethodHash=6168abc0) for method System.Runtime.Intrinsics.Vector128`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[double]):System.Runtime.Intrinsics.Vector128`1[double] (FullOpts)
 ; ============================================================
-12 (-30.00 % of base) - System.Runtime.Intrinsics.Vector128`1[int]:System.Runtime.Intrinsics.ISimdVector,T>.Create(System.ReadOnlySpan`1[int]):System.Runtime.Intrinsics.Vector128`1[int]
 ; Assembly listing for method System.Runtime.Intrinsics.Vector128`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[int]):System.Runtime.Intrinsics.Vector128`1[int] (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rsp based frame
 ; partially interruptible
 ; No PGO data
-; 0 inlinees with PGO data; 1 single block inlinees; 1 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;  V00 RetBuf       [V00,T00] (  4,  4   )    long  ->  rdi         single-def
 ;* V01 arg0         [V01    ] (  0,  0   )  struct (16) zero-ref    multireg-arg single-def <System.ReadOnlySpan`1[int]>
 ;# V02 OutArgs      [V02    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
-;* V03 tmp1         [V03    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[int]>
-;* V04 tmp2         [V04    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[int]>
-;  V05 tmp3         [V05,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
-;  V06 tmp4         [V06,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
-;* V07 tmp5         [V07    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V03._reference (fldOffset=0x0)" P-INDEP
-;* V08 tmp6         [V08    ] (  0,  0   )     int  ->  zero-ref    "field V03._length (fldOffset=0x8)" P-INDEP
-;* V09 tmp7         [V09    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V04._reference (fldOffset=0x0)" P-INDEP
-;* V10 tmp8         [V10    ] (  0,  0   )     int  ->  zero-ref    "field V04._length (fldOffset=0x8)" P-INDEP
+;  V03 tmp1         [V03,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
+;  V04 tmp2         [V04,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
 ;
 ; Lcl frame size = 8
 
 G_M17145_IG01:
        push     rax
 						;; size=1 bbWeight=1 PerfScore 1.00
 G_M17145_IG02:
        cmp      edx, 4
-       jl       SHORT G_M17145_IG04
+       jbe      SHORT G_M17145_IG04
        vmovups  xmm0, xmmword ptr [rsi]
        vmovups  xmmword ptr [rdi], xmm0
        mov      rax, rdi
 						;; size=16 bbWeight=1 PerfScore 7.50
 G_M17145_IG03:
        add      rsp, 8
        ret      
 						;; size=5 bbWeight=1 PerfScore 1.25
 G_M17145_IG04:
-       mov      edi, 6
-       mov      rax, 0xD1FFAB1E      ; code for System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
-       call     [rax]System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
+       call     CORINFO_HELP_RNGCHKFAIL
        int3     
-						;; size=18 bbWeight=0.50 PerfScore 1.88
+						;; size=6 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 40, prolog size 1, PerfScore 11.62, instruction count 12, allocated bytes for code 40 (MethodHash=8c3bbd06) for method System.Runtime.Intrinsics.Vector128`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[int]):System.Runtime.Intrinsics.Vector128`1[int] (FullOpts)
+; Total bytes of code 28, prolog size 1, PerfScore 9.75, instruction count 10, allocated bytes for code 28 (MethodHash=8c3bbd06) for method System.Runtime.Intrinsics.Vector128`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[int]):System.Runtime.Intrinsics.Vector128`1[int] (FullOpts)
 ; ============================================================
-12 (-30.00 % of base) - System.Runtime.Intrinsics.Vector128`1[long]:System.Runtime.Intrinsics.ISimdVector,T>.Create(System.ReadOnlySpan`1[long]):System.Runtime.Intrinsics.Vector128`1[long]
 ; Assembly listing for method System.Runtime.Intrinsics.Vector128`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[long]):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rsp based frame
 ; partially interruptible
 ; No PGO data
-; 0 inlinees with PGO data; 1 single block inlinees; 1 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;  V00 RetBuf       [V00,T00] (  4,  4   )    long  ->  rdi         single-def
 ;* V01 arg0         [V01    ] (  0,  0   )  struct (16) zero-ref    multireg-arg single-def <System.ReadOnlySpan`1[long]>
 ;# V02 OutArgs      [V02    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
-;* V03 tmp1         [V03    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[long]>
-;* V04 tmp2         [V04    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[long]>
-;  V05 tmp3         [V05,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
-;  V06 tmp4         [V06,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
-;* V07 tmp5         [V07    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V03._reference (fldOffset=0x0)" P-INDEP
-;* V08 tmp6         [V08    ] (  0,  0   )     int  ->  zero-ref    "field V03._length (fldOffset=0x8)" P-INDEP
-;* V09 tmp7         [V09    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V04._reference (fldOffset=0x0)" P-INDEP
-;* V10 tmp8         [V10    ] (  0,  0   )     int  ->  zero-ref    "field V04._length (fldOffset=0x8)" P-INDEP
+;  V03 tmp1         [V03,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
+;  V04 tmp2         [V04,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
 ;
 ; Lcl frame size = 8
 
 G_M11520_IG01:
        push     rax
 						;; size=1 bbWeight=1 PerfScore 1.00
 G_M11520_IG02:
        cmp      edx, 2
-       jl       SHORT G_M11520_IG04
+       jbe      SHORT G_M11520_IG04
        vmovups  xmm0, xmmword ptr [rsi]
        vmovups  xmmword ptr [rdi], xmm0
        mov      rax, rdi
 						;; size=16 bbWeight=1 PerfScore 7.50
 G_M11520_IG03:
        add      rsp, 8
        ret      
 						;; size=5 bbWeight=1 PerfScore 1.25
 G_M11520_IG04:
-       mov      edi, 6
-       mov      rax, 0xD1FFAB1E      ; code for System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
-       call     [rax]System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
+       call     CORINFO_HELP_RNGCHKFAIL
        int3     
-						;; size=18 bbWeight=0.50 PerfScore 1.88
+						;; size=6 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 40, prolog size 1, PerfScore 11.62, instruction count 12, allocated bytes for code 40 (MethodHash=8530d2ff) for method System.Runtime.Intrinsics.Vector128`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[long]):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts)
+; Total bytes of code 28, prolog size 1, PerfScore 9.75, instruction count 10, allocated bytes for code 28 (MethodHash=8530d2ff) for method System.Runtime.Intrinsics.Vector128`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[long]):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts)
 ; ============================================================
-12 (-30.00 % of base) - System.Runtime.Intrinsics.Vector128`1[short]:System.Runtime.Intrinsics.ISimdVector,T>.Create(System.ReadOnlySpan`1[short]):System.Runtime.Intrinsics.Vector128`1[short]
 ; Assembly listing for method System.Runtime.Intrinsics.Vector128`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[short]):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rsp based frame
 ; partially interruptible
 ; No PGO data
-; 0 inlinees with PGO data; 1 single block inlinees; 1 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;  V00 RetBuf       [V00,T00] (  4,  4   )    long  ->  rdi         single-def
 ;* V01 arg0         [V01    ] (  0,  0   )  struct (16) zero-ref    multireg-arg single-def <System.ReadOnlySpan`1[short]>
 ;# V02 OutArgs      [V02    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
-;* V03 tmp1         [V03    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[short]>
-;* V04 tmp2         [V04    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[short]>
-;  V05 tmp3         [V05,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
-;  V06 tmp4         [V06,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
-;* V07 tmp5         [V07    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V03._reference (fldOffset=0x0)" P-INDEP
-;* V08 tmp6         [V08    ] (  0,  0   )     int  ->  zero-ref    "field V03._length (fldOffset=0x8)" P-INDEP
-;* V09 tmp7         [V09    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V04._reference (fldOffset=0x0)" P-INDEP
-;* V10 tmp8         [V10    ] (  0,  0   )     int  ->  zero-ref    "field V04._length (fldOffset=0x8)" P-INDEP
+;  V03 tmp1         [V03,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
+;  V04 tmp2         [V04,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
 ;
 ; Lcl frame size = 8
 
 G_M59224_IG01:
        push     rax
 						;; size=1 bbWeight=1 PerfScore 1.00
 G_M59224_IG02:
        cmp      edx, 8
-       jl       SHORT G_M59224_IG04
+       jbe      SHORT G_M59224_IG04
        vmovups  xmm0, xmmword ptr [rsi]
        vmovups  xmmword ptr [rdi], xmm0
        mov      rax, rdi
 						;; size=16 bbWeight=1 PerfScore 7.50
 G_M59224_IG03:
        add      rsp, 8
        ret      
 						;; size=5 bbWeight=1 PerfScore 1.25
 G_M59224_IG04:
-       mov      edi, 6
-       mov      rax, 0xD1FFAB1E      ; code for System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
-       call     [rax]System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
+       call     CORINFO_HELP_RNGCHKFAIL
        int3     
-						;; size=18 bbWeight=0.50 PerfScore 1.88
+						;; size=6 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 40, prolog size 1, PerfScore 11.62, instruction count 12, allocated bytes for code 40 (MethodHash=fd8018a7) for method System.Runtime.Intrinsics.Vector128`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[short]):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts)
+; Total bytes of code 28, prolog size 1, PerfScore 9.75, instruction count 10, allocated bytes for code 28 (MethodHash=fd8018a7) for method System.Runtime.Intrinsics.Vector128`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[short]):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts)
 ; ============================================================
-12 (-30.00 % of base) - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector,T>.Create(System.ReadOnlySpan`1[ubyte]):System.Runtime.Intrinsics.Vector128`1[ubyte]
 ; Assembly listing for method System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[ubyte]):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX512 - Unix
 ; FullOpts code
 ; optimized code
 ; rsp based frame
 ; partially interruptible
 ; No PGO data
-; 0 inlinees with PGO data; 1 single block inlinees; 1 inlinees without PGO data
 ; Final local variable assignments
 ;
 ;  V00 RetBuf       [V00,T00] (  4,  4   )    long  ->  rdi         single-def
 ;* V01 arg0         [V01    ] (  0,  0   )  struct (16) zero-ref    multireg-arg single-def <System.ReadOnlySpan`1[ubyte]>
 ;# V02 OutArgs      [V02    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace" <Empty>
-;* V03 tmp1         [V03    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[ubyte]>
-;* V04 tmp2         [V04    ] (  0,  0   )  struct (16) zero-ref    ld-addr-op "Inlining Arg" <System.ReadOnlySpan`1[ubyte]>
-;  V05 tmp3         [V05,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
-;  V06 tmp4         [V06,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
-;* V07 tmp5         [V07    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V03._reference (fldOffset=0x0)" P-INDEP
-;* V08 tmp6         [V08    ] (  0,  0   )     int  ->  zero-ref    "field V03._length (fldOffset=0x8)" P-INDEP
-;* V09 tmp7         [V09    ] (  0,  0   )   byref  ->  zero-ref    single-def "field V04._reference (fldOffset=0x0)" P-INDEP
-;* V10 tmp8         [V10    ] (  0,  0   )     int  ->  zero-ref    "field V04._length (fldOffset=0x8)" P-INDEP
+;  V03 tmp1         [V03,T01] (  2,  2   )   byref  ->  rsi         single-def "field V01._reference (fldOffset=0x0)" P-INDEP
+;  V04 tmp2         [V04,T02] (  2,  2   )     int  ->  rdx         single-def "field V01._length (fldOffset=0x8)" P-INDEP
 ;
 ; Lcl frame size = 8
 
 G_M10869_IG01:
        push     rax
 						;; size=1 bbWeight=1 PerfScore 1.00
 G_M10869_IG02:
        cmp      edx, 16
-       jl       SHORT G_M10869_IG04
+       jbe      SHORT G_M10869_IG04
        vmovups  xmm0, xmmword ptr [rsi]
        vmovups  xmmword ptr [rdi], xmm0
        mov      rax, rdi
 						;; size=16 bbWeight=1 PerfScore 7.50
 G_M10869_IG03:
        add      rsp, 8
        ret      
 						;; size=5 bbWeight=1 PerfScore 1.25
 G_M10869_IG04:
-       mov      edi, 6
-       mov      rax, 0xD1FFAB1E      ; code for System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
-       call     [rax]System.ThrowHelper:ThrowArgumentOutOfRangeException(int)
+       call     CORINFO_HELP_RNGCHKFAIL
        int3     
-						;; size=18 bbWeight=0.50 PerfScore 1.88
+						;; size=6 bbWeight=0 PerfScore 0.00
 
-; Total bytes of code 40, prolog size 1, PerfScore 11.62, instruction count 12, allocated bytes for code 40 (MethodHash=b94bd58a) for method System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[ubyte]):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
+; Total bytes of code 28, prolog size 1, PerfScore 9.75, instruction count 10, allocated bytes for code 28 (MethodHash=b94bd58a) for method System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[ubyte]):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
 ; ============================================================

@MihuBot
Copy link
Owner Author

MihuBot commented Mar 1, 2025

@EgorBo

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant