[JIT] Asm difference for F# and C# methods #58941

En3Tho · 2021-09-10T13:52:04Z

I'm not sure whether this might be JIt imporvement or F#.. Basically I will duplicate this in fsharp repo (dotnet/fsharp#12138) too.

Consider these 2 methods:

[<MethodImpl(MethodImplOptions.AggressiveInlining)>]
let fold initial folder (enumerator: #IEnumerator<'i>) =
    let folder = OptimizedClosures.FSharpFunc<_,_,_>.Adapt folder
    let mutable enumerator = enumerator
    let mutable result = initial
    while enumerator.MoveNext() do
        result <- folder.Invoke(result, enumerator.Current)
    result

[MethodImpl(MethodImplOptions.AggressiveInlining)]
public static TResult Fold<TResult, TItem, TEnumerator>(TResult result, FSharpFunc<TResult, FSharpFunc<TItem, TResult>> folder, TEnumerator enumerator)
            where TEnumerator : IEnumerator<TItem>
{
    var fSharpFunc = OptimizedClosures.FSharpFunc<TResult, TItem, TResult>.Adapt(folder);
    var enumerator2 = enumerator;
    var result2 = result;
    while (enumerator2.MoveNext())
        result2 = fSharpFunc.Invoke(result2, enumerator2.Current);

    return result2;
}

They look very similar but there is an importnat il emit difference:

C# method is compiled to this basically:

[MethodImpl(MethodImplOptions.AggressiveInlining)]
public static TResult FoldRoslynVersion<TResult, TItem, TEnumerator>(TResult result, FSharpFunc<TResult, FSharpFunc<TItem, TResult>> folder, TEnumerator enumerator)
            where TEnumerator : IEnumerator<TItem>
{
    var fSharpFunc = OptimizedClosures.FSharpFunc<TResult, TItem, TResult>.Adapt(folder);
    var enumerator2 = enumerator;
    var result2 = result;
    goto movenext;

    logic:
    result2 = fSharpFunc.Invoke(result2, enumerator2.Current);

    movenext:
    if (!enumerator2.MoveNext())
        return result2;

    goto logic;
}

While F# is compiled to this basically:

[MethodImpl(MethodImplOptions.AggressiveInlining)]
public static TResult FoldFSharpVersion<TResult, TItem, TEnumerator>(TResult result, FSharpFunc<TResult, FSharpFunc<TItem, TResult>> folder, TEnumerator enumerator)
            where TEnumerator : IEnumerator<TItem>
{
    var fSharpFunc = OptimizedClosures.FSharpFunc<TResult, TItem, TResult>.Adapt(folder);
    var enumerator2 = enumerator;
    var result2 = result;

    movenext:
    if (!enumerator2.MoveNext())
        goto exit;

    result2 = fSharpFunc.Invoke(result2, enumerator2.Current);
    goto movenext;

    exit:
    return result2;
}

While difference might be non obvious, C# version with condition at the end of the method results in 10-15% perf imporvement while having the same assembly size.

Can JIT compiler regonize these patterns better and ideally emit the same code for both variants?

category:cq
theme:loop-opt
skill-level:expert
cost:large
impact:large

dotnet-issue-labeler · 2021-09-10T13:52:07Z

I couldn't figure out the best area label to add to this issue. If you have write-permissions please help me learn by adding exactly one area label.

EgorBo · 2021-09-10T14:00:28Z

@BruceForstall a loop optimization for you I guess:

static void Test()
{
start:
    if (Cond())
    {
        DoWork();
        goto start;
    }
}

should be transformed into the same as:

static void Caller2()
{
    goto condition;
start:
    DoWork();
condition:
    if (Cond())
        goto start;
}

it might make code a bit bigger but loop body will be more efficient.

PS: this transformation is made in Roslyn for while loops for us, but F# seems doesn't do it.

EgorBo · 2021-09-10T14:03:15Z

Proof that Roslyn does it:

EgorBo · 2021-09-10T14:29:03Z

ghost · 2021-09-11T13:42:17Z

Tagging subscribers to this area: @JulieLeeMSFT
See info in area-owners.md if you want to be subscribed.

Issue Details

I'm not sure whether this might be JIt imporvement or F#.. Basically I will duplicate this in fsharp repo (dotnet/fsharp#12138) too.

Consider these 2 methods:

[<MethodImpl(MethodImplOptions.AggressiveInlining)>]
let fold initial folder (enumerator: #IEnumerator<'i>) =
    let folder = OptimizedClosures.FSharpFunc<_,_,_>.Adapt folder
    let mutable enumerator = enumerator
    let mutable result = initial
    while enumerator.MoveNext() do
        result <- folder.Invoke(result, enumerator.Current)
    result

[MethodImpl(MethodImplOptions.AggressiveInlining)]
public static TResult Fold<TResult, TItem, TEnumerator>(TResult result, FSharpFunc<TResult, FSharpFunc<TItem, TResult>> folder, TEnumerator enumerator)
            where TEnumerator : IEnumerator<TItem>
{
    var fSharpFunc = OptimizedClosures.FSharpFunc<TResult, TItem, TResult>.Adapt(folder);
    var enumerator2 = enumerator;
    var result2 = result;
    while (enumerator2.MoveNext())
        result2 = fSharpFunc.Invoke(result2, enumerator2.Current);

    return result2;
}

They look very similar but there is an importnat il emit difference:

C# method is compiled to this basically:

[MethodImpl(MethodImplOptions.AggressiveInlining)]
public static TResult FoldRoslynVersion<TResult, TItem, TEnumerator>(TResult result, FSharpFunc<TResult, FSharpFunc<TItem, TResult>> folder, TEnumerator enumerator)
            where TEnumerator : IEnumerator<TItem>
{
    var fSharpFunc = OptimizedClosures.FSharpFunc<TResult, TItem, TResult>.Adapt(folder);
    var enumerator2 = enumerator;
    var result2 = result;
    goto movenext;

    logic:
    result2 = fSharpFunc.Invoke(result2, enumerator2.Current);

    movenext:
    if (!enumerator2.MoveNext())
        return result2;

    goto logic;
}

While F# is compiled to this basically:

[MethodImpl(MethodImplOptions.AggressiveInlining)]
public static TResult FoldFSharpVersion<TResult, TItem, TEnumerator>(TResult result, FSharpFunc<TResult, FSharpFunc<TItem, TResult>> folder, TEnumerator enumerator)
            where TEnumerator : IEnumerator<TItem>
{
    var fSharpFunc = OptimizedClosures.FSharpFunc<TResult, TItem, TResult>.Adapt(folder);
    var enumerator2 = enumerator;
    var result2 = result;

    movenext:
    if (!enumerator2.MoveNext())
        goto exit;

    result2 = fSharpFunc.Invoke(result2, enumerator2.Current);
    goto movenext;

    exit:
    return result2;
}

While difference might be non obvious, C# version with condition at the end of the method results in 10-15% perf imporvement while having the same assembly size.

Can JIT compiler regonize these patterns better and ideally emit the same code for both variants?

Author:	En3Tho
Assignees:	-
Labels:	`tenet-performance`, `area-CodeGen-coreclr`, `untriaged`
Milestone:	-

BruceForstall · 2021-09-17T23:57:41Z

The F# generated IL doesn't match the patterns the JIT looks for to do loop inversion or loop recognition, so the loop doesn't get added to our loop table for consideration for future optimization. We do mark the blocks with loop weights because we use a different logic for determining that.

Fixing this is part of a more general task to improve RyuJIT loop recognition.

jakobbotsch · 2024-01-12T13:07:58Z

Fixed by PRs listed in #93144 (comment)

En3Tho · 2024-01-12T13:23:39Z

@jakobbotsch Thank you!

En3Tho added the tenet-performance Performance related issue label Sep 10, 2021

dotnet-issue-labeler bot added the untriaged New issue has not been triaged by the area owner label Sep 10, 2021

En3Tho mentioned this issue Sep 10, 2021

[IL Emit] Improve il code emitting when working with loops dotnet/fsharp#12138

Closed

jeffschwMSFT added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Sep 11, 2021

JulieLeeMSFT assigned BruceForstall Sep 11, 2021

JulieLeeMSFT added needs-further-triage Issue has been initially triaged, but needs deeper consideration or reconsideration and removed untriaged New issue has not been triaged by the area owner labels Sep 11, 2021

JulieLeeMSFT added this to the Future milestone Sep 11, 2021

BruceForstall modified the milestones: Future, 7.0.0 Sep 17, 2021

This was referenced Sep 17, 2021

Certain loops do not get recorded in optLoopTable #43713

Closed

Improve JIT loop optimizations (.NET 7) #55235

Closed

BruceForstall removed the needs-further-triage Issue has been initially triaged, but needs deeper consideration or reconsideration label Sep 18, 2021

BruceForstall mentioned this issue Feb 15, 2022

Improve JIT loop optimizations #65342

Open

20 tasks

BruceForstall modified the milestones: 7.0.0, 8.0.0 May 24, 2022

BruceForstall mentioned this issue Oct 13, 2022

Improve JIT loop optimizations (.NET 8) #77032

Closed

4 tasks

BruceForstall modified the milestones: 8.0.0, Future Jun 16, 2023

BruceForstall mentioned this issue Oct 6, 2023

Improve JIT loop optimizations (.NET 9) #93144

Closed

21 tasks

jakobbotsch mentioned this issue Dec 5, 2023

JIT: Port loop cloning to the new loop representation #95326

Merged

jakobbotsch assigned jakobbotsch and unassigned BruceForstall Dec 5, 2023

jakobbotsch closed this as completed Jan 12, 2024

github-actions bot locked and limited conversation to collaborators Feb 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[JIT] Asm difference for F# and C# methods #58941

[JIT] Asm difference for F# and C# methods #58941

En3Tho commented Sep 10, 2021 •

edited by BruceForstall

Loading

dotnet-issue-labeler bot commented Sep 10, 2021

EgorBo commented Sep 10, 2021 •

edited

Loading

EgorBo commented Sep 10, 2021

EgorBo commented Sep 10, 2021

ghost commented Sep 11, 2021

BruceForstall commented Sep 17, 2021

jakobbotsch commented Jan 12, 2024

En3Tho commented Jan 12, 2024

[JIT] Asm difference for F# and C# methods #58941

[JIT] Asm difference for F# and C# methods #58941

Comments

En3Tho commented Sep 10, 2021 • edited by BruceForstall Loading

dotnet-issue-labeler bot commented Sep 10, 2021

EgorBo commented Sep 10, 2021 • edited Loading

EgorBo commented Sep 10, 2021

EgorBo commented Sep 10, 2021

ghost commented Sep 11, 2021

BruceForstall commented Sep 17, 2021

jakobbotsch commented Jan 12, 2024

En3Tho commented Jan 12, 2024

En3Tho commented Sep 10, 2021 •

edited by BruceForstall

Loading

EgorBo commented Sep 10, 2021 •

edited

Loading