Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove static exchange evaluation (see) #649

Merged
merged 1 commit into from
Nov 15, 2024
Merged

Remove static exchange evaluation (see) #649

merged 1 commit into from
Nov 15, 2024

Conversation

brunocodutra
Copy link
Owner

@brunocodutra brunocodutra commented Nov 15, 2024

Gauntlet

cutechess-cli -tournament gauntlet -games 2 -rounds 1500 -openings file=engines/openings-6ply-1000.pgn plies=6 policy=round -concurrency 8 -ratinginterval 10 -resultformat wide -recover -tb engines/syzygy/ -engine conf=dev stderr=stderr.log -engine conf=Frozenight-6.0 -engine conf=Halogen-11.0 -engine conf=Marvin-6.2 -each tc=3+0.025 option.Hash=32 option.Threads=1

Rank Name                          Elo     +/-   Games    Wins  Losses   Draws   Points   Score    Draw
   0 dev                            12       5    9000    2642    2324    4034   4659.0   51.8%   44.8%
   1 Frozenight-6.0                  2       9    3000     808     790    1402   1509.0   50.3%   46.7%
   2 Marvin-6.2                    -18       9    3000     722     873    1405   1424.5   47.5%   46.8%
   3 Halogen-11.0                  -21      10    3000     794     979    1227   1407.5   46.9%   40.9%

STS1-STS15_LAN_v6.epd

python sts_rating.py -f "./epd/STS1-STS15_LAN_v6.epd" -e dev -t 8 -h 256 --movetime 100 --maxpoint 100

STS Rating v14.2
Engine: chessboard
Hash: 256, Threads: 8, time/pos: 0.100s

Number of positions in ./epd/STS1-STS15_LAN_v6.epd: 1188
Max score = 1188 x 100 = 118800
Test duration: 00h:02m:30s
Expected time to finish: 00h:02m:34s

  STS ID   STS1   STS2   STS3   STS4   STS5   STS6   STS7   STS8   STS9  STS10  STS11  STS12  STS13  STS14  STS15    ALL
  NumPos     85     80     86     89     85     80     82     80     71     79     70     74     75     79     73   1188
 BestCnt     70     58     61     67     70     55     61     56     47     61     48     51     62     60     48    875
   Score   7960   6817   7526   8234   7877   7589   7309   7026   5989   7295   6100   6574   6981   7039   6618 106934
Score(%)   93.6   85.2   87.5   92.5   92.7   94.9   89.1   87.8   84.4   92.3   87.1   88.8   93.1   89.1   90.7   90.0

:: STS ID and Titles ::
STS 01: Undermining
STS 02: Open Files and Diagonals
STS 03: Knight Outposts
STS 04: Square Vacancy
STS 05: Bishop vs Knight
STS 06: Re-Capturing
STS 07: Offer of Simplification
STS 08: Advancement of f/g/h Pawns
STS 09: Advancement of a/b/c Pawns
STS 10: Simplification
STS 11: Activity of the King
STS 12: Center Control
STS 13: Pawn Play in the Center
STS 14: Queens and Rooks to the 7th rank
STS 15: Avoid Pointless Exchange

:: Top 5 STS with high result ::
1. STS 06, 94.9%, "Re-Capturing"
2. STS 01, 93.6%, "Undermining"
3. STS 13, 93.1%, "Pawn Play in the Center"
4. STS 05, 92.7%, "Bishop vs Knight"
5. STS 04, 92.5%, "Square Vacancy"

:: Top 5 STS with low result ::
1. STS 09, 84.4%, "Advancement of a/b/c Pawns"
2. STS 02, 85.2%, "Open Files and Diagonals"
3. STS 11, 87.1%, "Activity of the King"
4. STS 03, 87.5%, "Knight Outposts"
5. STS 08, 87.8%, "Advancement of f/g/h Pawns"

Rank Name                          Elo     +/-   Games    Wins  Losses   Draws   Points   Score    Draw
   0 dev                            12       5    9000    2642    2324    4034   4659.0   51.8%   44.8%
   1 Frozenight-6.0                  2       9    3000     808     790    1402   1509.0   50.3%   46.7%
   2 Marvin-6.2                    -18       9    3000     722     873    1405   1424.5   47.5%   46.8%
   3 Halogen-11.0                  -21      10    3000     794     979    1227   1407.5   46.9%   40.9%
@brunocodutra brunocodutra enabled auto-merge (rebase) November 15, 2024 19:19
Copy link

codecov bot commented Nov 15, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.26%. Comparing base (5ac58d2) to head (b62d986).
Report is 1 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master     #649      +/-   ##
==========================================
- Coverage   97.47%   97.26%   -0.22%     
==========================================
  Files          46       46              
  Lines        3212     3141      -71     
==========================================
- Hits         3131     3055      -76     
- Misses         81       86       +5     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@brunocodutra brunocodutra merged commit e1af234 into master Nov 15, 2024
17 of 18 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant