Forum: >>> Magnum BBS <<<

54 to play

From Timothy Chow@21:1/5 to All on Mon Dec 13 22:27:57 2021

XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10

X:Player 1 O:Player 2
Score is X:0 O:0. Unlimited Game
+13-14-15-16-17-18------19-20-21-22-23-24-+
| X O | | O O O O X | +---+
| | | O O O O | | 2 |
| | | O | +---+
| | | |
| | | |
| |BAR| |
| | | O |
| | | X O |
| | | X X O |
| X | | X X O X X |
| X | | X X O X X |
+12-11-10--9--8--7-------6--5--4--3--2--1-+
Pip count X: 106 O: 148 X-O: 0-0
Cube: 2, O own cube
X to play 54

---
Tim Chow

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From [email protected]@21:1/5 to Tim Chow on Tue Dec 14 02:36:01 2021

On Tuesday, December 14, 2021 at 3:27:59 AM UTC, Tim Chow wrote:

XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10

X:Player 1 O:Player 2
Score is X:0 O:0. Unlimited Game
+13-14-15-16-17-18------19-20-21-22-23-24-+
| X O | | O O O O X | +---+
| | | O O O O | | 2 |
| | | O | +---+
| | | |
| | | |
| |BAR| |
| | | O |
| | | X O |
| | | X X O |
| X | | X X O X X |
| X | | X X O X X |
+12-11-10--9--8--7-------6--5--4--3--2--1-+
Pip count X: 106 O: 148 X-O: 0-0
Cube: 2, O own cube
X to play 54

Make our ace point. Soon we will have targets, and putting our opponent on the bar will help us get more gammons.
In this scenario, we certainly want a 5 point board.
Some might escape the rear checker, but that checker isn't under any pressure. We don't want the opponent to be able to hit
us without us getting returns.

Paul

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From MK@21:1/5 to Tim Chow on Tue Dec 14 03:00:26 2021

On December 13, 2021 at 8:27:59 PM UTC-7, Tim Chow wrote:

XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10
X to play 54

Will this position be in your next bg book?

You never written a previous bg book? Uh, sorry.
It's hard to believe you never written a bg book.
I would've guessed that you wrote many.

At least some of your opinions here should be
worth 99 cents if not 99 dollar or euros...??

Make it a hard cover. I don't mean cardboard.
I mean plywood. You will enjoy it more... ;)

MK

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Timothy Chow@21:1/5 to All on Tue Dec 14 23:58:56 2021

On 12/14/2021 6:00 AM, MK wrote:

On December 13, 2021 at 8:27:59 PM UTC-7, Tim Chow wrote:

XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10
X to play 54

Will this position be in your next bg book?

You never written a previous bg book? Uh, sorry.
It's hard to believe you never written a bg book.
I would've guessed that you wrote many.

At least some of your opinions here should be
worth 99 cents if not 99 dollar or euros...??

Make it a hard cover. I don't mean cardboard.
I mean plywood. You will enjoy it more... ;)

MK

Quoting for archival purposes.

---
Tim Chow

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Timothy Chow@21:1/5 to All on Wed Dec 15 20:55:20 2021

XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10

X:Player 1 O:Player 2
Score is X:0 O:0. Unlimited Game
+13-14-15-16-17-18------19-20-21-22-23-24-+
| X O | | O O O O X | +---+
| | | O O O O | | 2 |
| | | O | +---+
| | | |
| | | |
| |BAR| |
| | | O |
| | | X O |
| | | X X O |
| X | | X X O X X |
| X | | X X O X X |
+12-11-10--9--8--7-------6--5--4--3--2--1-+
Pip count X: 106 O: 148 X-O: 0-0
Cube: 2, O own cube
X to play 54

The rollout below agrees with Paul's play. However, XG 3-ply plays
24/20 6/1. For Stick's benefit, I should explain that when I say
that XG 3-ply plays 24/20 6/1, what I actually mean is that XG 3-ply
plays 24/20 6/1. But it's also true that the play with the highest
XG 3-ply evaluation---by a narrow margin---is 24/20 6/1.

1. Rollout¹ 6/1 5/1 eq:+0.615
Player: 72.32% (G:32.32% B:0.79%)
Opponent: 27.68% (G:4.64% B:0.12%)
Confidence: ±0.007 (+0.608..+0.623) - [100.0%]

2. Rollout¹ 24/20 6/1 eq:+0.555 (-0.061)
Player: 71.46% (G:30.52% B:0.63%)
Opponent: 28.54% (G:6.90% B:0.22%)
Confidence: ±0.009 (+0.546..+0.563) - [0.0%]

3. Rollout¹ 24/15 eq:+0.517 (-0.098)
Player: 72.35% (G:23.38% B:0.51%)
Opponent: 27.65% (G:5.46% B:0.14%)
Confidence: ±0.007 (+0.511..+0.524) - [0.0%]

¹ 1296 Games rolled with Variance Reduction.
Dice Seed: 271828
Moves: 3-ply, cube decisions: XG Roller

eXtreme Gammon Version: 2.19.207.pre-release

---
Tim Chow

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From [email protected]@21:1/5 to Tim Chow on Thu Dec 16 04:44:51 2021

On Thursday, December 16, 2021 at 1:55:23 AM UTC, Tim Chow wrote:

XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10

X:Player 1 O:Player 2
Score is X:0 O:0. Unlimited Game
+13-14-15-16-17-18------19-20-21-22-23-24-+
| X O | | O O O O X | +---+
| | | O O O O | | 2 |
| | | O | +---+
| | | |
| | | |
| |BAR| |
| | | O |
| | | X O |
| | | X X O |
| X | | X X O X X |
| X | | X X O X X |
+12-11-10--9--8--7-------6--5--4--3--2--1-+
Pip count X: 106 O: 148 X-O: 0-0
Cube: 2, O own cube
X to play 54
The rollout below agrees with Paul's play. However, XG 3-ply plays
24/20 6/1. For Stick's benefit, I should explain that when I say
that XG 3-ply plays 24/20 6/1, what I actually mean is that XG 3-ply
plays 24/20 6/1. But it's also true that the play with the highest
XG 3-ply evaluation---by a narrow margin---is 24/20 6/1.

I missed this episode of the Stick-and-Tim show. (Maybe I'll treat myself to the Netflix box set for Christmas.) Is it really the case that XG n-ply doesn't always
make the play with the highest XG n-ply evaluation. Could you possibly explain this
XGPly Paradox please?

Paul

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From [email protected]@21:1/5 to Tim Chow on Thu Dec 16 04:47:02 2021

On Thursday, December 16, 2021 at 1:55:23 AM UTC, Tim Chow wrote:

XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10

X:Player 1 O:Player 2
Score is X:0 O:0. Unlimited Game
+13-14-15-16-17-18------19-20-21-22-23-24-+
| X O | | O O O O X | +---+
| | | O O O O | | 2 |
| | | O | +---+
| | | |
| | | |
| |BAR| |
| | | O |
| | | X O |
| | | X X O |
| X | | X X O X X |
| X | | X X O X X |
+12-11-10--9--8--7-------6--5--4--3--2--1-+
Pip count X: 106 O: 148 X-O: 0-0
Cube: 2, O own cube
X to play 54
The rollout below agrees with Paul's play. However, XG 3-ply plays
24/20 6/1. For Stick's benefit, I should explain that when I say
that XG 3-ply plays 24/20 6/1, what I actually mean is that XG 3-ply
plays 24/20 6/1. But it's also true that the play with the highest
XG 3-ply evaluation---by a narrow margin---is 24/20 6/1.

1. Rollout¹ 6/1 5/1 eq:+0.615
Player: 72.32% (G:32.32% B:0.79%)
Opponent: 27.68% (G:4.64% B:0.12%)
Confidence: ±0.007 (+0.608..+0.623) - [100.0%]

2. Rollout¹ 24/20 6/1 eq:+0.555 (-0.061)
Player: 71.46% (G:30.52% B:0.63%)
Opponent: 28.54% (G:6.90% B:0.22%)
Confidence: ±0.009 (+0.546..+0.563) - [0.0%]

3. Rollout¹ 24/15 eq:+0.517 (-0.098)
Player: 72.35% (G:23.38% B:0.51%)
Opponent: 27.65% (G:5.46% B:0.14%)
Confidence: ±0.007 (+0.511..+0.524) - [0.0%]

¹ 1296 Games rolled with Variance Reduction.
Dice Seed: 271828
Moves: 3-ply, cube decisions: XG Roller

Interesting. I didn't consider 24/20 6/1.
24/15 was my second choice by far.

Paul

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Timothy Chow@21:1/5 to [email protected] on Thu Dec 16 22:18:11 2021

On 12/16/2021 7:44 AM, [email protected] wrote:

I missed this episode of the Stick-and-Tim show. (Maybe I'll treat myself to the Netflix box set for Christmas.) Is it really the case that XG n-ply doesn't always
make the play with the highest XG n-ply evaluation. Could you possibly explain this
XGPly Paradox please?

First, as a bit of background, there are two main ways you'll
encounter XG 3-ply playing. First, if you want to play a match
against the bot and you select "eXtremeGammon" as your opponent,
then the bot will play at the XG 3-ply level. Second, if you
perform a rollout, the default setting is 3-ply/XGR, which means
that during a rollout trial, the bot will play the checkers at
the 3-ply level and the cube at the XGR level.

But what exactly does it mean to play the checkers at the 3-ply
level? What the bot first does is to evaluate every legal play
at the 1-ply level, by directly consulting the neural net. Then
it will select some subset of these to evaluate at the 2-ply
level, and finally it will select some subset of *those* to
evaluate at the 3-ply level. In particular, for most moves, it
will *not* actually evaluate every legal play at the 3-ply level.
It first filters out a bunch of moves whose 1-ply evaluation is
too low, and then it filters out more moves whose 2-ply evaluation
is too low. Finally, it evaluates the surviving candidates at the
3-ply level, picks the one with the highest equity, and plays it.

This procedure is usually quite sensible. Most of the time, if
a play's 1-ply evaluation is extremely far below that of the play
with the highest 1-ply evaluation, a 3-ply evaluation will not
overturn that verdict (it won't come up with the same equity
estimate as 1-ply does, but it will usually agree on which play
is better). So you save a lot of time by quickly filtering out
most of the moves immediately, and only investing further compute
time on the "promising" moves. (If you're worried that the bot
is filtering out moves too aggressively, then one thing you can
do, at least for rollouts, is to increase the size of the move
filter from "Normal" to "Large" or "Huge" or "Gigantic.")

You can see this visually if you set up a position with a lot of
legal plays, and then click on the 3-ply button near the lower
left corner of the screen. You'll see that next to each move is
an icon with 5 bars, and usually you'll see some plays at the
top with 3 bars filled in, followed by some plays with 2 bars
filled in, followed by yet more plays with only 1 bar filled in.

However, once in a while, a move that is filtered out at the 1-ply
or 2-ply level will, if forcibly evaluated at 3-ply, come out ahead
of the move that XG 3-ply plays.

Clear?

I have always said "XG 3-ply plays..." when I mean that that's
what XG 3-ply plays, either when it's playing at the eXtremeGammon
level against a human opponent, or when it's playing out a rollout
trial under the standard rollout settings. It's hard for me to
imagine a more natural meaning for "XG 3-ply plays." However, Stick
insists that the natural interpretation of "XG 3-ply plays..." is
*not* what XG 3-ply plays, but rather what move has the highest
XG 3-ply evaluation. To me, the natural way to refer to the move
with the highest XG 3-ply evaluation is "the move with the highest
XG 3-ply evaluation." But Stick continues to insist that I'm using
language in a very confusing manner.

---
Tim Chow

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From [email protected]@21:1/5 to Tim Chow on Fri Dec 17 04:44:51 2021

On Friday, December 17, 2021 at 3:18:15 AM UTC, Tim Chow wrote:

On 12/16/2021 7:44 AM, [email protected] wrote:

I missed this episode of the Stick-and-Tim show. (Maybe I'll treat myself to
the Netflix box set for Christmas.) Is it really the case that XG n-ply doesn't always
make the play with the highest XG n-ply evaluation. Could you possibly explain this
XGPly Paradox please?

First, as a bit of background, there are two main ways you'll
encounter XG 3-ply playing. First, if you want to play a match
against the bot and you select "eXtremeGammon" as your opponent,
then the bot will play at the XG 3-ply level. Second, if you
perform a rollout, the default setting is 3-ply/XGR, which means
that during a rollout trial, the bot will play the checkers at
the 3-ply level and the cube at the XGR level.

But what exactly does it mean to play the checkers at the 3-ply
level? What the bot first does is to evaluate every legal play
at the 1-ply level, by directly consulting the neural net. Then
it will select some subset of these to evaluate at the 2-ply
level, and finally it will select some subset of *those* to
evaluate at the 3-ply level. In particular, for most moves, it
will *not* actually evaluate every legal play at the 3-ply level.
It first filters out a bunch of moves whose 1-ply evaluation is
too low, and then it filters out more moves whose 2-ply evaluation
is too low. Finally, it evaluates the surviving candidates at the
3-ply level, picks the one with the highest equity, and plays it.

This procedure is usually quite sensible. Most of the time, if
a play's 1-ply evaluation is extremely far below that of the play
with the highest 1-ply evaluation, a 3-ply evaluation will not
overturn that verdict (it won't come up with the same equity
estimate as 1-ply does, but it will usually agree on which play
is better). So you save a lot of time by quickly filtering out
most of the moves immediately, and only investing further compute
time on the "promising" moves. (If you're worried that the bot
is filtering out moves too aggressively, then one thing you can
do, at least for rollouts, is to increase the size of the move
filter from "Normal" to "Large" or "Huge" or "Gigantic.")

You can see this visually if you set up a position with a lot of
legal plays, and then click on the 3-ply button near the lower
left corner of the screen. You'll see that next to each move is
an icon with 5 bars, and usually you'll see some plays at the
top with 3 bars filled in, followed by some plays with 2 bars
filled in, followed by yet more plays with only 1 bar filled in.

However, once in a while, a move that is filtered out at the 1-ply
or 2-ply level will, if forcibly evaluated at 3-ply, come out ahead
of the move that XG 3-ply plays.

Clear?

I have always said "XG 3-ply plays..." when I mean that that's
what XG 3-ply plays, either when it's playing at the eXtremeGammon
level against a human opponent, or when it's playing out a rollout
trial under the standard rollout settings. It's hard for me to
imagine a more natural meaning for "XG 3-ply plays." However, Stick
insists that the natural interpretation of "XG 3-ply plays..." is
*not* what XG 3-ply plays, but rather what move has the highest
XG 3-ply evaluation. To me, the natural way to refer to the move
with the highest XG 3-ply evaluation is "the move with the highest
XG 3-ply evaluation." But Stick continues to insist that I'm using
language in a very confusing manner.

I do think that what you say is clear.
I'd like to make some additional remarks.
The definition of an "error" is somewhat clear.
But it's less clear what makes an error "big" or "major" etc.
Usually, the definition will be based on MWC lost or lost equity
or by how much PR the play would lose.
However, in most human observational contexts, another definition
is very prevalent and totally unrelated with the above definitions.
Many would consider an error to be "big" if the non-optimality of the
play is obvious, even if the lost equity is small.
By this more human definition, I have seen XG's approach outlined above
lead to the most shocking play that no human would play who is not a total beginner. For example, I once had a single checker remaining on my two point, with XG having all 15 checkers left, and XG owning my ace point.
If XG doesn't roll big enough to escape my inner board, XG obviously needs
to maintain my ace point because otherwise it just concedes the backgammon.
But it didn't realise that.

Paul

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Timothy Chow@21:1/5 to All on Sun Dec 19 22:53:53 2021

Here's a fairly dramatic example of the distinction between
"XG 3-ply plays X" and "X is the play with the highest XG 3-ply
evaluation." First let's take a look at what XG 3-ply plays,
or equivalently what play comes out on top if you set up this
position in XG and click on the 3-ply evaluation button.

XGID=-B-aBBCCB------A----bcdbc-:0:0:1:61:5:0:0:7:10

X:Player 2 O:Player 1
Score is X:5 O:0 7 pt.(s) match.
+13-14-15-16-17-18------19-20-21-22-23-24-+
| X | | O O O O O |
| | | O O O O O |
| | | O O O |
| | | O |
| | | |
| |BAR| |
| | | |
| | | |
| X | | X |
| X X | | X X X X |
| X X | | X X X O X |
+12-11-10--9--8--7-------6--5--4--3--2--1-+
Pip count X: 90 O: 63 X-O: 5-0/7
Cube: 1
X to play 61

1. 3-ply 7/1 6/5 eq:+0.154
Player: 54.89% (G:1.32% B:0.05%)
Opponent: 45.11% (G:2.62% B:0.00%)

2. 3-ply 15/14 7/1 eq:+0.134 (-0.019)
Player: 53.32% (G:1.22% B:0.05%)
Opponent: 46.68% (G:2.05% B:0.00%)

3. 1-ply 15/8 eq:-0.101 (-0.254)
Player: 46.46% (G:1.48% B:0.04%)
Opponent: 53.54% (G:0.66% B:0.00%)

4. 1-ply 15/9 6/5 eq:-0.119 (-0.273)
Player: 47.68% (G:1.52% B:0.05%)
Opponent: 52.32% (G:3.32% B:0.00%)

5. 1-ply 15/9 7/6 eq:-0.127 (-0.281)
Player: 47.14% (G:1.48% B:0.05%)
Opponent: 52.86% (G:2.90% B:0.00%)

You can see that 15/9 6/5 and 15/9 7/6 look like massive blunders.
But now suppose we force an XG 3-ply evaluation of all five of the
above plays.

XGID=-B-aBBCCB------A----bcdbc-:0:0:1:61:5:0:0:7:10

X:Player 2 O:Player 1
Score is X:5 O:0 7 pt.(s) match.
+13-14-15-16-17-18------19-20-21-22-23-24-+
| X | | O O O O O |
| | | O O O O O |
| | | O O O |
| | | O |
| | | |
| |BAR| |
| | | |
| | | |
| X | | X |
| X X | | X X X X |
| X X | | X X X O X |
+12-11-10--9--8--7-------6--5--4--3--2--1-+
Pip count X: 90 O: 63 X-O: 5-0/7
Cube: 1
X to play 61

1. 3-ply 15/9 6/5 eq:+0.308
Player: 57.17% (G:1.95% B:0.07%)
Opponent: 42.83% (G:3.48% B:0.01%)

2. 3-ply 15/9 7/6 eq:+0.292 (-0.017)
Player: 56.92% (G:1.90% B:0.07%)
Opponent: 43.08% (G:2.88% B:0.01%)

3. 3-ply 7/1 6/5 eq:+0.154 (-0.155)
Player: 54.89% (G:1.32% B:0.05%)
Opponent: 45.11% (G:2.62% B:0.00%)

4. 3-ply 15/8 eq:+0.135 (-0.174)
Player: 52.67% (G:1.76% B:0.07%)
Opponent: 47.33% (G:1.04% B:0.00%)

5. 3-ply 15/14 7/1 eq:+0.134 (-0.174)
Player: 53.32% (G:1.22% B:0.05%)
Opponent: 46.68% (G:2.05% B:0.00%)

All of a sudden it's 7/1 6/5 that is rated a huge blunder. The
equity estimate of 15/9 6/5 went up from -0.119 to +0.308.

If we back up and look at the 1-ply evaluations of these five moves,
we can see why XG might decide that it's not worth considering further
any moves besides 7/1 6/5 and 15/14 7/1.

XGID=-B-aBBCCB------A----bcdbc-:0:0:1:61:5:0:0:7:10

X:Player 2 O:Player 1
Score is X:5 O:0 7 pt.(s) match.
+13-14-15-16-17-18------19-20-21-22-23-24-+
| X | | O O O O O |
| | | O O O O O |
| | | O O O |
| | | O |
| | | |
| |BAR| |
| | | |
| | | |
| X | | X |
| X X | | X X X X |
| X X | | X X X O X |
+12-11-10--9--8--7-------6--5--4--3--2--1-+
Pip count X: 90 O: 63 X-O: 5-0/7
Cube: 1
X to play 61

1. 1-ply 7/1 6/5 eq:+0.062
Player: 52.09% (G:1.38% B:0.04%)
Opponent: 47.91% (G:1.24% B:0.00%)

2. 1-ply 15/14 7/1 eq:+0.022 (-0.039)
Player: 50.87% (G:1.28% B:0.05%)
Opponent: 49.13% (G:1.28% B:0.00%)

3. 1-ply 15/8 eq:-0.101 (-0.163)
Player: 46.46% (G:1.48% B:0.04%)
Opponent: 53.54% (G:0.66% B:0.00%)

4. 1-ply 15/9 6/5 eq:-0.119 (-0.181)
Player: 47.68% (G:1.52% B:0.05%)
Opponent: 52.32% (G:3.32% B:0.00%)

5. 1-ply 15/9 7/6 eq:-0.127 (-0.189)
Player: 47.14% (G:1.48% B:0.05%)
Opponent: 52.86% (G:2.90% B:0.00%)

eXtreme Gammon Version: 2.19.207.pre-release, MET: Kazaross XG2

---
Tim Chow

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

Who's Online
Recent Visitors
- Rixter
  Wed Jul 29 02:00:40 2026
  from Madison, Nc via Telnet
- Centurion
  Tue Jul 28 22:54:59 2026
  from Berea, Ohio via Telnet
- Bob Worm
  Tue Jul 28 16:01:18 2026
  from Wales, Uk via Telnet
- Rixter
  Tue Jul 28 13:42:46 2026
  from Madison, Nc via Telnet
- Krenn
  Tue Jul 28 11:59:57 2026
  from Sydney, Nsw via Telnet
- Rixter
  Tue Jul 28 01:23:48 2026
  from Madison, Nc via Telnet
- Centurion
  Mon Jul 27 22:50:42 2026
  from Berea, Ohio via Telnet
- Ataricrypt
  Mon Jul 27 19:19:17 2026
  from England via Telnet

System Info

Sysop:	Keyop
Location:	Huddersfield, West Yorkshire, UK
Users:	741
Nodes:	16 (2 / 14)
Uptime:	56:56:36
Calls:	12,446
Calls today:	1
Files:	15,192
Messages:	6,537,374

54 to play

Who's Online

Recent Visitors

System Info