|Table of contents|
Lizzie 0.6 May 2019
See Uberdude's videos on youtube:
PJT: Hallo Uberdude, thanks for these videos. The second one breaks off with you starting to explain about changing networks: does that mean there is a third one due?
- Main board:
- Colored spots indicate moves LZ has analysed or is analysing
- The number in the top of the circle is the winning percentage for that move based on all the games it has explored from there
- The number at the bottom of the circle is the number of games it has explored from the move
- The color of the spot shows how good it finds the move:
- Cyan means: this is the move it has explored most, hence is its favorite move and it also has a red ring around it
- Green means: these are moves with high win rates, so it's going to explore these some more
- Red means: it has explored these moves as initial candidates but they give comparatively low win rates, so it has stopped exploring them
- A blue ring around one or more spots highlights those with the highest win rate, but these are not necessarily those it finds best (PJT: really? why?)
- Hovering over each candidate shows the most likely variation it finds from that move, i.e. best play for both (in terms of winning percentages)
- Upper left:
- Black and white prisoners
- Current win-rate estimate for Black and White (the broad horizontal black/white bar)
- The result of the last move
- The win-rate graph, showing how the balance has shifted during the game
- Red marks on the 50% line indicate blunders
- Click on the graph to go to the corresponding move
- Press ‘h’ to toggle between win-rate and handicap stones
- Lower left:
- The variation that LZ currently considers best
- Bottom left: Whether “pondering” is turned on or off; name of the file containing the neural network weights.
- Upper right: The game tree – click on a node to display the position on the board
- Lower right: comment on the current position, as loaded from an SGF file.
Hold down the ‘x’ key for a list of command keys (though not for the possibilities of clicking on the screen)
Looking at the source reveals a few commands that are not in the help text:
- ‘d’ toggles ‘dynamic komi’ if your LZ configuration supports it.
- ‘h’ toggles between showing win-rate and showing the advantage in handicap stones. This applies to the win-rate graph and the evaluations of the moves on the board, and can be particularly useful for handicap games.
- etc. To do: find out more from code.
- ‘,’ ‘Play best move’
- ‘q’ ‘Open online dialogue’
- Press ‘o’ to open an SGF-file.
- If there was a handicap of several stones, press ‘h’ to show the advantage in handicap stones.
- You can let LZ analyse the whole game while you do something else, or move for move while you play through the game:
- To analyse move for move:
- Press the space bar to start “pondering”.
- Press the down arrow to move forward through the game and the up arrow to go back.
- Wait long enough at each position for LZ to make up its mind; this is very dependent on your hardware, but you should at least wait for the win rates to stabilise. (PJT can anyone suggest how many lines it should play out to reach say 1d or superhuman strength?)
- Once LZ has analysed a position, it saves the results, but it takes a while for Lizzie/LZ to consider and retrieve them if you return to the position.
- To analyse the entire game:
- Press ‘a’ to start the analysis.
- Say how many playouts you want for each move.
- You may want to experiment a little to find a reasonable value for this, which depends heavily on your hardware and the network installed.
- If you stay to watch, you should see Lizzie going through the moves; it goes on to the next move when the playouts in all moves considered reaches the limit you specified.
- If you want to stop it, just press ‘a’ again, otherwise wait till it reaches the last move.
- Once it reaches the end, it continues trying moves in the final position until you stop it (which is fairly futile but also fairly harmless).
- Once you stop it, you can click on big swings in the win-rate graph to jump to those positions and see the mistakes that caused the swings.
- Analysis always starts at the position currently shown, so you can analyse part of the game more deeply by specifying more playouts/move.
- Since LZ remembers its results, it only has to do the difference in playouts. E.g. if you analysed at 1,000 playouts and ask for 1,500 it only has to do 500 more for each move.
- If you click on the win-rate graph while it is analysing, it jumps to that position and continues analysing from there.
- You may also want to stop it before the end and finish the analysis with fewer playouts/move.
- To analyse move for move:
- Once you have analysed enough, you may want to save the SGF-file, as Lizzie includes (some) analysis results for each move.
- Though you might want to save the entire analysis to save time later, this does not seem possible: if your restart Lizzie, you need to reanalyse all positions you want to know about.
- Warning You may lose information from the original SGF, so consider saving it under a different name
- Press ‘s’ to save
- This is the (rather verbose) information included per move:
- The win-rate after the move and swing (98.7% (0.1%)) are replace by handicap stones if ‘h’ was used.
- The long name indicates the file containing the network used.
Move 2 98.7% (0.1%) (f178a6e01bcb3d611963da9a9ee29920904ed148ca095928fc9a0f936d162d4e.gz / 219 playouts)
Both analysing and playing handicap games can give confusing results, as LZ has learnt to play level games.
When analysing it is also likely to suggest lines leading to a complicated fight which the weaker player would prefer to avoid. It sometimes also chooses or suggests plays on the first line although it is unclear if they really are worthwhile or LZ has just lost the plot.
LZ, since it aims to maximise the win rate rather than the score, may also happily suggest sacrificing substantial groups in order to be even more sure of a narrow win, where the weaker player would be much better advised to hang on to all the points they can.
PJT: Are there perhaps networks trained so as to make them more helpful to weaker players (if that is possible)?
For analysing games between very unequal players on high handicaps, the win rate is often not very useful: Black will start with a very high probability of winning (above 99% for 6 stones), and gradually lose his advantage, but this will have relatively little effect on the win rate unless and until the game becomes roughly level. At this point the win rate will plunge dramatically to about 50% and, unless Black is able to hold his own, it will continue down to near 0%. To deal with this, use the ‘h’ command to show the advantage in handicap stones.
Some remarks (about LeelaZero)
- White always starts with a 55-45 advantage, presumably because of komi being too high (?)
- LZ maximizes the probability of winning, not the score
- It is generally accepted that small differences in the winning percentages do not indicate that a move is definitely better and only drops of 3-5% are significant enough to speak of a mistake
- As Uberdude convincingly shows, LZ sometimes doesn't consider a candidate, which, if shown by the human analyst, changes her mind about a certain position
- A known defect of AI like LZ, are ladders, which are a one way street for humans, but the random exploration of continuations tends to obscure the AI's judgment
- When a game is loaded, the function ‘i’ = “edit game information” does not display the information from the SGF-file.
- Should be able to display expected score, especially for handicap games.
- When reviewing a game, Lizzie (or LZ?) should always do some analysis of the next move actually played, for comparison purposes.
- One should be able to add the current best variation for some point to the game as a variation.
- Should be possible to roll out and unroll the variations for each move one move at a time.
- Should be possible to hide all the coloured circles (to make reading easier).
Check the article on LeelaZero for tips on installing it and setting it up: especially on Linux, you may need to install a driver for your GPU (not without risks) or to install or build a CPU-only version.
You may also want to use a stronger network.