Brian Lee issues

Results 9 issues of


                                            Brian Lee

Reduce history length requirement

Nothing actionable here, just recording some thoughts: Training from the last ~50 generations seems like an awfully long window. Early on in the training process, this seems like it would...

Consider active learning approach

Instead of training on all game data indiscriminately, use the prior-Q and prior-policy recorded values to deduce how "unexpected" the actual played move was. If we filter for all the...

Automate export of data as a BigQuery table

This will probably start as a private BQ dataset; will have to consult to figure out how to offer data publicly/how to allow the general public to query over a...

Test for "symmetry divergence" during training

One of the symptoms of value net overfitting that we found was that the 8 board symmetries would yield wildly different results when put through the value net. So instead...

Add garden client and replace cluster implementation

frontend

backend

client lib

Brian Lee

Reduce history length requirement

Consider active learning approach

Automate export of data as a BigQuery table

Test for "symmetry divergence" during training

Add garden client and replace cluster implementation

Null fields in items don't update when row changes

Detect filesystem permissions issues and raise friendlier error messages

Reduce polling frequency of tasks when no task active

Add -u usage flag to `llm chat`