One interesting point: Joseph noted that when he r...

2012-02-15T22:26:42.518-08:00

One interesting point: Joseph noted that when he ran supervised learning he was using huge alphas, like 20, and minimum alphas around 0.5 or 1. That's not what I'm seeing: with big alphas it doesn't converge. I started with alpha=1 and went down from there, without cycling back up to large alphas if performance doesn't improve (like Joseph did).

This goes forward! It is Ian's and mine exper...

2012-02-14T13:46:20.583-08:00

This goes forward!

It is Ian's and mine experience that the best nets are those that are started as TD and then the training is extended with supervised training.

"Epoch" is not only Josephs term. It's the common term in the ML literature.

-Øystein

Comments on Computational Backgammon: Player 3: training with supervised learning

One interesting point: Joseph noted that when he r...

This goes forward! It is Ian's and mine exper...