pkg.gl

Categorygithub.com/timpalpant/go-cfrdeepcfr

package

0.0.0-20230208023716-2c859fbaa0a5

Repository: https://github.com/timpalpant/go-cfr.git

Documentation: pkg.go.dev

# Functions

NewCircularBuffer

No description provided by the author

NewExperienceTuple

No description provided by the author

NewRegretSample

No description provided by the author

NewReservoirBuffer

NewBuffer returns an empty Buffer with the given max size.

NewSingleDeepCFR

New returns a new SingleDeepCFR policy with the given model and sample buffer.

NewVRSingleDeepCFR

New returns a new VRSingleDeepCFR policy with the given model and sample buffer.

# Structs

No description provided by the author

No description provided by the author

ExperienceTuple

ExperienceTuple is a single sample of action-result collected for training.

RegretSample is a single sample of instantaneous advantages collected for training.

ReservoirBuffer

ReservoirBuffer is a collection of samples held in memory.

SingleDeepCFR implements cfr.StrategyProfile, and uses function approximation to estimate strategies rather than accumulation of regrets for all infosets.

VRSingleDeepCFR

VRSingleDeepCFR implements cfr.StrategyProfile, and uses function approximation to estimate strategies rather than accumulation of regrets for all infosets.

# Interfaces

Buffer collects samples of infoset action advantages to train a Model.

Model is a regression model that can be used to fit the given samples.

Samples must be binary marshalable, but embedding that interface breaks gob decoding.

TrainedModel is a regression model to use in DeepCFR that predicts a vector of advantages for a given InfoSet.