jaclearn.rl.algo package

Submodules

jaclearn.rl.algo.advantage module

class jaclearn.rl.algo.advantage.AdvantageComputerBase[source]

Bases: object

class jaclearn.rl.algo.advantage.DiscountedAdvantageComputer(gamma)[source]

Bases: jaclearn.rl.algo.advantage.AdvantageComputerBase

class jaclearn.rl.algo.advantage.GAEComputer(gamma, lambda_)[source]

Bases: jaclearn.rl.algo.advantage.AdvantageComputerBase

jaclearn.rl.algo.math module

class jaclearn.rl.algo.math.LinearValueRegressor[source]

Bases: object

coeffs = None
fit(states, steps, returns)[source]
predict(states, steps)[source]
register_snapshot_parts(env)[source]
class jaclearn.rl.algo.math.ObservationNormalizer(filter_mean=True)[source]

Bases: object

normalize(o)[source]
jaclearn.rl.algo.math.compute_gae(rewards, values, next_val, gamma, lambda_)[source]
jaclearn.rl.algo.math.discount_cumsum(x, gamma)[source]

Compute the discounted cumulative summation of an 1-d array. From https://github.com/rll/rllab/blob/master/rllab/misc/special.py

jaclearn.rl.algo.math.discount_return(x, discount)[source]

Compute the discounted return summation of an 1-d array. From https://github.com/rll/rllab/blob/master/rllab/misc/special.py

jaclearn.rl.algo.math.normalize_advantage(adv)[source]