algorithms.naive

A “naive” algorithm that does not learn (random policy).

This algorithm is provided as an example, and can be used to test that the environment is working correctly. It can also be used as a baseline for other learning algorithms.

members:

Modules

random_model

Model that returns purely random actions.