smartgrid.rewards¶

This package contains various reward functions, used to compute a reward signal for each agent, based on their action and the resulting state of the environment.

Reward functions indicate to which degree the agent’s actions were appropriate, or in this case, ethical. By ethical, we mean that they are aligned with one or several moral values. These moral values are encoded in the reward function itself, which guides the learning of agents.

Traditionally, reward functions in the Reinforcement Learning literature are purely numerical, i.e., based on mathematical expressions. Such functions can be found in the smartgrid.rewards.numeric package.

Other functions can also be based on symbolic reasoning, such as argumentation: see the smartgrid.rewards.argumentation package.

members:

Modules

`argumentation`	Argumentation-based reward functions.
`numeric`	Numeric reward functions are purely based on mathematical expressions.
`reward`	The Reward abstract class defines a common (standard) API for reward functions.
`reward_collection`	The RewardCollection is responsible for computing rewards from reward functions.
`reward_constraints`	RewardConstraints allow to activate a reward function only when certain conditions are met.