smartgrid.rewards.reward_constraints¶
RewardConstraints allow to activate a reward function only when certain conditions are met.
Two intuitive type of constraints are:
Time-based constraints: enabling a reward function only during certain
time steps. This allows, for example, adding progressively new reward functions,
by enabling some only after a given number of time steps. This simulates a kind
of “evolving” ethical considerations, through this addition of reward functions.
See the TimeConstrainedReward
for details.
Agent-based constraints: enabling a reward function only for certain agents.
This allows, for example, comparing the behaviour of two populations of
agents in the same simulation, driven by different objectives.
See the AgentConstrainedReward
for details.
Constraints can be combined, e.g., a reward function can be both time-constrained and agent-constrained, and extended, e.g., users can add their own constraints.
- members:
Classes
|
Enable or disable a reward function based on learning agents. |
|
Enable or disable a reward function based on the current time step. |