Epsilon Greedy

class rlforge.policies.epsilonGreedy(q_values, epsilon=0.1)

Select an action using the epsilon-greedy exploration strategy.

With probability epsilon, a random action is chosen (exploration). With probability 1 - epsilon, the action with the highest estimated value in q_values is selected (exploitation).