Agent (markov.markov.Markov.Agent)

Sourcemodule type MarkovCompressorType = sig ... end

Handle the continuous-time stream of information from a system and compress the information into a Markovian state representation such that the sequence of states returned by sequential calls to observe have the Markov property.

Sourcemodule type RewardType = sig ... end

A reward function which is a map from a state to a Reward.t option.

Sourcemodule type RLPolicyType = sig ... end

A policy for infering an action and an observer given a state and optionally a reward.

Sourcemodule type S = sig ... end

An MDP Agent. The output signature of the functor Make.

Source

module Make
  (MarkovCompressor : MarkovCompressorType)
  (Reward : RewardType with type state = MarkovCompressor.state)
  (Policy : 
    RLPolicyType
      with type state = MarkovCompressor.state
      with type reward = Reward.t) : 
  S with type policy = Policy.t

A functor. Make MarkovCompressor Reward Policy returns an Agent module. For example, Policy must be a type that includes the interface RLPolicyType (e.g. it may be of type RLPolicyType or a 'super-type' of RLPolicyType).