Scheduler
All Classes Namespaces Files Functions Variables Typedefs Enumerations Enumerator Friends Macros Pages
Public Member Functions | Protected Member Functions | Protected Attributes | List of all members
Mdp::EligibilityTraceAlgo Class Referenceabstract

#include <eligibilityTraceAlgo.h>

Inheritance diagram for Mdp::EligibilityTraceAlgo:
Mdp::RlBackupAlgorithm Mdp::DelayedQLearning Mdp::NaiveQLambda Mdp::SarsaLambda Mdp::WatkinsQLambda

Public Member Functions

 EligibilityTraceAlgo (std::shared_ptr< Context > c, TabularActionValues *av)
 
virtual void init ()
 
virtual void end ()
 
virtual void updateActionValues (state_t previousState, state_t nextState, action_t previousAction, double reward)=0
 
- Public Member Functions inherited from Mdp::RlBackupAlgorithm
 RlBackupAlgorithm (std::shared_ptr< Context > c, ActionValuesFunction *av)
 
virtual ~RlBackupAlgorithm ()
 
virtual double getMaxQ (state_t state)
 
virtual std::pair< action_t, double > getBestActionAndQ (state_t state)
 
virtual action_t getBestAction (state_t state)
 
virtual void updateBestActionAndQ (state_t state)
 
virtual void notifyUpdateNeeded ()
 

Protected Member Functions

void updateState (state_t previousState, action_t previousAction, double reward)
 
- Protected Member Functions inherited from Mdp::RlBackupAlgorithm
virtual void initAlpha ()
 
virtual void updateAlpha ()
 
virtual void updateIfNeeded (state_t state)
 

Protected Attributes

TabularActionValuestabularAv {nullptr}
 
state_t previousPreviousState {0}
 
action_t previousPreviousAction {0}
 
double previousReward {0.0}
 
std::vector< std::vector< double > > e
 
double lambda {0.5}
 
double discountFactor {0.5}
 
size_t stateSize {0}
 
size_t actionSize {0}
 
- Protected Attributes inherited from Mdp::RlBackupAlgorithm
std::shared_ptr< Contextcontext {nullptr}
 
ActionValuesFunctionactionValues {nullptr}
 
double alpha {-1.0}
 
double alpha0 {0.1}
 
double alphaCounter {1.0}
 
double alphaDecaySpeed {1.0}
 
bool hyperbolic {false}
 
bool stepwise {false}
 
unsigned long long stepwiseCounter {0}
 
unsigned long long int stepLength {0}
 
std::vector< double > bestQ
 
std::vector< action_tbestAction
 
std::vector< bool > needsUpdate
 

Detailed Description

Definition at line 19 of file eligibilityTraceAlgo.h.

Constructor & Destructor Documentation

Mdp::EligibilityTraceAlgo::EligibilityTraceAlgo ( std::shared_ptr< Context c,
TabularActionValues av 
)
inline

Definition at line 22 of file eligibilityTraceAlgo.h.

Member Function Documentation

void EligibilityTraceAlgo::end ( )
virtual

Implements Mdp::RlBackupAlgorithm.

Definition at line 32 of file eligibilityTraceAlgo.cpp.

void EligibilityTraceAlgo::init ( )
virtual

Implements Mdp::RlBackupAlgorithm.

Definition at line 21 of file eligibilityTraceAlgo.cpp.

virtual void Mdp::EligibilityTraceAlgo::updateActionValues ( state_t  previousState,
state_t  nextState,
action_t  previousAction,
double  reward 
)
pure virtual
void EligibilityTraceAlgo::updateState ( state_t  previousState,
action_t  previousAction,
double  reward 
)
protected

Definition at line 36 of file eligibilityTraceAlgo.cpp.

Member Data Documentation

size_t Mdp::EligibilityTraceAlgo::actionSize {0}
protected

Definition at line 39 of file eligibilityTraceAlgo.h.

double Mdp::EligibilityTraceAlgo::discountFactor {0.5}
protected

Definition at line 37 of file eligibilityTraceAlgo.h.

std::vector<std::vector<double> > Mdp::EligibilityTraceAlgo::e
protected

Definition at line 35 of file eligibilityTraceAlgo.h.

double Mdp::EligibilityTraceAlgo::lambda {0.5}
protected

Definition at line 36 of file eligibilityTraceAlgo.h.

action_t Mdp::EligibilityTraceAlgo::previousPreviousAction {0}
protected

Definition at line 33 of file eligibilityTraceAlgo.h.

state_t Mdp::EligibilityTraceAlgo::previousPreviousState {0}
protected

Definition at line 32 of file eligibilityTraceAlgo.h.

double Mdp::EligibilityTraceAlgo::previousReward {0.0}
protected

Definition at line 34 of file eligibilityTraceAlgo.h.

size_t Mdp::EligibilityTraceAlgo::stateSize {0}
protected

Definition at line 38 of file eligibilityTraceAlgo.h.

TabularActionValues* Mdp::EligibilityTraceAlgo::tabularAv {nullptr}
protected

Definition at line 31 of file eligibilityTraceAlgo.h.


The documentation for this class was generated from the following files: