Sample Efficient Feature Selection for Factored MDPs

[1703.03454] Sample Efficient Feature Selection for Factored MDPs

Title:Sample Efficient Feature Selection for Factored MDPs ... Abstract:In reinforcement learning, the state of the real world is often ...

Sample Efficient Learning with Feature Selection for Factored MDPs

In reinforcement learning, state is often represented by feature vectors. Prior sample complexity bounds scale with the complexity of all features.

[PDF] Sample Efficient Feature Selection for Factored MDPs

This work proposes Feature Selection Explore and Exploit (FS-EE), an algorithm that automatically selects the necessary features while ...

Sample Efficient Feature Selection for Factored MDPs - arXiv

Sample Efficient Feature Selection for Factored MDPs. Zhaohan Daniel Guo∗, Emma Brunskill†. Abstract. In reinforcement learning, the state of ...

Sample Efficient Feature Selection for Factored MDPs - ResearchGate

Download Citation | Sample Efficient Feature Selection for Factored MDPs | In reinforcement learning, the state of the real world is often represented by ...

Sample Efficient Feature Selection for Factored MDPs - DeviantPadam

Sample Efficient Feature Selection for Factored MDPs. id. 1703.03454v1; By Zhaohan Daniel Guo and Emma Brunskill; Year - 2017; 1. Machine Learning 2. Machine ...

Sample Efficient Feature Selection for Factored MDPs | DeepAI

Sample Efficient Feature Selection for Factored MDPs. 03/09/2017. ∙. by Zhaohan Daniel Guo, et al. ∙. 0. ∙. share. In reinforcement learning, the state of ...

[PDF] Sample Efficient Learning with Feature Selection for Factored ...

2 Citations · Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes · Improved Exploration in Factored Average-Reward MDPs.

Efficient Structure Learning in Factored-State MDPs

Although DBNs are quite useful, they fail to succinctly represent certain dependencies. For example, in the taxi do- main, the value of the passenger variable ...

Efficient Solution Algorithms for Factored MDPs - CiteSeerX

We offer some simple methods for selecting good features for MDPs in Section 11, but it is ... In this small example with only four state variables, our factored ...

Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement ...

The selection of K and the estimation of full transition kernel P will be discussed in the next section. Plug-in Solver Approach In the empirical MDP c. M, we ...

Sample-Efficient Reinforcement Learning for Linearly ... - OpenReview

and γ ∈ (0, 1) is the discount factor of the MDP. The results is applicable to the tabular MDPs by taking the coordinate basis with K = |S| × |A|. Both sample.

View of Efficient Solution Algorithms for Factored MDPs

First, we provide a new approach for approximately solving MDPs using a linear valuefunction. Previous approaches to linear function approximation typically ...

Improved Exploration in Factored Average-Reward MDPs

Zhaohan Daniel Guo and Emma Brunskill. Sample efficient learning with feature selection for factored MDPs. In. Proceedings of the 14th European Workshop on Rein ...

Sample-Efficient Reinforcement Learning for Linearly ...

This matches the minimax optimal lower bound (up to a logarithm factor) established in [YW19, Theorem 1] for feature-based MDP. In comparison, for tabular MDP ...

Efficient approximate linear programming for factored MDPs

In the following, we give an example to show the role of the separator constraints. Example 2. Consider the constraints on one variable x:(15) ...

EFFICIENT REINFORCEMENT LEARNING IN FACTORED MDPS ...

sample efficient for infinite-horizon MDP. arXiv preprint arXiv:1901.09311, 2019. Yonathan Efroni, Shie Mannor, and Matteo Pirotta. Exploration-exploitation ...

Learning to Generate Context-Specific Abstractions for Factored MDPs

Note: this part discusses our work “CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs ... samples of variable ...

Sample-Efficient Reinforcement Learning Is Feasible for Linearly ...

Sparse feature selection makes batch reinforcement learning more sample efficient. ... MDPs under linear realizability of the optimal state-value function ...

Efficient Reinforcement Learning in Factored MDPs - UPenn CIS

our sampling process may be forced to give them different values, decoupling them again. As this example clearly illustrates, it is not enough for a variable ...