Reading PAGE
Peer Evaluation activity
| Trusted by | 1 |
| Views | 4 |
Total impact ?
Send a 
Lihong has...
| Trusted | 0 |
| Reviewed | 0 |
| Emailed | 0 |
| Shared/re-used | 0 |
| Discussed | 0 |
| Invited | 0 |
| Collected | 0 |
This was brought to you by:
Followblock this user Lihong Li Trusted member
Research Fellow / lihongli.cs@gmail.com
Yahoo! Research
Batch reinforcement learning with state importance
Oh la la
Your session has expired but don’t worry, your message
has been saved.Please log in and we’ll bring you back
to this page. You’ll just need to click “Send”.
Your evaluation is of great value to our authors and readers. Many thanks for your time.
Your mailing list is currently empty.
It will build up as you send messages
and links to your peers.
Enter the e-mail addresses of your recipients in the box below. Note: Peer Evaluation will NOT store these email addresses log in
Your message has been sent.
Description
Title : Batch reinforcement learning with state importance
Area : Computer Science
Language : English
Url : http://www.cs.ualberta.ca/~greiner/PAPERS/ecml04-lihong-abs.pdf
Doi : 10.1.1.73.434
Abstract : Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classifier mapping states to actions. High classification accuracy is usually deemed to correlate with high policy quality. But this is not necessarily the case as increasing classification accuracy can actually decrease the policy’s quality. This phenomenon takes place when the learning process begins to focus on classifying less “important ” states. In this paper, we introduce a measure of state’s decision-making importance that can be used to improve policy learning. As a result, the focused learning process is shown to converge faster to better policies 1. 1 Problem Formulation and Related Work Reinforcement learning (RL) [11] provide a general framework for many sequential decision-making problems and has succeeded in a number of important applications. Let S be the state space, A the action set, and D the start-state distribution. A policy is a mapping from states to actions: ?: S ? ? A. The state- and action-value functions are denoted by V ? (s) and Q ? (s, a), respectively [11]. The quality of a policy ? is measured
Subject : unspecifiedArea : Computer Science
Language : English
| Affiliations : |
Doi : 10.1.1.73.434
Leave a comment
This contribution has not been reviewed yet. review?
You may receive the Trusted member label after :
• Reviewing 10 uploads, whatever the media type.
• Being trusted by 10 peers.
• If you are blocked by 10 peers the "Trust label" will be suspended from your page. We encourage you to contact the administrator to contest the suspension.
Please select an affiliation to sign your evaluation:
Please select an affiliation:
Lihong's Peer Evaluation activity
| Trusted by | 1 |
- FPeer Evaluation, Publisher, Peer Evaluation.
| Views | 4 |
- 1A Novel Benchmark Methodology and Data Repository for Real-life Reinforcement Learning
- 1An Analysis of Linear Models, Linear Value-Function Approximation, and Feature Selection for Reinforcement Learning
- 1Analyzing feature generation for valuefunction approximation
- 1Batch reinforcement learning with state importance
Lihong has...
| Trusted | 0 |
| Reviewed | 0 |
| Emailed | 0 |
| Shared/re-used | 0 |
| Discussed | 0 |
| Invited | 0 |
| Collected | 0 |
Full Text request
Your request will be sent.
Please enter your email address to be notified
when this article becomes available
Your email