Reading PAGE

Peer Evaluation activity

Trusted by 1
Views 4

Total impact ?

    Send a

    Lihong has...

    Trusted 0
    Reviewed 0
    Emailed 0
    Shared/re-used 0
    Discussed 0
    Invited 0
    Collected 0

     

    This was brought to you by:

    block this user Lihong Li Trusted member

    Research Fellow / lihongli.cs@gmail.com

    Yahoo! Research

    A Novel Benchmark Methodology and Data Repository for Real-life Reinforcement Learning

    Export to Mendeley

    Although reinforcement learning (RL) defines the learning problem in a more general setting than does supervised learning, making it a better fit to a broader spectrum of real-life learning tasks, it has been far less successful in gaining attention from other scientific disciplines and commercial ventures beyond machine learning. In an effort to help coax reinforcement learning out of the labs, we introduce a novel benchmark methodology and problem repository. Our approach involves collection and distribution of static training/testing datasets, similar to efforts in other disciplines such as the UCI repository for supervised learning [4]. The new framework allows for a fresh set of benchmarks grounded in measured data that are challenging, consistent, and realistic. We believe that the new approach can succeed as a focal point for empirical developments in the field. The current situation in reinforcement learning is akin to the pre-UCI era in supervised learning. New algorithms are validated with one or two artificial benchmark problems and often require problem-specific tweaking. Recently however, some impressive attempts have been made to grow reinforcement learning out of this model. The RL repository at the University of Massachusetts 1, the University of Alberta’s RL-Glue project 2 and the RL Competition are examples that have helped unify the evaluation of RL algorithms. However, the nature of the evaluation schemes of these methods have made it difficult to include actual measured data from real-life domains, like those found in the supervised-learning benchmarks. Creating an RL benchmark is not straightforward. For example, it is not apparent what data format should be used. In supervised learning, data can easily be provided as text files, usually in the form of a mapping from input space X to an output space Y. However, RL environ-

    Oh la laClose

    Your session has expired but don’t worry, your message
    has been saved.Please log in and we’ll bring you back
    to this page. You’ll just need to click “Send”.

    Your evaluation is of great value to our authors and readers. Many thanks for your time.

    Review Close

    Short review
    Select a comment
    Select a grade
    You and the author
    Anonymity My review is anonymous( Log in  or  Register )
    publish
    Close

    When you're done, click "publish"

    Only blue fields are mandatory.

    Relation to the author*
    Overall Comment*
    Anonymity* My review is anonymous( Log in  or  Register )
     

    Focus & Objectives*

    Have the objectives and the central topic been clearly introduced?

    Novelty & Originality*

    Do you consider this work to be an interesting contribution to knowledge?

    Arrangement, Transition and Logic

    Are the different sections of this work well arranged and distributed?

    Methodology & Results

    Is the author's methodology relevant to both the objectives and the results?

    Data Settings & Figures

    Were tables and figures appropriate and well conceived?

    References and bibliography

    Is this work well documented and has the bibliography been properly established?

    Writing

    Is this work well written, checked and edited?

    Write Your Review (you can paste text as well)
    Please be civil and constructive. Thank you.


    Grade (optional, N/A by default)

    N/A 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.5 9 9.5 10
    Close

    Your mailing list is currently empty.
    It will build up as you send messages
    and links to your peers.

     No one besides you has access to this list.
    Close
    Enter the e-mail addresses of your recipients in the box below.  Note: Peer Evaluation will NOT store these email addresses   log in
    Your recipients

    Your message:

    Your email : Your email address will not be stored or shared with others.

    Your message has been sent.

    Description

    Title : A Novel Benchmark Methodology and Data Repository for Real-life Reinforcement Learning
    Abstract : Although reinforcement learning (RL) defines the learning problem in a more general setting than does supervised learning, making it a better fit to a broader spectrum of real-life learning tasks, it has been far less successful in gaining attention from other scientific disciplines and commercial ventures beyond machine learning. In an effort to help coax reinforcement learning out of the labs, we introduce a novel benchmark methodology and problem repository. Our approach involves collection and distribution of static training/testing datasets, similar to efforts in other disciplines such as the UCI repository for supervised learning [4]. The new framework allows for a fresh set of benchmarks grounded in measured data that are challenging, consistent, and realistic. We believe that the new approach can succeed as a focal point for empirical developments in the field. The current situation in reinforcement learning is akin to the pre-UCI era in supervised learning. New algorithms are validated with one or two artificial benchmark problems and often require problem-specific tweaking. Recently however, some impressive attempts have been made to grow reinforcement learning out of this model. The RL repository at the University of Massachusetts 1, the University of Alberta’s RL-Glue project 2 and the RL Competition are examples that have helped unify the evaluation of RL algorithms. However, the nature of the evaluation schemes of these methods have made it difficult to include actual measured data from real-life domains, like those found in the supervised-learning benchmarks. Creating an RL benchmark is not straightforward. For example, it is not apparent what data format should be used. In supervised learning, data can easily be provided as text files, usually in the form of a mapping from input space X to an output space Y. However, RL environ-
    Subject : unspecified
    Area : Computer Science
    Language : English
    Affiliations
    Url : http://www.research.rutgers.edu/~lihong/pub/Nouri09NovelMsrl.pdf
    Doi : 10.1.1.155.5180

    Leave a comment

    This contribution has not been reviewed yet. review?

    You may receive the Trusted member label after :

    • Reviewing 10 uploads, whatever the media type.
    • Being trusted by 10 peers.
    • If you are blocked by 10 peers the "Trust label" will be suspended from your page. We encourage you to contact the administrator to contest the suspension.

    Does this seem fair to you? Please make your suggestions.

    Please select an affiliation to sign your evaluation:

    Cancel Evaluation Save

    Please select an affiliation:

    Cancel   Save

    Lihong's Peer Evaluation activity

    Lihong has...

    Trusted 0
    Reviewed 0
    Emailed 0
    Shared/re-used 0
    Discussed 0
    Invited 0
    Collected 0
    Invite this peer to...
    Title
    Start date (dd/mm/aaaa)
    Location
    URL
    Message
    send
    Close

    Full Text request

    Your request will be sent.

    Please enter your email address to be notified
    when this article becomes available

    Your email


     
    Your email address will not be shared or spammed.