Welcome to GameHourz.com!
FAQFAQ   SearchSearch      ProfileProfile    Private MessagesPrivate Messages   Log in/Register/PasswordLog in/Register/Password

Reinforcement Learning - Picking the Right Parameter Values

 
   Game Forums (Home) -> AI Games RSS
Related Topics:
TBC Fast Package(1-70) - Any Class Free 2000G - Wow per level per level. Dear Sir or Madam Hot Sale!For all of our news and are some Special Package! We now provide measured by..

A* and multi-goals - Hello, I am using A* to find the shortest path in a 3D I have waypoints at each rooms, points to create the graph. At the moment I can select a start and goal node, the A* algorithm do the rest and my

bigtest - bigtest

AI and C# - HI all... does anybody know a book about AI, which includes examples in c#? thx for reading ;)

2006 Chatterbox Challenge - The online voting for the 2006 Challenge at has begun. Visit the site and vote for the 3 best bots. No or anything required to vote. Voting ends 4/30/06. Wendell
Next:  AI Games: What are the different types of game AIs?  
Author Message
kartoun

External


Since: Dec 13, 2007
Posts: 1



(Msg. 1) Posted: Thu Dec 13, 2007 11:37 am
Post subject: Reinforcement Learning - Picking the Right Parameter Values
Archived from groups: comp>ai>games (more info?)

Several Q and Q(lambda) parameters include:

Lambda (ë) represents the eligibility decay rate. The greater it is,
the longer the sequence of values of state-action pairs updated.

Alpha (á) is the learning rate - how much the new state-action value
tends towards the new reward and value of the next state-action pair.
The greater alpha, the more
the state-action value tends towards new information. High values of
alpha makes learning faster, but ending up receiving slightly lower
rewards.

The discount rate parameter (gamma) describes how foreseeing the
agent
is. Small values of gamma (e.g., close to zero) make the agent giving
immediate events higher significance. Gamma describes how foreseeing
the agent is. If gamma = 0 only the now matters. If gamma = 1 the
agent
takes a timeless view at rewards. If gamma = 0.5 a reward now has the
double Value of the same reward in the next time step. For practical
purposes we want gamma close to but not quite 1.

A reasonable decision for choosing RL parameters might be:

1) to choose high values of alpha at the first iterations and
decrease
it over time (from 0.95 at the beginning to 0.05 at the end).

2) to pick a high (let's say 0.99) for gamma - to have a far view
over
future rewards.

3) to pick a moderate value of lambda (like 0.5) - first, to
accelerate
learning performance and second, not to harm the one's computer
memory
limitation.

Please give me your opinion and let's discuss it.

Thanks,

Uri.

 >> Stay informed about: Reinforcement Learning - Picking the Right Parameter Values 
Back to top
Login to vote
Display posts from previous:   
   Game Forums (Home) -> AI Games All times are: Ekaterinburg, Islamabad, Karachi, Tashkent (change)
Page 1 of 1

 
You can post new topics in this forum
You can reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum



[ Contact us | Terms of Service/Privacy Policy ]