Here is a nice informative PhD thesis on Reinforcement Learning. Really does a nice job in Chapter 2.2.3 explaining the use of Boltzmann distribution and Q-values.
Where we stack words and maybe rocks
Here is a nice informative PhD thesis on Reinforcement Learning. Really does a nice job in Chapter 2.2.3 explaining the use of Boltzmann distribution and Q-values.