I’m sure someone has thought of this, but I didn’t look. So, what if we delayed the update of the q-value and the policy for x number of time steps. We would keep track of the history and create an average reward. Initialize the policy for each action to be 1/|A| so that we can… Continue reading Q-Learning with delayed updates
ScraperWiki
I found this cool service call ScraperWiki. It allows you to write python, ruby or php scraper as well as a view. You can make the data public so others can view or private. You can also make a view for your data! So I think this is awesome!!! I think for my Adv. AI class… Continue reading ScraperWiki
Code.org
My brother showed me this video. Pretty cool.
Proof that the counting subset sum is in #P (Sharp P)
Yesterday I gave a presentation proving that in theory group. Presenting a proof is a lot harder than it seems when proving it to yourself on paper. Suggestion for presentations in the future: say what I am going to do then write it up on the board. Then after writing point to the different equations… Continue reading Proof that the counting subset sum is in #P (Sharp P)
Bayesian game?
What if we have a stochastic game S and a set of agents A and in each game s we might have a different subset of A playing? What type of game is that? This of course is only relevant when dealing with |A| > 2 agents. Also, the agents that play in each s… Continue reading Bayesian game?
Metropolis-Hastings algorithm
This algorithm looks useful.
Stochastic Coalitional Game Theory
In my MAS class I am creating a stochastic normal-form game engine and one of the suggested elaborations is to look at coalitional games. So, I was like what if the coalition games were played in a stochastic game setting? So what is that? Well based on these people at USC it is: A real-world adversary… Continue reading Stochastic Coalitional Game Theory
MGS Markov Game Simulator
I found a really cool paper that basically benchmarks a lot of MARL algorithms using MGS, a stochastic game simulator written in Java. This simulator is like my MALSIM except I only implemented features for repeated games not stochastic games.
Swarm of text
Can we use the Shapley idea to determine if there is a swarm on twitter based on tweets? A swarm could be thought as a lot of retweets or a tweet with similar words. We could find old tweet swarms characterize them and use them to predict whether certain tweets will become swarms. I think… Continue reading Swarm of text
Twitter: @tech5py
Well I got a twitter account. https://twitter.com/tech5py I doubt I will ever tweet anything, but I am planning on collecting data from twitter for use in my Adv. AI class. Cool thing is that if you google tech5py there is not a single hit! So, if I become popular 😉 then I will have a… Continue reading Twitter: @tech5py