brain:arcade_games
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
brain:arcade_games [2017/01/05 00:19] – peter | brain:arcade_games [2020/07/15 09:30] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 19: | Line 19: | ||
ale> make -j 4 | ale> make -j 4 | ||
</ | </ | ||
- | |||
- | |||
- | |||
- | ===== Ideas ===== | ||
- | |||
- | Brain learns from images, i.e. image snapshots, or differences between one image and another. | ||
- | |||
- | Images therefore need to be readable by the Brain. | ||
- | |||
- | Standardise all images to a standard size to standardise learning. | ||
- | |||
- | Perhaps different type neurons for different size images with some sort of standardisation comparison between unequal image sizes. | ||
Line 67: | Line 55: | ||
This should display some details about the game. | This should display some details about the game. | ||
+ | READ the manual at / | ||
<code bash> | <code bash> | ||
Line 102: | Line 91: | ||
./ale -display_screen true -game_controller fifo roms/ | ./ale -display_screen true -game_controller fifo roms/ | ||
</ | </ | ||
+ | |||
+ | ===== Install ImageMagick and supporting libs ===== | ||
+ | |||
+ | apt-get install imagemagick | ||
+ | apt-get install libmagick++-dev | ||
+ | |||
+ | |||
+ | |||
+ | ===== Ideas ===== | ||
+ | |||
+ | Brain learns from images, i.e. image snapshots, or differences between one image and another. | ||
+ | |||
+ | Images therefore need to be readable by the Brain. | ||
+ | |||
+ | Standardise all images to a standard size to standardise learning. | ||
+ | |||
+ | Perhaps different type neurons for different size images with some sort of standardisation comparison between unequal image sizes. | ||
+ | |||
+ | Images - Convolutions | ||
+ | |||
+ | update resolver' | ||
+ | # epsilon decreases over time | ||
+ | current_epsilon = 0.05 + 0.95 * np.exp(-epoch / 2500.) | ||
+ | resolver.epsilon.set_value(np.float32(current_epsilon)) | ||
+ | |||
+ | |||
+ | # gamma - delayed reward coefficient - what fraction of reward is retained if it is obtained one tick later | ||
+ | gamma = theano.shared(np.float32(1), | ||
Line 109: | Line 126: | ||
https:// | https:// | ||
+ | |||
+ | http:// | ||
+ | |||
brain/arcade_games.1483575583.txt.gz · Last modified: 2020/07/15 09:30 (external edit)