Ruben-FreddyLoafers
|
f623d1375c
|
finally fucking did pacman
|
2025-12-09 11:05:37 +01:00 |
Ruben-FreddyLoafers
|
3fb0afd80e
|
stuff persistent
|
2025-12-08 15:15:42 +01:00 |
Ruben-FreddyLoafers
|
0798236e26
|
stared at the screen for 2h
|
2025-12-08 11:30:42 +01:00 |
Ruben-FreddyLoafers
|
a53583b1d7
|
actually implemented that RL
|
2025-12-02 12:02:04 +01:00 |
Ruben-FreddyLoafers
|
a891d51ca9
|
commencing with actual reinforcement learning
|
2025-12-01 15:16:34 +01:00 |
Ruben Seitz
|
48a351518d
|
good enough i guess
|
2025-11-27 16:09:47 +01:00 |
Ruben Seitz
|
85f81e5f23
|
pacman works; commencing finetuning
|
2025-11-27 16:01:04 +01:00 |
Ruben Seitz
|
8049bfe29f
|
removed 0's; set q[s][a]=-10 at the right place
|
2025-11-24 21:56:57 +01:00 |
Ruben-FreddyLoafers
|
ad40c248d3
|
Keep fighting
|
2025-11-24 21:03:19 +01:00 |
Ruben Seitz
|
a76d2c41d3
|
added max iterations
|
2025-11-24 11:24:27 +01:00 |
Ruben Seitz
|
1453fd930a
|
debugging reward system
|
2025-11-24 10:22:34 +01:00 |
Ruben Seitz
|
8bd97eb9ef
|
done did it again not
|
2025-11-20 18:57:35 +01:00 |
Ruben-FreddyLoafers
|
6c9a096b61
|
mental breakdown
|
2025-11-20 15:32:28 +01:00 |
Ruben-FreddyLoafers
|
6f7dcb8326
|
debugging
|
2025-11-19 20:03:21 +01:00 |
Ruben Seitz
|
ee04e00627
|
tried a couple things out; Balancing reward system
|
2025-11-19 13:59:41 +01:00 |
Ruben-FreddyLoafers
|
24714fca0e
|
removed impossible states; better distance consideration; bug fixing
|
2025-11-18 14:42:43 +01:00 |
Ruben Seitz
|
a7b43c9037
|
logic done; debugging commencing
|
2025-11-17 14:52:36 +01:00 |
Ruben-FreddyLoafers
|
469d1d1a47
|
ASs 4 started
|
2025-11-13 13:54:32 +01:00 |