Commit Graph

19 Commits (476b67fa71d175c172c7d341a347b8c12a61370f)

Author SHA1 Message Date
Ruben-FreddyLoafers 476b67fa71 status quo 2025-12-10 12:09:15 +01:00
Ruben-FreddyLoafers f623d1375c finally fucking did pacman 2025-12-09 11:05:37 +01:00
Ruben-FreddyLoafers 3fb0afd80e stuff persistent 2025-12-08 15:15:42 +01:00
Ruben-FreddyLoafers 0798236e26 stared at the screen for 2h 2025-12-08 11:30:42 +01:00
Ruben-FreddyLoafers a53583b1d7 actually implemented that RL 2025-12-02 12:02:04 +01:00
Ruben-FreddyLoafers a891d51ca9 commencing with actual reinforcement learning 2025-12-01 15:16:34 +01:00
Ruben Seitz 85f81e5f23 pacman works; commencing finetuning 2025-11-27 16:01:04 +01:00
Ruben Seitz 8049bfe29f removed 0's; set q[s][a]=-10 at the right place 2025-11-24 21:56:57 +01:00
Ruben-FreddyLoafers ad40c248d3 Keep fighting 2025-11-24 21:03:19 +01:00
Ruben Seitz a76d2c41d3 added max iterations 2025-11-24 11:24:27 +01:00
Ruben Seitz 1453fd930a debugging reward system 2025-11-24 10:22:34 +01:00
Ruben Seitz 8bd97eb9ef done did it again not 2025-11-20 18:57:35 +01:00
Ruben-FreddyLoafers 6c9a096b61 mental breakdown 2025-11-20 15:32:28 +01:00
Ruben-FreddyLoafers 6f7dcb8326 debugging 2025-11-19 20:03:21 +01:00
Ruben Seitz ee04e00627 tried a couple things out; Balancing reward system 2025-11-19 13:59:41 +01:00
Ruben-FreddyLoafers 24714fca0e removed impossible states; better distance consideration; bug fixing 2025-11-18 14:42:43 +01:00
Ruben Seitz a7b43c9037 logic done; debugging commencing 2025-11-17 14:52:36 +01:00
Ruben-FreddyLoafers 469d1d1a47 ASs 4 started 2025-11-13 13:54:32 +01:00
Ruben-FreddyLoafers c9869979d3 assignment 3 done? 2025-11-05 10:09:22 +01:00