2211567/MLE - MLE - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Ruben-FreddyLoafers	476b67fa71	status quo	2025-12-10 12:09:15 +01:00
Ruben-FreddyLoafers	f623d1375c	finally fucking did pacman	2025-12-09 11:05:37 +01:00
Ruben-FreddyLoafers	3fb0afd80e	stuff persistent	2025-12-08 15:15:42 +01:00
Ruben-FreddyLoafers	0798236e26	stared at the screen for 2h	2025-12-08 11:30:42 +01:00
Ruben-FreddyLoafers	a53583b1d7	actually implemented that RL	2025-12-02 12:02:04 +01:00
Ruben-FreddyLoafers	a891d51ca9	commencing with actual reinforcement learning	2025-12-01 15:16:34 +01:00
Ruben Seitz	85f81e5f23	pacman works; commencing finetuning	2025-11-27 16:01:04 +01:00
Ruben Seitz	8049bfe29f	removed 0's; set q[s][a]=-10 at the right place	2025-11-24 21:56:57 +01:00
Ruben-FreddyLoafers	ad40c248d3	Keep fighting	2025-11-24 21:03:19 +01:00
Ruben Seitz	a76d2c41d3	added max iterations	2025-11-24 11:24:27 +01:00
Ruben Seitz	1453fd930a	debugging reward system	2025-11-24 10:22:34 +01:00
Ruben Seitz	8bd97eb9ef	done did it again not	2025-11-20 18:57:35 +01:00
Ruben-FreddyLoafers	6c9a096b61	mental breakdown	2025-11-20 15:32:28 +01:00
Ruben-FreddyLoafers	6f7dcb8326	debugging	2025-11-19 20:03:21 +01:00
Ruben Seitz	ee04e00627	tried a couple things out; Balancing reward system	2025-11-19 13:59:41 +01:00
Ruben-FreddyLoafers	24714fca0e	removed impossible states; better distance consideration; bug fixing	2025-11-18 14:42:43 +01:00
Ruben Seitz	a7b43c9037	logic done; debugging commencing	2025-11-17 14:52:36 +01:00
Ruben-FreddyLoafers	469d1d1a47	ASs 4 started	2025-11-13 13:54:32 +01:00
Ruben-FreddyLoafers	c9869979d3	assignment 3 done?	2025-11-05 10:09:22 +01:00

19 Commits (476b67fa71d175c172c7d341a347b8c12a61370f)