init project, first data exploration

main
Felix Jan Michael Mucha 2024-11-19 14:42:25 +01:00
parent 294616e4bc
commit 1998cc29d7
5 changed files with 1250539 additions and 3 deletions

View File

@ -4,3 +4,7 @@
Use NLP techniques you learned so far (N-gram models, basic machine learn- Use NLP techniques you learned so far (N-gram models, basic machine learn-
ing, no neural nets) to analyse texts or to build an application. Document ing, no neural nets) to analyse texts or to build an application. Document
your approach. your approach.
# Data Source
https://github.com/taivop/joke-dataset/tree/master

1167320
data/reddit_jokes.json 100644

File diff suppressed because one or more lines are too long

22640
data/stupidstuff.json 100644

File diff suppressed because one or more lines are too long

60116
data/wocka.json 100644

File diff suppressed because one or more lines are too long

File diff suppressed because one or more lines are too long