# Master MDS Use NLP techniques to analyse texts or to build an application. Document your approach.
 
 
Go to file
Felix Jan Michael Mucha 4469f55889 added bootstrap avg / ensemble preds 2025-02-16 03:56:50 +01:00
data !!!WARNING!!! Nuclear refactoring bomb in coming (Now 90% more confusing but 100% cleaner) 2025-02-15 17:16:34 +01:00
histories !!!WARNING!!! Nuclear refactoring bomb in coming (Now 90% more confusing but 100% cleaner) 2025-02-15 17:16:34 +01:00
.gitignore added helpfull functionality 2025-02-09 15:33:01 +01:00
BERT.py added bootstrap avg / ensemble preds 2025-02-16 03:56:50 +01:00
CNN.py added bootstrap avg / ensemble preds 2025-02-16 03:56:50 +01:00
Datasets.py !!!WARNING!!! Nuclear refactoring bomb in coming (Now 90% more confusing but 100% cleaner) 2025-02-15 17:16:34 +01:00
EarlyStopping.py !!!WARNING!!! Nuclear refactoring bomb in coming (Now 90% more confusing but 100% cleaner) 2025-02-15 17:16:34 +01:00
LICENSE Initial commit 2025-01-17 20:26:51 +01:00
LSTM.py !!!WARNING!!! Nuclear refactoring bomb in coming (Now 90% more confusing but 100% cleaner) 2025-02-15 17:16:34 +01:00
README.md added glove embeddings 2025-01-27 20:55:22 +01:00
Transformer.py added bootstrap avg / ensemble preds 2025-02-16 03:56:50 +01:00
cnn_bootstrap_agg.py refactored bootstrap 2025-02-16 00:42:57 +01:00
data_exploration.ipynb Merge branch 'main' of https://gitty.informatik.hs-mannheim.de/3016498/ANLP_WS24_CA2 2025-02-15 17:16:42 +01:00
dataset_helper.py added bootstrap avg / ensemble preds 2025-02-16 03:56:50 +01:00
ml_helper.py !!!WARNING!!! Nuclear refactoring bomb in coming (Now 90% more confusing but 100% cleaner) 2025-02-15 17:16:34 +01:00
ml_history.py added bootstrap avg / ensemble preds 2025-02-16 03:56:50 +01:00
ml_plots.py added: single model eval plots 2025-02-15 22:39:51 +01:00
ml_train.py added bootstrap avg / ensemble preds 2025-02-16 03:56:50 +01:00
model_comparison.ipynb added: single model eval plots 2025-02-15 22:39:51 +01:00
model_evaluation.ipynb added: single model eval plots 2025-02-15 22:39:51 +01:00
transformer_bootstrap_agg.py refactored bootstrap 2025-02-16 00:42:57 +01:00

README.md

ANLP_WS24_CA2

Master MDS Use NLP techniques to analyse texts or to build an application. Document your approach.

TODOS

data

  • maybe buffer zone between good and bad jokes (trade off would be less data)

  • maybe not bineary classification

  • maybe change to humor detection (more data available)

  • dataset shape doesnt work correctly

  • history: integrate validation loss

Data

https://competitions.codalab.org/competitions/27446

https://aclanthology.org/2021.semeval-1.9.pdf#:~:text=HaHackathon%20is%20the%20first%20shared%20task%20to%20combine,its%20average%20ratings%20for%20both%20humor%20and%20offense.

Data embeddings

Not Prioritised (Pun data)