# Master MDS Use NLP techniques to analyse texts or to build an application. Document your approach.
 
 
Go to file
arman db3a3474a2 Merge branch 'main' of https://gitty.informatik.hs-mannheim.de/3016498/ANLP_WS24_CA2 2025-02-15 13:55:18 +01:00
data test 2025-02-13 11:52:28 +01:00
puns init structure, added data exploration hack, added init transformer 2025-01-23 21:28:45 +01:00
.gitignore added helpfull functionality 2025-02-09 15:33:01 +01:00
BalancedCELoss.py custom loss function 2025-02-09 11:10:34 +01:00
BertFine.ipynb removed Main and transfered to notebook 2025-02-15 13:22:49 +01:00
EarlyStopping.py added helpfull functionality 2025-02-09 15:33:01 +01:00
HumorDataset.py from long to float for regression 2025-02-14 19:23:17 +01:00
LICENSE Initial commit 2025-01-17 20:26:51 +01:00
README.md added glove embeddings 2025-01-27 20:55:22 +01:00
TEST_CNN_2.py from long to float for regression 2025-02-14 19:23:17 +01:00
bert_no_ernie.py removed Main and transfered to notebook 2025-02-15 13:22:49 +01:00
cnn.py update 2025-01-27 13:56:00 +01:00
cnn_bootstrap_agg.py subset update+plots 2025-02-14 23:54:47 +01:00
cnn_class.ipynb added CNN Reg 2025-02-11 23:27:46 +01:00
cnn_reg.ipynb added CNN Reg 2025-02-11 23:27:46 +01:00
cnn_reg_test.ipynb test with hyperparameter 2025-02-13 12:15:17 +01:00
data_explore_hack.ipynb added analysis for humor rating 2025-01-27 07:09:29 +01:00
data_explore_hack_rating.ipynb added analysis for humor rating 2025-01-27 07:09:29 +01:00
data_explore_hack_reg.ipynb update 2025-02-14 22:28:34 +01:00
dataset_generator.py added helpfull functionality 2025-02-09 15:33:01 +01:00
lstm_1b.py lstm update 2025-02-09 11:35:41 +01:00
ml_evaluation.py added helpfull functionality 2025-02-09 15:33:01 +01:00
ml_helper.py test 2025-02-13 11:52:28 +01:00
ml_history.py added helpfull functionality 2025-02-09 15:33:01 +01:00
test_cnn.py test 2025-02-13 11:52:28 +01:00
transformer.ipynb updated transformer models 2025-02-09 15:31:47 +01:00
transformer_bootstrap_agg.py transformer mit bootstrap agg 2025-02-15 13:55:06 +01:00
transformer_reg.ipynb updated transformer models 2025-02-09 15:31:47 +01:00

README.md

ANLP_WS24_CA2

Master MDS Use NLP techniques to analyse texts or to build an application. Document your approach.

TODOS

data

  • maybe buffer zone between good and bad jokes (trade off would be less data)

  • maybe not bineary classification

  • maybe change to humor detection (more data available)

  • dataset shape doesnt work correctly

  • history: integrate validation loss

Data

https://competitions.codalab.org/competitions/27446

https://aclanthology.org/2021.semeval-1.9.pdf#:~:text=HaHackathon%20is%20the%20first%20shared%20task%20to%20combine,its%20average%20ratings%20for%20both%20humor%20and%20offense.

Data embeddings

Not Prioritised (Pun data)