Commit Graph

89 Commits (594eb4ea48a3ec5f8217953220f789b3f3f3c91b)

Author SHA1 Message Date
Jaronim Pracht 594eb4ea48 Add: OCR sends pdf async to coordinantor
Add progress tracking and storage lock for PDF processing
Refactor OCR service to handle PDF processing asynchronously
2025-06-07 12:40:32 +02:00
Jaronim Pracht 26d945e7eb Add dockerignore files to backend services
Add .dockerignore files to coordinator, ocr-service, and
validate-service to exclude virtual environments and bytecode
compilation cache from Docker builds
2025-06-07 12:35:17 +02:00
Abdulrahman Dabbagh 77d8ca8b35 Merge pull request '#13-OCR-Service' (#51) from #13-OCR-Service into main
Reviewed-on: #51
2025-06-05 08:40:54 +02:00
s8613 63882d77c0 Merge conflict resolved 2025-06-04 20:25:54 +02:00
s8613 96ad5fd15c Merge remote-tracking branch 'origin/main' into #13-OCR-Service
# Conflicts:
#	project/frontend/src/components/pdfViewer.tsx
2025-06-04 20:19:46 +02:00
Jaronim Pracht f81624b8ab Merge pull request 'Pdf Seite ist anspringbar durch Kennzahlen-Tabelle' (#50) from #21-seite-anspringen into main
Reviewed-on: #50
2025-06-04 19:45:07 +02:00
s8613 8ed5f7c114 Working flow 2025-06-04 19:20:57 +02:00
s8613 3992cac54f Working ocr, exxeta, spacy to validate 2025-06-04 09:28:16 +02:00
s8613 af75439270 added return ocrd pdf 2025-06-03 22:34:07 +02:00
s8613 f3bee2b62b fixed small errors 2025-06-03 22:11:15 +02:00
Zainab2604 93334898c9 Pdf Seite ist anspringbar durch Kennzahlen-Tabelle 2025-06-03 22:03:21 +02:00
s8613 86b74ff844 gunicorn time increased 2025-06-03 21:42:52 +02:00
s8613 1165bbbf08 Fixed json format 2025-06-03 21:17:30 +02:00
s8613 5f8580d1da Increased timeout 2025-06-03 18:43:54 +02:00
s8613 c65bbbdf1c First integration of flow 2025-06-03 18:25:29 +02:00
Anastasia Hanna Ougolnikova b9d7f425e5 Merge pull request 'Progress - Frontend' (#48) from #16-progress into main
Reviewed-on: #48
2025-06-03 13:44:57 +02:00
Jaronim Pracht 05d4289902 Merge pull request '#15-spacy-finetuning' (#49) from #15-spacy-finetuning into main
Reviewed-on: #49
2025-06-03 12:52:20 +02:00
Jaronim Pracht b1433e0c0b Fix typo in spacy-service README filename 2025-06-03 12:51:32 +02:00
Jaronim Pracht 5fc226f4fc Merge branch 'main' into #16-progress 2025-06-03 12:34:48 +02:00
Jaronim Pracht b1cf30c40e Merge branch 'main' into #16-progress 2025-06-03 12:25:28 +02:00
Anastasia Hanna Ougolnikova 566dacd179 Merge pull request 'Implementiere Tabelle zur Anzeige von Kennzahlen (Ticket #18)' (#46) from frontend/18-kennzahl-tabelle into main
Reviewed-on: #46
2025-06-03 11:16:53 +02:00
Zainab MohamedBasheer 4fcfcb856e Merge branch 'main' into #15-spacy-finetuning 2025-06-02 22:55:01 +02:00
Zainab2604 59dde98dcb Add last part of new training data for spacy 2025-06-02 22:51:47 +02:00
Jaronim Pracht d412d5741b Add Dockerfile for coordinator service and progress controller
Add progress tracking functionality to frontend and backend
- Add progress controller endpoint to handle progress updates
- Implement socket.io progress updates in UploadPage
- Update import path for CircularProgressWithLabel component
2025-06-02 19:09:16 +02:00
Jaronim Pracht 1b06867d88 Fix showing pdfs in production
Removed redundant PDF.js worker initialization from
PDFViewer component and updated the worker source path in main.tsx.

Downgraded react-pdf to v8.0.2 to resolve compatibility issues and
fixed missing newline in nginx.conf.
2025-06-02 19:06:25 +02:00
Jaronim Pracht 9aa6c8be87 add progress and file-upload to frontend 2025-06-02 15:03:39 +02:00
Abdulrahman Dabbagh 76a060a563 Fix: Bindestrich in LTV-Kennzahl korrigiert 2025-06-02 11:10:36 +02:00
Abdulrahman Dabbagh 23a06d6518 Implementiere Tabelle zur Anzeige von Kennzahlen (Ticket #18) 2025-06-01 18:11:44 +02:00
Zainab2604 2f159d8c8d Add second part of new training data for spacy 2025-06-01 17:55:53 +02:00
Zainab MohamedBasheer 7180db773e Merge pull request 'Init validate service' (#45) from #12-init-validate-service into main
Reviewed-on: #45
2025-06-01 12:51:30 +02:00
Zainab2604 420e21e8c4 Add Port to COORNATOR_URL 2025-06-01 12:49:23 +02:00
Zainab2604 5ff98ef137 Add part of new training data for spacy 2025-05-31 20:45:00 +02:00
Jaronim Pracht df5ac605c2 Add validate service with entity merging and validation
Implements a Flask microservice that receives entities from SpaCy and
Exxeta services, merges them based on normalized text matching, and
forwards validated results to coordinator. Also updates gunicorn
configuration with timeout and worker settings.
2025-05-30 13:44:13 +02:00
Anastasia Hanna Ougolnikova ba191dd0a6 Update project/backend/exxetaGPT/services/extractExxeta.py
Fixed false characters
2025-05-30 09:37:38 +02:00
Abdulrahman Dabbagh 416c2ceefd Merge pull request '#24-PDF-Anzeigen' (#41) from #24-PDF-Anzeigen into main
Reviewed-on: #41
2025-05-30 07:36:32 +02:00
s8613 efcf4fb831 Added error handling with pdf 2025-05-29 09:30:01 +02:00
s8613 f99700c696 Made PDF bit more responsive. 2025-05-29 09:19:02 +02:00
Abdulrahman Dabbagh 74d08d3c39 Merge pull request '#26-init-db' (#40) from #26-init-db into main
Reviewed-on: #40
2025-05-28 16:14:54 +02:00
s8613 676728021e Added PDFviewr as component and extractedResults as page that uses PDFViewer. 2025-05-27 16:01:56 +02:00
Jaronim Pracht 908050a2fb Containerize project
add compose for all services to start full project
2025-05-27 13:11:32 +02:00
Jaronim Pracht 141abc725f Refactor coordinator/app.py and add new controllers and models
closes #29
add persistence for pitch-books, spacy model and setttings
2025-05-27 13:10:21 +02:00
Jaronim Pracht c5f3224c68 Merge branch 'main' of gitMannheim:PSE2_FF/pse2_ff 2025-05-26 19:15:26 +02:00
Jaronim Pracht cc321fea4a Merge pull request 'backend/flask-setup' (#38) from backend/flask-setup into main
Reviewed-on: #38
2025-05-26 18:20:44 +02:00
Abdulrahman Dabbagh f504cc87e8 Ordnerstruktur angepasst: Flask-Backend nach backend/coordinator verschoben 2025-05-26 18:08:47 +02:00
Jaronim Pracht fd06fc1821 Merge branch 'main' of gitMannheim:PSE2_FF/pse2_ff 2025-05-26 16:19:27 +02:00
Jaronim Pracht d7528b07bc add gitignore 2025-05-26 16:19:23 +02:00
Anastasia Hanna Ougolnikova 3ddb35e51e Merge pull request '#6-spacy-service-aufbau' (#39) from #6-spacy-service-aufbau into main
Reviewed-on: #39
2025-05-26 07:57:19 +02:00
Zainab2604 377647db8b Removed __pycache__ 2025-05-25 17:45:43 +02:00
Abdulrahman Dabbagh af3eed2bdc Review-Kommentare eingearbeitet 2025-05-25 16:48:07 +02:00
Zainab MohamedBasheer 31f894a194 Merge pull request '#14 Datei Hochladen (UI)' (#36) from #14-upload-ui-frontend into main
Reviewed-on: #36
2025-05-25 15:31:08 +02:00