Commit Graph

10 Commits (594eb4ea48a3ec5f8217953220f789b3f3f3c91b)

Author SHA1 Message Date
Jaronim Pracht 594eb4ea48 Add: OCR sends pdf async to coordinantor
Add progress tracking and storage lock for PDF processing
Refactor OCR service to handle PDF processing asynchronously
2025-06-07 12:40:32 +02:00
s8613 3992cac54f Working ocr, exxeta, spacy to validate 2025-06-04 09:28:16 +02:00
s8613 f3bee2b62b fixed small errors 2025-06-03 22:11:15 +02:00
s8613 1165bbbf08 Fixed json format 2025-06-03 21:17:30 +02:00
s8613 c65bbbdf1c First integration of flow 2025-06-03 18:25:29 +02:00
Jaronim Pracht b1cf30c40e Merge branch 'main' into #16-progress 2025-06-03 12:25:28 +02:00
Jaronim Pracht 1b06867d88 Fix showing pdfs in production
Removed redundant PDF.js worker initialization from
PDFViewer component and updated the worker source path in main.tsx.

Downgraded react-pdf to v8.0.2 to resolve compatibility issues and
fixed missing newline in nginx.conf.
2025-06-02 19:06:25 +02:00
Jaronim Pracht 9aa6c8be87 add progress and file-upload to frontend 2025-06-02 15:03:39 +02:00
Jaronim Pracht df5ac605c2 Add validate service with entity merging and validation
Implements a Flask microservice that receives entities from SpaCy and
Exxeta services, merges them based on normalized text matching, and
forwards validated results to coordinator. Also updates gunicorn
configuration with timeout and worker settings.
2025-05-30 13:44:13 +02:00
Jaronim Pracht 908050a2fb Containerize project
add compose for all services to start full project
2025-05-27 13:11:32 +02:00