From the Chair of Systems Design, ETHZ and the Swiss Data Science Center.
Develop an open source and user-friendly tool for technical and non-technical users that performs page, block & textline segmentation and combines both manual & automatic annotation.
- Structure your work into project and case studies.
- Upload your pdf files.
- Annotate the results of the segmentation algorithm using the interactive dashboard.
- Automate the training of a classification algorithm.
- Export your results for further analysis.
Start the message broker:
docker-compose up
Start the backend:
source server/venv/bin/activate
make run
Start the worker:
source server/venv/bin/activate
make worker
Start the client:
cd client
yarn start
- Node.js and React.js deliver the interactive dashboard.
- Tornado runs the data backend.
- Celery with a RabbitMQ backend operates the execution of asynchronous tasks.