bertsky / ocrd_wrap Goto Github PK
View Code? Open in Web Editor NEWOCR-D wrapper for arbitrary coords-preserving image operations
License: MIT License
OCR-D wrapper for arbitrary coords-preserving image operations
License: MIT License
I am using the latest ocrd_all maximum image. Workspace used: https://gdz.sub.uni-goettingen.de/mets/PPN1023134829.mets.xml
N E X T F L O W ~ version 21.04.3
Launching `/scratch1/users/mmustaf/operandi/slurm_workspaces/a7752ccc-3908-4d6a-917c-036cf9ffef6c/user_workflow.nf` [hopeful_plateau] - revision: 4d3b00d56e
O P E R A N D I - H P C - D E F A U L T P I P E L I N E
===========================================
input_file_group : MAX
mets : /scratch1/users/mmustaf/operandi/slurm_workspaces/a7752ccc-3908-4d6a-917c-036cf9ffef6c/7ed688de-482d-439a-816f-75b2226c60db/mets.xml
volume_map_dir : /scratch1/users/mmustaf/operandi/slurm_workspaces/a7752ccc-3908-4d6a-917c-036cf9ffef6c
models_mapping : /scratch1/users/mmustaf/ocrd_models:/usr/local/share
sif_path : /scratch1/users/mmustaf/ocrd_all_maximum_image.sif
singularity_wrapper : singularity exec --bind /scratch1/users/mmustaf/operandi/slurm_workspaces/a7752ccc-3908-4d6a-917c-036cf9ffef6c --bind /scratch1/users/mmustaf/ocrd_models:/usr/local/share --env OCRD_METS_CACHING=true /scratch1/users/mmustaf/ocrd_all_maximum_image.sif
[2e/fe535b] Submitted process > ocrd_cis_ocropy_binarize
[66/3a142e] Submitted process > ocrd_anybaseocr_crop
[fe/6339fd] Submitted process > ocrd_skimage_denoise
Error executing process > 'ocrd_skimage_denoise'
Caused by:
Process `ocrd_skimage_denoise` terminated with an error exit status (1)
Command executed:
singularity exec --bind /scratch1/users/mmustaf/operandi/slurm_workspaces/a7752ccc-3908-4d6a-917c-036cf9ffef6c --bind /scratch1/users/mmustaf/ocrd_models:/usr/local/share --env OCRD_METS_CACHING=true /scratch1/users/mmustaf/ocrd_all_maximum_image.sif ocrd-skimage-denoise -m mets.xml -I OCR-D-CROP -O OCR-D-BIN-DENOISE -p '{"level-of-operation": "page"}'
Command exit status:
1
Command output:
(empty)
Command error:
Traceback (most recent call last):
File "/usr/local/lib/python3.8/dist-packages/PIL/Image.py", line 3089, in fromarray
mode, rawmode = _fromarray_typemap[typekey]
KeyError: ((1, 1, 2), '|b1')
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/build/core/ocrd/ocrd/processor/helpers.py", line 128, in run_processor
processor.process()
File "/usr/local/lib/python3.8/site-packages/ocrd_wrap/skimage_denoise.py", line 90, in process
self._process_segment(page, page_image, page_coords, dpi,
File "/usr/local/lib/python3.8/site-packages/ocrd_wrap/skimage_denoise.py", line 166, in _process_segment
image = Image.fromarray(~array2)
File "/usr/local/lib/python3.8/dist-packages/PIL/Image.py", line 3092, in fromarray
raise TypeError(msg) from e
TypeError: Cannot handle this data type: (1, 1, 2), |b1
Traceback (most recent call last):
File "/usr/local/lib/python3.8/dist-packages/PIL/Image.py", line 3089, in fromarray
mode, rawmode = _fromarray_typemap[typekey]
KeyError: ((1, 1, 2), '|b1')
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/bin/ocrd-skimage-denoise", line 8, in <module>
sys.exit(ocrd_skimage_denoise())
File "/usr/local/lib/python3.8/dist-packages/click/core.py", line 1130, in __call__
return self.main(*args, **kwargs)
File "/usr/local/lib/python3.8/dist-packages/click/core.py", line 1055, in main
rv = self.invoke(ctx)
File "/usr/local/lib/python3.8/dist-packages/click/core.py", line 1404, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "/usr/local/lib/python3.8/dist-packages/click/core.py", line 760, in invoke
return __callback(*args, **kwargs)
File "/usr/local/lib/python3.8/site-packages/ocrd_wrap/cli.py", line 33, in ocrd_skimage_denoise
return ocrd_cli_wrap_processor(SkimageDenoise, *args, **kwargs)
File "/build/core/ocrd/ocrd/decorators/__init__.py", line 116, in ocrd_cli_wrap_processor
run_processor(processorClass, mets_url=mets, workspace=workspace, **kwargs)
File "/build/core/ocrd/ocrd/processor/helpers.py", line 131, in run_processor
raise err
File "/build/core/ocrd/ocrd/processor/helpers.py", line 128, in run_processor
processor.process()
File "/usr/local/lib/python3.8/site-packages/ocrd_wrap/skimage_denoise.py", line 90, in process
self._process_segment(page, page_image, page_coords, dpi,
File "/usr/local/lib/python3.8/site-packages/ocrd_wrap/skimage_denoise.py", line 166, in _process_segment
image = Image.fromarray(~array2)
File "/usr/local/lib/python3.8/dist-packages/PIL/Image.py", line 3092, in fromarray
raise TypeError(msg) from e
TypeError: Cannot handle this data type: (1, 1, 2), |b1
The release 0.1.4 is missing here on GitHub!
Line 180 in 84db8bf
e.g.
mets.xml: Line 86: Element '{http://www.loc.gov/METS/}file', attribute 'ID': 'DESKEW_1586.IMG-BINARIZED,DESKEWED' is not a valid value of the atomic type 'xs:ID'.
Candidates for low-effort wrappers:
convert
with ocrd-preprocess-image
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.