linuxserver / docker-paperless-ngx Goto Github PK
View Code? Open in Web Editor NEWLicense: GNU General Public License v3.0
License: GNU General Public License v3.0
Files in consume folder should be automatically processed by paperless-ngx
Nothing happens after saving pdf files into to consume folder
**OS: Ubunti 20.04
**CPU architecture: x86_64
How docker service was installed:
Using the provided docker-compose.yml with some changes regarding the port and paths.
docker-compose up -d
paperless ngx docker not working on unraid,
connection reset when opening webui.
1.install paperless ngx docker from CA
2.
3.
4.
OS: Unraid
CPU architecture: x86_64
How docker service was installed:
From CA using default parameters, except port mapping
21:02:06 [Q] INFO Process-1:1 ready for work at 439
21:02:06 [Q] INFO Process-1:2 ready for work at 440
21:02:06 [Q] INFO Process-1:3 ready for work at 441
21:02:06 [Q] INFO Process-1:4 ready for work at 442
21:02:06 [Q] INFO Process-1:5 monitoring at 443
21:02:06 [Q] INFO Process-1 guarding cluster zulu-hot-jupiter-four
21:02:06 [Q] INFO Process-1:6 pushing tasks at 444
21:02:06 [Q] INFO Q Cluster zulu-hot-jupiter-four running.
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
21:02:36 [Q] INFO Enqueued 1
21:02:36 [Q] INFO Process-1 created a task from schedule [Train the classifier]
21:02:36 [Q] INFO Process-1:1 processing [eight-pluto-nuts-montana]
21:02:36 [Q] INFO Enqueued 1
21:02:36 [Q] INFO Process-1 created a task from schedule [Optimize the index]
21:02:36 [Q] INFO Process-1:2 processing [papa-oranges-arizona-earth]
21:02:36 [Q] INFO Enqueued 1
21:02:36 [Q] INFO Process-1 created a task from schedule [Perform sanity check]
21:02:36 [Q] INFO Enqueued 1
21:02:36 [Q] INFO Process-1 created a task from schedule [Check all e-mail accounts]
21:02:36 [Q] INFO Process-1:3 processing [leopard-echo-bacon-uniform]
21:02:36 [Q] INFO Process-1:4 processing [oklahoma-cat-seventeen-jersey]
21:02:36 [Q] INFO Process-1:4 stopped doing work
21:02:36 [Q] INFO Processed [oklahoma-cat-seventeen-jersey]
21:02:36 [Q] INFO Process-1:1 stopped doing work
[2022-04-05 21:02:36,490] [INFO] [paperless.sanity_checker] Sanity checker detected no issues.
21:02:36 [Q] INFO Process-1:3 stopped doing work
21:02:36 [Q] INFO Process-1:2 stopped doing work
21:02:36 [Q] INFO Processed [eight-pluto-nuts-montana]
21:02:36 [Q] INFO Processed [leopard-echo-bacon-uniform]
21:02:36 [Q] INFO Processed [papa-oranges-arizona-earth]
21:02:36 [Q] INFO recycled worker Process-1:1
21:02:36 [Q] INFO Process-1:7 ready for work at 451
21:02:36 [Q] INFO recycled worker Process-1:3
21:02:36 [Q] INFO Process-1:8 ready for work at 452
21:02:37 [Q] INFO recycled worker Process-1:2
21:02:37 [Q] INFO Process-1:9 ready for work at 453
21:02:37 [Q] INFO recycled worker Process-1:4
21:02:37 [Q] INFO Process-1:10 ready for work at 454
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
invalid HTTP request size (max 4096)...skip
Redis service config should be removed by 50-config
if the environment variable REDIS_URL
is set
The redis service is started anyway because of the superfluous "$" that fails the check if REDIS_URL was set. I prepared a quick-fix PR but opened this issue as your PR template suggested (typo in code).
OS: n/a
CPU architecture: n/a
How docker service was installed: n/a
name: Bug report
about: Create a report to help us improve
UI upload should work
UI upload fails
Missing libzbar0 package. Running
apt install libzbar0
solves the issue.
Fresh install.
OS: Ubuntu 21.10
CPU architecture: amd64
How docker service was installed: Installed from official docker repo
Copied compose text from dockerhub and ran from Portainer stacks.
Process Process-1:1:
Traceback (most recent call last):
File "/usr/lib/python3.8/pydoc.py", line 347, in safeimport
module = import(path)
File "/app/paperless/src/documents/tasks.py", line 25, in
from pyzbar import pyzbar
File "/usr/local/lib/python3.8/dist-packages/pyzbar/pyzbar.py", line 7, in
from .wrapper import (
File "/usr/local/lib/python3.8/dist-packages/pyzbar/wrapper.py", line 151, in
zbar_version = zbar_function(
File "/usr/local/lib/python3.8/dist-packages/pyzbar/wrapper.py", line 148, in zbar_function
return prototype((fname, load_libzbar()))
File "/usr/local/lib/python3.8/dist-packages/pyzbar/wrapper.py", line 127, in load_libzbar
libzbar, dependencies = zbar_library.load()
File "/usr/local/lib/python3.8/dist-packages/pyzbar/zbar_library.py", line 65, in load
raise ImportError('Unable to find zbar shared library')
ImportError: Unable to find zbar shared library
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/usr/lib/python3.8/multiprocessing/process.py", line 315, in _bootstrap
self.run()
File "/usr/lib/python3.8/multiprocessing/process.py", line 108, in run
self._target(*self._args, **self._kwargs)
File "/usr/local/lib/python3.8/dist-packages/django_q/cluster.py", line 424, in worker
f = pydoc.locate(f)
File "/usr/lib/python3.8/pydoc.py", line 1632, in locate
nextmodule = safeimport('.'.join(parts[:n+1]), forceload)
File "/usr/lib/python3.8/pydoc.py", line 362, in safeimport
raise ErrorDuringImport(path, sys.exc_info())
pydoc.ErrorDuringImport: problem in documents.tasks - ImportError: Unable to find zbar shared library
The latest docker-paperless-ngx image should pull the latest redis server.
The default, ancient redis is installed from ubuntu's repo.
This old version has severe performance issues, increasing the idling CPU resource of paperless-ngx. See also paperless-ngx/paperless-ngx#1084
Create a new container
Please add the following core (official way to install redis) before this line
curl -fsSL https://packages.redis.io/gpg | gpg --dearmor -o /usr/share/keyrings/redis-archive-keyring.gpg
echo "deb [signed-by=/usr/share/keyrings/redis-archive-keyring.gpg] https://packages.redis.io/deb $(lsb_release -cs) main" | tee /etc/apt/sources.list.d/redis.list
The rest is fine - the apt-get update
then apt-get install redis
will fetch it from the official redis repo rather than the outdated ubuntu repo.
The defaults changed, so you'll also need to add this (disable persistence) to the redis config
appendonly no
save ""
Hello. Thank you for creating an image for paperless-ngx. It will be very useful to me and I'm sure a lot of people.
I set up the containers using the docker run method (since I'm not that familiar with docker-compose).
Everything looks like it worked like it should, but I can't figure out how to login or create the default admin account. I've read through the docs and tried a few things but nothing worked. I found a method using docker-compose but since I'm using the docker run method wasn't able to try it. Do you have any suggestions or ideas?
Thanks in advance,
Adit
Pulling "lscr.io/linuxserver/paperless-ngx:arm64v8-latest" should get the latest image.
It works with dockerhub image, but fails with ghcr ones (Head "https://ghcr.io/v2/linuxserver/paperless-ngx/manifests/arm32v7-latest": unauthorized)
OS: Linux
CPU architecture: aarch64
How docker service was installed: Using Home Assistant OS
When configured to run against a Postgresql host, it should be able to connect to the host.
Paperless-ngx fails to start with the error in the title.
Compose configuration:
paperless-ng:
image: lscr.io/linuxserver/paperless-ngx:latest
container_name: paperless-ng
environment:
- PUID=1001
- PGID=100
- TZ=Europe/London
- PAPERLESS_DBHOST=postgres
- PAPERLESS_OCR_MODE=skip_noarchive
- PAPERLESS_FILENAME_FORMAT={document_type}/{title}
# - DOCKER_MODS=chooban/universal-pip3-install:latest
# - INSTALL_PACKAGES=psycopg2-binary
volumes:
- /share/CACHEDEV1_DATA/Config/paperlessng:/config
- /share/CACHEDEV2_DATA/documents:/data
ports:
- 8000:8000
restart: unless-stopped
postgres:
image: postgres:13
restart: unless-stopped
container_name: postgres
volumes:
- /share/CACHEDEV1_DATA/Config/postgres/:/var/lib/postgresql/data
environment:
POSTGRES_DB: paperless
POSTGRES_USER: paperless
POSTGRES_PASSWORD: paperless
Running the container with that fails to start the paperless service. I have a small mod that I use to add python modules in a few containers and by using that (see commented lines above) the service starts, so it seems to be a missing dependancy.
OS: Linux
CPU architecture: x86_64
How docker service was installed: QNAP NAS provided
Waiting for redis to become available...
[2022-05-08 09:49:02 +0000] [965] [INFO] Starting gunicorn 20.1.0
[2022-05-08 09:49:02 +0000] [965] [INFO] Listening at: http://0.0.0.0:8000 (965)
[2022-05-08 09:49:02 +0000] [965] [INFO] Using worker: paperless.workers.ConfigurableWorker
[2022-05-08 09:49:02 +0000] [965] [INFO] Server is ready. Spawning workers
[2022-05-08 09:49:02 +0000] [965] [WARNING] Worker with pid 981 was terminated due to signal 15
[2022-05-08 09:49:02 +0000] [965] [INFO] Shutting down: Master
[2022-05-08 09:49:02 +0000] [965] [INFO] Reason: Worker failed to boot.
Waiting for redis to become available...
Traceback (most recent call last):
File "/usr/local/lib/python3.8/dist-packages/django/db/backends/postgresql/base.py", line 24, in <module>
import psycopg2 as Database
File "/usr/local/lib/python3.8/dist-packages/psycopg2/__init__.py", line 51, in <module>
from psycopg2._psycopg import ( # noqa
ImportError: libpq.so.5: cannot open shared object file: No such file or directory
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "manage.py", line 11, in <module>
execute_from_command_line(sys.argv)
File "/usr/local/lib/python3.8/dist-packages/django/core/management/__init__.py", line 446, in execute_from_command_line
utility.execute()
File "/usr/local/lib/python3.8/dist-packages/django/core/management/__init__.py", line 420, in execute
django.setup()
File "/usr/local/lib/python3.8/dist-packages/django/__init__.py", line 24, in setup
apps.populate(settings.INSTALLED_APPS)
File "/usr/local/lib/python3.8/dist-packages/django/apps/registry.py", line 116, in populate
app_config.import_models()
File "/usr/local/lib/python3.8/dist-packages/django/apps/config.py", line 304, in import_models
self.models_module = import_module(models_module_name)
File "/usr/lib/python3.8/importlib/__init__.py", line 127, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
File "<frozen importlib._bootstrap>", line 1014, in _gcd_import
File "<frozen importlib._bootstrap>", line 991, in _find_and_load
File "<frozen importlib._bootstrap>", line 975, in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 671, in _load_unlocked
File "<frozen importlib._bootstrap_external>", line 848, in exec_module
File "<frozen importlib._bootstrap>", line 219, in _call_with_frames_removed
File "/usr/local/lib/python3.8/dist-packages/django/contrib/auth/models.py", line 3, in <module>
from django.contrib.auth.base_user import AbstractBaseUser, BaseUserManager
File "/usr/local/lib/python3.8/dist-packages/django/contrib/auth/base_user.py", line 49, in <module>
class AbstractBaseUser(models.Model):
File "/usr/local/lib/python3.8/dist-packages/django/db/models/base.py", line 141, in __new__
new_class.add_to_class("_meta", Options(meta, app_label))
File "/usr/local/lib/python3.8/dist-packages/django/db/models/base.py", line 369, in add_to_class
value.contribute_to_class(cls, name)
File "/usr/local/lib/python3.8/dist-packages/django/db/models/options.py", line 235, in contribute_to_class
self.db_table, connection.ops.max_name_length()
File "/usr/local/lib/python3.8/dist-packages/django/utils/connection.py", line 15, in __getattr__
return getattr(self._connections[self._alias], item)
File "/usr/local/lib/python3.8/dist-packages/django/utils/connection.py", line 62, in __getitem__
conn = self.create_connection(alias)
File "/usr/local/lib/python3.8/dist-packages/django/db/utils.py", line 208, in create_connection
backend = load_backend(db["ENGINE"])
File "/usr/local/lib/python3.8/dist-packages/django/db/utils.py", line 113, in load_backend
return import_module("%s.base" % backend_name)
File "/usr/lib/python3.8/importlib/__init__.py", line 127, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
File "/usr/local/lib/python3.8/dist-packages/django/db/backends/postgresql/base.py", line 28, in <module>
raise ImproperlyConfigured("Error loading psycopg2 module: %s" % e)
django.core.exceptions.ImproperlyConfigured: Error loading psycopg2 module: libpq.so.5: cannot open shared object file: No such file or directory
Waiting for redis to become available...
Traceback (most recent call last):
File "/usr/local/lib/python3.8/dist-packages/django/db/backends/postgresql/base.py", line 24, in <module>
import psycopg2 as Database
File "/usr/local/lib/python3.8/dist-packages/psycopg2/__init__.py", line 51, in <module>
from psycopg2._psycopg import ( # noqa
ImportError: libpq.so.5: cannot open shared object file: No such file or directory
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "manage.py", line 11, in <module>
execute_from_command_line(sys.argv)
File "/usr/local/lib/python3.8/dist-packages/django/core/management/__init__.py", line 446, in execute_from_command_line
utility.execute()
File "/usr/local/lib/python3.8/dist-packages/django/core/management/__init__.py", line 420, in execute
django.setup()
File "/usr/local/lib/python3.8/dist-packages/django/__init__.py", line 24, in setup
apps.populate(settings.INSTALLED_APPS)
File "/usr/local/lib/python3.8/dist-packages/django/apps/registry.py", line 116, in populate
app_config.import_models()
File "/usr/local/lib/python3.8/dist-packages/django/apps/config.py", line 304, in import_models
self.models_module = import_module(models_module_name)
File "/usr/lib/python3.8/importlib/__init__.py", line 127, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
File "<frozen importlib._bootstrap>", line 1014, in _gcd_import
File "<frozen importlib._bootstrap>", line 991, in _find_and_load
File "<frozen importlib._bootstrap>", line 975, in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 671, in _load_unlocked
File "<frozen importlib._bootstrap_external>", line 848, in exec_module
File "<frozen importlib._bootstrap>", line 219, in _call_with_frames_removed
File "/usr/local/lib/python3.8/dist-packages/django/contrib/auth/models.py", line 3, in <module>
from django.contrib.auth.base_user import AbstractBaseUser, BaseUserManager
File "/usr/local/lib/python3.8/dist-packages/django/contrib/auth/base_user.py", line 49, in <module>
class AbstractBaseUser(models.Model):
File "/usr/local/lib/python3.8/dist-packages/django/db/models/base.py", line 141, in __new__
new_class.add_to_class("_meta", Options(meta, app_label))
File "/usr/local/lib/python3.8/dist-packages/django/db/models/base.py", line 369, in add_to_class
value.contribute_to_class(cls, name)
File "/usr/local/lib/python3.8/dist-packages/django/db/models/options.py", line 235, in contribute_to_class
self.db_table, connection.ops.max_name_length()
File "/usr/local/lib/python3.8/dist-packages/django/utils/connection.py", line 15, in __getattr__
return getattr(self._connections[self._alias], item)
File "/usr/local/lib/python3.8/dist-packages/django/utils/connection.py", line 62, in __getitem__
conn = self.create_connection(alias)
File "/usr/local/lib/python3.8/dist-packages/django/db/utils.py", line 208, in create_connection
backend = load_backend(db["ENGINE"])
File "/usr/local/lib/python3.8/dist-packages/django/db/utils.py", line 113, in load_backend
return import_module("%s.base" % backend_name)
File "/usr/lib/python3.8/importlib/__init__.py", line 127, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
File "/usr/local/lib/python3.8/dist-packages/django/db/backends/postgresql/base.py", line 28, in <module>
raise ImproperlyConfigured("Error loading psycopg2 module: %s" % e)
django.core.exceptions.ImproperlyConfigured: Error loading psycopg2 module: libpq.so.5: cannot open shared object file: No such file or directory
Waiting for redis to become available...
[2022-05-08 09:49:13 +0000] [983] [INFO] Starting gunicorn 20.1.0
[2022-05-08 09:49:13 +0000] [983] [INFO] Listening at: http://0.0.0.0:8000 (983)
[2022-05-08 09:49:13 +0000] [983] [INFO] Using worker: paperless.workers.ConfigurableWorker
[2022-05-08 09:49:13 +0000] [983] [INFO] Server is ready. Spawning workers
[2022-05-08 09:49:13 +0000] [983] [WARNING] Worker with pid 998 was terminated due to signal 15
[2022-05-08 09:49:14 +0000] [983] [INFO] Shutting down: Master
[2022-05-08 09:49:14 +0000] [983] [INFO] Reason: Worker failed to boot.
Waiting for redis to become available...
Today I discovered that my e-mail jobs were not executed while listed as successfully in the task overview. Also deleting or modifying rules or accounts for e-mails resulted in a server 500 error. In both cases, the log complained about a missing db table.
Filing a bug in the paperless-ngx project (paperless-ngx/paperless-ngx#936) about this lead to a better understanding what's going on here.
Somehow when the lsio image in the 50-config
job finds an error (ocr language is not installed), it does not actually perform the migration steps.
The multilang ocr mod is applied after this script is being run. So probably the docker mod could be loaded earlier?
However once started with the missing ocr language disabled, the container successgfully performed the migrations. I can add the language through the dockermod again afterwards and apparently now it is all back to normal.
Beside being not. Because now everytime you need the migrations, you have to temporarily disable the ocr multilang mod once.
Migrations should be executed even when a missing ocr language is detected / the ocr multilang mod is enabled
[cont-init.d] 50-config: executing...
paperless-ngx | No changes detected
paperless-ngx | Operations to perform:
paperless-ngx | Apply all migrations: admin, auth, authtoken, contenttypes, django_q, documents, paperless_mail, sessions
paperless-ngx | Running migrations:
paperless-ngx | No migrations to apply.
paperless-ngx | [cont-init.d] 50-config: exited 0.
Migrations are not run:
[cont-init.d] 50-config: executing...
paperless-ngx | SystemCheckError: System check identified some issues:
paperless-ngx |
paperless-ngx | ERRORS:
paperless-ngx | ?: The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE.
paperless-ngx | SystemCheckError: System check identified some issues:
paperless-ngx |
paperless-ngx | ERRORS:
paperless-ngx | ?: The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE.
paperless-ngx | [cont-init.d] 50-config: exited 0.
OS: debian buster
CPU architecture: x86_64
How docker service was installed:
official repo
env:
DOCKER_MODS=linuxserver/mods:papermerge-multilangocr
OCRLANG=deu
USERMAP_UID=995
USERMAP_GID=995
PAPERLESS_OCR_LANGUAGE=deu+eng
PAPERLESS_FILENAME_FORMAT={correspondent}-{created_year}{created_month}{created_day}-{document_type}-{tag_list}
PAPERLESS_TIME_ZONE=Europe/Berlin
compose:
---
version: "2.1"
services:
paperless-ngx:
image: lscr.io/linuxserver/paperless-ngx
container_name: paperless-ngx
environment:
- PUID=${USERMAP_UID}
- PGID=${USERMAP_GID}
- TZ=${PAPERLESS_TIME_ZONE}
- PAPERLESS_TIME_ZONE=${PAPERLESS_TIME_ZONE}
- DOCKER_MODS=${DOCKER_MODS}
- OCRLANG=${OCRLANG}
- PAPERLESS_OCR_LANGUAGE=${PAPERLESS_OCR_LANGUAGE}
- PAPERLESS_FILENAME_FORMAT=${PAPERLESS_FILENAME_FORMAT}
volumes:
- config:/config
- data:/data
ports:
- 8001:8000
restart: unless-stopped
See in the bug report over at paperless-ngx, there is a full log
An uploaded image should be stored and processed by OCR
The upload crashes on certain images (maybe of a certain size?)
OS: Synology
CPU architecture: linux x86_64
How docker service was installed:
Docker-compose:
version: "3"
services:
paperless:
container_name: paperless
cpu_percent: 25
mem_limit: 256m
image: linuxserver/paperless-ngx:latest
network_mode: "bridge"
restart: always
ports:
- 6688:8000 # host:container
volumes:
- ${LOCAL_SOC_LOCATION}/paperless/data:/data
- ${APP_DATA_LOCATION}/paperless-ngx:/config
environment:
- PAPERLESS_DATA_DIR=/config
- PAPERLESS_MEDIA_ROOT=/data/media
- PAPERLESS_CONSUMPTION_DIR=/data/consume
[2022-07-08 06:29:09,422] [DEBUG] [paperless.parsing.tesseract] Deleting directory /tmp/paperless/paperless-xlzgcu_k
[2022-07-08 06:29:09,454] [ERROR] [paperless.consumer] Error while consuming document crop_5547.jpeg: TypeError: shutdown() got an unexpected keyword argument 'cancel_futures'
Traceback (most recent call last):
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_exec/tesseract.py", line 226, in get_deskew
p = run(args_tesseract, stdout=PIPE, stderr=STDOUT, timeout=timeout, check=True)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/subprocess/__init__.py", line 53, in run
proc = subprocess_run(args, env=env, **kwargs)
File "/usr/lib/python3.8/subprocess.py", line 516, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['tesseract', '-l', 'eng', '--psm', '2', '/tmp/ocrmypdf.io._d8p80vg/000001_rasterize.png', 'stdout']' died with <Signals.SIGKILL: 9>.
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/builtin_plugins/concurrency.py", line 135, in _execute
result = future.result()
File "/usr/lib/python3.8/concurrent/futures/_base.py", line 437, in result
return self.__get_result()
File "/usr/lib/python3.8/concurrent/futures/_base.py", line 389, in __get_result
raise self._exception
File "/usr/lib/python3.8/concurrent/futures/thread.py", line 57, in run
result = self.fn(*self.args, **self.kwargs)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_sync.py", line 192, in exec_page_sync
ocr_image, preprocess_out = make_intermediate_images(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_sync.py", line 135, in make_intermediate_images
preprocess_out = preprocess(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_sync.py", line 106, in preprocess
image = preprocess_deskew(image, page_context)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_pipeline.py", line 488, in preprocess_deskew
deskew_angle_degrees = ocr_engine.get_deskew(input_file, page_context.options)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/builtin_plugins/tesseract_ocr.py", line 167, in get_deskew
return tesseract.get_deskew(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_exec/tesseract.py", line 237, in get_deskew
raise SubprocessOutputError() from e
ocrmypdf.exceptions.SubprocessOutputError
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/app/paperless/src/paperless_tesseract/parsers.py", line 261, in parse
ocrmypdf.ocr(**args)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/api.py", line 337, in ocr
return run_pipeline(options=options, plugin_manager=plugin_manager, api=True)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_sync.py", line 385, in run_pipeline
exec_concurrent(context, executor)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_sync.py", line 274, in exec_concurrent
executor(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_concurrent.py", line 82, in __call__
self._execute(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/builtin_plugins/concurrency.py", line 148, in _execute
executor.shutdown(wait=False, cancel_futures=True)
TypeError: shutdown() got an unexpected keyword argument 'cancel_futures'
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/app/paperless/src/documents/consumer.py", line 261, in try_consume_file
document_parser.parse(self.path, mime_type, self.filename)
File "/app/paperless/src/paperless_tesseract/parsers.py", line 314, in parse
raise ParseError(f"{e.__class__.__name__}: {str(e)}")
documents.parsers.ParseError: TypeError: shutdown() got an unexpected keyword argument 'cancel_futures'
I assume this issue is related to an upgrade of Python to 3.8. A similar issue was reported on this project here: DiscordTime/ggdrive#56
When uploading a document, this error turns up:
Traceback (most recent call last):
File "/usr/local/lib/python3.8/dist-packages/pikepdf/__init__.py", line 13, in <module>
from . import _qpdf
ImportError: libqpdf.so.28: cannot open shared object file: No such file or directory
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/lib/python3.8/pydoc.py", line 347, in safeimport
module = __import__(path)
File "/app/paperless/src/documents/tasks.py", line 24, in <module>
from pikepdf import Pdf
File "/usr/local/lib/python3.8/dist-packages/pikepdf/__init__.py", line 16, in <module>
raise ImportError(_msg) from _e
ImportError: pikepdf's extension library failed to import
The document can be imported without problems.
When uploading a document, the UI hangs. In the logs, I can find the above error.
OS: Manjaro
CPU architecture: arm64
How docker service was installed: From distro repo
docker create -e DOCKER_MODS=linuxserver/mods:papermerge-multilangocr -e OCRLANG="deu,eng" linuxserver/paperless-ngx
Traceback (most recent call last):
File "/usr/local/lib/python3.8/dist-packages/pikepdf/__init__.py", line 13, in <module>
from . import _qpdf
ImportError: libqpdf.so.28: cannot open shared object file: No such file or directory
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/lib/python3.8/pydoc.py", line 347, in safeimport
module = __import__(path)
File "/app/paperless/src/documents/tasks.py", line 24, in <module>
from pikepdf import Pdf
File "/usr/local/lib/python3.8/dist-packages/pikepdf/__init__.py", line 16, in <module>
raise ImportError(_msg) from _e
ImportError: pikepdf's extension library failed to import
I try to upload a simple pdf file via the gui and i get the an error message. I already tried installing afterwards some packages that could be neccessary, from similar issues, but it didn't fix it for me.
I'm working with portainer on a raspberry pi 4. The error message i get:
Upload a file without issues
Upload a file with issues
OS: openmediavault
CPU architecture: ARMv8
How docker service was installed: via openmediavault package management, portainer aswell
version: "2.1"
services:
paperless-ngx:
image: lscr.io/linuxserver/paperless-ngx:latest
container_name: paperless
environment:
- PUID=1000
- PGID=1000
- TZ=Europe/Berlin
volumes:
- /appdata/paperless:/config
- /srv/dev-disk-by-uuid-34797fac-e6fb-410c-b5f9-bf6ac6a50b9e/paperless:/data
ports:
- 8001:8000
restart: unless-stopped
[2022-05-29 20:54:39,365] [INFO] [paperless.management.consumer] Using inotify to watch directory for changes: /data/consume
[2022-05-29 20:55:05,878] [INFO] [paperless.consumer] Consuming kfz.pdf
[2022-05-29 20:55:05,882] [DEBUG] [paperless.consumer] Detected mime type: application/pdf
[2022-05-29 20:55:05,890] [DEBUG] [paperless.consumer] Parser: RasterisedDocumentParser
[2022-05-29 20:55:05,898] [DEBUG] [paperless.consumer] Parsing kfz.pdf...
[2022-05-29 20:55:11,091] [INFO] [paperless.sanity_checker] Sanity checker detected no issues.
[2022-05-29 20:55:11,496] [DEBUG] [paperless.parsing.tesseract] Extracted text from PDF file /tmp/paperless/paperless-upload-m25o3eqb
[2022-05-29 20:55:11,659] [DEBUG] [paperless.parsing.tesseract] Calling OCRmyPDF with args: {'input_file': '/tmp/paperless/paperless-upload-m25o3eqb', 'output_file': '/tmp/paperless/paperless-w60t_auu/archive.pdf', 'use_threads': True, 'jobs': 2, 'language': 'eng', 'output_type': 'pdfa', 'progress_bar': False, 'skip_text': True, 'clean': True, 'deskew': True, 'rotate_pages': True, 'rotate_pages_threshold': 12.0, 'sidecar': '/tmp/paperless/paperless-w60t_auu/sidecar.txt'}
[2022-05-29 20:55:15,716] [DEBUG] [paperless.parsing.tesseract] Deleting directory /tmp/paperless/paperless-w60t_auu
[2022-05-29 20:55:15,728] [ERROR] [paperless.consumer] Error while consuming document kfz.pdf: TypeError: shutdown() got an unexpected keyword argument 'cancel_futures'
Traceback (most recent call last):
File "/usr/lib/python3.8/concurrent/futures/thread.py", line 57, in run
result = self.fn(*self.args, **self.kwargs)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 638, in _pdf_pageinfo_sync
page = PageInfo(pdf, pageno, infile, check_pages, detailed_analysis)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 716, in __init__
self._gather_pageinfo(pdf, pageno, infile, check_pages, detailed_analysis)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 762, in _gather_pageinfo
for ci in _process_content_streams(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 565, in _process_content_streams
yield from _find_regular_images(container, contentsinfo)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 483, in _find_regular_images
yield ImageInfo(name=draw.name, pdfimage=pdfimage, shorthand=draw.shorthand)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 353, in __init__
pim_icc = pim.icc
File "/usr/local/lib/python3.8/dist-packages/pikepdf/models/image.py", line 479, in icc
self._icc = ImageCms.ImageCmsProfile(iccbytesio)
File "/usr/local/lib/python3.8/dist-packages/PIL/ImageCms.py", line 202, in __init__
self._set(core.profile_frombytes(profile.read()))
File "/usr/local/lib/python3.8/dist-packages/PIL/_util.py", line 19, in __getattr__
raise self.ex
File "/usr/lib/python3.8/concurrent/futures/thread.py", line 57, in run
result = self.fn(*self.args, **self.kwargs)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 638, in _pdf_pageinfo_sync
page = PageInfo(pdf, pageno, infile, check_pages, detailed_analysis)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 716, in __init__
self._gather_pageinfo(pdf, pageno, infile, check_pages, detailed_analysis)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 762, in _gather_pageinfo
for ci in _process_content_streams(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 565, in _process_content_streams
yield from _find_regular_images(container, contentsinfo)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 483, in _find_regular_images
yield ImageInfo(name=draw.name, pdfimage=pdfimage, shorthand=draw.shorthand)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 353, in __init__
pim_icc = pim.icc
File "/usr/local/lib/python3.8/dist-packages/pikepdf/models/image.py", line 479, in icc
self._icc = ImageCms.ImageCmsProfile(iccbytesio)
File "/usr/local/lib/python3.8/dist-packages/PIL/ImageCms.py", line 202, in __init__
self._set(core.profile_frombytes(profile.read()))
File "/usr/local/lib/python3.8/dist-packages/PIL/_util.py", line 19, in __getattr__
raise self.ex
File "/usr/lib/python3.8/concurrent/futures/thread.py", line 57, in run
result = self.fn(*self.args, **self.kwargs)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 638, in _pdf_pageinfo_sync
page = PageInfo(pdf, pageno, infile, check_pages, detailed_analysis)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 716, in __init__
self._gather_pageinfo(pdf, pageno, infile, check_pages, detailed_analysis)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 762, in _gather_pageinfo
for ci in _process_content_streams(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 565, in _process_content_streams
yield from _find_regular_images(container, contentsinfo)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 483, in _find_regular_images
yield ImageInfo(name=draw.name, pdfimage=pdfimage, shorthand=draw.shorthand)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 353, in __init__
pim_icc = pim.icc
File "/usr/local/lib/python3.8/dist-packages/pikepdf/models/image.py", line 479, in icc
self._icc = ImageCms.ImageCmsProfile(iccbytesio)
File "/usr/local/lib/python3.8/dist-packages/PIL/ImageCms.py", line 202, in __init__
self._set(core.profile_frombytes(profile.read()))
File "/usr/local/lib/python3.8/dist-packages/PIL/_util.py", line 19, in __getattr__
raise self.ex
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/builtin_plugins/concurrency.py", line 135, in _execute
result = future.result()
File "/usr/lib/python3.8/concurrent/futures/_base.py", line 437, in result
return self.__get_result()
File "/usr/lib/python3.8/concurrent/futures/_base.py", line 389, in __get_result
raise self._exception
File "/usr/lib/python3.8/concurrent/futures/thread.py", line 57, in run
result = self.fn(*self.args, **self.kwargs)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 638, in _pdf_pageinfo_sync
page = PageInfo(pdf, pageno, infile, check_pages, detailed_analysis)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 716, in __init__
self._gather_pageinfo(pdf, pageno, infile, check_pages, detailed_analysis)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 762, in _gather_pageinfo
for ci in _process_content_streams(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 565, in _process_content_streams
yield from _find_regular_images(container, contentsinfo)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 483, in _find_regular_images
yield ImageInfo(name=draw.name, pdfimage=pdfimage, shorthand=draw.shorthand)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 353, in __init__
pim_icc = pim.icc
File "/usr/local/lib/python3.8/dist-packages/pikepdf/models/image.py", line 479, in icc
self._icc = ImageCms.ImageCmsProfile(iccbytesio)
File "/usr/local/lib/python3.8/dist-packages/PIL/ImageCms.py", line 202, in __init__
self._set(core.profile_frombytes(profile.read()))
File "/usr/local/lib/python3.8/dist-packages/PIL/_util.py", line 19, in __getattr__
raise self.ex
File "/usr/local/lib/python3.8/dist-packages/PIL/ImageCms.py", line 25, in <module>
from PIL import _imagingcms
ImportError: cannot import name '_imagingcms' from 'PIL' (/usr/local/lib/python3.8/dist-packages/PIL/__init__.py)
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/app/paperless/src/paperless_tesseract/parsers.py", line 261, in parse
ocrmypdf.ocr(**args)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/api.py", line 337, in ocr
return run_pipeline(options=options, plugin_manager=plugin_manager, api=True)
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_sync.py", line 370, in run_pipeline
pdfinfo = get_pdfinfo(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_pipeline.py", line 163, in get_pdfinfo
return PdfInfo(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 904, in __init__
self._pages = _pdf_pageinfo_concurrent(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/pdfinfo/info.py", line 681, in _pdf_pageinfo_concurrent
executor(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/_concurrent.py", line 82, in __call__
self._execute(
File "/usr/local/lib/python3.8/dist-packages/ocrmypdf/builtin_plugins/concurrency.py", line 148, in _execute
executor.shutdown(wait=False, cancel_futures=True)
TypeError: shutdown() got an unexpected keyword argument 'cancel_futures'
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/app/paperless/src/documents/consumer.py", line 261, in try_consume_file
document_parser.parse(self.path, mime_type, self.filename)
File "/app/paperless/src/paperless_tesseract/parsers.py", line 314, in parse
raise ParseError(f"{e.__class__.__name__}: {str(e)}")
documents.parsers.ParseError: TypeError: shutdown() got an unexpected keyword argument 'cancel_futures'
On a fresh install, when you try to add a .txt document (probably any document), via the web or ./data/consume folder, you get a traceback in the logs and the file is not added.
Adding a file should add the file :)
Adding a file via web or ./data/consume you get the following error
8:49:31 [Q] ERROR Failed [test.txt] - cannot open resource : Traceback (most recent call last):
File "/usr/local/lib/python3.8/dist-packages/django_q/cluster.py", line 432, in worker
res = f(*task["args"], **task["kwargs"])
File "/app/paperless/src/documents/tasks.py", line 298, in consume_file
document = Consumer().try_consume_file(
File "/app/paperless/src/documents/consumer.py", line 265, in try_consume_file
thumbnail = document_parser.get_optimised_thumbnail(
File "/app/paperless/src/documents/parsers.py", line 320, in get_optimised_thumbnail
thumbnail = self.get_thumbnail(document_path, mime_type, file_name)
File "/app/paperless/src/paperless_text/parsers.py", line 28, in get_thumbnail
font = ImageFont.truetype(
File "/usr/local/lib/python3.8/dist-packages/PIL/ImageFont.py", line 878, in truetype
return freetype(font)
File "/usr/local/lib/python3.8/dist-packages/PIL/ImageFont.py", line 875, in freetype
return FreeTypeFont(font, size, index, encoding, layout_engine)
File "/usr/local/lib/python3.8/dist-packages/PIL/ImageFont.py", line 226, in __init__
self.font = core.getfont(
OSError: cannot open resource
OS:
CPU architecture: x86_64
How docker service was installed:
dockerhub, via docker run ...
docker run -d --name paperless-ngx --restart=always \
-p 8844:8000 \
-v /docker/paperless-ngx/config:/config \
-v /docker/paperless-ngx/data:/data \
--env PUID=1000 --env PGID=1000 \
--env TZ=America/Los_Angeles \
-l "diun.enable=true"
linuxserver/paperless-ngx:latest
see above
See the config for PAPERLESS_THUMBNAIL_FONT_NAME
https://paperless-ngx.readthedocs.io/en/latest/configuration.html?highlight=font#software-tweaks
PAPERLESS_THUMBNAIL_FONT_NAME=<filename>
Paperless creates thumbnails for plain text files by rendering the content of the file on an image and uses a predefined font for that. This font can be changed here.
Note that this won’t have any effect on already generated thumbnails.
Defaults to /usr/share/fonts/liberation/LiberationSerif-Regular.ttf.
Adding the missing file /usr/share/fonts/liberation/LiberationSerif-Regular.ttf allows import to function
Process mail-attachments in PDF format.
paperless.log reports: Error while processing mail account
mail.log: Error while authenticating account
Every 10 minutes I get an error in the logs.
e-mail-account configured in paperless-ngx:
Name: [email protected]
IMAP-Server: imap.gmail.com
IMAP-Port: 993
IMAP-Security: Use SSL
Username: [email protected]
Password: *********
OS: Ubuntu 20.04 LTS server
CPU architecture: x86_64
How docker service was installed:
docker-compose.yml:
version: "2.1"
services:
paperless-ngx:
image: lscr.io/linuxserver/paperless-ngx:latest
container_name: paperless-ngx
volumes:
- /home/john/docker/paperless-ngx:/config
- /mnt/nas/paperless-ngx:/data
env_file: docker-compose.env
environment:
- PUID=1000
- PGID=1000
ports:
- 8601:8000
restart: unless-stopped
paperless.log:
[2022-09-02 16:45:50,875] [ERROR] [paperless.mail.tasks] Error while processing mail account [email protected]
Traceback (most recent call last):
File "/app/paperless/src/paperless_mail/mail.py", line 180, in handle_mail_account
M.login(account.username, account.password)
File "/usr/local/lib/python3.8/dist-packages/imap_tools/mailbox.py", line 65, in login
check_command_status(login_result, MailboxLoginError)
File "/usr/local/lib/python3.8/dist-packages/imap_tools/utils.py", line 46, in check_command_status
raise exception(command_result=command_result, expected=expected)
imap_tools.errors.MailboxLoginError: Response status "OK" expected, but "NO" received. Data: [b'[AUTHENTICATIONFAILED] Invalid credentials (Failure)']
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/app/paperless/src/paperless_mail/tasks.py", line 15, in process_mail_accounts
total_new_documents += MailAccountHandler().handle_mail_account(account)
File "/app/paperless/src/paperless_mail/mail.py", line 212, in handle_mail_account
raise MailError(
paperless_mail.mail.MailError: Error while authenticating account [email protected]
mail.log:
[2022-09-02 16:45:50,598] [DEBUG] [paperless_mail] Processing mail account [email protected]
[2022-09-02 16:45:50,854] [ERROR] [paperless_mail] Error while authenticating account [email protected]: Response status "OK" expected, but "NO" received. Data: [b'[AUTHENTICATIONFAILED] Invalid credentials (Failure)']
As mentioned in the official documentation, ngx should be a "drop-in replacement for Paperless-ng and thus upgrading should be trivial for most users". Sadly it isn't for me 🙃
I am using unraid as the docker host and haven't found an option to set PAPERLESS_OCR_LANGUAGE or PAPERLESS_OCR_LANGUAGES via variables. This is the error:
The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE
OS: Unraid OS 6.9.2
CPU architecture: x86_64
How docker service was installed: out-of-the-box OS provided
/usr/local/emhttp/plugins/dynamix.docker.manager/scripts/docker create --name='paperless-ngx' --net='bridge' --privileged=true -e TZ="Europe/Berlin" -e HOST_OS="Unraid" -e 'PAPERLESS_REDIS'='redis://:[email protected]:6379' -e 'PAPERLESS_OCR_LANGUAGE'='deu' -e 'PAPERLESS_OCR_LANGUAGES'='deu' -e 'PAPERLESS_FILENAME_FORMAT'='{created}-{correspondent}-{title}' -e 'PAPERLESS_TIME_ZONE'='Europe/Berlin' -e 'PAPERLESS_IGNORE_DATES'='' -e 'PAPERLESS_CONSUMER_POLLING'='0' -e 'PAPERLESS_SECRET_KEY'='notimportant' -e 'USERMAP_UID'='99' -e 'USERMAP_GID'='100' -p '8301:8000/tcp' -v '/mnt/user/appdata/paperless-ng/data':'/usr/src/paperless/data':'rw' -v '/mnt/user/appdata/paperless-ng/media':'/usr/src/paperless/media':'rw' -v '/mnt/user/scannedDocuments/':'/usr/src/paperless/consume':'rw' -v '/mnt/user/appdata/paperless-ng/export':'/usr/src/paperless/export':'rw' 'lscr.io/linuxserver/paperless-ngx'
os: Linux-5.10.28-Unraid #1 SMP Wed Apr 7 08:23:18 PDT 2021
nodename: c1f640325733
machine: x86_64
clock source: unix
pcre jit disabled
detected number of CPU cores: 8
current working directory: /app/paperless/src
detected binary path: /usr/bin/uwsgi-core
your processes number limit is 256306
your memory page size is 4096 bytes
detected max file descriptor number: 40960
building mime-types dictionary from file /etc/mime.types...567 entry found
lock engine: pthread robust mutexes
thunder lock: disabled (you can enable it with --thunder-lock)
uwsgi socket 0 bound to TCP address :8000 fd 3
Python version: 3.8.10 (default, Nov 26 2021, 20:14:08) [GCC 9.3.0]Python main interpreter initialized at 0x55cd0c9801d0
python threads support enabled
your server socket listen backlog is limited to 100 connections
your mercy for graceful operations on workers is 60 seconds
mapped 145840 bytes (142 KB) for 1 cores
*** Operational MODE: single process ***
running "exec:/usr/bin/python3 /app/paperless/src/manage.py collectstatic --noinput" (pre app)...166 static files copied to '/app/paperless/static', 277 unmodified.
running "exec:/usr/bin/python3 /app/paperless/src/manage.py compilemessages" (pre app)...
File “/app/paperless/src/locale/km_KH/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/sl_SI/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/no_NO/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/pl_PL/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/cs_CZ/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/th_TH/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/nb_NO/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/ru_RU/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/ko_KR/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/da_DK/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/ar_EG/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/lb_LU/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/it_IT/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/ar_BH/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/en_GB/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/es_ES/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/xh_ZA/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/la_LA/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/ar_SA/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/en_US/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/nl_NL/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/he_IL/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/pt_PT/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/zh_TW/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/el_GR/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/pt_BR/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/fr_FR/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/hu_HU/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/sv_SE/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/ro_RO/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/zh_CN/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/ar_YE/LC_MESSAGES/django.po” is already compiled and up to date.
File “/app/paperless/src/locale/de_DE/LC_MESSAGES/django.po” is already compiled and up to date.
WSGI app 0 (mountpoint='') ready in 0 seconds on interpreter 0x55cd0c9801d0 pid: 355 (default app)
*** uWSGI is running in multiple interpreter mode ***
spawned uWSGI master process (pid: 355)
spawned uWSGI worker 1 (pid: 390, cores: 1)
[uwsgi-daemons] spawning "/usr/bin/python3 /app/paperless/src/manage.py document_consumer" (uid: 911 gid: 911)
[uwsgi-daemons] spawning "/usr/bin/python3 /app/paperless/src/manage.py qcluster" (uid: 911 gid: 911)
SystemCheckError: System check identified some issues:ERRORS:
?: The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE.
SystemCheckError: System check identified some issues:ERRORS:
?: The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE.
[uwsgi-daemons] throttling "/usr/bin/python3 /app/paperless/src/manage.py document_consumer" for 1 seconds
[uwsgi-daemons] throttling "/usr/bin/python3 /app/paperless/src/manage.py qcluster" for 1 seconds
[uwsgi-daemons] respawning "/usr/bin/python3 /app/paperless/src/manage.py document_consumer" (uid: 911 gid: 911)
[uwsgi-daemons] respawning "/usr/bin/python3 /app/paperless/src/manage.py qcluster" (uid: 911 gid: 911)
SystemCheckError: System check identified some issues:ERRORS:
?: The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE.
SystemCheckError: System check identified some issues:ERRORS:
?: The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE.
SystemCheckError: System check identified some issues:ERRORS:
?: The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE.
SystemCheckError: System check identified some issues:ERRORS:
?: The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE.
[uwsgi-daemons] throttling "/usr/bin/python3 /app/paperless/src/manage.py document_consumer" for 1 seconds
[uwsgi-daemons] throttling "/usr/bin/python3 /app/paperless/src/manage.py qcluster" for 1 seconds
[uwsgi-daemons] respawning "/usr/bin/python3 /app/paperless/src/manage.py document_consumer" (uid: 911 gid: 911)
[uwsgi-daemons] respawning "/usr/bin/python3 /app/paperless/src/manage.py qcluster" (uid: 911 gid: 911)
SystemCheckError: System check identified some issues:ERRORS:
?: The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE.
SystemCheckError: System check identified some issues:ERRORS:
?: The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE.
[uwsgi-daemons] throttling "/usr/bin/python3 /app/paperless/src/manage.py document_consumer" for 1 seconds
[uwsgi-daemons] throttling "/usr/bin/python3 /app/paperless/src/manage.py qcluster" for 1 seconds
SystemCheckError: System check identified some issues:ERRORS:
?: The selected ocr language deu is not installed. Paperless cannot OCR your documents without it. Please fix PAPERLESS_OCR_LANGUAGE.
There is a bug that requires an environment variable to be set:
PAPERLESS_URL
If you do not set this variable then you can't use a reverse proxy.
paperless-ngx/paperless-ngx#817
Settings this environment variable does not resolve the issue, possible something else has to be done additionally.
Import PDFs via Drag&Drop when running on a Rhaspberry Pi ARMv7.
Import fails without an error message.
OS: Linux / Debian
CPU architecture: ARMv7
How docker service was installed:
docker-ce
using debian system-tools.I started it via ansible. The task is:
- name: Create paperless-ng container
community.docker.docker_container:
name: paperless-ng
image: linuxserver/paperless-ngx:1.7.0
restart_policy: unless-stopped
env:
PUID: "1000"
PGID: "1000"
TZ: UTC
volumes:
- paperless-ngx-config:/config
- paperless-ngx-data:/data
container_default_behavior: compatibility
Process Process-1:9:
Traceback (most recent call last):
File "/usr/lib/python3.8/pydoc.py", line 347, in safeimport
module = __import__(path)
File "/app/paperless/src/documents/tasks.py", line 24, in <module>
from pikepdf import Pdf
File "/usr/local/lib/python3.8/dist-packages/pikepdf/__init__.py", line 55, in <module>
from .models import (
File "/usr/local/lib/python3.8/dist-packages/pikepdf/models/__init__.py", line 20, in <module>
from .metadata import PdfMetadata
File "/usr/local/lib/python3.8/dist-packages/pikepdf/models/metadata.py", line 29, in <module>
from lxml import etree
ImportError: libxslt.so.1: cannot open shared object file: No such file or directory
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/usr/lib/python3.8/multiprocessing/process.py", line 315, in _bootstrap
self.run()
File "/usr/lib/python3.8/multiprocessing/process.py", line 108, in run
self._target(*self._args, **self._kwargs)
File "/usr/local/lib/python3.8/dist-packages/django_q/cluster.py", line 424, in worker
f = pydoc.locate(f)
File "/usr/lib/python3.8/pydoc.py", line 1632, in locate
nextmodule = safeimport('.'.join(parts[:n+1]), forceload)
File "/usr/lib/python3.8/pydoc.py", line 362, in safeimport
raise ErrorDuringImport(path, sys.exc_info())
pydoc.ErrorDuringImport: problem in documents.tasks - ImportError: libxslt.so.1: cannot open shared object file: No such file or directory
16:23:55 [Q] ERROR reincarnated worker Process-1:9 after death
paperless-ngx / ghostscript consumes PDF and correctly rasterizes content
This particular PDF does not contain embedded fonts or CID fonts (bad practice, but happens). GS tries to find an alternative, finally attempts fallback to CID font "CIDFallBack" but is unable due to to a broken symlink:
/usr/share/ghostscript/9.50/Resource/CIDFSubst/DroidSansFallback.ttf -> ../../../../fonts/truetype/droid/DroidSansFallbackFull.ttf
gs /path/to/document.pdf
FIX: confirmed issue is resolved by staging a copy of DroidSansFallbackFull.ttf
into /usr/share/fonts/truetype/droid
Incidentally, "droid" dir does not exist in docker container, had to create manually, prior to staging the ttf file
OS: Unraid 6.10.0
CPU architecture: x86_64
How docker service was installed: Community Apps / Unraid Docker Config
Start container via Unraid dockerman from gui
I want to use https://github.com/linuxserver/docker-mods/tree/papermerge-multilangocr as described in #5
After trying to add the dockermod I got the following error during the start of the container
[mod-init] Curl/JQ was not found on this system for Docker mods installing
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.