ianzhao05 / textshot Goto Github PK

View Code? Open in Web Editor NEW

1.7K 1.7K 255.0 63 KB

Python tool for grabbing text via screenshot

License: MIT License

AutoHotkey 3.32% Python 96.68%

ocr ocr-recognition python python-3 python-script python3 screenshot script tesseract tesseract-ocr

textshot's Introduction

TextShot

Take a screenshot and copy its text content to the clipboard. Works on Windows, macOS, and most modern Linux distros.

Use

textshot -h prints the available command line options:

usage: textshot [-h] [-i INTERVAL] [langs]

Take a screenshot and copy its text content to the clipboard.

positional arguments:
langs                 languages passed to tesseract, eg. "eng+fra" (default: eng)

optional arguments:
-h, --help            show this help message and exit
-i INTERVAL, --interval INTERVAL
                        select a screen region then take textshots every INTERVAL milliseconds

Examples

Basic usage: textshot opens an overlay where a rectangle can be drawn around the text to be copied.
Alternate languages: textshot eng+fra specifies use of English as the primary language and French as the secondary language. Make sure that the appropriate data files for Tesseract are installed for other languages. A list of all supported languages can be found here.
Continuously copy text content: textshot --interval 200 draw a rectangle at a screen region then copy text from it every 200ms.

Hotkeys

It is recommended to attach a global hotkey to this tool, so you can run it without opening a console and typing in the command.

On Windows, one can accomplish this by using an AutoHotkey script; textshot.ahk contains a sample AHK script that can be used.

On Ubuntu, open the Keyboard Settings, which shows you all the Gnome shortcuts. At the bottom there is a + button to add your own shortcuts. Click it and set the command to textshot. In case you are using a virtual environment, the textshot path above should point to the environment's textshot.

The process on other operating systems can be found by searching how to run a shell command with a keyboard shortcut.

Installation

Prerequisites

Install Google's Tesseract OCR Engine, and ensure that tesseract can be reached from the command line by adding the directory to your system path.

Installation with `pip`

$ pip install textshot
$ textshot

You may wish to use a virtual environment if the dependencies conflict with others on your machine.

Installation from source

Clone this repository... git clone https://github.com/ianzhao05/textshot.git
...and cd into it: cd textshot
Run pip install . (for development, you may install with pip install -e . which will allow you to test your modifications without reinstall)
You may now run textshot

From repository

@rigred has added this to the AUR, so Arch Linux users can install the package textshot-git with their AUR helper. For example, yay -S textshot-git. This may not be up to date, so if you encounter issues, use the normal installation method above.

Troubleshooting

macOS

You may need to give permission to capture the screen. You can do so by going to System Preferences > Security & Privacy > Privacy > Screen Recording, then checking the box for Terminal/iTerm.

Linux

If the text shows up correctly in the notification, but you cannot paste it, install xclip (e.g. with sudo apt install xclip).

textshot's People

Contributors

Stargazers

Watchers

Forkers

rhurta krithikvaidya haoict leonardofreua sayedsahbeni behyeefatt techiewasp shshahab hanzcoder cosito-bonito blongcha mewiecat halfk1ng schwenkd alanbosco 0xflotus kaimunchi hadryan cy-dev-tex brunorochax tianlajiangzhaji smart-patrol trendingtechnology krzemienski ioplock-zz aidev42 dataorz thatpolishboy13 jorgeavilacartes hybridego nbswords adwinwhite xrosliang hidannyxu rainly cv-ip linsong8208 7more0 leuojn leedaga lfs119 huangshizhi 76782875 williamrjw yuhonghong95721 d-danielyang cqray1990 eujenz gaosq0604 spencertruett shivamnamdeo0101 jackhappy lthomiso binalmehta hasantahir aryansharmaa fakegit kingctan thehornydaddy renchao7060 jkruigu chagge wming404 makarbaderko myccfoo xinxi-blip ashu-cybertron xingcxb tangli-1987 t0ny1974 ashyglim kousun henuguyu sunqiang25 mayurmorin nishanthkadapakonda lunker2019 rkrishna116 caswml ycj0808 osamafrougi ashimroy88 mukeshkumar2617 fengtaijun rajivnr yanjing2407 ashkin2 shanhedian2017 stuti24m bid-tools cimszw komal7209 lijiasheng1984 rus0wes simstems 1164513233 zxstar7789 youtang1993 zhongqianli alading241

textshot's Issues

You need to install AutoHotkey, which can be found at https://www.autohotkey.com/

Consider adding pyproject.toml for package installation

Here is a pyproject.toml example

[build-system]
requires = ["hatchling"]
build-backend = "hatchling.build"

[project]
name = "textshot"
version = "0.0.1"
authors = [
  { name="Ian ianzhao05", email="[email protected]" },
]
description = "Python tool for grabbing text via screenshot"
readme = "README.md"
requires-python = ">=3.7"
classifiers = [
    "Programming Language :: Python :: 3",
    "License :: OSI Approved :: MIT License",
    "Operating System :: OS Independent",
]
dynamic = ["dependencies"]

[project.urls]
"Homepage" = "https://github.com/ianzhao05/textshot"
"Bug Tracker" = "https://github.com/ianzhao05/textshot/issues/"

[project.scripts]
textshot = "textshot.textshot:main"

[tool.setuptools.dynamic]
dependencies = {file = ["requirements.txt"]}

[tool.setuptools.packages.find]
where = ["textshot"]

It may need some tweaking and modifying the project to relocate python files in textshot and changing the way import is done by using something like from .ocr import … for example.

It would be helpful to make a working package for linux distributions.

You then build and install the package with:

python3 -m build --wheel
python -m installer dist/*.whl

Does this repo support Chinese?

Screen turns black when opening textshot

When I run textshot opposite this problem. I show in video. Sorry bad english if I mistake anywhere.

simplescreenrecorder-2022-08-31_01.18.00.mp4

Segmentation fault, (core dumped)

I tried to run textshot on my fedora Linux machine and I got a segmentation fault error:
Traceback (most recent call last): File "/home/maerqin/PycharmProjects/Screenshot_To_Text/venv/lib/python3.12/site-packages/textshot/textshot.py", line 11, in <module> from .logger import log_copied, log_ocr_failure ImportError: attempted relative import with no known parent package [1] 82030 segmentation fault (core dumped) python textshot.py -h

License

Please add a license file.

[Feature suggestion] Add an auto magnification

Sometimes the target is too small on the screen and I can't capture it accurately. Maybe it is a good idea to add a magnified image based on what is around the cursor when users are capturing the screen.

Selection not from upper left corner

When I make a selection starting not from the upper left corner, but from any other corner, instead of the text in selection it returns some long random text, which seems to be the text from all screen.

To reproduce, make a selection of some text from, for example, lower right corner to upper left corner.

2022-02-09.11.12.21.mp4

Produces a grey screen when called

Everything works except the starting window when textshot is called, which is fully grey.
OS : Manjaro Linux KDE

Not considering dpi scaling results in wrong positions of start and end points on Linux

Here is an example.
The positions captured by the program:

start:1227,695
end:1272,715

Their real positions:

start:3681,2085
end:3816,2145

Thus pyscreenshot grabs the wrong image.

My scale factor:

GDK_DPI_SCALE=0.333
GDK_SCALE=3
QT_AUTO_SCREEN_SCALE_FACTOR=0
QT_SCREEN_SCALE_FACTORS=eDP1=3;DP1=3;DP2=3;HDMI1=3;HDMI2=3;VIRTUAL1=3;

Hotkey opening failed, but can be performed in CMD

Use the textShot.ahk script that comes with it
Please guide

Error

ERROR: An error occurred when trying to process the image: (1, "Tesseract Open Source OCR Engine v3.05.00dev with Leptonica read_params_file: Can't open txt Warning in pixReadMemPng: work-around: writing to a temp file libpng warning: Application built with libpng-1.4.3 but running with 1.5.14 Error in pixReadStreamPng: png_ptr not made Error in pixReadMemPng: pix not read Error in pixReadMem: png: no pix returned Error during processing.")

ERROR: Unable to read text from image, did not copy

On recent Arch linux with i3wm window manager I often get ERROR: Unable to read text from image, did not copy.

I have :

python-pyqt5 : 5.15.8
python-pyqt5-sip: 12.11.1
python-pillow: 9.4.0
python-pytesseract: 0.3.10

Doesn't support MacBook Fullscreen

This tool could only take a shot on its current desktop. However, MacBook has a multi-desktop feature, and you can't ask this tool to take a shot on the desktop where the terminal is opened. Hope the author can support multi-desktop screenshots.

Windows defender started recognizing executable file from AutoHotkey script as trojan.

Size limitation for text, and sometimes prior OCR conversion in clipboard is not replaced with new OCR conversion

Hi
I think this is a really cool idea to make OCR simple to do and allows for correcting OCR mistakes very easily.
I am on a Windows machine and I find that I need to OCR a large text image in parts because it doesn't handle
a lot of text well. Is there a recommended maximum amount of text that should be selected for conversion?
But even doing OCR in parts, some areas appear to be captured and the "spinning wheel" indicates that
a conversion is being done. But when pasting the text that is in the clipboard to notepad++, it is the text from a prior conversion.
If there is an error in the conversion process, I can't find where it is displayed. Can you please give me some pointers
on getting around these issues?
Thanks!

doesn't work in macos

很遗憾

Windows defender started recognizing executable file from AutoHotkey script as trojan.

Anyone encountered this phenomena?

能不能集成中文的OCR识别能力呀？

好项目呀，感觉很实用，不过我看现在应该是只有英文，之前chineseocr有17M中文识别模型模型，
还有最近百度飞桨新发的，https://github.com/PaddlePaddle/PaddleOCR 只有9M的模型，效果好像还更好一些，
不知道几位大佬，最近有没有计划把中文识别能力集成进去呀？

Doesn't work in multi-monitor setup

I have two screens (let's name them Main and Side). When I open type textshot in a terminal in Side, the Main monitor starts mirroring Side monitor's content.

So, to copy text from Main, I have to open the terminal in Main. This is not a good experience

Does not pause desktop while snipping

The screen overlay does not pause the desktop while snipping. For example, videos and GIFs continue to play in the background. This is inconsistent with Windows's screenshot tools, Snipping Tool and the newer Snip & Sketch.

Raised by @rigred in #12

can't work when using multi screen

I use two monitors , this program can't work

Added your package to arch linux aur

Just a friendly heads up that I've added your package to the archlinux aur and it will keep itself updated based on the latest git commits to the github repo.
https://aur.archlinux.org/packages/textshot-git

So for arch users it's as easy as installing textshot-git with their favourite aur helper.
yay -S textshot-git

Also is there a way to make textshot pause the desktop (animations like the gif on this page)?
Currently it keeps on animating while in box select mode.

Thanks for the great tool!

Cropping depends on screen resolution

Cropping is incorrect at times. It depends on screen resolution. I have a 4k display and it wasn't performing as expected

macOS Big Sur opens new screen

I just downloaded text shot on my Mac and installed all the dependencies but have been experiencing this weird behavior where as soon as I run it it will open a new screen to the right with no open apps and would only allow me to screenshot there. Did anyone else encounter this or have a fix?

2021-02-27 19:08:54.739 Python[952:13732] ApplePersistenceIgnoreState: Existing state will not be touched. New state will be written to /var/folders/ld/wjmpqdpj1pq2j4j_svh1j8740000gn/T/org.python.python.savedState

Text not being copied to clipboard and sometimes words are translated to french language !

I have created a shortcut in my Ubuntu for textshot and whenever I use it (some times not all the time) the text is copied in French and Not being copied to clipboard at all(this is main issue), I knew it was in French because it was shown in notification :/

Failed when have multiple monitors

Once I unplugged from external monitors it worked well. The problem is raised from line 29 in textshot.py,
self.screen = QtWidgets.QApplication.screenAt(QtGui.QCursor.pos()).grabWindow(0)

My system is MacOS 10.15.3, with python 3.6.9. Thank you.

textshot on macOS Big Sur

Hi,

On macOS 11.1, invoking python textshot.py throws a Qt GUI error:

QPixmap::fromImage: QPixmap cannot be created without a QGuiApplication
QPixmap: Must construct a QGuiApplication before a QPixmap

Any suggestions? This worked fine before updating to Big Sur. Thank you!

EDIT: This has been tried with a virtual environment.

screenshot "E:\>cd github" output -->"INFO: Copied "AR" to the clipboard"

screenshot "E:>cd github" output -->"INFO: Copied "AR" to the clipboard"

the command line test info:

E:\github>cd..

E:>cd github

E:\github>cd textshot

E:\github\textshot>python textshot.py
**INFO: Copied "E:\github>cd. .
AR

E:\github>cd textshot" to the clipboard**

E:\github\textshot>python textshot.py chi_sim
**INFO: Copied "ET
E:N>cd github

E:Ngithub>cd textshot" to the clipboard**

E:\github\textshot>

Tesseract Process Timeout

Appears to not work on more than 5 words at a time, presents with error

"TextShot"
"An error occurred when trying to process the image: Tesseract process timeout"

how to get facebook information

could not work on MacOS

on Mac get

INFO: Unable to read text from image, did not copy

seems pyperclip do not work properly on MacOS.

I'm too dumb and autistic to make this thing work pls help

Hello the issue is my brain, I can't make it work pls help.
Basically, I can make textshot work via cmd but am too dumb to understand the greatness of your coding skill and btw how autokhey works.
Pls help