如何解决在tesseract上使用图像串的问题
大家好,我在使用tesseract的python中编写了简单的代码,但我认为这是与版本或类似问题相关的问题,请看一下代码:
from PIL import Image
import pytesseract
file = '/home/gxs/Downloads/a.png'
img = Image.open(file)
text = pytesseract.image_to_string(Image.open(file))
为此,我有以下输出(错误):
TesseractError Traceback (most recent call last)
<ipython-input-1-65b8cbea5fe0> in <module>
4 img = Image.open(file)
5 #display(img)
----> 6 text = pytesseract.image_to_string(Image.open(file))
~/.local/lib/python3.8/site-packages/pytesseract/pytesseract.py in image_to_string(image,lang,config,nice,output_type,timeout)
368 args = [image,'txt',timeout]
369
--> 370 return {
371 Output.BYTES: lambda: run_and_get_output(*(args + [True])),372 Output.DICT: lambda: {'text': run_and_get_output(*args)},~/.local/lib/python3.8/site-packages/pytesseract/pytesseract.py in <lambda>()
371 Output.BYTES: lambda: run_and_get_output(*(args + [True])),--> 373 Output.STRING: lambda: run_and_get_output(*args),374 }[output_type]()
375
~/.local/lib/python3.8/site-packages/pytesseract/pytesseract.py in run_and_get_output(image,extension,timeout,return_bytes)
280 }
281
--> 282 run_tesseract(**kwargs)
283 filename = kwargs['output_filename_base'] + extsep + extension
284 with open(filename,'rb') as output_file:
~/.local/lib/python3.8/site-packages/pytesseract/pytesseract.py in run_tesseract(input_filename,output_filename_base,timeout)
256 with timeout_manager(proc,timeout) as error_string:
257 if proc.returncode:
--> 258 raise TesseractError(proc.returncode,get_errors(error_string))
259
260
TesseractError: (-11,'Tesseract Open Source OCR Engine v3.03 with Leptonica actual_tessdata_num_entries_ <= TESSDATA_NUM_ENTRIES:Error:Assert failed:in file tessdatamanager.cpp,line 53')
PS:我的发行版是arch linux,我尝试卸载所有依赖项,但根本无法工作。
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。