2016-12-29 58 views
0

我在Windows上使用10 x64 pytesseract错误打开数据文件,蟒蛇是3.5.2 x64,正方体是4.0,代码如下:pytesseract在Windows 10:

# -*- coding: utf-8 -*- 

try: 
    import Image 
except ImportError: 
    from PIL import Image 
import pytesseract 


print(pytesseract.image_to_string(Image.open('d:/testimages/name.gif'), lang='chi_sim')) 

错误:

Traceback (most recent call last): 
    File "D:/test.py", line 10, in <module> 
    print(pytesseract.image_to_string(Image.open('d:/testimages/name.gif'), lang='chi_sim')) 
    File "C:\Users\dell\AppData\Local\Programs\Python\Python35\lib\site-packages\pytesseract\pytesseract.py", line 165, in image_to_string 
    raise TesseractError(status, errors) 
pytesseract.pytesseract.TesseractError: (1, 'Error opening data file \\Program Files (x86)\\Tesseract-OCR\\tessdata/chi_sim.traineddata') 

C:\Program Files (x86)\Tesseract-OCR\tessdata,像这样:

enter image description here

为什么?

+0

是否对其他语言的工作? – sayan

+0

相关:http://stackoverflow.com/questions/24672531/annoying-python-tesseract-error-error-opening-data-file-tessdata-eng-trainedda – sayan

回答

0

设置TESSDATA_PREFIX环境变量C:\Program Files (x86)\Tesseract-OCR\

0

如果你有一个像tessdata错误:“错误打开数据文件...”

tessdata_dir_config = '--tessdata-dir "<replace_with_your_tessdata_dir_path>"' 
# Example config: '--tessdata-dir "C:\\Program Files (x86)\\Tesseract-OCR\\tessdata"' 
# It's important to add double quotes around the dir path. 

pytesseract.image_to_string(image, lang='chi_sim', config=tessdata_dir_config) 
+0

tessdata_dir_config ='--tessdata-dir“C:\\ Tesseract-OCR \\ tessdata“' – dubinglin

+0

你是否知道你可以编辑你的答案? – Yunnosch