快捷導(dǎo)航

Python利用wxPython實現(xiàn)ocr識別圖片漢字程序

更新時間：2023年08月07日 10:25:53 作者：winfredzhang

在這篇博客中，我們將介紹一個如何使用wxPython構(gòu)建的簡單OCR識別圖片漢字應(yīng)用程序，文章的示例代碼講解詳細(xì)，感興趣的小伙伴可以學(xué)習(xí)一下

當(dāng)你需要構(gòu)建一個簡單的圖形用戶界面（GUI）應(yīng)用程序，并在其中實現(xiàn)光學(xué)字符識別（OCR）功能時，wxPython是一個強大而靈活的選擇。wxPython是一個基于Python的跨平臺GUI開發(fā)框架，結(jié)合了wxWidgets C++庫和Python語言的優(yōu)勢。結(jié)合pytesseract和OpenCV等庫，你可以輕松地創(chuàng)建一個具有OCR功能的應(yīng)用程序。

在這篇博客中，我們將介紹一個使用wxPython構(gòu)建的簡單OCR應(yīng)用程序示例。我們將使用wxPython創(chuàng)建一個框架，并在其中添加一個選擇圖像的按鈕和一個用于顯示識別文本的文本控件。當(dāng)用戶選擇圖像后，我們將使用pytesseract和OpenCV對圖像進行處理和OCR，并將識別到的文本顯示在應(yīng)用程序中。

下載tesseract安裝盤：

https://digi.bib.uni-mannheim.de/tesseract/tesseract-ocr-w64-setup-5.3.1.20230401.exe

安裝tesseract程序。

設(shè)置環(huán)境變量。

安裝pytesseract模塊：

pip install pytesseract

測試：

tesseract C:\myimages\1.png result -l chi_sim

源代碼：

import wx
import pytesseract
import cv2
class MyFrame(wx.Frame):
    def __init__(self, parent, title):
        super(MyFrame, self).__init__(parent, title=title, size=(400, 300))
        panel = wx.Panel(self)
        self.text_ctrl = wx.TextCtrl(panel, style=wx.TE_MULTILINE)
        self.button = wx.Button(panel, label="Select Image")
        self.button.Bind(wx.EVT_BUTTON, self.on_select_image)
        sizer = wx.BoxSizer(wx.VERTICAL)
        sizer.Add(self.text_ctrl, proportion=1, flag=wx.EXPAND | wx.ALL, border=10)
        sizer.Add(self.button, flag=wx.ALIGN_CENTER | wx.ALL, border=10)
        panel.SetSizer(sizer)
    def on_select_image(self, event):
        wildcard = "JPEG files (*.jpg)|*.jpg|PNG files (*.png)|*.png"
        dialog = wx.FileDialog(self, "Select Image", wildcard=wildcard, style=wx.FD_OPEN | wx.FD_FILE_MUST_EXIST)
        if dialog.ShowModal() == wx.ID_CANCEL:
            return
        image_path = dialog.GetPath()
        dialog.Destroy()
        # Perform OCR on the selected image
        text = self.perform_ocr(image_path)
        # Display the recognized text in the text control
        self.text_ctrl.SetValue(text)
    def perform_ocr(self, image_path):
        # Load the image using OpenCV
        image = cv2.imread(image_path)
        # Preprocess the image (you may need to modify this based on your requirements)
        gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
        gray = cv2.threshold(gray, 0, 255, cv2.THRESH_BINARY | cv2.THRESH_OTSU)[1]
        # Perform OCR using pytesseract
        text = pytesseract.image_to_string(gray)
        return text
if __name__ == "__main__":
    app = wx.App()
    frame = MyFrame(None, "OCR with wxPython")
    frame.Show()
    app.MainLoop()

代碼說明：

結(jié)果如下：