该插件尚未通过WordPress的最新3个主要版本进行测试。 当与较新版本的WordPress一起使用时,可能不再受到维护或支持,并且可能会存在兼容性问题。

WP Tesseract

描述

A plugin for extracting text from attached images using OCR via Tesseract.
This plugin adds a new post named for each image upload containing any recognized text characters within the file.
This text can then be edited for accuracy and used elsewhere on the site.

The OCR plugin requires a supported version of PHP with the GD extension and the following command line utility:
* Tesseract for the actual OCR
This utility must be manually installed on your server and executable by PHP.
This process, and consequently this plugin, is recommended only for advanced users.

屏幕截图

安装

  1. Install Tesseract OCR on your server (Tesseract wiki)
  2. Search and add the plugin from WordPress, or upload a copy of the source to your /wp-content/plugins/ directory
  3. Activate the plugin through the Plugins menu in WordPress
  4. Configure the plugin through the Plugins > OCR link in the sidebar menu in WordPress

常见问题

What is Tesseract OCR and where do I get it?

Tesseract OCR is an open source optical character recognition library that
the WordPress OCR plugin uses to extract text from images.
The library as well as installation instructions can be found at
https://github.com/tesseract-ocr/tesseract/wiki/.

How do I know if / where I have Tesseract installed on my server?

Linux:

  1. SSH into your server and type which tesseract.
  2. If Tesseract is installed and in your shell environment PATH the terminal should return a path similar to /opt/local/bin/tesseract.
  3. Place this path in the configuration of the OCR plugin through the Settings > Tesseract link in the sidebar menu in WordPress

Where is the detected text stored?

The text detected by the OCR plugin is added as a new post, named after the image file.

What is the ‘Resize percentage’ configuration option?

The OCR plugin is tailored to detect text in images with ~12pt text at 72dpi.
GD is used to upscale the temporary TIFF images fed to Tesseract as Tesseract is generally more accurate with
larger type, even if it’s been upscaled from a smaller source. If you wish to disable this option simply set this
configuration option to 100% and no resizing will occur.

What if I just want to use the plugin but not install anything?

Hosting options are available. See https://tattersoftware.com
for contact info.

How about that great banner photo?

The plugin’s banner photo is by Ekrulila from Pexels.

评价

此插件暂无评价。

贡献者及开发者

「WP Tesseract」是开源软件。 以下人员对此插件做出了贡献。

贡献者

帮助将「WP Tesseract」翻译成简体中文。

对开发感兴趣吗?

您可以浏览代码,查看SVN仓库,或通过RSS订阅开发日志

更新日志

0.1.0

Initial Release.

1.0.0

Complete rewrite: updates for PHP 7, ImageMagick replaced by GD, added language support

1.0.1

Actions for automated publication, License updates, Name bugfix

1.0.2

Clean up docs, remove legacy references

1.0.3

Add uploaded file as featured image, bump PHP requirement for EOL

1.0.4

Backend tweaks for WordPress 5.4