2018-02-08
1.1
Takasi Moriya
All
.NET
Components, Libraries, Applications
[Overview]
Simple OCR includes extension and sample module for simple OCR function. This application requires .NET stack. The OCR function is powered by Tesseract .net NuGet library. https://www.nuget.org/packages/tesseract.net/
[Detail]
Simple OCR extension provides simple OCR function by using Tesseract 4.0.0 via Tesseract.net 4.0.0.6 NuGet Library.
The extension has an action:
ExtractTextFromImage
Input:
ImagePath/Text: Path to a image file. e.g. "C:\Users\VO80825\Desktop\sample.png"
Language/Text: Language specification for Tesseract 4.0.x. e.g. "eng"
DataPath/Text: Path to a directory contains language trained data. e.g. "C:\Users\VO80825\Desktop\tessdata"
Output:
Output/Text: Extracted text.
You can download other language trained data from https://github.com/tesseract-ocr/tesseract/wiki/Data-Files .