OCR Image Reader Simple powerful OCR without server iteration
Support Development
PayPal ● 
Bitcoin Address: 1sM2BrTH8BRgt3quiASK8TmYSafutNvDo
 ● 
Ether Address: 0xCf9eaAc56992e12EB61fD46342172d4EEff5C8e4

Advertisement
Screenshot
"OCR Image Reader" extension aims to ease the optical character recognition process in your browser. After installation, the extension adds a new button to the toolbar area of your browser. When this button is pressed, the current window goes to the area selection mode. You can skip this mode by pressing the Escape key. This tool is used to select an area on the current page. This area then is sent to the OCR engine of the extension and all the text content of it will be extracted. The process of text extraction will be displayed in a popup like a floating window on each page. If you have multiple jobs, you will get multiple floating windows. The OCR engine of this extension is Tesseract.js which supports more than 100 languages and is written purely in JavaScript language. Note that on the first usage, the extension fetches the proper language database from the server, though on future use, since your browser has already cached this resource, the OCR process takes shorter. The progress of both the fetching and the OCR extraction is displayed in the popup window.

Features

  1. What is the "OCR - Image Reader" add-on and how can I use it?

    This extension is about having a simple yet powerful in-page OCR application on hand without installing a native one. The extension is pretty simple to use. Whenever you need to extract the content of an image or a text that cannot be selected, simply press the toolbar button to switch to the area selection mode. Now hold your mouse down and select the area of interest, then release the mouse pointer. At this point, the extension captures the screen area and send this image to the OCR engine. This process happens inside the page in a frame element. You can see the progress of the entire process in a popup window. The first step is to fetch the language training database from the server, then extract the text content out of the image area. Both of these processes have a progress bar. The fetch step might take some time on the first run, but should be fast for all the subsequent calls as your browser should have already cached the resource once.

  2. What's new in this version?

    Please check the Logs section.

  3. Does this extension uses an online service for dong the text recognition?

    No the process of extracting the text content out of the image all happens locally. However, note that this extension gets the training data from a remote server since the database is about 30Mbytes and cannot be packed with the extension itself. This extension does not interact with any remote services at all except the database fetching part.

  4. Can I send a very large image to the OCR engine?

    Theoretically, you can send large images too, but it is going to take a long time for the extension to process the image. It might even require too much CPU resources to be able to extract the content. It is recommended to use the area selection tool properly to only select the required area instead of having a large image which large empty area around.

  5. What is the OCR engine of this extension?

    This extension uses the powerful Tesseract.js with online language training resources to have the latest database

Matched Content

Reviews

Please keep reviews clean, avoid the use of improper language and do not post any personal information.
  • <a> Defines an anchor.

    Example: <a href="http://add0n.com">a sample link</a>

  • <pre><code> Syntax Highlighting (Supported languages: Bash, JSON, HTML, JavaScript, and CSS).

    Example: <pre><code class="javascript">var foo = 'bar';</code></pre>

  • <strong> Defines bold text
  • <blockquote> Defines a long quotation
  • <caption> Defines a table caption
  • <cite> Defines a citation
  • <em> Defines italic text
  • <p> Defines a paragraph
  • <span> Defines a section in a document
  • <s> Defines strikethrough text
  • <strike> Defines strikethrough text
  • <u> Defines underlined text
  • <br> Defines a single line break; can be used alone and don't need an ending tag

What's new in this version

Version--
Published--/--/--
Change Logs:
    Last 10 commits on GitHub
    Hover over a node to see more details

    Need help?

    If you have questions about the extension, or ideas on how to improve it, please post them on the  support site. Don't forget to search through the bug reports first as most likely your question/bug report has already been reported or there is a workaround posted for it.

    Open IssuesIssuesForks

    Permissions are explained

    PermissionDescription
    storageto keep the internal preferences
    activeTabto inject area select script into the active page after a user action
    notificationsto display possible warnings during the OCR process

    Recent Blog Posts on add0n.com