Convert PDF & Image Text to Notepad Format

Text in PDF or photos is by default just for user eyes to read and not for editing or indexing. However, this situation is changed and now it is possible to convert text in PDF file to plain content using free tools. This tutorial explains how to convert words written on PDF document and image to plain text. I use free OCR tools to perform this conversion. Full form of OCR is Optical Character Recognition and this technology is used to identify image text and convert it to machine encoded text. There are many OCR services available and some handpicked services are introduced in this guide.


Convert Image to Text

Here I take the screenshot of "about me" page of CoreNetworkZ.com and try to take words in the screenshot to real words automatically. I have saved the screenshot in different formats like JPEG, GIF,bmp, TIFF and PNG. The sample document is uploaded to Free OCR services.
CoreNetworkZ

I have the same document saved in different formats and each one is shown exactly as after the conversion.

  1. Result : Convert bmp image to plain text

    Check the result and I must say the result would be better if I used a picture with higher quality.
    bmp

    Though this tool failed to convert with 100 percent accuracy, it has satisfactory output.

  2. Convert JPEG to real words

    This time I have uploaded a JPEG file to this free OCR service. Look at the output.
    jpeg


  3. Converting GIF to Plain Content
    For some unknown reasons I have failed to get a converted file. I tried 3 times with this gif version but it didn't work.
    changed


  4. Convert PNG File

    After running PNG file on this free OCR tool, I received following output.
    PDF to Notepad


  5. Convert TIF file


    This time, I have run the file in TIF format using this service. See the notepad version of the output.


The tool I have used for above experiment is: http://www.free-ocr.com/


Convert PDF Text to Notepad Content

Now let us check how to copy the sentences displayed in a PDF file to notepad. Here I am using  http://www.ocrconvert.com/ to perform this task. Just like the previous tool, this one too a free online service. Steps to extract content from PDF version to notepad are given below.
  1. Visit http://www.ocrconvert.com/

  2. Click on browse button to upload PDF file
    Changing

  3. Conversion starts once we click Process button
    test

though these tools are helpful, some data entry job centers prefer not to use them because of the high percentage of error while copying letters from PDF files. So until a high accuracy program is developed, they need work force to complete data entry works.

Recent Topics
  1. Tor Proxy Review

  2. Hide specific computer Hard Disk Drive

  3. How to Turn off IE default browser setting message

Technology Blog

3 comments:

nithinvg said...

its great pretty useful i guess

thanks for sharing...

Admin said...

So far I’ve found OCRconvert.com the best online optical character recognition service, they have no limit on the number of files that you can convert also their conversion has so far been very accurate for me, may be its because I only convert files in English language. But that works well.

Siju George said...

Thanks Admin, for your valuable addition. I read your blog and it is full of posts regarding text conversions using OCR services.