How to scan and OCR like a pro with open source tools


 
Thread Tools Search this Thread
Special Forums News, Links, Events and Announcements UNIX and Linux RSS News How to scan and OCR like a pro with open source tools
# 1  
Old 06-24-2008
How to scan and OCR like a pro with open source tools

Tue, 24 Jun 2008 18:00:00 GMT
With optical character recognition (OCR), you can scan the contents of a document into a single file of editable text. This article, which focuses on scanning books, describes the steps you need to take to prepare pages for optimal OCR results, and compares various free OCR tools to determine which is the best at extracting the text.


Source...
Login or Register to Ask a Question

Previous Thread | Next Thread

1 More Discussions You Might Find Interesting

1. Programming

Developing Open Source tools

Hi Gurus, I am very much interested in developing and publishing a small piece of open source code which would be the best place to start off. I am just a beginner in unix ,c. Please provide me some suggestions. Thanks. (3 Replies)
Discussion started by: ennstate
3 Replies
Login or Register to Ask a Question
GUESSNET-SCAN(8)					      System Manager's Manual						  GUESSNET-SCAN(8)

NAME
guessnet-scan - guess network configuration data by looking at network traffic SYNOPSIS
guessnet-scan [options] [ethernet_interface] DESCRIPTION
Guessnet-scan tries to deduce network configuration data by watching network traffic at a given Ethernet interface. After scanning network traffic for some time, guessnet-scan prints a configuration string suitable for inclusion in /etc/network/inter- faces. Note that guessnet-scan uses heuristics and wild guesses and that the resulting data is not guaranteed to be accurate. The program is intended to be used as a first try at getting network configuration data without bothering anyone. OPTIONS
Options follow the usual GNU conventions, --debug Print debugging messages. --help Show a brief summary of commandline options. --init-time=int Time in seconds to wait for the interface to initialize when it is not found already up at program startup. Default: 3 seconds. -t, --timeout=int Time in seconds to watch for network traffic. Default: 5 seconds. -v, --verbose Operate verbosely. --version Show the version number of the program. SCANNING REQUIREMENTS
To correctly identify all data of the local network, guessnet-scan needs to see traffic related to a host in the local network and to the local gateway, if any. To be able to identify the network gateway, guessnet-scan also needs to see some traffic directed to the external network: you can help the detection by generating some outbound IP traffic during the scan, for example by browsing a web page (without proxy) or using telnet to open a connection to some remote host. Note that if you are connected to a switch, guessnet-scan won't probably be able to work, since the switch will isolate it from the network traffic that the other machines are generating. SEE ALSO
guessnet(8), interfaces(5). AUTHOR
Guessnet-scan was written by Enrico Zini <enrico@debian.org>. 10 October 2004 GUESSNET-SCAN(8)