Linked by Thom Holwerda on Thu 31st Aug 2006 18:07 UTC, submitted by diegocg
Google Google has announced the release of the source of an old OCR software called Tesseract in source. "In a nutshell, we are all about making information available to users, and when this information is in a paper document, OCR is the process by which we can convert the pages of this document into text that can then be used for indexing."
Order by: Score:
Good job, Google.
by mike hess (2.2) on Thu 31st Aug 2006 19:39 UTC
mike hess
Member since:
2005-08-22
Fans: 0

On the surface, this seems like a very nice contribution, that could be useful in lots of other applications.

But Sourceforge doesn't have anything listed under "License", so hopefully, it'll get sorted out.

License
by KugelKurt (2.68) on Fri 1st Sep 2006 20:11 UTC in reply to "Good job, Google."
KugelKurt Member since:
2005-07-06
Fans: 0

It's licensed under the Apache License 2.0. See http://tesseract-ocr.cvs.sourceforge.net/tesseract-ocr/tesseract/RE...

Smart move
by Ronald Vos (1.64) on Thu 31st Aug 2006 19:45 UTC
Ronald Vos
Member since:
2005-07-06
Fans: 0

Google has stated before they want to organise the world's information, and now they gain mindshare with people who would add substantive globs of information to the internet.

Interesting...
by JCooper (3.44) on Thu 31st Aug 2006 19:55 UTC
JCooper
Member since:
2005-07-06
Fans: 1

...I haven't had a chance to look at the potential of the code here, but could this be leveraged to provide another string to the bow of desktop search? OCR of images (png, jpg, gif etc) by Beagle would be fantastic - picking out signs and all sorts of text would be a sweet feature! ;)

users helping google
by Adurbe (2.76) on Thu 31st Aug 2006 20:27 UTC
Adurbe
Member since:
2005-07-06
Fans: 0

get people to ocr their own works, then google can help u share it with he world :-)

Nice one Google
by twenex (2.56) on Thu 31st Aug 2006 21:14 UTC
twenex
Member since:
2006-04-21
Fans: 14

Nice to know there's a big company that knows what FOSS is all about.

RE: Nice one Google
by NotParker (-2) on Thu 31st Aug 2006 22:01 UTC in reply to "Nice one Google"
NotParker Member since:
2006-06-01
Fans: 4

The Chinese like them too.

RE: Nice one Google
by Soulbender (3.48) on Fri 1st Sep 2006 06:51 UTC in reply to "Nice one Google"
Soulbender Member since:
2005-08-18
Fans: 15

"Nice to know there's a big company that knows what FOSS is all about."

Really? I thought this article was about Google?

First FOSS OCR?
by CaptainPinko (3.36) on Thu 31st Aug 2006 22:09 UTC
CaptainPinko
Member since:
2005-07-21
Fans: 0

I'm not aware of any other.

RE: First FOSS OCR?
by Anonymous Coward (1.52) on Thu 31st Aug 2006 23:25 UTC in reply to "First FOSS OCR?"
Anonymous Coward Member since:
2005-07-06
Fans: 1

I dunno...

Synaptic reveals: Clara, gocr, and ocrad

but I have all of the repositories enabled... so I'm not sure how free they are...

A couple of things ...
by kadymae (1.68) on Fri 1st Sep 2006 21:12 UTC
kadymae
Member since:
2005-08-02
Fans: 6

"The University of Nevada in Las Vegas".

::headdesk::

It's The University of Nevada, Las Vegas.

---

Apache 2.0 license ... interesting. It's nearly as flexible as the BSD license in terms of what it permits.

---

And in the meantime, I'm interested in seeing who grabs the technology and runs with it and what interesting projects it spawns.

Edited 2006-09-01 21:16