Finding simple Java class for implementation of text recognition

C

cumlibrisnonsolus

Hello you all.

I am in a hobby project and would need a easy-to-use Java class or
component I can use more or less out of the box in my project for
simple text recognition. I want to compare one or more pieces of text
with one or more pieces of text to get some sort of indication if (1)
the information is auto-generated and (2) the writer is the same
person.

Do not look for a high probability, have from a small to medium amount
of text material to work with, just need something that can raise a
red flag and merit human attention in some cases.

Any hints on where I should go looking?

As a small side-question, I'd also appriciate info on similar open-
source class/component to scramble (or rather perform simple
encryption).

/Thanks
 
A

Abhijat Vatsyayan

Hello you all.

I am in a hobby project and would need a easy-to-use Java class or
component I can use more or less out of the box in my project for
simple text recognition. I want to compare one or more pieces of text
with one or more pieces of text to get some sort of indication if (1)
the information is auto-generated and (2) the writer is the same
person.

Do not look for a high probability, have from a small to medium amount
of text material to work with, just need something that can raise a
red flag and merit human attention in some cases.

Any hints on where I should go looking?

As a small side-question, I'd also appriciate info on similar open-
source class/component to scramble (or rather perform simple
encryption).

/Thanks
I have tried using tesseract-ocr , a project hosted at code.google . I
installed it on a linux box and called it using runtime.exec from Java
but the results of tesseract-ocr were pretty bad. I suspect it was me; I
probably needed to look more into configuration (training and testing).
I would suggest looking into it if you do not mind invoking it through
exec.
 
A

Abhijat Vatsyayan

Hello you all.

I am in a hobby project and would need a easy-to-use Java class or
component I can use more or less out of the box in my project for
simple text recognition. I want to compare one or more pieces of text
with one or more pieces of text to get some sort of indication if (1)
the information is auto-generated and (2) the writer is the same
person.

Do not look for a high probability, have from a small to medium amount
of text material to work with, just need something that can raise a
red flag and merit human attention in some cases.

Any hints on where I should go looking?

As a small side-question, I'd also appriciate info on similar open-
source class/component to scramble (or rather perform simple
encryption).

/Thanks
What do you mean by auto-generated? Do you mean machine generated as
opposed to hand-written? I am also confused about what you mean by
"writer is the same person". Are you trying to do handwriting
recognition to figure out who the writer is (or figure out all text
written by the same person)?

Abhijat
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,999
Messages
2,570,243
Members
46,836
Latest member
login dogas

Latest Threads

Top