C
Corn Holio
I'm currently writing a POP3 proxy to act as a spam filter in Java.
Does anyone know of any good java text classification tools? I want
to start with basic spam filtering.
I tried using Classifier4J's BayesianClassifier. I tested this
by exporting about 6300 emails from Outlook and used the subjects
and bodies of these messages to "teach" the classifier what spam looks
like.
Problem is.. now Classifier4J thinks EVERY email that comes
through is spam and it filters it into a different spam inbox.
Anyone have any suggestions? I'm looking for a Java API
for accomplishing this.
Thanks
Does anyone know of any good java text classification tools? I want
to start with basic spam filtering.
I tried using Classifier4J's BayesianClassifier. I tested this
by exporting about 6300 emails from Outlook and used the subjects
and bodies of these messages to "teach" the classifier what spam looks
like.
Problem is.. now Classifier4J thinks EVERY email that comes
through is spam and it filters it into a different spam inbox.
Anyone have any suggestions? I'm looking for a Java API
for accomplishing this.
Thanks