Java Design For A News Filter

Ben Jessel · Jul 26, 2004

I am doing the technical design for a news syndication system that:

1) Reads news feeds ( xml-rss ) from user defined sources.
2) Filters out the news feeds based on applying user defined search
expressions in the subject, and body xml portions.
3) Stores this in a database so that people can view the filtered
news.

I've had a look at the options:

1) Write the whole thing from scratch; devise an algorithm for text
searching. This would have to deal with logic ( i.e "must match Java
AND Programmer but not Coffee" OR "must match java AND UML" ) and
possible regular expressions ( can be dropped out of scope ).

Advantages
Totally meets requirements.

Disadvantage
Complex coding.
Time intensive

2) Use XPath - this would involve stylesheets to be created
on-the-fly, which has the appropriate logic. Some translation between
XPath's search and what the user enters may be required.

Advantages
Less Flexible

Disadvantages
May not be flexible enough ( could you do "must match Java AND
Programmer but not Coffee" OR "must match java AND UML" in XPath ).

3) Save the whole lot to the database and use database Full Text
Retrieval.

Advantages
Simple And Easy

Disadvantages
May be slow.
But of a hacky workaround.
Databases are not Search engines!

I'd really appreciate some comments as going down the wrong route
could be a world of pain!

Thanks,

Ben

Seeking co-founders for my company.	3	Sep 8, 2024
Java API news letter	0	Nov 11, 2003
Java News	0	Jul 10, 2003
object oriented design question in context of Java program	2	Jun 21, 2012
Seeking a Java / UI Developer for a contract opportunity in Albany, NY	0	Jun 29, 2012
[For Beginners)Difference between Java and JavaScript	3	Nov 7, 2013
Python-URL! - weekly Python news and links (Mar 31)	4	Mar 31, 2012
design pattern for a file converter...	14	Dec 9, 2010

Java Design For A News Filter

Ben Jessel

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads