automatically identify patterns. (RegEX)

I

Idan.Yael

hi all,
i would like to know if there a software on the net i can download
which can identify text patterns between text files...
meanning:

if i have 10 files and i want to compare those files and find their
tamplate translated to a regular expression string.

for eample,
lets say i got a news site with holds an index of all the news
published.
i want to take 5-10 pages and declare the sections that i know supposed
to be diffrent between the files.
the software will then run on the files and give me the regular
expression to identify the pages i got and get the "changing text" in a
group ( "(<text>)" ).
so that way i can just scan the site and know about all the urls that
match this regular expression tamplate.

i know this is alot to ask for but i though i might give it a try and
ask before i start coding myself :)

thanks in advance for any answer,
Idan.
 
M

Matt Garrish

hi all,
i would like to know if there a software on the net i can download
which can identify text patterns between text files...

Don't do software requests. I think every decent text editor has some kind
of file compare functionality, though. Or you can always just get diff.
if i have 10 files and i want to compare those files and find their
tamplate translated to a regular expression string.

Even assuming you mean a template, what does "translated to a regular
expression string" mean to you? Clickety-click regex trick? I'd like
something to write my regexes for me, too. In fact, it would be nice if it
did all my coding so I could nap some more during the day... : )
for eample,
lets say i got a news site with holds an index of all the news
published.
i want to take 5-10 pages and declare the sections that i know supposed
to be diffrent between the files.
the software will then run on the files and give me the regular
expression to identify the pages i got and get the "changing text" in a
group ( "(<text>)" ).
so that way i can just scan the site and know about all the urls that
match this regular expression tamplate.

If they all have the same template, then why not just use diff? It's
certainly not a hard task to run the command, read the output, parse out the
lines you need and then extract them from a file (finding the start and end
delimiters for the changed text will be the trick).

Matt
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
474,184
Messages
2,570,978
Members
47,561
Latest member
gjsign

Latest Threads

Top