Yet another RE question

Bogdan Marinescu · Jan 10, 2004

Hello all,

First I want to apologize if this was already discussed before, I can't find an answer anywhere right now. I'm writing a simple compiler for a small language using Spark (http://pages.cpsc.ucalgary.ca/~aycock/spark/). And I just found out that the regular expressions in Python follow the Perl semantics (first-then-longest) instead of the POSIX semantics (longest match). This is quite annoying for me; while some solutions to this problem exists and they are shown in the Spark documentation, I have some background with lex/yacc and I would really like to use the "lex" semantics (longest match). Is there a package for Python that implements this behaviour?
Thank you,

Bogdan

Christos TZOTZIOY Georgiou · Jan 10, 2004

[snip: this is about a simple compiler of a small language, and Python
follows Perl re symantics instead of POSIX: a|ab always matches 'a' even
if 'ab' would match in the search string]

This is quite annoying for me; while some solutions to this problem exists and they are shown in the Spark documentation, I have some background with lex/yacc and I would really like to use the "lex" semantics (longest match). Is there a package for Python that implements this behaviour?

AFAIK no, there is no such package. However, you can do the following
things:

- reorder alternations (sp?) to be longest first

Substitute "ab|a" for "a|ab"

- the (?!...) operator might help

The re "if(?![a-z_0-9])" would match the 'if' and would ignore all
identifiers starting with 'if'.

HTH.

With this artifact, everyone can easily invent new languages	5	Jan 11, 2014
How to get the "longest possible" match with Python's RE module?	32	Sep 12, 2006
using re: hitting recursion limit	6	Oct 26, 2004
Evaluate my first python script, please	13	Mar 4, 2010
multi regexp analyzer ? or how to do...	1	Jun 30, 2005
Wildcard String Comparisons: Set Pattern to a Wildcard Source	7	Oct 5, 2010
The devolution of English language and slothful c.l.p behaviors exposed!	50	Jan 24, 2012
HOWTO: Parsing email using Python part2	1	Jul 15, 2011

Yet another RE question

Bogdan Marinescu

Christos TZOTZIOY Georgiou

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads