DRY fanatics?

Giles Bowkett · Oct 22, 2006

Anybody know a way to make this DRYer?

when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

a literal regex with a subpattern repeated three times

I could probably split on the ', but it seems that might have unwanted
side effects.

Joel VanderWerf · Oct 22, 2006

Giles said:
Anybody know a way to make this DRYer?

when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

a literal regex with a subpattern repeated three times

I could probably split on the ', but it seems that might have unwanted
side effects.

This doesn't help much, unless the 3 comes from a variable, but ...

case "a, 'b', 'c', 'd'"
when /^([A-Za-z0-9,]+)((?:, '[^']+'){3,3})/
p $1, $2.scan(/'([^']+)'/)
# when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/
# p $1, $2, $3, $4
end

Eero Saynatkari · Oct 22, 2006

--83Y2sXKo4f/n2njO
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

Anybody know a way to make this DRYer?
=20
when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

\1, \2 etc. may be used to refer to previous numbered groups.

I wish the same could be applied to the egregious overuse of
the term DRY by the Rails club.

--83Y2sXKo4f/n2njO
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (FreeBSD)

iD4DBQFFOtAp7Nh7RM4TrhIRAmNgAKCELP6PUPu+2Tt2ET2zrD6/YE3ttwCY+n9w
EsNCgmAV7brkrf99u2vdnA==
=X9tB
-----END PGP SIGNATURE-----

--83Y2sXKo4f/n2njO--

Ken Bloom · Oct 22, 2006

Anybody know a way to make this DRYer?

when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

a literal regex with a subpattern repeated three times

I could probably split on the ', but it seems that might have unwanted
side effects.

That's fine. I see no reason to make it more obfuscated. A couple tips
though:

* Use .+? instead of [^']+
.+? does a non-greedy match, which is what you're really trying to say
with the [^']+

* If you want to match the *same* text three times, for example
"a, '1', '1', '1'" but not "a, '1', '2', '3'", then you should use a
backsubstitution in the match, using \2 twice, instead of the second
two groups, so the pattern becomes
/^([A-Za-z0-9,]+), '([^']+)', '\2', '\2'/

--Ken

James Britt · Oct 22, 2006

Eero said:
Anybody know a way to make this DRYer?

when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

Click to expand...

\1, \2 etc. may be used to refer to previous numbered groups.

I wish the same could be applied to the egregious overuse of
the term DRY by the Rails club.

Indeed. Irony at its best.

--
James Britt

"Trying to port the desktop metaphor to the Web is like working
on how to fuel your car with hay because that is what horses eat."
- Dare Obasanjo

Robert Klemme · Oct 22, 2006

Eero said:
Anybody know a way to make this DRYer?

when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

Click to expand...

\1, \2 etc. may be used to refer to previous numbered groups.

Which does not help in this case because that makes it match the same
stuff again:

irb(main):031:0> %w{aaa abc bcd}.map {|s| /(\w)\1\1/ =~ s}
=> [0, nil, nil]

Other suggestions

when /^([A-Za-z0-9,]+)((?:, '([^']+)'){3})/
# and then apply a second match to group 2

when /^([A-Za-z0-9,]+)#{", '([^']+)'" * 3}/o

irb(main):032:0> /^([A-Za-z0-9,]+)#{", '([^']+)'" * 3}/o
=> /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

Kind regards

robert

Eero Saynatkari · Oct 23, 2006

--9hzSyicXuByfNYJd
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

Anybody know a way to make this DRYer?
=20
when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

Click to expand...

=20
\1, \2 etc. may be used to refer to previous numbered groups.
=20
I wish the same could be applied to the egregious overuse of
the term DRY by the Rails club.

Giles, if the above came across snide--it was meant to. I do,
however, apologise that you had to bear the mighty brunt of
it. I have grown somewhat tired of the sheer number of DRY
this and DRY that I have seen recently and you are most likely
not behind all of it at least

--9hzSyicXuByfNYJd
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (FreeBSD)

iD8DBQFFO/8P7Nh7RM4TrhIRAs9fAJ9C+0buOCnbTeMja3e2ys3/pDnSuACgnOr0
Zndgwj+vHBXiMK/v7t2eJ7E=
=Edwt
-----END PGP SIGNATURE-----

--9hzSyicXuByfNYJd--

Kevin Jackson · Oct 23, 2006

I wish the same could be applied to the egregious overuse of
I'm actually grateful that the Rails people have pushed the idea of
Don't repeat yourself - it was one of those 'good idea, wish I'd have
thought of it' moments that I had when starting with ruby last year.

Not sure about the acronym, but the concept is (although about as
basic as it gets) worthwhile

Kev

Jacob Fugal · Oct 23, 2006

when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

Click to expand...

* Use .+? instead of [^']+
.+? does a non-greedy match, which is what you're really trying to say
with the [^']+

Actually, no, the way he had it is better. Check this out:

http://perlmonks.org/?node=Death to Dot Star!

Using /.+?/ (which is really equivalent to /..*?/ and thus in the same
camp as the article) can be incorrect and also slower. In this case --
I think -- the '.+?' would yield correct results since there's a one
character terminator, but the speed is still an issue. For a simple
string and pair of regexes, it's not much:

$ cat regex-bm.rb
require 'benchmark'

TIMES = 10_000_000
REGEX1 = /'([^']+)'/
REGEX2 = /'(.+?)'/
STRING = "'Woah,' John said, 'there're multiple quotes!'"

Benchmark.bmbm do |x|
x.report("[^']+") { TIMES.times{ STRING =~ REGEX1 } }
x.report(".+?") { TIMES.times{ STRING =~ REGEX2 } }
end

$ ruby -v regex-bm.rb
ruby 1.8.4 (2005-12-24) [powerpc-darwin7.9.0]
Rehearsal -----------------------------------------
[^']+ 24.130000 0.050000 24.180000 ( 24.261712)
.+? 25.490000 0.040000 25.530000 ( 25.799146)
------------------------------- total: 49.710000sec

user system total real
[^']+ 24.160000 0.070000 24.230000 ( 24.729463)
.+? 25.410000 0.060000 25.470000 ( 25.572987)

But it is present. And it gets worse as both the regex being used and
the string being matched get more complex. For a simple case like
Giles, I wouldn't worry too much about the performance difference
between .+? and [^']+, and the correctness is fine. So you can use .+?
-- it *is* more readable. But it also hides subtleties, which raises
flags for me.

Jacob Fugal

Giles Bowkett · Oct 23, 2006

Anybody know a way to make this DRYer?

when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

a literal regex with a subpattern repeated three times

I could probably split on the ', but it seems that might have unwanted
side effects.

Click to expand...

That's fine. I see no reason to make it more obfuscated.

Well, see, I used to be a Perl guy. I'm used to thinking of
obfuscation as its own reward.

A couple tips though:

* Use .+? instead of [^']+
.+? does a non-greedy match, which is what you're really trying to say
with the [^']+

Cheers. You're absolutely right there. I knew there was a way to do
non-greedy matching but couldn't recall it.

* If you want to match the *same* text three times, for example
"a, '1', '1', '1'" but not "a, '1', '2', '3'", then you should use a
backsubstitution in the match, using \2 twice, instead of the second
two groups, so the pattern becomes
/^([A-Za-z0-9,]+), '([^']+)', '\2', '\2'/

ah ok. that is a crucial difference. the data was in the same pair of
' -s each time, but it was different data each time also. so that \2
wouldn't actually have worked for me in this case.

Giles Bowkett · Oct 23, 2006

Anybody know a way to make this DRYer?

when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

Click to expand...

\1, \2 etc. may be used to refer to previous numbered groups.

I wish the same could be applied to the egregious overuse of
the term DRY by the Rails club.

Click to expand...

Giles, if the above came across snide--it was meant to.

well DUH

I do,
however, apologise that you had to bear the mighty brunt of
it. I have grown somewhat tired of the sheer number of DRY
this and DRY that I have seen recently and you are most likely
not behind all of it at least

OK, first off, you don't need to apologize to me for making fun of the
Rails club. I got kicked out of that club for being intolerant of
script kiddies. As much as I admire Rails for its brilliance and
elegance, you can go ahead and say anything you want about that
community for all I care.

Second, no offense, but although I did bear the brunt, I don't think
the brunt was as mighty as all that. It wasn't a runt of a brunt but
it wasn't mighty either. I'll live.

Third! There was a bit of tongue-in-cheek going on there. I mean you'd
have to be insane, or a Perl coder, to think that a straightforward
regex could genuinely **benefit** from being compacted into something
terser merely for the sake of compacting things. I really just wanted
to see if it was possible at all.

Giles Bowkett · Oct 23, 2006

I'm actually grateful that the Rails people have pushed the idea of
Don't repeat yourself - it was one of those 'good idea, wish I'd have
thought of it' moments that I had when starting with ruby last year.

Not sure about the acronym, but the concept is (although about as
basic as it gets) worthwhile

The acronym actually comes from "The Pragmatic Programmer" and if the
Rails thing inspires even one programmer to read that book it'll make
the world a better place.

Morus Walter · Oct 24, 2006

Anybody know a way to make this DRYer?

when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

a literal regex with a subpattern repeated three times

I could probably split on the ', but it seems that might have unwanted
side effects.

Click to expand...

That's fine. I see no reason to make it more obfuscated. A couple tips
though:

* Use .+? instead of [^']+
.+? does a non-greedy match, which is what you're really trying to say
with the [^']+

Really?
What about input like
"bla, 'bl'ub', 'foo', 'bar'"

You'll easily see that the non-greedy version matches, whereas the
original regex doesn't.
You have to be *very* careful if you use non-greedy matches instead of
explicit exclusion, when the match is followed by further rules.
/'[^']+'/ and /'.+?'/ are equivalent, but /'[^']+',/ and /'.+?',/ are not.

My rule of thumb is to avoid non greedy matches in complex regexes.

M.

Martin DeMello · Oct 24, 2006

Anybody know a way to make this DRYer?

when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

grp = %{ '([^']+)'}
rx = /^([A-Za-z0-9,]+),#{grp},#{grp},#{grp}/

martin

Rick DeNatale · Oct 25, 2006

Anybody know a way to make this DRYer?

when /^([A-Za-z0-9,]+), '([^']+)', '([^']+)', '([^']+)'/

Click to expand...

grp = %{ '([^']+)'}
rx = /^([A-Za-z0-9,]+),#{grp},#{grp},#{grp}/

Still too WET!

grp = %{,'([^']+)'}
rx = /^([A-Za-z0-9,]+)#{grp*3}/

Now, can someone how to make this DRYer?

x = 111

Seriously, every good idea can be taken to evil extremes.

Moderation in everything, including moderation.

Regular expression problem	13	Mar 10, 2013
regex \w allows non english characters	7	May 10, 2007
How to capture repeated subpatterns?	7	Nov 1, 2006
Simple web framework - improvements to makefile	0	Feb 1, 2023
Crazy gsub/regex scheme - can this be done better?	3	Aug 11, 2006
C++ Student, need help with very simple concept	2	Mar 13, 2023
Tasks	1	Nov 29, 2022
Seeking co-founders for my company.	3	Sep 8, 2024

DRY fanatics?

Giles Bowkett

Joel VanderWerf

Eero Saynatkari

Ken Bloom

James Britt

Robert Klemme

Eero Saynatkari

Kevin Jackson

Jacob Fugal

Giles Bowkett

Giles Bowkett

Giles Bowkett

Morus Walter

Martin DeMello

Rick DeNatale

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads