Matching

Shandy Nantz · Nov 12, 2008

I am trying to parse out some characters from a string and I am having
some trouble because the string may have a space or it may not.
Basically I have a string that looks like this:

Seattle, WA[SEA] or
Colorado Springs, CO[COS] or
Denver, CO[DEN]

What I have so far to match the Seattle and Denver strings looks like
this:

/[A-Z]{1,1}[a-z]+[,]\s[A-Z]+\[{1,1}/

This regular expression is trying to grab everything within the []; in
the cases above this would be SEA, COS, DEN.

How do I make that same match if the initial string has whitespace
include, such as the Colorado Springs. I could add a \w, but what if the
section of string before the "," has more then one space; what if it has
three, or four, or ten.

If there is a better way to do this I all ears, regular expressions
isn't my cup of tea. Thanks,

-S

Peter Szinek · Nov 12, 2008

I am trying to parse out some characters from a string and I am having

some trouble because the string may have a space or it may not.
Basically I have a string that looks like this:

Seattle, WA[SEA] or
Colorado Springs, CO[COS] or
Denver, CO[DEN]

What I have so far to match the Seattle and Denver strings looks like
this:

/[A-Z]{1,1}[a-z]+[,]\s[A-Z]+\[{1,1}/

This regular expression is trying to grab everything within the []; in
the cases above this would be SEA, COS, DEN.

If you want to grab everything in brackets, why do you care about the
text before them?
You could do just:

"Seattle, WA[SEA]"[/\[(.+)\]/,1] => "SEA"
"Colorado Springs, CO[COS]"[/\[(.+)\]/,1]

Click to expand...

=> "COS"

etc.

Cheers,
Peter

Peter Szinek · Nov 12, 2008

[Note: parts of this message were removed to make it a legal post.]

I am trying to parse out some characters from a string and I am
having
some trouble because the string may have a space or it may not.
Basically I have a string that looks like this:

Seattle, WA[SEA] or
Colorado Springs, CO[COS] or
Denver, CO[DEN]

What I have so far to match the Seattle and Denver strings looks like
this:

/[A-Z]{1,1}[a-z]+[,]\s[A-Z]+\[{1,1}/

This regular expression is trying to grab everything within the [];
in
the cases above this would be SEA, COS, DEN.

Click to expand...

Here is a 'full' solution that should cover these cases:

"Colorado Springs, CO[COS]"[/(\w|\s)+, [A-Z]{2}\[(.+?)\]/,2] => "COS"
"Denver, CO[DEN]"[/(\w|\s)+, [A-Z]{2}\[(.+?)\]/,2]

Click to expand...

=> "DEN"

Cheers,
Peter

Shandy Nantz · Nov 13, 2008

Peter said:
If you want to grab everything in brackets, why do you care about the
text before them?
You could do just:

I realized that after I made this post - that I was over thinking the
problem way to much. Thanks,

-S

Shandy Nantz · Nov 13, 2008

Peter said:
Here is a 'full' solution that should cover these cases:

"Colorado Springs, CO[COS]"[/(\w|\s)+, [A-Z]{2}\[(.+?)\]/,2] => "COS"
"Denver, CO[DEN]"[/(\w|\s)+, [A-Z]{2}\[(.+?)\]/,2]

Click to expand...

Click to expand...

=> "DEN"

I have had to refine my search a little, mostly because getting the last
four characters turned out not to be a simple thing when using regular
expressions. I have people entering text via an auto_complete_text_field
in a rails app; my issues is that on my local machine there parameters
get passed like this:

Denver, CO[DEN] or Colorado Springs, CO[COS]

However, up on the server the parameters get passed like this:

" Denver, CO[DEN] " or " Colorado Springs, CO[COS] "

and for whatever reason (my guess being because regex is a pain in the
a#%), simply replacing the spaces dident work and cause massive amounts
of havoic. So I used the code found up above and all was well in the
world....

Then a new hick-up was encountered:

Toronto, Canada[YYZ]

As you can imagine the regex wouldn't ketch this expression. So I tried
this:

params[:airport][:city].gsub(/$\s/, "")
[/(\w|\s)+, [A-Z]{2}|[A-Z][a-z]{1,}\[(.+?)\]/,2]

And al' bee it caught the Torono, Canada[YYZ]. Now, however, it fails to
get the Denver, CO[DEN]. I must have a complete lack of understanding
when it comes to regex because I always thought that the "|" in the
above statement meant "or", as in, try and match TWO A-Z OR one A-Z
FOLLOWED BY one or more a-z. I guess I am drawing a total blank because
I look at the revised code above and I think it should match all the
above cases and I cannot figure out for the life of me where it is
failing. Can someon ehelp, thanks

-S

Rob Biedenharn · Nov 13, 2008

Peter said:
Peter said:

Here is a 'full' solution that should cover these cases:

"Colorado Springs, CO[COS]"[/(\w|\s)+, [A-Z]{2}\[(.+?)\]/,2] => "COS"
"Denver, CO[DEN]"[/(\w|\s)+, [A-Z]{2}\[(.+?)\]/,2]

Click to expand...

=> "DEN"

Click to expand...

I have had to refine my search a little, mostly because getting the
last
four characters turned out not to be a simple thing when using regular
expressions. I have people entering text via an
auto_complete_text_field
in a rails app; my issues is that on my local machine there parameters
get passed like this:

Denver, CO[DEN] or Colorado Springs, CO[COS]

However, up on the server the parameters get passed like this:

" Denver, CO[DEN] " or " Colorado Springs, CO[COS] "

and for whatever reason (my guess being because regex is a pain in the
a#%), simply replacing the spaces dident work and cause massive
amounts
of havoic. So I used the code found up above and all was well in the
world....

Then a new hick-up was encountered:

Toronto, Canada[YYZ]

As you can imagine the regex wouldn't ketch this expression. So I
tried
this:

params[:airport][:city].gsub(/$\s/, "")
[/(\w|\s)+, [A-Z]{2}|[A-Z][a-z]{1,}\[(.+?)\]/,2]

And al' bee it caught the Torono, Canada[YYZ]. Now, however, it
fails to
get the Denver, CO[DEN]. I must have a complete lack of understanding
when it comes to regex because I always thought that the "|" in the
above statement meant "or", as in, try and match TWO A-Z OR one A-Z
FOLLOWED BY one or more a-z. I guess I am drawing a total blank
because
I look at the revised code above and I think it should match all the
above cases and I cannot figure out for the life of me where it is
failing. Can someon ehelp, thanks

-S

OK, you're trying to pull the three-letter airport code from within
the square brackets at the end of the location.

airports = [ "Denver, CO[DEN]",
"Colorado Springs, CO[COS]",
" Denver, CO[DEN] ",
" Colorado Springs, CO[COS] ",
"Toronto, Canada[YYZ]",
"Erlanger, KY[CVG]",
]

airports.each do |airport|
location, code = airport.strip.match(/(.*)\[(.{3})\]/).captures
puts "#{code} is in '#{location}'"
end

DEN is in 'Denver, CO'
COS is in 'Colorado Springs, CO'
DEN is in 'Denver, CO'
COS is in 'Colorado Springs, CO'
YYZ is in 'Toronto, Canada'
CVG is in 'Erlanger, KY'

Got it?

-Rob

Rob Biedenharn http://agileconsultingllc.com
(e-mail address removed)

Rob Biedenharn · Nov 14, 2008

As you can imagine the regex wouldn't ketch this expression. So I
tried
this:

params[:airport][:city].gsub(/$\s/, "")
[/(\w|\s)+, [A-Z]{2}|[A-Z][a-z]{1,}\[(.+?)\]/,2]

And al' bee it caught the Torono, Canada[YYZ]. Now, however, it
fails to
get the Denver, CO[DEN]. I must have a complete lack of understanding
when it comes to regex because I always thought that the "|" in the
above statement meant "or", as in, try and match TWO A-Z OR one A-Z
FOLLOWED BY one or more a-z. I guess I am drawing a total blank
because
I look at the revised code above and I think it should match all the
above cases and I cannot figure out for the life of me where it is
failing. Can someon ehelp, thanks

-S

Oh, I see part of your problem: You need to group the alternation to
see what you want.
=> An UPPER with one or more lowers,
followed by some characters within
square brackets
v---------------------v
/(\w|\s)+, [A-Z]{2}|[A-Z][a-z]{1,}\[(.+?)\]/
^----------------^
=> Some word or
space characters,
a comma,
and 2 letters

I think you intended to have:
v------v
/(\w|\s)+, (?:[A-Z]{2}|[A-Z][a-z]{1,})\[(.+?)\]/
^------------^
Grouped with (?: ) which is a non-capturing group.

(But my earlier message still holds.)

-Rob

Rob Biedenharn http://agileconsultingllc.com
(e-mail address removed)

Shandy Nantz · Nov 14, 2008

Rob said:
Oh, I see part of your problem: You need to group the alternation to
see what you want.
=> An UPPER with one or more lowers,
followed by some characters within
square brackets
v---------------------v
/(\w|\s)+, [A-Z]{2}|[A-Z][a-z]{1,}\[(.+?)\]/
^----------------^
=> Some word or
space characters,
a comma,
and 2 letters

I think you intended to have:
v------v
/(\w|\s)+, (?:[A-Z]{2}|[A-Z][a-z]{1,})\[(.+?)\]/
^------------^
Grouped with (?: ) which is a non-capturing group.

(But my earlier message still holds.)

-Rob

Rob Biedenharn http://agileconsultingllc.com
(e-mail address removed)

Can you direct me to some good online resources where some Ruby methods
ar listed? I didn't know that there was a method for a string named
"match" that would return a string and not just chomp of various
characters. I have a book on Ruby an it is well writen but a list on
methods broken down to what methods are for which classes would be
helpful? Thanks,

-S

Rob Biedenharn · Nov 14, 2008

Rob said:
Rob said:

Oh, I see part of your problem: You need to group the alternation to
see what you want.
=> An UPPER with one or more lowers,
followed by some characters within
square brackets
v---------------------v
/(\w|\s)+, [A-Z]{2}|[A-Z][a-z]{1,}\[(.+?)\]/
^----------------^
=> Some word or
space characters,
a comma,
and 2 letters

I think you intended to have:
v------v
/(\w|\s)+, (?:[A-Z]{2}|[A-Z][a-z]{1,})\[(.+?)\]/
^------------^
Grouped with (?: ) which is a non-capturing group.

(But my earlier message still holds.)

-Rob

Rob Biedenharn http://agileconsultingllc.com
(e-mail address removed)

Click to expand...

Can you direct me to some good online resources where some Ruby
methods
ar listed? I didn't know that there was a method for a string named
"match" that would return a string and not just chomp of various
characters. I have a book on Ruby an it is well writen but a list on
methods broken down to what methods are for which classes would be
helpful? Thanks,

-S

If you don't have a copy of "Programming Ruby"[1] aka, the Pickaxe
book, it's well worth getting. If you are new to Ruby, you can get
pretty far with the online first edition[2] which covers Ruby 1.6
(second edition is 1.8, anticipated 3rd edition is 1.9 and currently a
beta-book).

You can locally use ri (or get fri, the fastri alternative) to get docs.

ri String
ri String#match
ri Regexp#match
ri MatchData

-Rob

[1] http://www.pragprog.com/titles/ruby/programming-ruby
[2] http://www.ruby-doc.org/docs/ProgrammingRuby/

Rob Biedenharn http://agileconsultingllc.com
(e-mail address removed)

Matching multiple line reg exp	3	Nov 22, 2010
What is the most astounding C++ syntax construct?	0	Dec 22, 2022
regular expressions and matching delimeters	17	May 21, 2014
regex matching a state in a string	2	Mar 15, 2012
I am learning code...	2	Oct 31, 2020
Did you know that there is a match-case function in python?	4	Dec 17, 2023
Problem with displaying character that code number is 219 (after SetConsoleTextAttribute)?	3	Jan 9, 2023
Algorithm for fuzzy string matching	1	Mar 23, 2011

Matching

Shandy Nantz

Peter Szinek

Peter Szinek

Shandy Nantz

Shandy Nantz

Rob Biedenharn

Rob Biedenharn

Shandy Nantz

Rob Biedenharn

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads