[QUIZ] Unit Conversion (#183)

R

Robert Dober

If your main issue is that you think "cheat" implies dishonesty, then
understand I mean no such thing. gsub(/cheat/, "reuse") and you'll be
happier.
No problem at all, I have however taken some time to explain why I was
somehow shocked OL.
I apologize if this is kind of wasting bandwidth on the Quiz Thread.
 
P

Peter Szinek

[Note: parts of this message were removed to make it a legal post.]

I just wanted to submit my first Ruby-quiz solution ever - but I was
waiting for the other solutions, being unsure when does the 48-hour
deadline end - but by now I am pretty sure it's over. So what's going
on?



Cheers,
Peter
___
http://www.rubyrailways.com
http://scrubyt.org
 
M

Matthew Moss

I just wanted to submit my first Ruby-quiz solution ever - but I was
waiting for the other solutions, being unsure when does the 48-hour
deadline end - but by now I am pretty sure it's over. So what's
going on?

Yes, the quiz is (usually) posted Friday, so submissions typically
start rolling in Sunday. And Martin did provide his alcohol submission
yesterday. So feel free to post.
 
P

Peter Szinek

[Note: parts of this message were removed to make it a legal post.]
Yes, the quiz is (usually) posted Friday, so submissions typically
start rolling in Sunday. And Martin did provide his alcohol
submission yesterday. So feel free to post.

OK. <drumroll> my first submission to Ruby-quiz:

(After all those discussion on cheating, I am not sure if this
solution is not considered cheating of some sort ;-)

===============================================================
require 'rubygems'
require 'cgi'
require 'scrubyt'

begin
google_converter = Scrubyt::Extractor.define do
fetch "http://www.google.com/search?q=#{CGI::escape(ARGV[0])}
+#{CGI::escape(ARGV[1])}+to+#{CGI::escape(ARGV[2])}"

google_result "//td[@dir='ltr']" do
final_result(/= (.+) /)
end
end
puts google_converter.to_hash[0][:final_result]
rescue
puts "Sorry, even *google* can't translate that!"
end
===============================================================

ex:
ruby converter.rb 10 "meter per second" "mile per hour"
22.3693629

ruby converter.rb 10 USD EUR
7.91201836

ruby converter.rb 7 "ruby gems" "python eggs"
Sorry, even *google* can't translate that!
etc.

disadvantage: you need to be online
advantage: it's quite rich, robust and up-to date (e.g. currency
conversions)

I am wondering if this solution is (at least somewhat) OK - I have a
similar one, though that doesn't require you to be on-line. However, I
am not going to implement it if the above solution is BS :)

Cheers,
Peter
___
http://www.rubyrailways.com
http://scrubyt.org
 
T

Trans

Yes, the quiz is (usually) posted Friday, so submissions typically =A0
start rolling in Sunday. And Martin did provide his alcohol =A0
submission yesterday. So feel free to post.

OK. <drumroll> my first submission to Ruby-quiz:

(After all those discussion on cheating, I am not sure if this =A0
solution is not considered cheating of some sort ;-)

=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
require 'rubygems'
require 'cgi'
require 'scrubyt'

begin
google_converter =3D Scrubyt::Extractor.define do
=A0 =A0fetch "http://www.google.com/search?q=3D#{CGI::escape(ARGV[0])}
+#{CGI::escape(ARGV[1])}+to+#{CGI::escape(ARGV[2])}"

=A0 =A0google_result "//td[@dir=3D'ltr']" do
=A0 =A0 =A0final_result(/=3D (.+) /)
=A0 =A0end
end
=A0 =A0puts google_converter.to_hash[0][:final_result]
rescue
=A0 =A0puts "Sorry, even *google* can't translate that!"
end
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D

ex:
ruby converter.rb 10 "meter per second" "mile per hour"
22.3693629

ruby converter.rb 10 USD EUR
7.91201836

ruby converter.rb 7 "ruby gems" "python eggs"
Sorry, even *google* can't translate that!
etc.

disadvantage: you need to be online
advantage: it's quite rich, robust and up-to date (e.g. currency =A0
conversions)

I am wondering if this solution is (at least somewhat) OK - I have a =A0
similar one, though that doesn't require you to be on-line. However, I = =A0
am not going to implement it if the above solution is BS :)


I think this is an interesting solution b/c it is prescient of a
future we are likely to see. When average broadband connections
approach 1gbs or so this kind of thing will be the norm. We are seeing
this evolution now, with software not being installed by physical
media any longer, but via online package management. And now with
things like RubyGems it is becoming even more fine grained.

Ultimately "usr/lib" will be on the wire.

So what is your similar solution? Taking a guess... is it this with a
cache?

T.
 
T

Trans

Given the conversation about "cheating", and what I know about the
potential complexities of a good solution, having written the original
prototype of units.rb, I asked Peter Vanbroekhoven, who is the master
mind behind Stick's current units system, if he'd be willing to write
up a brief overview of how it works. Thankfully he has obliged us.

Also, I just switched the Stick repo to Git. So the code can be
browsed here:

http://stick.rubyforge.org/git?p=stick.git;a=tree

The main units code part being here:

http://stick.rubyforge.org/git?p=st...aac9685963df5436718c704e1a2bb159c1f87;hb=HEAD

Especially look at base.rb.

The notation this uses is not exacty like that requested by the quiz,
rather it would look like:

50.mi.to(km)

But I think that's close enough --it wouldn't be very hard to add a
cli front-end to translate.

Anyway, cheating or not, I hope this proves helpful to anyone
interested in the subject.

Here's Peter's overview:

== The Basics

There are two types of units: base units, which are not expressed as a
function of other units, and derived units which are expressed as a
function of other units. The base units are represented by the
Stick::Units::BaseUnit class. Derived units are implemented by
Stick::Units::Unit. Derived units are represented as the product of
powers of other units that can be base units or derived units. These
powers are kept in a Hash. BaseUnit is never exposed to the user; if
you need a base unit, is represented as a Unit that's the product of a
single BaseUnit to the power 1. To be able to work with units, it's
often necessary to normalize them in some way, i.e., express them in
function of base units only. This is done by the Unit#simplify method.
This normalization is not performed automatically; rather we have
chosen to have the user of units.rb initiate any conversion so he or
she can have more control over rounding errors and such. Units can be
multiplied, divided and exponentiated.

Units by itself are not very interesting unless they are combined with
some numeric value. This is what the Stick::Units::Value class does.
It holds a Unit, and a value which can be integers, float,
BigDecimals, complex numbers, etc. Values can be multiplied, divided,
and added and subtracted if the units are compatible. This is checked
by normalizing the units and then transforming the Value with the
larger unit to the smaller unit. For instance, adding inches and feet
will result in inches because an inch is the smaller unit. Value
supports most other numeric operators.

== Converters

units.rb has the notion of converters. This notion has been introduced
because sometimes units have the same name but differ in value
depending on location (e.g., a hundredweight in the UK and the US),
they can have the same symbol though they are different, or in the
case of currency you might want to use different services with
possibly slightly different exchange rates. A unit belongs
unambiguously to a single converter. Units from different converters
can be used together, so it is possible to use US and UK
hundredweights in the same expression. What happens here is that a
unit is not only determined by a name but also by a converter. That's
all there is to it really.

There's always the notion of a current converter. When constructing a
unit, the current converter is used by default unless specified
otherwise. This current converter can be changed with
Stick::Units.with_unit_converter which takes a block. The current
converter is stored in a thread-local variable for the duration of the
block and is taken from there. This gives the user means to specify
what unit systems to use in a more granular way. Converter can include
other converter. This allows a converter to extend another converter
and override some of the names. This is actually what
Stick::Units.with_unit_converter does: internally it creates an
anonymous converter that includes both the previous "current
converter" and the new one which gets stacked on top and thus takes
precedence.

== Syntactic Sugar

Units are specified using symbols like :mile and :in, but units.rb
offers a lot of syntactic sugar to make it easier to use. After
including Stick::Units, you can use mile and in directly, or even do
1.in and 2.miles. This is implemented through method_missing and
const_missing (for units that start with a capital letter).

== Loading Config Files

units.rb uses a DSL to specify converters and units. These DSLs are
conveniently used in the config files to preload a large number of
units.
 
M

Matthew Moss

Anyway, cheating or not, I hope this proves helpful to anyone
interested in the subject.

Ack... Fine, nothing is cheating; forget I ever mentioned it. Since
the word seems to be problematic all of a sudden, I'll refrain from
using it in the future.


In any case, thanks for the overview of stick/units. I was unaware of
the library before this week, and it's always good to know some the
the quality libs out there.
 
M

Michael Ulm

Martin said:
My converter converts alcoholic beverages by effect *). In addition to
the source and destination beverage it converts the used containers,
too. Google can't do this yet ;-)

Example:

$ ruby convert.rb 1 bottle of wine into glasses of beer
1 bottle (0.75l) of wine (VOL 12%) effects equal to 6.06 glasses (2.00l)
of beer (VOL 4%)

$ ruby convert.rb 1 bucket of beer into cups of martini
1 bucket (10.00l) of beer (VOL 4%) effects equal to 26.47 cups (2.65l)
of martini (VOL 17%)

Either your glasses and cups are way bigger than mine, there seems to be
a bug in your output code.

Regards,

Michael
 
M

Michael Ulm

Michael said:
Either your glasses and cups are way bigger than mine, there seems to be
a bug in your output code.

Oops, I understand the output now, the amount in the brackets is the total
amount in liters, not the amount per container. Sorry for the noise.

Regards,

Michael
 
T

Trans

Ack... Fine, nothing is cheating; forget I ever mentioned it. Since =A0
the word seems to be problematic all of a sudden, I'll refrain from =A0
using it in the future.

Don't sweat it. I thought of it as "cheating" myself at first.
Thankfully, some wise members of this list made me realize, whether it
technically qualifies as a quiz solution or not, it could at least
contribute to the conversation.
In any case, thanks for the overview of stick/units. I was unaware of =A0
the library before this week, and it's always good to know some the =A0
the quality libs out there.

Your welcome, and I'll extend that to Peter.

T.
 
T

Trans

Well time is difficult to get but I just had to supply a Ruby Quiz
submission *again*.
Was so much fun. Tried to implemnt Tom's syntax driven by a data file
(tiny :(, see above)

If this were a a contest, looks like you get the blue ribbon :)

# file: units.txt
#
# Units data file for Ruby Quiz #183
#
# -----------------------------------

Impressed you got that much functionality from just this little bit of
code.

What up with "=B5" =3D> 0.000_001 ;-)

T.
 
M

Matthew Moss

## Unit Conversion (#183)


Summary will be posted tomorrow; new quiz on Saturday. Apologies...
 
R

Robert Dober

If this were a a contest, looks like you get the blue ribbon :)



Impressed you got that much functionality from just this little bit of
code.

What up with "=B5" =3D> 0.000_001 ;-)

T.
I do not really merit this praise, though I appreciate it;). Actually
the metaprogramming part is about ok, but the
datastructure is quite primitive, well if only I had more time (or
were smarter LOL).
Cheers
R.
 
M

Matthew Moss

## Unit Conversion (#183)


The right way, generally, to do a task such as unit conversion is to
see if someone has already done all the hard work for you. As was
pointed out, there are several options in this respect:

* The [Stick] library for Ruby; a [brief summary] was provided.
Stick provides a value class (i.e. quantity with units), conversions,
syntactic sugar and more.
* Google's search engine can act as a calculator, including unit
conversions. Using Google's API is one option; another is screen-
scraping, as was done by _Peter Szinek_. (Of course, as noted, you
must have an activate Internet connection to use this solution.)
* As was pointed out by _Ryan Davis_, there is a BSD/Un*x command
and library called `units` which does this same task. Transform the
arguments, pass them to a shell, and capture the output.

Many thanks to _Martin Boese_, whose solution had to be empirically
confirmed. Repeatedly.

But I'm going to look at the solution from _Robert Dober_. While it is
limited, as posted, his data driven approach could be expanded to
include more conversions.

To understand how the expression `1.0.in.to.mm` will generate the
string "25.4mm", I'll trace it a step at a time, looking at the
relevant bits of code.

First, we have the float value `1.0`, but where does the method `in`
come from? Clearly, class `Float` gets something by way of extension:

class Float
include Conversion
end

Module `Conversion` only defines one method that will extend `Float`
(with the rest of `Conversion` being helper classes and code executed
when `Conversion` is first evaluated). That method is `method_missing`:

def method_missing unit_name
pc = ProxyClasses[ unit_name.to_s ] || super( unit_name )
pc::new self
end

So we will look for `ProxyClasses["in"]` and, if not found, we just
call to the parent class and hope it knows what to do with method call
`in`. But in this case, we're expecting to find something in
`ProxyClasses`... a Class, in fact, which we attempt to instantiate
immediately using `new`. But where does we fill `ProxyClasses`?

Ah, that would be the code right below `method_missing` in his
solution: the code that makes use of `LineParser`.

conversions = LineParser::new
File::eek:pen "units.txt" do | f |
f.each do | line |
conversions.parse_line line
end
end

Robert provided a minimal `units.txt` data file to show how the code
works. (Note that the line beginning "use SI" is part of the data file
and not a mistake; see `parse_line` for how that is handled.)

1 in = 0.0254 m
1 l = 0.001 m3
use SI prefixes for m g l m3

It could be expanded greatly to support many more units. As each line
is read, the `LineParser` object parses them, keeping track of the
conversion rules -- I'll come back to that later. What I want to look
at first is what gets done with those rules:

conversions.traverse do | src_unit, tgt_unit, conversion |
( ProxyClasses[ src_unit ] ||= Class::new ProxyClass ).module_eval do
define_method tgt_unit do (@value * conversion).to_s + tgt_unit end
end
end

`traverse` is going to enumerate over a number of valid conversions --
source units, target units, and the conversion factor. And here we see
from where the `ProxyClasses` originate... New `ProxyClass` objects
are created through the code `Class::new ProxyClass` (but only if one
didn't exist already for the particular source unit... note the use of
the `||=` operator which only evaluates the right side and assigns
left if the left was initially nil).

After ensuring that the `ProxyClass` corresponding to the source units
exists, we call `module_eval` in order to add methods to the anonymous
class just created. The method name will be the target units, and the
method multiplies in the conversion factor, converts to a string, and
appends the targets units.

So, getting back to our example `1.0.in.to.mm`, we've now found the
`ProxyClass` corresponding to `1.0.in`. And we know that `ProxyClass`
also has methods named by target units, which includes one that
corresponds to the last part of the example: `.mm`.

If you're wondering about `to`, every `ProxyClass` defines that method
to return self: essentially a useless function (in the sense that it
does nothing more than `1.0.in.mm`). It's existence mimics other
libraries, and the point is readability. (An alternative would be a
more traditional call, such as 1.0.convert:)in, :mm) or similar.)

So once these proxy classes exist, there's very little effort going on
to evaluate calls such as our example. And creating the proxy classes
isn't much more difficult, assuming you have a proper conversion
table. Now we come back to `LineParser` and what happens beyond its
`parse_line` method. (I'll skip `parse_line` itself, since it is a
few, simple regular expressions.)

Most of `units.txt` that defines our conversions is going to be
handled by `add_conversion`, which just receives as arguments each
split line of the data file. The conversion table (stored in `@c`) is
two-layered hash -- a hash of hashes -- and is setup with this code:

def add_conversion lhs_value, lhs_unit, equal_dummy, rhs_value,
rhs_unit
@c[ lhs_unit ][ rhs_unit ] = Float( rhs_value ) / Float( lhs_value )
@c[ rhs_unit ][ lhs_unit ] = Float( lhs_value ) / Float( rhs_value )
end

The conversion ratio (and the inverse conversion ratio) are stored in
two places based on the indexing order. By storing both ratios/orders,
we can convert in "both directions". That is, for our example, not
only can we convert inches to millimeters, but millimeters to inches.

The last bit of file parsing is adding appropriate metric prefixes (SI
units). One line in the file indicates which units are worthy of
metric prefixes. In the data file provided, we see that meters can
accept metric prefixes (such as "kilo" and "milli"), but inches will
not. These prefixes are handed by `add_si_unit_for`:

def add_si_unit_for unit
SIUnits.each do | prefix, conversion |
@c[ prefix + unit ][ unit ] = conversion
@c[ unit ][ prefix + unit ] = 1 / conversion
end
end

Here, `unit` is the particular unit we want to support metric
prefixes. `SIUnits` is the hash containing the metric prefixes as
characters and the corresponding orders of magnitude. For every unit
and metric prefix, two more conversions are added, each the inverse of
the other: conversion between the naked unit and the adorned unit
(e.g. between meters and millimeters, and vice-versa).

Finally, `traverse` is an enumerator that will yield (via `blk.call`)
every valid combination of units and the appropriate conversion
factor. It manages this without storing every conversion (e.g. we
store the inches to meters conversion, and the meters to millimeters
conversion, but don't explicitly store inches to millimeters).
Enumerating every possible, valid conversion is done in the private
method `_traverse`:

def _traverse src_unit, unit_conversions, traversed_units, f=1.0, &blk
unit_conversions.each do | new_unit, conversion |
next if traversed_units.include? new_unit
blk.call src_unit, new_unit, f * conversion
_traverse src_unit, @c[ new_unit ], traversed_units + [ new_unit ],
f * conversion, &blk
end
end

The final, recursive step here is what allows us to build a transitive
closure of all units. `src_unit` is, of course, the source unit (e.g.
inches). `unit_conversion` contains all possible immediate conversions
from the source and is the hash of units and conversion factors. And,
you can see, we enumerate those into `new_unit` and `conversion`.

We skip a target unit if it's already been visited (i.e. in
`traversed_units`). Otherwise, we yield to the caller (`blk.call`) and
recurse, now converting the source unit to everything the target unit
can also be converted, making sure to update `traversed_units` so as
to terminate eventually.



[1]: http://stick.rubyforge.org/
[2]: http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-talk/320583
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,999
Messages
2,570,246
Members
46,839
Latest member
MartinaBur

Latest Threads

Top