why is overloading invalid in ruby.

C

Charles Oliver Nutter

Not very quackish.

Neither are case/when statements like this:

case obj
when String ...
when Regexp ...
end

But sometimes that's what you need to do.

- Charlie
 
J

Jörg W Mittag

Adam said:
Not very quackish.

But it's used all over the core and standard libraries.

I tried implementing my own version of Enumerable#inject, which has no
less than *four* overloads and is almost impossible to implement in
Ruby because of that:

Enumerable[$A]#inject($B) {|$B, $A| $B } → $B
Enumerable[$A]#inject {|$A, $A| $A } → $A
Enumerable[$A]#inject($B, Symbol) → $B
Enumerable[$A]#inject(Symbol) → $A

(For obvious reasons, Ruby doesn't have a standardized syntax for
talking about types, so I made one up. I hope you get what I mean.)

For example, the Rubinius implementation of Enumerable#inject contains
about 8 lines of actual code (ignoring comments, blank lines and lines
consisting only of 'end' or '}'). Two implement the actual behavior of
inject, the other six basically implement an ad hoc,
informally-specified, bug-ridden, slow implementation of half of
argument-based dispatch.

In fact, almost every method in kernel/common/enumerable.rb in
Rubinius contains at least one line that does not actually contribute
anything to the logic of the method but instead changes the behavior
of the method in some way, shape or form based on the number or class
of the arguments.

And I've seen that in other code as well, not just Rubinius's or my
own, and not just in code that tries to replicate stdlib behavior.

Several projects I know chose YARD over RDoc as their documentation
tool, because it allows them to at least document their overloads
easily, even if writing them is cumbersome.

Personally, I would enjoy being able to dispatch on arguments in
addition to receivers. Martin Odersky hinted that he is interested in
adding argument-based dispatch to Scala, it will be interesting to see
what he comes up with, although he also said that it's a very hard
problem (because of its interactions with overloading) and will take
many years if it happens at all.

Note that I avoided the use of the term "overloading" and used
"argument-based dispatch" instead. I'm pretty sure that's what the OP
meant, since overloading happens statically and simply doesn't make
sense in Ruby.

jwm
 
E

Eric Christopherson

Whenever the overloading topic comes up, I wonder about what the
future holds regarding true named arguments. Does anyone know?

Specifically, MacRuby has an extension to the usual hash parameters
that seems pretty complicated to me, but allows it to interact fairly
painlessly with Objective-C*. It basically allows you to define a
method with named parameters. It's bewildering because a method
invocation like "person.setFirstName('Gloria', { :lastName=>'Alvarez'
})" actually calls a *different* method from
"person.setFirstName('Gloria', lastName: 'Alvarez')" (the former is
the regular Ruby hash argument type; the latter uses the
MacRuby-specific keyed arguments. If no keyed-argument version of a
method exists, it will just translate it into a hash-argument form.)

Anyway, I wonder if there has been any talk between MacRuby and the
other Ruby implementations on getting this sort of keyed arguments
standardized.



* Objective-C, like Smalltalk, has methods with multi-part names,
where some of the arguments go inside the method name; e.g. you have a
method beginSheetModalForWindow: modalDelegate: didEndSelector:
contextInfo: where each colon marks a place where an argument should
go. Generally the word (or few words) right before the colon indicate
what the argument is supposed to be, semantically.
 
J

Jörg W Mittag

Eric said:
Whenever the overloading topic comes up, I wonder about what the
future holds regarding true named arguments. Does anyone know?

AFAIK, they are on the wishlist for Ruby 2.0, but I don't think there
has been any commitment made. There's no implementation that I know
of, not even a design.
Anyway, I wonder if there has been any talk between MacRuby and the
other Ruby implementations on getting this sort of keyed arguments
standardized.

AFAIK, Laurent Sansonetti did consult with matz to make sure that
MacRuby's extensions would be forward compatible.

So, while there does not exist a design, let alone an implementation
of named arguments for Ruby 2.0, it seems to be clear that whatever
design they come up with, will have to be compatible with MacRuby.

jwm
 
C

Charles Oliver Nutter

2011/1/29 J=C3=B6rg W Mittag said:
AFAIK, Laurent Sansonetti did consult with matz to make sure that
MacRuby's extensions would be forward compatible.

I don't believe that's the case, and if I remember right Matz actually
expressed concern that MacRuby was adding syntax that might later
conflict with MRI.
So, while there does not exist a design, let alone an implementation
of named arguments for Ruby 2.0, it seems to be clear that whatever
design they come up with, will have to be compatible with MacRuby.

MacRuby has taken the risk of future syntax being incompatible. I
don't think their decision to add syntax no other Ruby impl supports
should limit future design of Ruby proper.

FWIW, I understand the justification for the MacRuby syntax (objc
interop), but it's pretty clear to me that adding incompatible syntax
puts MacRuby on its own wrt future standard syntax changes.

I've considered adding syntax to JRuby for some things (like to allow
static dispatch against Java objects, for perf) but in every case I've
only considered options that would be forward-compatible (like
comment-based annotation of types, etc).

- Charlie
 
C

Charles Oliver Nutter

Specifically, MacRuby has an extension to the usual hash parameters
that seems pretty complicated to me, but allows it to interact fairly
painlessly with Objective-C*. It basically allows you to define a
method with named parameters. It's bewildering because a method
invocation like "person.setFirstName('Gloria', { :lastName=>'Alvarez'
})" actually calls a *different* method from
"person.setFirstName('Gloria', lastName: 'Alvarez')" (the former is
the regular Ruby hash argument type; the latter uses the
MacRuby-specific keyed arguments. If no keyed-argument version of a
method exists, it will just translate it into a hash-argument form.)

Laurent can correct me if I'm wrong, but I don't think ObjC's syntax
qualifies as true named arguments. In MacRuby/ObjC, a method call like

foo(bar:1, baz:2)

Is just a way of saying something like

foo_with_bar_with_baz(1, 2)

The method name and the argument names actually resolve to different
method bodies internally. In addition, the order is absolutely
crucial. The following two calls are not the same and will not end up
in the same target method:

foo(bar: 1, baz: 2)
foo(baz: 2, bar: 1)

Named arguments would allow you to call the *same* method with
different groups of names in any order.

- Charlie
 
S

Shadowfirebird

So far I don't believe I've heard anyone make the obvious case against method overloading -- that unless done very carefully indeed, it makes the code much, much more difficult to read. One method does one job is the sane way to go, thanks.

Or, indeed, the practical case - at the very heart of Ruby is the idea of duck typing. Duck typing rules out method overloading, because parameters would have to have set types before you could have a signature. Presumably no-one is suggesting that we should have fixed typing in Ruby?
 
R

Robert Klemme

So far I don't believe I've heard anyone make the obvious case against me=
thod overloading -- that unless done very carefully indeed, it makes the co=
de much, much more difficult to read. =A0One method does one job is the san=
e way to go, thanks.

I had tried to make the point here:

http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-talk/377075
Or, indeed, the practical case - at the very heart of Ruby is the idea of=
duck typing. =A0Duck typing rules out method overloading, because paramete=
rs would have to have set types before you could have a signature. =A0Presu=
mably no-one is suggesting that we should have fixed typing in Ruby?

Erm, actually there _are_ people who believe typing should change in
Ruby to support static typing features. There does not seem to be
much support for this in the community though. Obviously ducks feel
more at home in our community pond than metal skeletons. :)

Kind regards

robert

--=20
remember.guy do |as, often| as.you_can - without end
http://blog.rubybestpractices.com/
 
G

Gary Wright

Or, indeed, the practical case - at the very heart of Ruby is the idea =
of duck typing. Duck typing rules out method overloading, because =
parameters would have to have set types before you could have a =
signature. Presumably no-one is suggesting that we should have fixed =
typing in Ruby?

I think the original poster provided an example of overloading based on =
the number of parameters but not their type.
Even restricting yourself to overloading by arity is a bit problematic =
in Ruby because the arity still has to be determined (in some cases) =
dynamically:

args =3D [1,2]
foo(*args) # two arguments
args << 3
foo(*args) # three arguments

Gary Wright
 
J

Jörg W Mittag

Charles said:
Laurent can correct me if I'm wrong, but I don't think ObjC's syntax
qualifies as true named arguments.

Yes. Objective-C inherits its syntax from Smalltalk, which just has
plain old boring standard positional arguments like every other
language, like C, like Ruby minus optional and splat arguments.

The only peculiarity is that the arguments get written between the
subroutine name instead of at the end. So, if you have a method named
foo:bar:baz: which takes three arguments, the way you would call it in
pretty much every other language is

foo:bar:baz:(1, 2, 3)

whereas in Smalltalk it's

foo: 1 bar: 2 baz: 3

This allows you to achieve some nice readability with clever method
naming, i.e. instead of

dictionary.add(1, 2) # which one's the key and which is the value?

you get

dictionary at: 1 put: 2.

But fundamentally, these are still positional arguments. I could, for
example, do this in Ruby:

dictionary.at_first_put_second(1, 2)
In MacRuby/ObjC, a method call like

foo(bar:1, baz:2)

Is just a way of saying something like

foo_with_bar_with_baz(1, 2)

More precisely, your example (roughly) translates to the following
snippet of Smalltalk:

temp := Dictionary new.
temp at: #bar put: 1. 'No dictionary literals in Smalltalk'

foo: temp baz: 2.

i.e. in Ruby:

send:)'foo:baz:', { :bar => 1 }, 2)

A better example would be

foo(1, bar: 2, baz: 3)

which translates to

foo: 1 bar: 2 baz: 3.

or

send:)'foo:bar:baz:', 1, 2, 3)
The method name and the argument names actually resolve to different
method bodies internally. In addition, the order is absolutely
crucial. The following two calls are not the same and will not end up
in the same target method:

foo(bar: 1, baz: 2)
foo(baz: 2, bar: 1)

Correct.

The only reason why you can say

condition ifTrue: [doThis] ifFalse: [doThat].
condition ifFalse: [doThat] ifTrue: [doThis].
condition ifTrue: [doThis]; ifFalse: [doThat].

is because TrueClass and FalseClass define four methods

ifTrue:ifFalse:
ifFalse:ifTrue:
ifTrue:
ifFalse:

With named arguments (and optional arguments), you would have just one
method:

def if(then=->{}, else=->{})

jwm
 
T

Thomas E Enebo

f duck typing. =A0Duck typing rules out method overloading, because paramet=
ers would have to have set types before you could have a signature. =A0Pres=
umably no-one is suggesting that we should have fixed typing in Ruby?
I think the original poster provided an example of overloading based on t=
he number of parameters but not their type.
Even restricting yourself to overloading by arity is a bit problematic in=
Ruby because the arity still has to be determined (in some cases) dynamica=
lly:
args =3D [1,2]
foo(*args) =A0 =A0 =A0# two arguments
args << 3
foo(*args) =A0 =A0 =A0# three arguments

Actually, arity of callsite is always calculated in Ruby to know if
you should throw an ArgumentError (3 for 0 specified sort of errors)
against the method you are calling. It seems like overloading based
on arity is not such a bad idea to me based on some of the common
arity parsing idioms people do by hand in the first few lines of their
methods. What implementing arity-based overloads would do is get rid
of most of this code we put at the top of methods and perform that
logic in the Ruby implementation itself (in MRI in C vs in Ruby).

Stylistically, I think the biggest issue is not realizing there are n
overloads and then implementing less than n overloads in an overridden
class.

-Tom

--=20
blog: http://blog.enebo.com=A0 =A0 =A0=A0 twitter: tom_enebo
mail: (e-mail address removed)
 
C

Charles Oliver Nutter

n Ruby because the arity still has to be determined (in some cases) dynamic=
ally:
Actually, arity of callsite is always calculated =C2=A0in Ruby to know if
you should throw an ArgumentError (3 for 0 specified sort of errors)
against the method you are calling. =C2=A0It seems like overloading based
on arity is not such a bad idea to me based on some of the common
arity parsing idioms people do by hand in the first few lines of their
methods. =C2=A0What implementing arity-based overloads would do is get ri= d
of most of this code we put at the top of methods and perform that
logic in the Ruby implementation itself (in MRI in C vs in Ruby).

I'd also add that in JRuby, if the source arity matches the target
(non-rest, non-optional) arity, we do no calculation at all (for
arities up to 3). So if Ruby supported multiple overloads, it would
basically just work like our core arity-split methods do right now and
automatically route to the correct body.
Stylistically, I think the biggest issue is not realizing there are n
overloads and then implementing less than n overloads in an overridden
class.

An *excellent* point. This bites overload-supporting static languages
very frequently too.

- Charlie
 
G

Gary Wright

Maybe I'm overlooking something but I wasn't suggesting that you don't need to calculate arity but instead was pointing out that you can't just calculate it at parse time but sometimes need to calculate it at call time.

foo(1,2) # the call arity can be computed at parse time
foo(*a) # the call arity must be computed at call time

So in an 'overload-based-on-arity' scheme there might be some optimization opportunities for some call sites but not for every call site.

Seems like you can get pretty far though with just a little meta-programming with no special language support:

module Arity
def overload(base)
define_method(base) do |*args|
argc = args.size
method_name = "#{base}_#{argc}"
if respond_to?(method_name)
send(method_name, *args)
else
raise ArgumentError, "wrong number of arguments (no method for #{argc} arguments)"
end
end
end
end

class A
extend Arity

overload :foo

def foo_0
puts "foo with no arguments"
end

def foo_1(arg1)
puts "foo with 1 argument: #{arg1.inspect}"
end

end

A.new.foo # dispatches to A#foo_0
A.new.foo(1) # dispatches to A#foo_1
A.new.foo(1,2) # ArgumentError


Gary Wright
 
T

Thomas E Enebo

in Ruby because the arity still has to be determined (in some cases) dynam=
ically:
Maybe I'm overlooking something but I wasn't suggesting that you don't ne=
ed to calculate arity but instead was pointing out that you can't just calc=
ulate it at parse time but sometimes need to calculate it at call time.
foo(1,2) =A0 =A0 # the call arity can be computed at parse time
foo(*a) =A0 =A0 =A0# the call arity must be computed at call time

So in an 'overload-based-on-arity' scheme there might be some optimizatio=
n opportunities for some call sites but not for every call site.

Yeah, that is certainly true in the general sense even in existing
Ruby semantics. In your first example, we know that it is always a
two-arg call (at parse time) so we can go through a two-arity call
path and not 'box' the call parameters into an array. In the second
case we might be able to know arity at parse time if we know 'a' is a
literal array. If we don't then the optimizations we can do get
more limited.

You are correct that adding arity to the mix would make the
optimizations more complex, but I think it depends on the case and
what you get by supporting it. For example, in the second case if we
made our callsite cache lookup the methods foo and cache all of them
at the site, then dispatch to the appropriate one, we would have quite
a bit more complicated callsite cache (since we would need to
invalidate it if any arity version changed), but this would dispatch
much faster than doing things like you are showing in the example
later in your email (actually much faster than any pure-Ruby logic for
arity resolution). In the case where there was only one arity it
would behave more or less like it does currently. The main change
would be on any same-named method we would need to invalidate. [Note:
This is just one way this could be done and callsite invalidation
would be about the same as what it is now since it would invalidate
based on name. We could do it name+arity. We could do it both ways
even (perhaps name+arity for first case and only name for second).

With all this said, it seems like a good idea to me, but OTOH no
feature is without its caveats. I am more worried about Ruby
programming style than performance and though it seems to pass a smell
test for me...I don't think I am in the 'yeah let's do it camp' yet.
I am probably in the 'someone should play with this' camp.
Seems like you can get pretty far though with just a little meta-programm=
ing with no special language support:
module Arity
=A0def overload(base)
=A0 =A0define_method(base) do |*args|
=A0 =A0 =A0argc =3D args.size
=A0 =A0 =A0method_name =3D "#{base}_#{argc}"
=A0 =A0 =A0if respond_to?(method_name)
=A0 =A0 =A0 =A0send(method_name, *args)
=A0 =A0 =A0else
=A0 =A0 =A0 =A0raise ArgumentError, "wrong number of arguments (no method= for #{argc} arguments)"
=A0 =A0 =A0end
=A0 =A0end
=A0end
end

class A
=A0extend Arity

=A0overload :foo

=A0def foo_0
=A0 =A0puts "foo with no arguments"
=A0end

=A0def foo_1(arg1)
=A0 =A0puts "foo with 1 argument: #{arg1.inspect}"
=A0end

end

A.new.foo =A0 =A0 =A0 =A0 =A0 # dispatches to A#foo_0
A.new.foo(1) =A0 =A0 =A0 =A0# dispatches to A#foo_1
A.new.foo(1,2) =A0 =A0 =A0# ArgumentError


Gary Wright



--=20
blog: http://blog.enebo.com=A0 =A0 =A0=A0 twitter: tom_enebo
mail: (e-mail address removed)
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
474,141
Messages
2,570,813
Members
47,357
Latest member
sitele8746

Latest Threads

Top