Calling functions with the wrong parameters

Kenneth Brody · Jul 18, 2006

The recent thread on "query about main()" got me thinking...

As I recall, calling a function with the wrong parameters causes
undefined behavior.

(These all assume that no prototype of foo() is in scope.)

For example:

==== fileA.c

int foo(int a, double b)
{
...
}

==== fileB.c

...
x = foo("bar",3);

However, what about "extra" parameters to non-varadic functions,
such as:

x = foo(3,5.0,"bar");

Or, what if the function is defined with no parameters, as in:

int main(void)
{
...
}

--
+-------------------------+--------------------+-----------------------+
| Kenneth J. Brody | www.hvcomputer.com | #include |
| kenbrody/at\spamcop.net | www.fptech.com | <std_disclaimer.h> |
+-------------------------+--------------------+-----------------------+
Don't e-mail me at: <mailto:[email protected]>

pemo · Jul 18, 2006

Kenneth said:
The recent thread on "query about main()" got me thinking...

As I recall, calling a function with the wrong parameters causes
undefined behavior.

(These all assume that no prototype of foo() is in scope.)

For example:

==== fileA.c

int foo(int a, double b)
{
...
}

==== fileB.c

...
x = foo("bar",3);

However, what about "extra" parameters to non-varadic functions,
such as:

x = foo(3,5.0,"bar");

Or, what if the function is defined with no parameters, as in:

int main(void)
{
...
}

What's the question - if it's /what will happen/, the answer's more
undefined behavior presumably? Or have I missed some subtlety in your
question?

Ben Pfaff · Jul 18, 2006

Kenneth Brody said:
As I recall, calling a function with the wrong parameters causes
undefined behavior.
[...]

However, what about "extra" parameters to non-varadic functions,
such as:

Still undefined.

Eric Sosman · Jul 18, 2006

Ben Pfaff wrote On 07/18/06 14:37,:

As I recall, calling a function with the wrong parameters causes
undefined behavior.

[...]

However, what about "extra" parameters to non-varadic functions,
such as:

Click to expand...

Still undefined.

And to answer the next question (I can feel it coming;
me rheumatism allus' gits a-cranky when one o' them li'l
questions is in the wind): No, this doesn't cast doubt on
the status of the main() function. (K.B.'s original post
mentioned that it was prompted by the recent main() thread.)

The Standard prescribes two forms for main(), and allows
an implementation to support additional forms. So it's
sometimes said that main() is unique among C functions in
having "multiple signatures." But this isn't quite right:
In any particular (hosted) program, main() has exactly one
signature, namely, the signature it is defined with. If you
try to call it recursively and provide an argument list that
doesn't match the definition, it's just like calling any
other function with the wrong argument list: undefined.

#include <stdio.h>
int main(int argc, char **argv) {
static int (*func)(void) = NULL;
if (func == NULL) {
puts ("First time");
func = (int (*)(void))main;
return func(); /* undefined behavior */
}
puts ("Second time");
return 0;
}

The magical thing about main() is that it can be "called
from the environment" with two (or more) different kinds of
argument lists. What this means is that the environment's
invocation of main() cannot be written in strictly-conforming
C: either it calls main() with the wrong arguments and gets
away with it, or it uses some outside-of-C means to discover
the argument list main() expects and then calls it properly.
But in any one program, there is nothing special about the
argument list of main().

Chris Torek · Jul 18, 2006

And to answer the next question (I can feel it coming;
me rheumatism allus' gits a-cranky when one o' them li'l
questions is in the wind): No, this doesn't cast doubt on
the status of the main() function. (K.B.'s original post
mentioned that it was prompted by the recent main() thread.)

The Standard prescribes two forms for main(), and allows
an implementation to support additional forms. So it's
sometimes said that main() is unique among C functions in
having "multiple signatures." But this isn't quite right:
In any particular (hosted) program, main() has exactly one
signature, namely, the signature it is defined with. ...

And, perhaps not incidentally, here is how it works on one
system on which "return" from a non-variadic function pops
the arguments pushed by the caller: ... um, hang on a moment.

First, let me describe how arguments work, on this machine.

The machine has a conventional stack, and arguments are pushed
onto the stack in the usual way:

extern int foo(char *, int);
...
result = foo("this", 7);

compiles to:

push $7 # push last argument first, and then
push $.LC3 # first argument last
call foo_ # call the function
mov r1, -4(sp) # save result
...

and "foo" itself ends with:

ret $8

where the "$8" means "pop 8 bytes off the stack". This makes
function calls slightly smaller and faster, since all the callers
get to omit their "pop bytes off the stack" instructions.

Naturally, this method does not work with variadic functions, so
those end with "ret $0" and the caller, who knows how many bytes
were pushed, pops the pushed bytes.

Now, the problem lies in handling main(). The compiler sees
that main() is *not* a variadic function -- it does not have
", ..." as its last argument:

int main(void) {
return 42;
}
/* or
int main(int argc, char **argv) {
return 0;
}
*/

The first one compiles to:

mov $42, r1
ret $0

which pops nothing at all off the stack; but the second compiles
to:

mov $0, r1
ret $8

which pops 8 bytes off the stack. How can the startup code know
what to do?

There are three "obvious" solutions. The first one is that
the compiler recognizes main() and always uses "ret $0" (or always
uses "ret $8"), so that the startup code can be sure what happened.

Another is to have the startup code examine the value in the
"sp" register upon return, to see whether main() popped 8 bytes
or none (or save and restore it so that it works no matter what).

A third method is to have the compiler recognize "main", and compile
it to one of *two* "link names". Here, the two examples above
actually produce the following:

__0_main: .global __0_main
mov $42, r1
ret $0

and:

__2_main: .global __2_main
mov $0, r1
ret $8

Now, at link time, the compiler simply looks to see whether it
can find the symbol "__0_main" or the symbol "__2_main". Whichever
one it finds, it chooses the appropriate startup code:

void __0_startup() {
extern int main(void);
extern void __init_c_library();

__init_c_library();
exit(main());
}

or:

void __2_startup() {
extern int main(int, char **);
extern void __init_c_library();

__init_c_library();
... find argc and argv ...
exit(main(argc, argv));
}

Which method does this compiler actually use, on this machine?
The answer is: who cares? It works, just as the Standard requires.
That is all you need to know.

pemo · Jul 19, 2006

Chris Torek wrote:

The machine has a conventional stack, and arguments are pushed
onto the stack in the usual way:

extern int foo(char *, int);
...
result = foo("this", 7);

compiles to:

push $7 # push last argument first, and then
push $.LC3 # first argument last
call foo_ # call the function
mov r1, -4(sp) # save result
...

and "foo" itself ends with:

ret $8

where the "$8" means "pop 8 bytes off the stack". This makes
function calls slightly smaller and faster, since all the callers
get to omit their "pop bytes off the stack" instructions.

<snip>

Interestingly this last bit is /encoded/ in the /PASCAL calling convention/
[recently mentioned on another thread here].

I seem to remember that the Windows 1 [could have been 2] API was re-written
so that each non-variadic function was declared PASCAL, e.g.,

HWND GetWindow(HWND, UINT);

was changed to

HWND PASCAL GetWindow(HWND, UINT);

Early Windows' C programs [the only language available other than assembly]
were able to save a few bytes per call [not having to generate any stack
adjustment code] - important in the those days!

pemo · Jul 19, 2006

pemo said:
Chris Torek wrote:

The machine has a conventional stack, and arguments are pushed
onto the stack in the usual way:

extern int foo(char *, int);
...
result = foo("this", 7);

compiles to:

push $7 # push last argument first, and then
push $.LC3 # first argument last
call foo_ # call the function
mov r1, -4(sp) # save result
...

and "foo" itself ends with:

ret $8

where the "$8" means "pop 8 bytes off the stack". This makes
function calls slightly smaller and faster, since all the callers
get to omit their "pop bytes off the stack" instructions.

Click to expand...

<snip>

Interestingly this last bit is /encoded/ in the /PASCAL calling
convention/ [recently mentioned on another thread here].

I seem to remember that the Windows 1 [could have been 2] API was
re-written so that each non-variadic function was declared PASCAL,
e.g.,
HWND GetWindow(HWND, UINT);

was changed to

HWND PASCAL GetWindow(HWND, UINT);

Early Windows' C programs [the only language available other than
assembly] were able to save a few bytes per call [not having to
generate any stack adjustment code] - important in the those days!

Now I come to think about it a bit more - the change from CDCEL to PASCAL
was made before Windows 1 was released I think.

Kenneth Brody · Jul 19, 2006

pemo wrote:
[...]

Interestingly this last bit is /encoded/ in the /PASCAL calling
convention/ [recently mentioned on another thread here].

I seem to remember that the Windows 1 [could have been 2] API was
re-written so that each non-variadic function was declared PASCAL,
e.g.,
HWND GetWindow(HWND, UINT);

was changed to

HWND PASCAL GetWindow(HWND, UINT);

Early Windows' C programs [the only language available other than
assembly] were able to save a few bytes per call [not having to
generate any stack adjustment code] - important in the those days!

Click to expand...

Now I come to think about it a bit more - the change from CDCEL to PASCAL
was made before Windows 1 was released I think.

And the early SDKs included Pascal examples.

--
+-------------------------+--------------------+-----------------------+
| Kenneth J. Brody | www.hvcomputer.com | #include |
| kenbrody/at\spamcop.net | www.fptech.com | <std_disclaimer.h> |
+-------------------------+--------------------+-----------------------+
Don't e-mail me at: <mailto:[email protected]>

Is "?" a sequence point?	7	Jun 9, 2006
C FAQs section 20.27	9	Apr 21, 2008
YKYBRclcTLW	13	Jun 19, 2007
"(unsigned)" with long/int	10	Aug 7, 2007
Sorta-OT: John Backus obit	5	Mar 21, 2007
Only one table shows up with the information	2	Mar 29, 2023
printf() with too many args -- legal?	33	Jun 24, 2005
Function call with wrong number of parameters	16	Mar 14, 2007

Calling functions with the wrong parameters

Kenneth Brody

pemo

Ben Pfaff

Eric Sosman

Chris Torek

pemo

pemo

Kenneth Brody

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads