Wide string initializer syntax

Derrick Coetzee · Sep 11, 2004

Looking through the C90 standard, it occurred to me that the possible
syntaxes for initializers, particularly of wchar_t arrays, are really
bizarre. Consider the following:

wchar_t s1[] = { L"abcdef" };
wchar_t* s2[] = { L"abcdef" };
wchar_t s3[][6] = { L"abcdef" };
wchar_t* s4[][6] = { L"abcdef" };

That's four different types initialized with exactly the same
initializer syntax, but it means four different things. In the first
case, a mutable buffer is being initialized, and the standard lets you
wrap the string intializing the buffer in braces for no apparent reason.
In the second case, an array containing one pointer to a literal string
is declared. In the third case, an array containing one initialized
mutable buffer is declared. In the fourth case, a 1 by 6 two-dimensional
array is declared, with s4[0][0] set to a literal string, and s4[0][1]
through s[0][5] set to a null pointer. I could continue with
larger-dimensional arrays right up to the environment limits.

Thoughts?

Nicolas Pavlidis · Sep 11, 2004

Derrick said:
Looking through the C90 standard, it occurred to me that the possible
syntaxes for initializers, particularly of wchar_t arrays, are really
bizarre. Consider the following:

Are you shure that wchar_t is a build-in TYpe for C? I don't know about
C99, but in C90 ther is defently no wchar_t build-in type!

Kind regards,
Nicolas

Derrick Coetzee · Sep 12, 2004

Nicolas said:
Are you shure that wchar_t is a build-in TYpe for C? I don't know about
C99, but in C90 ther is defently no wchar_t build-in type!

The wchar_t type is not built-in, but is required to be defined in the
standard header stddef.h. Wide string literals are always arrays of
whatever wchar_t is defined to be, even if the type's definition is not
available. The standard mentions wchar_t in several places.

Chris Torek · Sep 15, 2004

Looking through the C90 standard, it occurred to me that the possible
syntaxes for initializers, particularly of wchar_t arrays, are really
bizarre. Consider the following:

wchar_t s1[] = { L"abcdef" };
wchar_t* s2[] = { L"abcdef" };
wchar_t s3[][6] = { L"abcdef" };
wchar_t* s4[][6] = { L"abcdef" };

That's four different types initialized with exactly the same
initializer syntax, but it means four different things. ...

Indeed, this is all correct and true, but it is not special to wide
characters. Replace "wchar_t" with "char", and remove the uppercase
L's, and it is still all correct and true.

(Versions of gcc helpfully warn about incomplete/inconsistent
brace-bracketing of the fourth line, given the appropriate options.)

Derrick Coetzee · Sep 15, 2004

Chris said:
wchar_t s1[] = { L"abcdef" };
wchar_t* s2[] = { L"abcdef" };
wchar_t s3[][6] = { L"abcdef" };
wchar_t* s4[][6] = { L"abcdef" };

Click to expand...

Indeed, this is all correct and true, but it is not special to wide
characters. Replace "wchar_t" with "char", and remove the uppercase
L's, and it is still all correct and true.

Ah, you're right. It was the first one I was unsure of, but:

"An array of character type may be initialized by a character string
literal, optionally enclosed in braces."
"An array with element type compatible with wchar_t may be initialized
by a wide string literal, optionally enclosed in braces."
- C90, 6.5.7

I can't figure out what these optional braces are for. I suppose yet
another concession to existing implementations.

J. J. Farrell · Sep 15, 2004

Derrick Coetzee said:
"An array of character type may be initialized by a character string
literal, optionally enclosed in braces."
"An array with element type compatible with wchar_t may be initialized
by a wide string literal, optionally enclosed in braces."
- C90, 6.5.7

I can't figure out what these optional braces are for. I suppose yet
another concession to existing implementations.

Consistency. In general, initializers for aggregate type are enclosed
in braces.

Michael Wojcik · Sep 16, 2004

Consistency. In general, initializers for aggregate type are enclosed
in braces.

The braces are also optional for initializers for scalar types.

This consistency simplifies things for source-code generators, and
means that {0} is a valid initializer for any object type or any
array of unknown size (in a declaration where initialization is
permitted).

An empty initializer is invalid for an array with unspecified bound	0	Jul 1, 2020
string literal initializer	15	Jun 19, 2010
constant string as controlling expression in _Generic gives error	8	Dec 8, 2013
How to use ufixed when it involves multiplication a number of times?(VHDL question)	0	Aug 22, 2016
Is this String class properly implemented?	96	Apr 18, 2009
Strings as non-type template parameters	11	May 31, 2010
comp.lang.c FAQ list Table of Contents	0	Jan 12, 2008
build error on Linux - initializer element is not computable at load time	7	Oct 16, 2003

Wide string initializer syntax

Derrick Coetzee

Nicolas Pavlidis

Derrick Coetzee

Chris Torek

Derrick Coetzee

J. J. Farrell

Michael Wojcik

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads