C/C++ – Difference Between Initializing and Zeroing an Array

arraysc++

In c (or maybe c++) , what's the difference between

char myarr[16]={0x00};

and

char myarr[16];
memset(myarr, '\0', sizeof(myarr));

edit: I ask this because in vc++ 2005 the result is the same..

edit more :
and

char myarr[16]={0x00,};

?

maybe can get more comprehensive answer and not ambiguous as some answers below refer to this kind of code,ie.
put comma just before closing curly braces. Also the result is the same in vc++ 2005.

Best Answer

The important difference is that the first default initializes the array in an element-specific manner: Pointers will receive a null pointer value, which doesn't need to be 0x00 (as in all-bits-zero), booleans will be false. If the element type is a class type that's not a so-called POD (plain old data-type), then you can only do the first one, because the second one only works for the simplest cases (where you don't have virtual functions, user defined constructors and so on). In contrast, the second way using the memset sets all elements of the array to all-bits-zero. That is not always that what you want. If your array has pointers for example, they won't be set to null-pointers necessarily.

The first will default initialize the elements of the array, except for the first one, which is set to 0 explicitly. If the array is local and on the stack (that is, not a static), the compiler internally often does a memset to clear the array out. If the array is non-local or static, the first version can be considerably more efficient. The compiler can put the initializers already, at compile time, into the generated assembler code, making it require no runtime code at all. Alternatively, the array can be laid out on a section that is automatically zero'd out (also for pointers, if they have a all-bits-zero representation) when the program starts in a fast manner (i.e page-wise).

The second does a memset explicitly over the whole array. Optimizing compilers will usually replace a memset for smaller regions with inline machine code that just loops using labels and branches.

_{Here is assembler-code generated for the first case. My gcc stuff isn't much optimized, so we got a real call to memset (16 bytes at the stack-top are always allocated, even if we got no locals. $n is a register number):}

void f(void) {
    int a[16] = { 42 };
}

sub     $29, $29, 88 ; create stack-frame, 88 bytes
stw     $31, $29, 84 ; save return address
add     $4, $29, 16  ; 1st argument is destination, the array.
add     $5, $0, 0    ; 2nd argument is value to fill
add     $6, $0, 64   ; 3rd argument is size to fill: 4byte * 16
jal     memset       ; call memset
add     $2, $0, 42   ; set first element, a[0], to 42
stw     $2, $29, 16  ;
ldw     $31, $29, 84 ; restore return address
add     $29, $29, 88 ; destroy stack-frame
jr      $31          ; return to caller

The gory details from the C++ Standard. The first case above will default-initialize remaining elements.

8.5:

To zero-initialize storage for an object of type T means:

if T is a scalar type, the storage is set to the value of 0 (zero) converted to T;

if T is a non-union class type, the storage for each nonstatic data member and each base-class subobject is zero-initialized;

if T is a union type, the storage for its first data member is zero-initialized;

if T is an array type, the storage for each element is zero-initialized;

if T is a reference type, no initialization is performed.

To default-initialize an object of type T means:

if T is a non-POD class type, the default constructor for T is called

if T is an array type, each element is default-initialized;

otherwise, the storage for the object is zero-initialized.

8.5.1:

If there are fewer initializers in the list than there are members in the aggregate, then each member not explicitly initialized shall be default-initialized (8.5).

Related Solutions

C++ vs C Array Initialization – Key Differences

No, because the characters "jgkl" from Statement A are used to initialize a, it does not create storage in the executable for a character string (other than the storage you created by declaring a). This declaration creates an array of characters in read-write memory which contain the bytes {'j','g','k','l','\0'}, but the string which was used to initialize it is otherwise not present in the executable result.

In Statement B, the string literal's address is used as an initializer. The variable char *b is a pointer stored in read-write memory. It points to the character string "jhdfjnfnsfnnkjdf". This string is present in your executable image in a segment often called ".sdata", meaning "static data." The string is usually stored in read-only memory, as allowed by the C standard.

That is one key difference between declaring an array of characters and a string constant: Even if you have a pointer to the string constant, you should not modify the contents.

Attempting to modify the string constant is "undefined behavior" according to ANSI C standard section 6.5.7 on initialization.

C++ Array Initialization with {0} and {0,} Explained

Actually, in C++, I personally recommend:

char myArray[MAX] = {};

They all do the same thing, but I like this one better in C++; it's the most succinct. (Unfortunately this isn't valid in C.)

By the way, do note that char myArray[MAX] = {1}; does not initialize all values to 1! It only initializes the first value to 1, and the rest to zero. Because of this, I recommend you don't write char myArray[MAX] = {0}; as it's a little bit misleading for some people, even though it works correctly.

Best Answer

Related Solutions

C++ vs C Array Initialization – Key Differences

C++ Array Initialization with {0} and {0,} Explained

Related Question