Arrays and Strings in C and C++
Essay Preview: Arrays and Strings in C and C++
Report this essay
Chapter 6: Arrays and Strings
Arrays and strings are closely related. In the abstract sense, a string is really just an array (possibly read-only) of characters. Most of the string-manipulation problems youll encounter are therefore based on your understanding of array data types, particularly in languages such as C and C++ in which strings and character arrays are essentially identical. Although other languages – especially the object-oriented ones such as C# and Java – consider strings and character arrays to be separate, theres always a way to convert a string to an array and vice versa. When the two are different, however, its very important to understand where and why they diverge. In addition, not all array problems will involve strings, so understanding how arrays work in the abstract and how theyre implemented by the language youre using is absolutely crucial to answering those array-focused problems.
Arrays
An array is a sequence of variables of the same type arranged contiguously in a block of memory. Because arrays play an important role in every major language used in commercial development, we assume youre at least somewhat familiar with their syntax and usage. With that in mind, this discussion focuses on the theory and application of arrays.
Like a linked list, an array provides an essentially linear form of storage, but its properties are significantly different. In a linked list, lookup is always an O(n) operation, but array lookup is O(1) as long as you know the index of the element you want. The provision regarding the index is important – if you know only the value, lookup is still O(n) in the worst case. For example, suppose you have an array of characters. Locating the sixth character is O(1), but locating the character with value w is O(n).
Tip Of course, multidimensional arrays are not exactly linear, but they are implemented as linear arrays of linear arrays (of linear arrays repeated as needed), so even multidimensional arrays are linear in each dimension.
The price for this improved lookup is significantly decreased efficiency in the insertion and deletion of data in the middle of the array. Because an array is essentially a block of contiguous memory, its not possible to create or eliminate storage between any two elements as it is with a linked list. Instead, you must physically move data within the array to make room for an insertion or to close the gap left by a deletion.
Arrays are not dynamic data structures: They have a finite, fixed number of elements. Memory must be allocated for every element in an array, even if only part of the array is used. Arrays are best used when you know how many elements you need to store before the program executes. When the program needs a variable amount of storage, the size of the array imposes an arbitrary limit on the amount of data that can be stored. Making the array large enough so that the program always operates below the limit doesnt solve the problem: Either memory will be wasted or there wont be enough memory to handle the largest data sizes possible.
Some languages do support dynamic arrays, which are arrays that can change size to efficiently store as much or as little data as necessary. (Note that in this case were referring to dynamic arrays as a language feature. Dynamic arrays can be simulated in code by other languages.) This discussion wont go into the details of implementing a dynamic array, but it is important to know that most dynamic array implementations use static arrays internally. A static array cannot be resized, so dynamic arrays are resized by allocating a new array of the appropriate size, copying every element from the old array into the new array, and freeing the old array. This is an expensive operation that must be done as infrequently as possible. (Many strings are implemented as dynamic arrays of characters.)
Each language handles arrays somewhat differently, giving each language a different set of array programming pitfalls. Lets take a look at array usage across several different languages.
C/C++
Despite the differences between C and C++, they are very similar in their treatment of arrays. In most cases, an array name is equivalent to a pointer constant to the first element of the array.[1] This means that you cant initialize the elements of one array with another array using a simple assignment.
Tip Pointers and constants can be confusing concepts separately; they are often nearly incomprehensible in combination. When we say pointer constant we mean a pointer declared like char *const chrPtr that cannot be altered to point at a different place in memory, but that can be used to change the contents of the memory it points to. This is not the same as the more-commonly seen constant pointer, declared like const char *chrPtr, which can be changed to point at a different memory location but cannot be used to change the contents of a memory location. If you find this confusing, youre certainly not the only one.
For example,
arrayA = arrayB; /* Compile error: arrayA is not an lvalue */
is interpreted as an attempt to make arrayA refer to the same area of memory as arrayB. If arrayA has been declared as an array, this causes a compile error because you cant change the memory location to which arrayA refers. To copy arrayB into arrayA, you have to write a loop that does an element-by-element assignment or use a library function such as memcpy that does the copying for you (and usually much more efficiently).
In C and C++, the compiler tracks only the location of arrays, not their size. The programmer is responsible for tracking array sizes, and there is no bounds checking on array accesses – the language wont complain if you store something in the twentieth element of a ten-element array. As you