The Data World of C Language: A Deep Dive from Basic Types to Memory Layout

I am Feri. In embedded development, the choice of data types directly affects memory usage and runtime efficiency. The power of C language comes from its precise control over data—this article will guide you through the surface of data to understand the underlying computer logic behind types.

1. Data Types: The “Language Rules” for Computers to Understand the World

The C language maps real-world information into binary space through a strict type system. Mastering data types is akin to mastering the grammar rules for conversing with computers.

1.1 Basic Data Types: The Building Blocks of Data Structures

🔥 Integer Family: Precise Representation of Values

Type	Keyword	Byte Size (32-bit)	Value Range (Signed)	Typical Use Cases
Short Integer	`<span>short</span>`	2	-32768 ~ 32767	Counters, small data storage
Integer	`<span>int</span>`	4	-2147483648 ~ 2147483647	General integer operations
Long Integer	`<span>long</span>`	4/8*	At least the same length as`<span>int</span>`, usually 8 bytes on 64-bit systems	Large integer calculations (e.g., file sizes)
Long Long Integer	`<span>long long</span>`	8	-9223372036854775808 ~ 9223372036854775807	High-precision numerical processing

⚠️ Note: The C standard only specifies thatshort <= int <= long <= long long, the specific byte size is determined by the compiler (can be obtained usingsizeof()).

🧮 Floating Point: Approximate Representation of Real Numbers

Single Precisionfloat (4 bytes): Effective digits 6-7, suitable for graphical rendering coordinate calculations
Double Precisiondouble (8 bytes): Effective digits 15-16, preferred for scientific calculations (e.g., physical formula derivation)
Long Double Precisionlong double (8/16 bytes): Ultra-high precision scenarios (e.g., financial calculations, cryptography)

float pi = 3.1415926;  // Actual storage precision only up to the 6th digit, subsequent digits may be distorted

double precisePi = 3.141592653589793;  // Double precision can retain 15 significant digits

📖 Character Type: Binary Encoding of Text

char is essentially a 1-byte integer, storing ASCII codes (0-127) or extended character sets (e.g., GBK’s -128~127)
Signed signed char (default, supports negative numbers) and unsigned char (0-255, used for byte operations)

char ch = 'A';        // Stores ASCII code 65 (decimal)

unsigned char byte = 0xFF;  // Represents 255, commonly used for network data transmission

1.2 Constructed Data Types: Assembly Solutions for Complex Data

🧱 Arrays: Continuous Memory Blocks of the Same Type

int scores[5] = {85, 90, 95};  // Defines an integer array of length 5, uninitialized elements are random values

char name[] = "Feri";  // Automatically calculates length as 5 (including the terminating \0)

Memory addresses are continuous, accessed via index[index]<code> (index starts from 0)
The array name is the address of the first element, which can be manipulated via pointers:int *p = scores; p[0] == scores[0]

🧩 Structures: Custom Data Containers

struct Student {  // Define a student structure
    char name[20];
    int age;
    float score;
};

struct Student tom = {"Tom", 18, 89.5};  // Initialize structure variable

Allows combinations of different types of data, memory allocated in the order of members
Access members via. (tom.score), pointer access uses-> (struct Student *p = &tom; p->age)

🔄 Unions: The Magic of Memory Reuse

union Data {  // Union members share the same memory segment
    int num;
    char ch;
    float f;
};

union Data d;
d.num = 100;    // At this point, ch and f's values are meaningless
d.ch = 'A';     // At this point, num and f's values are overwritten

Memory size equals the size of the largest member (saves space at the cost of type safety)
Suitable for scenarios where only one type of data is used at a time (e.g., variable fields in protocol parsing)

📌 Enumerations: A Collection of Named Constants

enum Color { RED, GREEN, BLUE = 5 };  // Enumeration members start from 0 by default, BLUE is 5

enum Color favorite = GREEN;  // favorite's value is 1

Enhances code readability, avoids magic numbers (e.g., using RED instead of 0)
Essentially integers, can participate in arithmetic operations (not recommended to misuse)

1.3 Pointer Types: Manipulators of Memory Addresses

int num = 10;
int *ptr = &amp;num;  // Pointer ptr stores the address of num (e.g., 0x7ffd5f8e4a20)
*ptr = 20;       // Modify the value of num through dereferencing (num becomes 20)

Pointers are the soul of C language, enabling dynamic memory allocation (malloc), array operations, function parameter passing, and other core functionalities
Beware of dangling pointer risks: uninitialized pointers (int *p; *p = 10; will lead to undefined behavior)

1.4 Void Type: The Universal Type’s Transfer Station

No Return Value Functions:void printHello() { printf("Hello\n"); } (no need forreturn statement)
Generic Pointers:void *buffer = malloc(1024); (can point to any type, requires type casting when used)
No Parameter Declaration:int main(void) (explicitly declares no parameters, more in line with modern C standards)

2. Variables: Dynamic Containers for Data

2.1 Variable Lifecycle and Scope

🌐 Global Variables: Always Present During Program Execution

int globalVar = 10;  // Defined outside all functions
void func() {
    globalVar = 20;  // Correct, global variable scope extends from definition to end of file
}

Disadvantage: Breaks encapsulation, prone to race conditions in multi-threaded environments
Recommendation: Use only for data that must be shared (e.g., configuration parameters)

🏢 Local Variables: Temporary Storage in Function Stack Frames

void calculate() {
    int localVar = 0;  // Visible only within the calculate function
    for (int i=0; i&lt;10; i++) {  // C99 supports variable definition within for loop (i's scope is limited to the loop body)
        localVar += i;
    }
}  // localVar and i are destroyed after function ends

Advantage: Clear scope, avoids naming conflicts
Note: Uninitialized local variables have random values (commonly known as “garbage values”), which may lead to logical errors

2.2 Variable Naming: The First Line of Defense for Code Readability

Rules:

Composed of letters, numbers, and underscores, the first character cannot be a number
Case-sensitive (Count and count are different variables)
Keywords cannot be used (e.g.,if, register, sizeof)

Conventions:

Camel case:studentAge, maxScore
Hungarian notation (commonly used in embedded systems):u8_t age (u8_t represents unsigned 8-bit integer)
Avoid single-letter variables (except for loop indicesi, j, etc.)

3. Constants: The Immutable Foundation of Data

3.1 Literal Constants: Fixed Values Written Directly

📍 Three Representations of Integers

Decimal:123
Octal (starts with 0):0173 (equals decimal 123)
Hexadecimal (starts with 0x):0x7B (equals decimal 123)

🌐 The Essential Difference Between Strings and Characters

char ch = 'A';       // 1 byte, stores ASCII code 65
char str[] = "A";    // 2 bytes, stores 65 and the terminating \0 (ASCII code 0)

The string automatically adds a\0 at the end, which is fundamental for C language text processing

3.2 Symbolic Constants: Aliases that Give Meaning to Values

✨ `#define` Preprocessor Definitions

#define PI 3.141592  // Macro definition, text replacement during preprocessing
#define MAX_SIZE 100

Advantage: Facilitates unified modification (e.g., changing PI precision requires only one change)
Disadvantage: No type checking, may lead to macro expansion errors (recommended to useconst instead)

🛡️ `const` Read-only Variables

const int MAX_SCORE = 100;  // Define a constant, cannot be modified
MAX_SCORE = 200;  // Compilation error!

Has type information, safer (C99 standard supports)
Scope follows variable rules (local/global constants)

4. The Golden Rules for Choosing Data Types

Principle of Sufficiency: Useshort if possible, avoid usingint to save limited RAM in embedded devices
Precision Matching: Usedouble for financial calculations, usefloat for interface coordinates
Avoid Implicit Conversions: Mixingint andfloat in operations may lose precision, use explicit type casting:(float)age
Pointers are Addresses: Ensure pointers point to valid memory before operating on them (check ifmalloc returnsNULL)

Data types are the bridge for C language to communicate with hardware. When you definechar ch = 'F', the computer is inscribing your programming mark in memory with the binary number 65 (01000110). In the next article, we will delve into operators and expressions, learning how to use these data to construct complex logical structures. Follow me to unlock the underlying control power of C language!

// Philosophical Reflection on Data Types:
typedef struct {
    char name[20];
    int experience;
    void (*teach)(void);  // Function pointer, points to teaching method
} Programmer;

Programmer feri = {"Feri", 12, teachC};  // Define a programmer using structure

A Deep Dive into C Language: From Basic Types to Memory Layout

The Data World of C Language: A Deep Dive from Basic Types to Memory Layout

1. Data Types: The “Language Rules” for Computers to Understand the World

1.1 Basic Data Types: The Building Blocks of Data Structures

🔥 Integer Family: Precise Representation of Values

🧮 Floating Point: Approximate Representation of Real Numbers

📖 Character Type: Binary Encoding of Text

1.2 Constructed Data Types: Assembly Solutions for Complex Data

🧱 Arrays: Continuous Memory Blocks of the Same Type

🧩 Structures: Custom Data Containers

🔄 Unions: The Magic of Memory Reuse

📌 Enumerations: A Collection of Named Constants

1.3 Pointer Types: Manipulators of Memory Addresses

1.4 Void Type: The Universal Type’s Transfer Station

2. Variables: Dynamic Containers for Data

2.1 Variable Lifecycle and Scope

🌐 Global Variables: Always Present During Program Execution

🏢 Local Variables: Temporary Storage in Function Stack Frames

2.2 Variable Naming: The First Line of Defense for Code Readability

3. Constants: The Immutable Foundation of Data

3.1 Literal Constants: Fixed Values Written Directly

📍 Three Representations of Integers

🌐 The Essential Difference Between Strings and Characters

3.2 Symbolic Constants: Aliases that Give Meaning to Values

✨ `<span>#define</span>` Preprocessor Definitions

🛡️ `<span>const</span>` Read-only Variables

4. The Golden Rules for Choosing Data Types

Leave a Comment Cancel reply

The Data World of C Language: A Deep Dive from Basic Types to Memory Layout

1. Data Types: The “Language Rules” for Computers to Understand the World

1.1 Basic Data Types: The Building Blocks of Data Structures

🔥 Integer Family: Precise Representation of Values

🧮 Floating Point: Approximate Representation of Real Numbers

📖 Character Type: Binary Encoding of Text

1.2 Constructed Data Types: Assembly Solutions for Complex Data

🧱 Arrays: Continuous Memory Blocks of the Same Type

🧩 Structures: Custom Data Containers

🔄 Unions: The Magic of Memory Reuse

📌 Enumerations: A Collection of Named Constants

1.3 Pointer Types: Manipulators of Memory Addresses

1.4 Void Type: The Universal Type’s Transfer Station

2. Variables: Dynamic Containers for Data

2.1 Variable Lifecycle and Scope

🌐 Global Variables: Always Present During Program Execution

🏢 Local Variables: Temporary Storage in Function Stack Frames

2.2 Variable Naming: The First Line of Defense for Code Readability

3. Constants: The Immutable Foundation of Data

3.1 Literal Constants: Fixed Values Written Directly

📍 Three Representations of Integers

🌐 The Essential Difference Between Strings and Characters

3.2 Symbolic Constants: Aliases that Give Meaning to Values

✨ <span>#define</span> Preprocessor Definitions

🛡️ <span>const</span> Read-only Variables

4. The Golden Rules for Choosing Data Types

Related posts

Leave a Comment Cancel reply

✨ `<span>#define</span>` Preprocessor Definitions

🛡️ `<span>const</span>` Read-only Variables