GitHub - danbev/learning-c

Learning C

The sole purpose of this project is to learn c and experiment with various system calls.

Inheritance

A simple example of how struct inheritance works in C

$ make inherit
$ ./inherit

kqueue example.

A simple example that uses kqueue to monitor a file named test.txt for updates and deletion.

$ make kqueue
$ touch test.txt
$ ./kq

You can now update test.txt to and notice that the program will log that this has happend. Deleting the file will cause the program to exit.

pthreads example

A simple example using pthreads

$ make pthreads
$ ./pthreads

variadic example

Example of using variable arguments to a function

$ make variadic
$ ./var

Header files on Mac

If you go searching for headers files on Mac and look in /usr/include you might find that directory empty. You'll need to install the command line tools using:

$ xcode-select --install

Socket server and client

$ make server-socket
$ ./server-socket

$ make client-socket
$ ./client-socket

When the client disconnects there are zombies left on the server side:

$ ps -t ttys018 -o pid,ppid,tty,stat,args,wchan
  PID  PPID TTY      STAT ARGS            WCHAN
  15071   381 ttys018  Ss   -/bin/bash      -
  89040 15071 ttys018  S+   ./server-socket -
  89044 89040 ttys018  Z+   (server-socket) -
  89054 89040 ttys018  Z+   (server-socket) -
  89057 89040 ttys018  Z+   (server-socket) -
  89060 89040 ttys018  Z+   (server-socket) -
  89062 89040 ttys018  Z+   (server-socket) -
  89064 89040 ttys018  Z+   (server-socket) -
  89066 89040 ttys018  Z+   (server-socket) -

These need to be cleand up properly. This was before any signal handlers were added to the example.

Signals

Just to try out signal handling.

  $ make signals
  $ ./signals

From a different shell run:

  $ ps -ef |grep signal
  $ kill -USR1 pid
  $ kill -USR2 pid

c-ares

Just notes about building c-ares

autoreconf --install
./configure
make

attribute((constructor))

This is can be used to have code run before the main function has been entered. This works for executables and for dynamically linked libraries. For dynmically linked libraries the code will be called before the library is loaded and you also have the equivalent destructor called when the library gets unloaded (or when the main has exited).

One case I came accross was that I wanted to link my executable to a static library and still be able to have the constructors/deconstructors called. But just linking the executable with the static library will not work. There is an example of this called ctor.

Comparing the symbols for the both we see the following:

$ nm ctor-static
0000000100000000 T __mh_execute_header
0000000100000f60 T _main
                 U _printf
                 U dyld_stub_binder
$ nm ctor
0000000100000000 T __mh_execute_header
0000000100000f30 T _main
0000000100000f60 T _pre_func
                 U _printf
                 U dyld_stub_binder

ctor-static does not include the pre_func. This is most probably due to that is not used and hence not included and not executed. Unreferenced symbols are not included in the output binary.

Is there a way to specify that every thing from the static library be included?

clang -o ctor-static -Wl,-force_load,libctor.a ctor.c

$ nm ctor-static
0000000100000000 T __mh_execute_header
0000000100000f50 T _main
0000000100000f30 T _pre_func
                 U _printf
                 U dyld_stub_binder

Socket address

A problem arises in how to declare the type of pointer that is passed. With ANSI C, the solution is simple: void * is the generic pointer type. But, the socket functions predate ANSI C and the solution chosen in 1982 was to define a generic socket address structure in the <sys/socket.h> header

If we take a look at sockaddr in /usr/include/sys/socket.h we can find:

struct sockaddr {
  __uint8_t       sa_len;         /* total length */
  sa_family_t     sa_family;      /* [XSI] address family */
  char            sa_data[14];    /* [XSI] addr value (actually larger) */
};

$ lldb sockadd
(lldb) br s -n main
(lldb) expr sock_add
(sockaddr) $0 = (sa_len = '\0', sa_family = '\0', sa_data = "")

The socket functions are then defined as taking a pointer to the generic socket address structure. For example:

int bind(int, struct sockaddr *, socklen_t);

To call bind the caller must cast the socket address structure to this generic type:

struct sockaddr_in  serv;      /* IPv4 socket address structure */
...//set fields
bind(sockfd, (struct sockaddr *) &serv, sizeof(serv));

Now, take a look at sockaddr_in in netinet/in.h which is an IPv4 socket address structure, commonly called an “Internet socket address structure”, and is defined by including the <netinet/in.h>

struct sockaddr_in {
  __uint8_t       sin_len;
  sa_family_t     sin_family;
  in_port_t       sin_port;
  struct  in_addr sin_addr;
  char            sin_zero[8];
};

So if you have a pointer to a sockaddr_in you could cast it to a sockaddr:

struct sockaddr* addr_;
  struct sockaddr_in addr;
  int port = 8888;
  int r = uv_ip4_addr("127.0.0.1", port, &addr);
  addr_ = (struct sockaddr*) &addr;
  printf("addr_.sa_family:%d\n", addr_->sa_family);
  printf("addr.sin_family:%d\n", addr.sin_family);

A new generic socket address structure was defined as part of the IPv6 sockets API, to overcome some of the shortcomings of the existing struct sockaddr:

struct sockaddr_storage {
  __uint8_t       ss_len;         /* address length */
  sa_family_t     ss_family;      /* [XSI] address family */
  // implementation specific elements below
  char            __ss_pad1[_SS_PAD1SIZE];
  __int64_t       __ss_align;     /* force structure storage alignment */
  char            __ss_pad2[_SS_PAD2SIZE];
};

Think of the storage used in this name as it can store ipv4, ipv6, ISO address, a unix path name or anything else.

Pointer to pointer

Lets say you have a function that needs to reassign a pointer that it is passed. If the function only takes a pointer to the object this is only a copy of the pointer:

    int* org =  new int{10};
    func(org);

    void fun(int* ptr) { ...};

Now, func will get a copy of org (pointing to the same int. But changing that pointer to point somewhere else does not affect But, if we pass a pointer to the org the we can update that value which will update our original as we are changing what it points to:

    void fun(int** ptr) { ...};

 Pointer        Pointer         Variable
   a              b                int
+-------+      +--------+      +--------+
|address| ---->|address | ---->| value  |
+-------+      +--------+      +--------+

int c = 18;
int* b = &c;
int** a = &b;

std::cout << "c=" << c << '\n';
std::cout << "b=" << b << '\n';
std::cout << "a=" << a << '\n';

std::cout << "a value=" << *a << '\n';
std::cout << "b value=" << *b << '\n';

c=18
b=0x7ffee49c605c
a=0x7ffee49c6050

a value=0x7ffee4ebe05c
b value=18

Notice that the value in a is the address of b.

Now take the signature of main:

int main(int argc, char** argv) {

 argv           char*            
+-------+      +--------+      +--------+
|address| ---->|address | ---->|progname|
+-------+      +--------+      +--------+

$ lldb -- ptp one two three
(lldb) expr argv
(char **) $1 = 0x00007ffeefbfefd8
(lldb) memory read -f x -s 8 -c 8 0x00007ffeefbfefb8
0x7ffeefbfefb8: 0x00007ffeefbff288 0x00007ffeefbff2d3
0x7ffeefbfefc8: 0x00007ffeefbff2d7 0x00007ffeefbff2db

(lldb) expr *argv
(char *) $1 = 0x00007ffeefbff288 "/Users/danielbevenius/work/c++/learningc++11/src/fundamentals/pointers/ptp"

We can visualize this:



      char**                      char*            
+------------------+       +------------------+       +--------+
|0x00007ffeefbfefb8| ----> |0x00007ffeefbff288| ----> |progname|
+------------------+       +------------------+       +--------+
                           +------------------+       +--------+
                           |0x00007ffeefbff2d3| ----> | "one"  |
                           +------------------+       +--------+
                           +------------------+       +--------+
                           |0x00007ffeefbff2d7| ----> | "two"  |
                           +------------------+       +--------+
                           +------------------+       +--------+
                           |0x00007ffeefbff2db| ----> | "three"|
                           +------------------+       +--------+

So we can see that we have a pointer char** that contains 0x00007ffeefbff288. Remember that this memory location only contains pointers, and the type of pointers it can store are char*, which are of size 8 bits on my machine. We can change it to point to other char*, but the nice thing is that we can simply increment it to go through all of them:

(lldb) expr *(argv+1)
(char *) $16 = 0x00007ffeefbff2d3 "one"
(lldb) expr *(argv+2)
(char *) $17 = 0x00007ffeefbff2d7 "two"
two
(lldb) expr *(argv+3)
(char *) $18 = 0x00007ffeefbff2db "three"

Looking at the memory layout again:

(lldb) memory read -f x -s 8 -c 4 0x00007ffeefbfefb8
0x7ffeefbfefb8: 0x00007ffeefbff288 0x00007ffeefbff2d3
0x7ffeefbfefc8: 0x00007ffeefbff2d7 0x00007ffeefbff2db

Now, argv has the memory address 0x7ffeefbfefb8 starts off containing the memory address 0x00007ffeefbff2888. Using addition and subtraction of we change the address it contains. By using +1 we are saying that is should point to a memory address of the current plus 8 (as these are char* (pointers) of size 8):

(lldb) expr *(argv+1)
(char *) $28 = 0x00007ffeefbff2d3 "one"
So, we should be able to read `0x00007ffeefbff2d3`:
(lldb) memory read  -f C -s 1 -c 3 0x00007ffeefbff2d3
0x7ffeefbff2d3: one
(lldb) memory read  -f C -s 1 -c 3 0x00007ffeefbff2d7
0x7ffeefbff2d7: two
(lldb) memory read  -f C -s 1 -c 5 0x00007ffeefbff2db
0x7ffeefbff2db: three

writev

struct iovec {
  char* iov_base;  // Base address
  size_t iov_len;  
};

struct tag names

I've come a cross the following struct declarations using typedef:

typedef struct tag2 {
} T2;

Notice that tag2 is a tag name for the struct and T2 is the name of the typedef.

typedef int integer;

In this case the struct is added into the typedef declaration:

struct tag {
};
typedef struct tag T;

typedef struct tag2 {
} T2;

Both of the above add typedefs for structs. The tag is optional but if you need to refer to the struct from within (link if you have a linked list of nodes you might want to refer to the node struct from itself) it is good to give the struct a tag name.

Ringbuffer

perror

perror.c contains an example of using perror.

#include <errno.h>

The error codes can be found in /usr/include/sys/errno.h.

sys_nerr is defined in stdlib.h sys_errlist is an array of character strings that provides a mapping from errno values to error messages

$ man 3 perror

FILE

StandardI/O routines do not operate directly on file descriptors but use their own unique identifier (file pointer):

typedef FILE:

This is defined in /usr/include/_stdio.h Standard I/O was originally written as macros. Not only FILE but all of the methods in the library were implemented as a set of macros. The style at the time, which remains common to this day, is to give macros all-caps names. As the C language progressed and standard I/O was ratified as an official part, most of the methods were reimplemented as proper functions and FILE became a typedef. But the uppercase remains. FILE is an opaque data type so it's implementation is hidden, so we don't know how it is implemented and we only use it to pass into functions that accept it.

The function fdopen() converts an already open file descriptor (fd) to a stream:

fwrite

size_t fwrite (void *buf, size_t size, size_t nr, FILE *stream);
on my mac:
size_t fwrite(const void *restrict ptr, size_t size, size_t nitems, FILE *restrict stream);

Example can be found in stream.c.

restrict keyword

Is mainly used in pointer declarations as type qualifiers. It doesn’t add any new functionality. It is only a way for programmer to inform about an optimizations that compiler can make When we use restrict with a pointer ptr, it tells the compiler that ptr is the only way to access the object pointed by it and compiler doesn’t need to add any additional checks.

API

The API acts only to ensure that if both pieces of software follow the API, they are source compatible; that is, that the user of the API will successfully compile against the implementation of the API.

ABI

ABIs are concerned with issues such as calling conventions, byte ordering, register use, system call invocation, linking, library behavior, and the binary object format. The calling convention, for example, defines how functions are invoked, how arguments are passed to functions, which registers are preserved and which are mangled, and how the caller retrieves the return value.

LD_LIBRARY_PATH, LIBRARY_PATH

LIBRARY_PATH is searched at link time.

LD_LIBRARY_PATH is searched when the program starts. On macos this would instead be DYLD_LIBRARY_PATH as the dynamic linker is dyld.

$ man dyld

CMake

CMake is written in C/C++ and does not have any external dependencies, apart from the C++ compiler that is, and can be run on multiple architectures.

CMake has its own simple language specific to CMake and one creates a CMakeList.txt file which contains command that are parsed into command which are then run.

CMake has two phases, the first is the configure phase where it processes the CMakeCache.txt file. This file stores the variables that CMake parses on the last run. These variables are then read when running CMake.

After that it then reads the toplevel CMakeList.txt which is parsed by the CMake language parser. There can be multiple CMakeList.txt files by using include or sub_directory commands.

So each line in the CMakeList.txt seem to be parsed into a cmCommand object. Take the project command for example:

project(cmake-example VERSION 0.1)

This would matched with Source/cmProjectCommand.h:

#ifndef cmProjectCommand_h
#define cmProjectCommand_h

#include "cmConfigure.h" // IWYU pragma: keep

#include <string>
#include <vector>

class cmExecutionStatus;

bool cmProjectCommand(std::vector<std::string> const& args,
                      cmExecutionStatus& status);

#endif

Each of the CMake commands found in the file is executed by a command pattern object. The configure step then runs these commands.

After all of the code is executed, and all cache variable values have been computed, CMake has an in-memory representation of the project to be built. This will include all of the libraries, executables, custom commands, and all other information required to create the final build files for the selected generator. At this point, the CMakeCache.txt file is saved to disk for use in future runs of CMake.

The in-memory representation of the project is a collection of targets, which are simply things that may be built, such as libraries and executables. CMake also supports custom targets: users can define their inputs and outputs, and provide custom executables or scripts to be run at build time. CMake stores each target in a cmTarget object.

The results of each CMakeList.txt file will be a cmMakefile object which contains multiple cmTargets:

cmMakefile
  - cmTargets[]

The next step is the generate phase where the build files are created. In my case this would be the generation of makefiles.

Building CMake with debugging symbols:

$ CXXFLAGS='-g' ./bootstrap --prefix=install_dir --parallel=8

We can then set our PATH variable and use this when running lldb.

There is a bacic cmake example in cmake-example.

To configure cmake:

$ mkdir -p build
$ cd build
$ cmake ..
-- The C compiler identification is AppleClang 11.0.0.11000033
-- Check for working C compiler: /Users/danielbevenius/work/ccache/ccache
-- Check for working C compiler: /Users/danielbevenius/work/ccache/ccache -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Detecting C compile features
-- Detecting C compile features - done
-- Configuring done
-- Generating done
-- Build files have been written to: /Users/danielbevenius/work/c/learning-c/cmake-example/build

This will generate the cmake configuration in the build director. This can also be done using cmake -H. -Bbuild.

We can now compile, staying in the build directory:

$ cmake --build .
Scanning dependencies of target hello
[ 50%] Building C object CMakeFiles/hello.dir/src/hello.c.o
[100%] Linking C executable hello
[100%] Built target hello

$ ./hello
cmake example...

cmake will create a Makefile by default on mac. We can list the targets using:

$ cmake --build . --target help
The following are some of the valid targets for this Makefile:
... all (the default if no target is provided)
... clean
... depend
... rebuild_cache
... edit_cache
... hello
... src/hello.o
... src/hello.i
... src/hello.s

So we can do:

$ make clean
$ make hello

or :

$ cmake --build . --target hello

Options

Options can be specified in CMakeList.txt using:

option(CUSTOM_OPTION "A custom option to be pased using -DCUSTOM_OPTION=" FALSE)

Variables

Can be set using set:

set(SOME_VAR "bajja")

install

The CMake variable CMAKE_INSTALL_PREFIX is used to determine the root of where the files will be installed. If using cmake --install a custom installation directory can be given via --prefix argument.

ctest

Is the CMake test driver.

System calls

$ man -k . | grep '(2)'

dtruss

main only has a main function that returns 22:

$ sudo dtruss main
SYSCALL(args) 		 = return
open("/dev/dtracehelper\0", 0x2, 0xFFFFFFFFE4D91D00)		 = 3 0
ioctl(0x3, 0x80086804, 0x7FFEE4D91B10)		 = 0 0
close(0x3)		 = 0 0
access("/AppleInternal/XBS/.isChrooted\0", 0x0, 0x0)		 = -1 Err#2
bsdthread_register(0x7FFF76582400, 0x7FFF765823F0, 0x2000)		 = 1073742047 0
sysctlbyname(kern.bootargs, 0xD, 0x7FFEE4D90F60, 0x7FFEE4D90F50, 0x0)		 = 0 0
issetugid(0x0, 0x0, 0x0)		 = 0 0
ioctl(0x2, 0x4004667A, 0x7FFEE4D90764)		 = 0 0
mprotect(0x10AE73000, 0x1000, 0x0)		 = 0 0
mprotect(0x10AE7A000, 0x1000, 0x0)		 = 0 0
mprotect(0x10AE7B000, 0x1000, 0x0)		 = 0 0
mprotect(0x10AE82000, 0x1000, 0x0)		 = 0 0
mprotect(0x10AE71000, 0x90, 0x1)		 = 0 0
mprotect(0x10AE83000, 0x1000, 0x1)		 = 0 0
mprotect(0x10AE71000, 0x90, 0x3)		 = 0 0
mprotect(0x10AE71000, 0x90, 0x1)		 = 0 0
getentropy(0x7FFEE4D908B0, 0x20, 0x0)		 = 0 0
getpid(0x0, 0x0, 0x0)		 = 19135 0
stat64("/AppleInternal\0", 0x7FFEE4D913D0, 0x0)		 = -1 Err#2
csops(0x4ABF, 0x7, 0x7FFEE4D90F00)		 = -1 Err#22
proc_info(0x2, 0x4ABF, 0xD)		 = 64 0
csops(0x4ABF, 0x7, 0x7FFEE4D90740)		 = -1 Err#22

autotools

autoconf

Is responsible for translating configure.ac which is a mix of m4 and shell scripting and producing a configure shell script.

configure.ac is written in M4sh (based on both m4 and sh hence the name). The syntax is very similar to sh but with autoconf macros thown into the mix.

AS_IF

AS_IF([test], [true], [false])

This will be translated into:

if test; then
  true
else
  false
fi

If we have the need for if..then..elif..else:

AS_IF([test1], [true], [test2], [true2], [false])

Will be translated into:

if test1; then
  true
elif test2; then
  true2
else
  false
fi

We can also have case statements:

AS_CASE([$variable], [foo*], [run1], [bar*], [run2], [catchall])

case $variable in
  foo*) run1 ;;
  bar*) run2 ;;
  *) catchall ;;
esac

pkg-config

Is a tool to collect metadata about the installed libraries on the system and provide the to the user. Developers can provide pkg-config files with their package to allow easier adoption/consume of the API. The information is about how to compile and link a program against the library which is stored in pkg-config files. These files have a .pc suffic and reside in specific locations known to the pkg-config tool.

As an example, when we build ngtcp2 we specify the following variable:

PKG_CONFIG_PATH=$PWD/../nghttp3/build/lib/pkgconfig

$ cat build/lib/pkgconfig/libnghttp3.pc
prefix=/Users/danielbevenius/work/nodejs/quic/nghttp3/build
exec_prefix=${prefix}
libdir=${exec_prefix}/lib
includedir=${prefix}/include

Name: libnghttp3
Description: nghttp3 library
URL: https://github.com/ngtcp2/nghttp3
Version: 0.1.0-DEV
Libs: -L${libdir} -lnghttp3
Cflags: -I${includedir}

This file is configured/generated in CMakeLists.txt:

set(prefix          "${CMAKE_INSTALL_PREFIX}")
set(exec_prefix     "${CMAKE_INSTALL_PREFIX}")
set(libdir          "${CMAKE_INSTALL_FULL_LIBDIR}")
set(includedir      "${CMAKE_INSTALL_FULL_INCLUDEDIR}")
set(VERSION         "${PACKAGE_VERSION}")
# For init scripts and systemd service file (in contrib/)
set(bindir          "${CMAKE_INSTALL_FULL_BINDIR}")
set(sbindir         "${CMAKE_INSTALL_FULL_SBINDIR}")
foreach(name
  lib/libnghttp3.pc
  lib/includes/nghttp3/version.h
)
  configure_file("${name}.in" "${name}" @ONLY)
endforeach()

$ pkg-config --libs --cflags --modversion ~/work/nodejs/quic/nghttp3/build/lib/pkgconfig/libnghttp3.pc
-I/Users/danielbevenius/work/nodejs/quic/nghttp3/build/include -L/Users/danielbevenius/work/nodejs/quic/nghttp3/build/lib -lnghttp3

It will additionally look in the colon-separated list of directories specified by the PKG_CONFIG_PATH environment variable.

User Datagram Protocol (UDP)

UDP Server
+---------+
|socket() |
+---------+
    ↓
+---------+
| bind()  |
+---------+
    ↓
+------------+
| recvfrom() |
+------------+

ssize_t recvfrom(int sockfd,
                 void* buff,
                 size_t nbytes,
                 int flags,
                 struct sockaddr* from,
                 socklen_t* addrlen);

The from struct will be filled in by the call and tells us who sent the datagram.

libev

Is an event loop were we register that we are interested in events and libev will manage the events and provide them to our program. This is done with an event loop which will callback to our program.

event-watchers are registered with libev.

Libev supports select, poll, the Linux-specific aio and epoll interfaces, the BSD-specific kqueue. Notice that there is no Windows support. Node.js initially used libev but then libuv was created to add Window support and was then grown.

Skip list

In a normal sorted linked list we don't have random access and searching is going to be n in the worst case. Note, that with an array you do have random access and can to a binary search if it is sorted.

+---+     +---+     +---+     +---+
| 1 |---->| 2 |---->| 3 |---->| 4 |
+---+     +---+     +---+     +---+

How can we make this faster and if you are only allowed to use linked lists?
Lets add anoter list:

+---+               +---+
| 1 |-------------->| 3 |
+---+               +---+
  ↓                   ↓
+---+     +---+     +---+     +---+
| 1 |---->| 2 |---->| 3 |---->| 4 |
+---+     +---+     +---+     +---+

So we have if we want to search for 3, instead of going through 1, 2 we can go directly to three using the first list. Notice that the bottom list stores all the elements and that the top list only stores copies of some of them, and there are links between equal keys in the top and bottom list (you can have more that two list but I'm just showing two for now).

We want to start with the top most list, if the key we are searching for is greater than the the first value in the topmost list, keep going to the next is that list. When we get to to a value that is greater than our key value, we follow that link downward to the next list below.

The cost of a search will be about:

             bottom length
top length + -------------
              top length

A nice description on this is the subway system in New York where they have express lines and local lines. The local line stops at every stop but the express line skips some. So you can travel faster if you take the express line and switch to the local line.

warn unsued return value

src/warn-unused.c contains an example of using __warn_unused_result__ attribute. This can be controlled by using an environment variable:

$ clang -o warn warn-unused.c -D"WARN=FALSE"

Code coverage

$ cmake -DCODE_COVERAGE=ON -DCMAKE_BUILD_TYPE=Debug .
$ cmake --build . --target check
$ cd CMakefiles
$ lcov --directory . --capture --output-file coverage.inf
$ genhtml coverage.info

LLVM

Show the compiler tools used when invoking the compiler tool chain (clang):

$ clang -### -o static static.c
Apple clang version 11.0.0 (clang-1100.0.33.12)
Target: x86_64-apple-darwin19.0.0
Thread model: posix
InstalledDir: /Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin

 "/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/clang" "-cc1" "-triple" "x86_64-apple-macosx10.15.0" "-Wdeprecated-objc-isa-usage" "-Werror=deprecated-objc-isa-usage" "-emit-obj" "-mrelax-all" "-disable-free" "-disable-llvm-verifier" "-discard-value-names" "-main-file-name" "static.c" "-mrelocation-model" "pic" "-pic-level" "2" "-mthread-model" "posix" "-mdisable-fp-elim" "-fno-strict-return" "-masm-verbose" "-munwind-tables" "-target-sdk-version=10.15" "-target-cpu" "penryn" "-dwarf-column-info" "-debugger-tuning=lldb" "-ggnu-pubnames" "-target-linker-version" "520" "-resource-dir" "/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/lib/clang/11.0.0" "-isysroot" "/Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk" "-I/usr/local/include" "-Wno-framework-include-private-from-public" "-Wno-atimport-in-framework-header" "-Wno-extra-semi-stmt" "-Wno-quoted-include-in-framework-header" "-fdebug-compilation-dir" "/Users/danielbevenius/work/c/learning-c" "-ferror-limit" "19" "-fmessage-length" "158" "-stack-protector" "1" "-fstack-check" "-mdarwin-stkchk-strong-link" "-fblocks" "-fencode-extended-block-signature" "-fregister-global-dtors-with-atexit" "-fobjc-runtime=macosx-10.15.0" "-fmax-type-align=16" "-fdiagnostics-show-option" "-fcolor-diagnostics" "-o" "/var/folders/8n/rmc4tl7j6nz__qg3bl0ptc680000gn/T/static-7b477e.o" "-x" "c" "static.c"

 "/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/ld" "-demangle" "-lto_library" "/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/lib/libLTO.dylib" "-no_deduplicate" "-dynamic" "-arch" "x86_64" "-macosx_version_min" "10.15.0" "-syslibroot" "/Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk" "-o" "static" "/var/folders/8n/rmc4tl7j6nz__qg3bl0ptc680000gn/T/static-7b477e.o" "-L/usr/local/lib" "-lSystem" "/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/lib/clang/11.0.0/lib/darwin/libclang_rt.osx.a"

Notice that clang is called using clang -cc1 followed by options. Also, not that the linker ld is called separately. This is pretty useful to have when troubleshooting linking errors, it gives you an easy way to see the options passed to the linker.

clang

Is the offical compiler frontend but there are actually three things named clang:

the frontend (implmeented in Clang libraries)
the compiler driver (the clang command line tool)
the actual compiler which is called using the clang -cc1

show AST:

$ clang -cc1 -ast-dump hello.c

dump tokens:

$ clang -cc1 -dump-tokens hello.c
int 'int'	 [StartOfLine]	Loc=<hello.c:1:1>
identifier 'main'	 [LeadingSpace]	Loc=<hello.c:1:5>
l_paren '('		Loc=<hello.c:1:9>
int 'int'		Loc=<hello.c:1:10>
identifier 'argc'	 [LeadingSpace]	Loc=<hello.c:1:14>
comma ','		Loc=<hello.c:1:18>
char 'char'	 [LeadingSpace]	Loc=<hello.c:1:20>
star '*'		Loc=<hello.c:1:24>
star '*'		Loc=<hello.c:1:25>
identifier 'argv'	 [LeadingSpace]	Loc=<hello.c:1:27>
r_paren ')'		Loc=<hello.c:1:31>
l_brace '{'	 [LeadingSpace]	Loc=<hello.c:1:33>
int 'int'	 [StartOfLine] [LeadingSpace]	Loc=<hello.c:2:3>
identifier 'bajja'	 [LeadingSpace]	Loc=<hello.c:2:7>
equal '='	 [LeadingSpace]	Loc=<hello.c:2:13>
numeric_constant '10'	 [LeadingSpace]	Loc=<hello.c:2:15>
semi ';'		Loc=<hello.c:2:17>
r_brace '}'	 [StartOfLine]	Loc=<hello.c:3:1>
eof ''		Loc=<hello.c:3:2>

Intermediate Representation (IR)

cat sum.c:

int sum(int a, int b) {
  return a + b;
}

$ clang sum.c -emit-llvm -S -c -o sum.ll

sum.ll:

; ModuleID = 'sum.c'
source_filename = "sum.c"
target datalayout = "e-m:o-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-apple-macosx10.15.0"

; Function Attrs: noinline nounwind optnone ssp uwtable
define i32 @sum(i32, i32) #0 {
  %3 = alloca i32, align 4
  %4 = alloca i32, align 4
  store i32 %0, i32* %3, align 4
  store i32 %1, i32* %4, align 4
  %5 = load i32, i32* %3, align 4
  %6 = load i32, i32* %4, align 4
  %7 = add nsw i32 %5, %6
  ret i32 %7
}

attributes #0 = { noinline nounwind optnone ssp uwtable "correctly-rounded-divide-sqrt-fp-math"="false" "darwin-stkchk-strong-link" "disable-tail-calls"="false" "less-precise-fpmad"="false" "min-legal-vector-width"="0" "no-frame-pointer-elim"="true" "no-frame-pointer-elim-non-leaf" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "probe-stack"="___chkstk_darwin" "stack-protector-buffer-size"="8" "target-cpu"="penryn" "target-features"="+cx16,+fxsr,+mmx,+sahf,+sse,+sse2,+sse3,+sse4.1,+ssse3,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }

!llvm.module.flags = !{!0, !1, !2}
!llvm.ident = !{!3}

!0 = !{i32 2, !"SDK Version", [2 x i32] [i32 10, i32 15]}
!1 = !{i32 1, !"wchar_size", i32 4}
!2 = !{i32 7, !"PIC Level", i32 2}
!3 = !{!"Apple clang version 11.0.0 (clang-1100.0.33.12)"}

The content of an assembly or bitcode file is called an LLVM module. Each module contains a sequence of functions and functions contain sequences of bacic blocks which contain sequences of instructions. A module can also have global variables, the target data layout etc.

define i32 @sum(i32, i32) #0 {

The #0 tag refers to the function attributes #0.

A local variable is one that starts with %s

  %3 = alloca i32, align 4
  %4 = alloca i32, align 4

So where we have a and b which are being allocated on the stack as i32 ints and aligned by 4 byte. The alloca instruction reserves space on the stack frame of the current function. %3 will store a pointer to the stack element. These are local variables which are also called automatic variable, which is why the operator name is alloca (allocate local variable). Recall that automatic means that they are deallocated automatically. So we have reserved space on the stack for two ints, and we have pointers to these locations. But there is nothing there yet (or just garbage).

  store i32 %0, i32* %3, align 4

The next instruction, store, will store value of the first argument, a, and the destination is the specified using the pointer in %3.

  %5 = load i32, i32* %3, align 4
  %6 = load i32, i32* %4, align 4
  %7 = add nsw i32 %5, %6

Next, the value is loaded onto the stack in preperation for the add call. nsw stands for "no signed wrap" which means that instructions are known to have no overflow. These extra instructions would not been needed if we compiles with optimizations turned on.

Each basic block has a single entry point and a single exit point. It will either return or jump to another basic blokc

Single Static Assignment

Only one assignement of every variable in the program. Static because the when you look at the text of the program there will only be a single assignment. So if you have reassignement of a variable the after converting to SSA form there will be two variable, for example x1 and x2. If you have code that looks like this:

x = 10;
if (something) {
  x = 20;
}
add_function(x);

What value should the SSA format use for the function call?
What it does is something like this:

x3 = Φ(x1, x2)
add_function(x3);

Where Φ is phi, and is a function that merges the values of x1 and x2.

Static lib

There is an example in static-lib which is a very basic statically linked archive/library.

$ cd static-lib

First we can compile main into an object file.

$ gcc -c -o main.o main.c

Then compile the contents of the library, in this case only a single object file:

$ gcc -c -o simple.o simple.c

$ ar rc libdir/libsimple.a simple.o 
$ ar t libsimple.a
simple.o

Now, if we want to link the executable we can use the following command

$ gcc -o main main.o -L./libdir -lsimple

Just notice that if you change the order into this:

$ gcc -L./libdir -lsimple -o main main.o
/usr/bin/ld: main.o: in function `main':
main.c:(.text+0x24): undefined reference to `doit'
collect2: error: ld returned 1 exit status

So even if you think we have specified the -L, the library path, and -l, the library name, if might still get this error.

Stack allocated

This is just to cement/clarify what is actually done when we call something stack allocated. Take a struct for example:

struct Something {
    int x;
    int y;
};

If we create a instance of this using:

struct Something s = {1, 2};

$ /usr/bin/clang -O0 -g -o stackalloc stackalloc.c
$ lldb stackalloc
(lldb) br s -n main
Breakpoint 1: where = stackalloc`main + 20 at stackalloc.c:7:22, address = 0x0000000000401124

Lets take a look at the code that is generated for main:

(lldb) disassemble --bytes
stackalloc`main:
    0x401110 <+0>:  55                       pushq  %rbp
    0x401111 <+1>:  48 89 e5                 movq   %rsp, %rbp
    0x401114 <+4>:  31 c0                    xorl   %eax, %eax
    0x401116 <+6>:  c7 45 fc 00 00 00 00     movl   $0x0, -0x4(%rbp)
    0x40111d <+13>: 89 7d f8                 movl   %edi, -0x8(%rbp)
    0x401120 <+16>: 48 89 75 f0              movq   %rsi, -0x10(%rbp)
->  0x401124 <+20>: 48 8b 0c 25 10 20 40 00  movq   0x402010, %rcx
    0x40112c <+28>: 48 89 4d e8              movq   %rcx, -0x18(%rbp)
    0x401130 <+32>: 5d                       popq   %rbp
    0x401131 <+33>: c3                       retq

First we have the function prolouge followed by clearning eax (that is the xorl). Next, we are storing 0 in the first slot of the stack, followed by by argc, and then argv (remember that 0x10 is hex and in decimal is 16).

(lldb) memory read -f x -c 1 -s 8 $rbp-8
0x7fffffffd2d8: 0x0000000000000001
(lldb) memory read -f p -c 1 -s 8 $rbp-16
0x7fffffffd2d0: 0x00007fffffffd3c8
(lldb) memory read -f p -c 1 -s 8 0x00007fffffffd3c8
0x7fffffffd3c8: 0x00007fffffffd744
(lldb) memory read -f s  0x00007fffffffd744
0x7fffffffd744: "/home/danielbevenius/work/c/learning-c/stackalloc"

So, that takes care of the local variables argc, and argv. Next we have:

->  0x401124 <+20>: 48 8b 0c 25 10 20 40 00  movq   0x402010, %rcx
    0x40112c <+28>: 48 89 4d e8              movq   %rcx, -0x18(%rbp)

So we are moving the value into rcx and then storing it on the stack which is our s variable. If we inspect this we find something interesting:

(lldb) register read rcx
     rcx = 0x0000000200000003

Notice that these are the the values that our struct contains:

(lldb) memory read -f x -c 1 -s 4 $rbp-0x18
0x7fffffffd2c8: 0x00000001
(lldb) memory read -f x -c 1 -s 4 $rbp-0x14
0x7fffffffd2cc: 0x00000002

(lldb) memory read -f x -s 8 -c 1 0x402010
0x00402010: 0x0000000200000001

$ objdump -s -j .rodata stackalloc-ref

stackalloc-ref:     file format elf64-x86-64

Contents of section .rodata:
 402000 01000200 00000000 00000000 00000000  ................
 402010 01000000 02000000                    ........        
$

Notice that the memory location just contains the values 1 and 2 and it part of the read only data in the object file. This will be mapped into the address virutual address space when the executable is loaded. So these values can be accessed using the $rbp-0x18 and $rbp-0x14.

Heap allocated

If we instead allocate the same struct as in the previous example but do so using malloc which will allocate memory in the .data section called the heap.

#include <stdlib.h>                                                             
                                                                                
struct Something {                                                              
    int x;                                                                      
    int y;                                                                      
};                                                                              
                                                                                
int main(int argc, char** argv) {                                               
    struct Something* s = malloc(sizeof(struct Something));                     
    s->x = 1;
    s->y = 2;
    return 0;
}

This results in the following assembly code for the main function:

lldb) disassemble 
heapalloc`main:
    0x401130 <+0>:  pushq  %rbp
    0x401131 <+1>:  movq   %rsp, %rbp
    0x401134 <+4>:  subq   $0x20, %rsp
    0x401138 <+8>:  movl   $0x0, -0x4(%rbp)
    0x40113f <+15>: movl   %edi, -0x8(%rbp)
    0x401142 <+18>: movq   %rsi, -0x10(%rbp)
->  0x401146 <+22>: movl   $0x8, %edi
    0x40114b <+27>: callq  0x401030                  ; symbol stub for: malloc
    0x401150 <+32>: xorl   %ecx, %ecx
    0x401152 <+34>: movq   %rax, -0x18(%rbp)
    0x401156 <+38>: movq   -0x18(%rbp), %rax
    0x40115a <+42>: movl   $0x1, (%rax)
    0x401160 <+48>: movq   -0x18(%rbp), %rax
    0x401164 <+52>: movl   $0x2, 0x4(%rax)
    0x40116b <+59>: movl   %ecx, %eax
    0x40116d <+61>: addq   $0x20, %rsp
    0x401171 <+65>: popq   %rbp
    0x401172 <+66>: retq

The first instructions are simliar to the previous example. The first one that differs is moving 8 into edi. That is the size of the memory area to allocate. The return value from malloc will be in rax, which is then moved onto the stack. Next we move the value 1 into the location pointed to be rax. Then the return value from malloc is copied into rax again (this was compiled without optimisations) which is probably the reason for doing this again. And then 2 is moved into the offset 4 or the value in rax (dereferenced pointer). Since this memory location is not directly on the stack as opposed to the stack allocated version where we had the values directly on the stack, we can pass the pointer to other functions and need to make sure we free the memory when finished with it.

One interesting question I have we know that we can store an instance of a struct on the stack or in the heap. But we can also pass a reference of a stack allocated struct to a function. What does that look like:

struct Something {
    int x;
    int y;
};                                                                              
                                                                                
void process(struct Something* s) {
    int x = s->x;
}

int main(int argc, char** argv) {
    struct Something s = {1, 2};
    process(&s);
    return 0;
}

(lldb) disassemble --bytes
stackalloc-ref`main:
    0x401130 <+0>:  55                       pushq  %rbp
    0x401131 <+1>:  48 89 e5                 movq   %rsp, %rbp
    0x401134 <+4>:  48 83 ec 20              subq   $0x20, %rsp
    0x401138 <+8>:  c7 45 fc 00 00 00 00     movl   $0x0, -0x4(%rbp)
    0x40113f <+15>: 89 7d f8                 movl   %edi, -0x8(%rbp)
    0x401142 <+18>: 48 89 75 f0              movq   %rsi, -0x10(%rbp)
->  0x401146 <+22>: 48 8b 04 25 10 20 40 00  movq   0x402010, %rax
    0x40114e <+30>: 48 89 45 e8              movq   %rax, -0x18(%rbp)
    0x401152 <+34>: 48 8d 7d e8              leaq   -0x18(%rbp), %rdi
    0x401156 <+38>: e8 b5 ff ff ff           callq  0x401110                  ; process at stackalloc-ref.c:6
    0x40115b <+43>: 31 c0                    xorl   %eax, %eax
    0x40115d <+45>: 48 83 c4 20              addq   $0x20, %rsp
    0x401161 <+49>: 5d                       popq   %rbp
    0x401162 <+50>: c3                       retq

Address Sanitizer (asan)

$ gcc -g3 -o asan -fsanitize=address asan.cc 
$ ./asan 
=================================================================
==65843==ERROR: AddressSanitizer: stack-buffer-overflow on address 0x7ffc4bd51022 at pc 0x7f5192d282ad bp 0x7ffc4bd50fd0 sp 0x7ffc4bd50778
WRITE of size 20 at 0x7ffc4bd51022 thread T0
    #0 0x7f5192d282ac  (/lib64/libasan.so.5+0x9c2ac)
    #1 0x4012d2 in main /home/danielbevenius/work/c/learning-c/asan.cc:9
    #2 0x7f5192aea1a2 in __libc_start_main (/lib64/libc.so.6+0x271a2)
    #3 0x4010dd in _start (/home/danielbevenius/work/c/learning-c/asan+0x4010dd)

Address 0x7ffc4bd51022 is located in stack of thread T0 at offset 50 in frame
    #0 0x4011a5 in main /home/danielbevenius/work/c/learning-c/asan.cc:5

  This frame has 2 object(s):
    [32, 50) 'mem' (line 7) <== Memory access at offset 50 overflows this variable
    [96, 116) 'line' (line 6)
HINT: this may be a false positive if your program uses some custom stack unwind mechanism, swapcontext or vfork
      (longjmp and C++ exceptions *are* supported)
SUMMARY: AddressSanitizer: stack-buffer-overflow (/lib64/libasan.so.5+0x9c2ac) 
Shadow bytes around the buggy address:
  0x1000097a21b0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a21c0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a21d0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a21e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a21f0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 f1 f1
=>0x1000097a2200: f1 f1 00 00[02]f2 f2 f2 f2 f2 00 00 04 f3 f3 f3
  0x1000097a2210: f3 f3 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a2220: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a2230: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a2240: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a2250: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
Shadow byte legend (one shadow byte represents 8 application bytes):
  Addressable:           00
  Partially addressable: 01 02 03 04 05 06 07 
  Heap left redzone:       fa
  Freed heap region:       fd
  Stack left redzone:      f1
  Stack mid redzone:       f2
  Stack right redzone:     f3
  Stack after return:      f5
  Stack use after scope:   f8
  Global redzone:          f9
  Global init order:       f6
  Poisoned by user:        f7
  Container overflow:      fc
  Array cookie:            ac
  Intra object redzone:    bb
  ASan internal:           fe
  Left alloca redzone:     ca
  Right alloca redzone:    cb
  Shadow gap:              cc
==65843==ABORTING

Notice that in the shadow memory stores information about the memory available and allocated to the program. This is a different part of the memory hence the different memory addresses (TODO: show how these are calculated and show the memory in lldb).

Shadow bytes around the buggy address:
  0x1000097a21b0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a21c0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a21d0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a21e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a21f0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 f1 f1
=>0x1000097a2200: f1 f1 00 00[02]f2 f2 f2 f2 f2 00 00 04 f3 f3 f3
  0x1000097a2210: f3 f3 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a2220: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a2230: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a2240: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x1000097a2250: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

Each 00 represents a fully addressable 8 application bytes.

00 = 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
f1 = 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000

When asan instruments a memory allocation is will reserv space before the memory being requested by the program and after. It will use this memory space to detect memory corruption.

Notice the arrow and the brackets around one of the bytes:

  0x1000097a21f0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 f1 f1
=>0x1000097a2200: f1 f1 00 00[02]f2 f2 f2 f2 f2 00 00 04 f3 f3 f3

If we look at the bytes before this we see that they are addressable by the application since they are 00. And we have two or them so that means 16 bytes, and [02] means there are two bytes available in the next memory location which gives a total of 18 bytes. But the program has written beyond that into the f2 area.

Lets take a look at how asan instruments this code:

$ objdump -Cd asan

asan:     file format elf64-x86-64
...

0000000000401196 <main>:
  401196:	55                   	push   %rbp
  401197:	48 89 e5             	mov    %rsp,%rbp
  40119a:	41 56                	push   %r14
  40119c:	41 55                	push   %r13
  40119e:	41 54                	push   %r12
  4011a0:	53                   	push   %rbx
  4011a1:	48 81 ec b0 00 00 00 	sub    $0xb0,%rsp
  4011a8:	89 bd 3c ff ff ff    	mov    %edi,-0xc4(%rbp)
  4011ae:	48 89 b5 30 ff ff ff 	mov    %rsi,-0xd0(%rbp)
  4011b5:	4c 8d a5 40 ff ff ff 	lea    -0xc0(%rbp),%r12
  4011bc:	4d 89 e6             	mov    %r12,%r14
  4011bf:	83 3d fa 2e 00 00 00 	cmpl   $0x0,0x2efa(%rip)        # 4040c0 <__asan_option_detect_stack_use_after_return>
  4011c6:	74 12                	je     4011da <main+0x44>
  4011c8:	bf a0 00 00 00       	mov    $0xa0,%edi
  4011cd:	e8 ae fe ff ff       	callq  401080 <__asan_stack_malloc_2@plt>
  4011d2:	48 85 c0             	test   %rax,%rax
  4011d5:	74 03                	je     4011da <main+0x44>
  4011d7:	49 89 c4             	mov    %rax,%r12
  4011da:	49 8d 84 24 a0 00 00 	lea    0xa0(%r12),%rax
  4011e1:	00 
  4011e2:	49 89 c5             	mov    %rax,%r13
  4011e5:	49 c7 04 24 b3 8a b5 	movq   $0x41b58ab3,(%r12)
  4011ec:	41 
  4011ed:	49 c7 44 24 08 20 20 	movq   $0x402020,0x8(%r12)
  4011f4:	40 00 
  4011f6:	49 c7 44 24 10 96 11 	movq   $0x401196,0x10(%r12)
  4011fd:	40 00 
  4011ff:	4c 89 e3             	mov    %r12,%rbx
  401202:	48 c1 eb 03          	shr    $0x3,%rbx
  401206:	c7 83 00 80 ff 7f f1 	movl   $0xf1f1f1f1,0x7fff8000(%rbx)
  40120d:	f1 f1 f1 
  401210:	c7 83 04 80 ff 7f 00 	movl   $0xf2020000,0x7fff8004(%rbx)
  401217:	00 02 f2 
  40121a:	c7 83 08 80 ff 7f f2 	movl   $0xf2f2f2f2,0x7fff8008(%rbx)
  401221:	f2 f2 f2 
  401224:	c7 83 0c 80 ff 7f 00 	movl   $0xf3040000,0x7fff800c(%rbx)
  40122b:	00 04 f3 
  40122e:	c7 83 10 80 ff 7f f3 	movl   $0xf3f3f3f3,0x7fff8010(%rbx)
  401235:	f3 f3 f3 
  401238:	49 8d 45 c0          	lea    -0x40(%r13),%rax
  40123c:	48 89 c2             	mov    %rax,%rdx
  40123f:	48 c1 ea 03          	shr    $0x3,%rdx
  401243:	48 81 c2 00 80 ff 7f 	add    $0x7fff8000,%rdx
  40124a:	0f b6 12             	movzbl (%rdx),%edx
  40124d:	84 d2                	test   %dl,%dl
  40124f:	0f 95 c1             	setne  %cl
  401252:	84 d2                	test   %dl,%dl
  401254:	0f 9e c2             	setle  %dl
  401257:	21 d1                	and    %edx,%ecx
  401259:	89 cf                	mov    %ecx,%edi
  40125b:	ba 14 00 00 00       	mov    $0x14,%edx
  401260:	48 83 ea 01          	sub    $0x1,%rdx
  401264:	48 8d 0c 10          	lea    (%rax,%rdx,1),%rcx
  401268:	48 89 ca             	mov    %rcx,%rdx
  40126b:	48 c1 ea 03          	shr    $0x3,%rdx
  40126f:	48 81 c2 00 80 ff 7f 	add    $0x7fff8000,%rdx
  401276:	0f b6 12             	movzbl (%rdx),%edx
  401279:	84 d2                	test   %dl,%dl
  40127b:	40 0f 95 c6          	setne  %sil
  40127f:	83 e1 07             	and    $0x7,%ecx
  401282:	38 d1                	cmp    %dl,%cl
  401284:	0f 9d c2             	setge  %dl
  401287:	21 f2                	and    %esi,%edx
  401289:	09 fa                	or     %edi,%edx
  40128b:	84 d2                	test   %dl,%dl
  40128d:	74 0d                	je     40129c <main+0x106>
  40128f:	be 14 00 00 00       	mov    $0x14,%esi
  401294:	48 89 c7             	mov    %rax,%rdi
  401297:	e8 04 fe ff ff       	callq  4010a0 <__asan_report_store_n@plt>
  40129c:	48 b8 53 6f 6d 65 74 	movabs $0x6e696874656d6f53,%rax
  4012a3:	68 69 6e 
  4012a6:	48 ba 67 20 73 6f 6d 	movabs $0x6874656d6f732067,%rdx
  4012ad:	65 74 68 
  4012b0:	49 89 45 c0          	mov    %rax,-0x40(%r13)
  4012b4:	49 89 55 c8          	mov    %rdx,-0x38(%r13)
  4012b8:	41 c7 45 d0 69 6e 67 	movl   $0x676e69,-0x30(%r13)
  4012bf:	00 
  4012c0:	49 8d 55 c0          	lea    -0x40(%r13),%rdx
  4012c4:	49 8d 45 80          	lea    -0x80(%r13),%rax
  4012c8:	48 89 d6             	mov    %rdx,%rsi
  4012cb:	48 89 c7             	mov    %rax,%rdi
  4012ce:	e8 7d fd ff ff       	callq  401050 <strcpy@plt>
  4012d3:	49 8d 45 80          	lea    -0x80(%r13),%rax
  4012d7:	48 89 c6             	mov    %rax,%rsi
  4012da:	bf 40 20 40 00       	mov    $0x402040,%edi
  4012df:	b8 00 00 00 00       	mov    $0x0,%eax
  4012e4:	e8 47 fd ff ff       	callq  401030 <printf@plt>
  4012e9:	b8 00 00 00 00       	mov    $0x0,%eax
  4012ee:	4d 39 e6             	cmp    %r12,%r14
  4012f1:	74 36                	je     401329 <main+0x193>
  4012f3:	49 c7 04 24 0e 36 e0 	movq   $0x45e0360e,(%r12)
  4012fa:	45 
  4012fb:	48 be f5 f5 f5 f5 f5 	movabs $0xf5f5f5f5f5f5f5f5,%rsi
  401302:	f5 f5 f5 
  401305:	48 bf f5 f5 f5 f5 f5 	movabs $0xf5f5f5f5f5f5f5f5,%rdi
  40130c:	f5 f5 f5 
  40130f:	48 89 b3 00 80 ff 7f 	mov    %rsi,0x7fff8000(%rbx)
  401316:	48 89 bb 08 80 ff 7f 	mov    %rdi,0x7fff8008(%rbx)
  40131d:	c7 83 10 80 ff 7f f5 	movl   $0xf5f5f5f5,0x7fff8010(%rbx)
  401324:	f5 f5 f5 
  401327:	eb 20                	jmp    401349 <main+0x1b3>
  401329:	48 c7 83 00 80 ff 7f 	movq   $0x0,0x7fff8000(%rbx)
  401330:	00 00 00 00 
  401334:	48 c7 83 08 80 ff 7f 	movq   $0x0,0x7fff8008(%rbx)
  40133b:	00 00 00 00 
  40133f:	c7 83 10 80 ff 7f 00 	movl   $0x0,0x7fff8010(%rbx)
  401346:	00 00 00 
  401349:	48 81 c4 b0 00 00 00 	add    $0xb0,%rsp
  401350:	5b                   	pop    %rbx
  401351:	41 5c                	pop    %r12
  401353:	41 5d                	pop    %r13
  401355:	41 5e                	pop    %r14
  401357:	5d                   	pop    %rbp
  401358:	c3                   	retq

Notice how the following code is populating the shadow memory (processor used is little endian):

  4011ff:	4c 89 e3             	mov    %r12,%rbx
  401202:	48 c1 eb 03          	shr    $0x3,%rbx
  401206:	c7 83 00 80 ff 7f f1 	movl   $0xf1f1f1f1,0x7fff8000(%rbx)
  40120d:	f1 f1 f1 
  401210:	c7 83 04 80 ff 7f 00 	movl   $0xf2020000,0x7fff8004(%rbx)
  401217:	00 02 f2 
  40121a:	c7 83 08 80 ff 7f f2 	movl   $0xf2f2f2f2,0x7fff8008(%rbx)
  401221:	f2 f2 f2 
  401224:	c7 83 0c 80 ff 7f 00 	movl   $0xf3040000,0x7fff800c(%rbx)
  40122b:	00 04 f3 
  40122e:	c7 83 10 80 ff 7f f3 	movl   $0xf3f3f3f3,0x7fff8010(%rbx)

Which matches the shadow memory report above:

  0x1000097a21f0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 f1 f1
=>0x1000097a2200: f1 f1 00 00[02]f2 f2 f2 f2 f2 00 00 04 f3 f3 f3

0x1000097a21f0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 f1 f1 =>0x1000097a2200: f1 f1 00 00[02]f2 f2 f2 f2 f2 00 00 04 f3 f3 f3

HERE document with gcc

This can be nice when you don't need to keep the source file around:

gcc -o cap -lcap -xc - <<HERE                                                      
#include <sys/capability.h>                                                        
#include <stdio.h>                                                                 
int main(int argc, char** argv) {                                                  
  printf("bajja\n");                                                               
  return 0;                                                                        
}                                                                                  
HERE

Flexibile Array Member (FEM)

Is a why to have a dynamically allocated last member in a struct. This is done to save having to use a pointer and having to first allocate memory for the struct and then the pointer. Instead only one memory allocation is needed using this method.

Example: fem.c

Stack layout

To clearly get a visual of the stack and be able to print it out and inspect it when needed this following example can be used:

 +-------------------------+ 0xffffffff
1GB|                         |
   |  Kernel space           |
   |                         |
   |-------------------------| 0xc0000000 [TASK_SIZE]
   |  User space             |
   |-------------------------|
   |  Stack segment          |
   |          ↓              |
   |-------------------------| rsp (extended stack pointer)
   |                         |
   |-------------------------|
   |  Memory Mapped Segment  |
   |          ↓              |
   |-------------------------|
3GB|                         |
   |                         |
   |                         |
   |                         |
   |                         | program break
   |-------------------------| [brk]
   |          ↑              |
   |  Heap segment           |
   |                         |
   |-------------------------| [start brk]
   |  BSS segment            |
   |                         |
   |-------------------------| [end data]
   |  Data segment           |
   |                         | [start data]
   |-------------------------| [end code]
   |  Text segment           | 0x08048000
   |                         |
   +-------------------------+ 0

So notice stack pointer rsp points to the next location of the stack and that these addresses go from higher to lower:

+-+
|f|                      Higher
|e|                        ↓
|d|                      Lower
|c|  <-- rsp
|b|
|a|
|9|
|8|
|7|
|6|
|5|
|4|
|3|
|2|
|1|
|0|
+-+

(lldb) memory read -f x -s 8 -c 10 -l 1 $rsp

When we issue this command we are asking memory to be read starting at rsp and then print out 10 values each of 8 bytes. So this will memory that is on the stack and that is "reserved" by the program. Either values have been pushed onto the stack using commands, or rsp has been subtraced to make room for new values. Note that depending on the program, if it is very simple it might just use rbp instead but in this case rbp and rsp will probably point to the same address.

So we use $rsp + # to inspect values that are on the stack, that is these values are above the stack pointer. And we use $rsp - # to see values that below the stack pointer.

I/O Models

Normal I/O operation like read/write system calls will block if there is no data to be read or if there is not enough space to hold the data to be written.

So normal I/O is synchronous in that block on read/write but note that file I/O is somewhat different where kernel buffer cache is used to improve performace, meaning that a write() to disk returns as soon as the requested data has been transferred to the kernel buffer cache. Likewise a read() transfers data from the kernel buffer cache to a user buffer and if there is no data available the kernel puts the process to sleep while disk read is done.

There are cases where we want to check if I/O is possible on a file descriptor without blocking, and also cases where we want to monitor multiple file descriptors at the same time to see if I/O is possible on any of them. Just to be clear we are not performing any I/O operations like read/writing using the file descriptor but just monitoring them to see if an I/O system call could be performed without blocking.

So why can't we use nonblocking I/O where we specify O_NONBLOCK and if the system call can't be immediatly completed it will return an error instead of blocking which it would otherwise. In this case we have to poll to check if I/O is possible and if we have many file descriptors we have to loop over them all and check each one (I guess a system call for every one). Now, one could add process or a thread to do this to work but that is expensive and also one has the issue of interprocess communication between the processes/threads.

One solution to this is I/O multiplexing where a process can monitor multiple file descriptors at the same time. Examples of this is select() and poll().

Also signal driven I/O where a process requests that the kernel send a signal when the specified file descriptor is available for read/write. The process is then notified when an I/O operation is available with a signal. This performs better when one has many file descriptors to monitor, better than select() and poll().

The Linux specific epoll() allows for monitoring many files just like select() and poll(), and like signal driven I/O, but provides much better performance then those two.

Since there are many ways to implement/deal with the same things there are libraries like libevent which provide an abstract interface and the library can then choose the best solution for the operating system in use.

Level-triggered

A file descriptor is considered ready if it is possible to perform an I/O system call without blocking. select() and poll() are examples of level-triggered.

Egde-triggered

In this case the notification is given if there is I/O activity, like new input to be read, on a file descriptor since it was last monitored. Signal-driven I/O is an example of edge-triggered

So compared to level-triggered if we recieve that data is ready to be read but we don't actually read (consume) that data, it would still be considered as ready. But in edge-triggered that would not be the case, it would not be seen as ready again.

select

The oldest of the system calls so has great portability but does not scale well. Example: select

poll

Like select also has great portability on Unix systems. Example: poll

epoll

Linux specific Example: epoll

Supports both level-triggered and edge-triggered notification.

Name		Name	Last commit message	Last commit date
Latest commit History 140 Commits
app-image		app-image
cmake-example		cmake-example
dbus		dbus
docs		docs
dynamic-lib		dynamic-lib
glibc-exploration		glibc-exploration
notes		notes
static-lib		static-lib
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
addrinfo.c		addrinfo.c
anonymous_struct.c		anonymous_struct.c
ares.c		ares.c
array-init.c		array-init.c
array.c		array.c
asan-heap-use-after-freed.c		asan-heap-use-after-freed.c
asan.c		asan.c
bitfield.c		bitfield.c
bitop.c		bitop.c
blas.c		blas.c
client-socket.c		client-socket.c
comma.c		comma.c
const_pointers.c		const_pointers.c
container_of.c		container_of.c
cp.c		cp.c
ctor.c		ctor.c
ctor2.c		ctor2.c
cvtsh.c		cvtsh.c
dat.c		dat.c
epoll.c		epoll.c
fem.c		fem.c
filescope.c		filescope.c
fioclex.c		fioclex.c
floats.c		floats.c
fs.c		fs.c
function-pointer.c		function-pointer.c
heapalloc.c		heapalloc.c
inet_convertions.c		inet_convertions.c
inherit.c		inherit.c
kqueue.c		kqueue.c
libev.c		libev.c
longjmp.c		longjmp.c
macro.cc		macro.cc
malloc.c		malloc.c
memcmp.c		memcmp.c
memmove.c		memmove.c
memset.c		memset.c
mmap.c		mmap.c
netlink.c		netlink.c
no-return.c		no-return.c
perror.c		perror.c
pointer-to-pointer.c		pointer-to-pointer.c
poll.c		poll.c
pthread-mutex.c		pthread-mutex.c
pthread-sema.c		pthread-sema.c
pthread-sigwait.c		pthread-sigwait.c
pthreads-once-deadlock.c		pthreads-once-deadlock.c
pthreads-once.c		pthreads-once.c
resolv.c		resolv.c
reverse.c		reverse.c
ringbuffer.c		ringbuffer.c
select.c		select.c
server-socket.c		server-socket.c
sigaction.c		sigaction.c
signals.c		signals.c
sigset.c		sigset.c
socket-options.c		socket-options.c
stackalloc.c		stackalloc.c
static.c		static.c
stream.c		stream.c
strings.c		strings.c
structs.c		structs.c
symbol.c		symbol.c
trie.c		trie.c
unix-domain.c		unix-domain.c
variadic.c		variadic.c
vla.c		vla.c
warn-unused.c		warn-unused.c
writev.c		writev.c

danbev/learning-c

Folders and files

Latest commit

History

Repository files navigation