Skip to main content

Lecture Notes: 17 Threads

··4 mins

Virtual Memory and Fork: A Review #

  • Draw the virtual memory diagram.
  • Allocate some shared memory.
  • Fork.
  • Point out that shared memory is shared, and non-shared writable memory soon isn’t.

Introducing Threads #

// create.c
#include <stdio.h>
#include <pthread.h>
#include <assert.h>

#define NN 10

void*
thread_main(void* thread_arg)
{
    int xx = *((int*)thread_arg);
    printf("thread %d: We're in a thread.\n", xx);
    *((int*)thread_arg) += xx;
    return thread_arg;
}

int
main(int _argc, char* _argv[])
{
    int nums[NN];
    int rv;
    pthread_t threads[NN];

    printf("main: Starting %d threads.\n", NN);

    for (int ii = 0; ii < NN; ++ii) {
        nums[ii] = ii;

        rv = pthread_create(&(threads[ii]), 0, thread_main, &(nums[ii]));
        assert(rv == 0);
    }

    printf("main: Started %d threads.\n", NN);

    for (int ii = 0; ii < NN; ++ii) {
        void* ret;
        rv = pthread_join(threads[ii], &ret);

        int yy = *((int*) ret);
        printf("main: Joined thread %d, rv = %d.\n", ii, yy);
    }

    printf("main: All threads joined.\n");

    return 0;
}
  • show create.c
  • Discuss how threads change the virtual memory story.

Threads vs. Processes #

  • We can spawn multiple processes with fork()
  • We can execute multiple threads within a single process.

Key difference: With threads, all memory is shared by default.

  • Advantage: Allocating shared memory post-spawn.
  • Disadvantage: 100% data races

History: #

Early days #

  • Before multi-processor systems parallelism didn’t matter.
  • Concurrency was still useful though:
    • Running multiple programs at once.
    • Having multiple logical tasks happening within one program.
  • On Unix style systems, processes were commonly used for concurrency.
  • On early Windows / Mac systems, concurrency within a program was represented by cooperative threading:
    • One thread could run at a time.
    • To let other threads run, explicitly call yield()
    • Some systems had an implicit yield when a thread blocked on I/O.
  • By the 90’s, systems had some sort of pre-emptive threading. This still didn’t work in parallel, but it would automatically schedule work between threads without explicit yield() calls.

Multiprocessors #

  • Multiprocessor servers became widely available in the mid 90’s.
  • Windows and Solaris had decent parallel thread support.
  • Linux didn’t get fully functional threads until like 2002, so fork() was heavily optimized instead.
  • Result: Threads are much more efficient than processes on Windows.
  • Threads under Linux evolved from fork(), so the performance difference is small.
  • Multi-core desktop processors showed up around 2005, and suddenly parallelism became nessisary for performance.

Conditon Variables #

  • Stack
  • Condvar stack

Other stuff #

These things are a usually bad idea compared to just using mutexes:

  • show atomic-sum101; compare to mutex and parallel versions
  • write it with pthread_spin_lock
  • write our own spinlock with atomic_compare_exchange_strong
    • need to google the docs; too new for a manpage
    • bad ideas include sched_yield
// sequential stack

#include <pthread.h>
#include <stdio.h>
#include <assert.h>
#include <unistd.h>

int stack[5];
int stptr = 0;

void
stack_push(int xx)
{
    stack[stptr++] = xx;
}

int
stack_pop()
{
    return stack[stptr--];
}

int
main(int _ac, char* _av[])
{
    for (int ii = 0; ii < 5; ++ii) {
        stack_push(ii);
    }

    for (int ii = 0; ii < 5; ++ii) {
        int yy = stack_pop();
        printf("%d\n", yy);
    }

    return 0;
}

Parallel stack with cond vars:

#include <pthread.h>
#include <stdio.h>
#include <assert.h>
#include <unistd.h>
#include <stdlib.h>

#define STACK_SIZE 5
int stack[STACK_SIZE];
int stptr = 0;
pthread_mutex_t mutex;
pthread_cond_t  condv;

void
stack_push(int xx)
{
    pthread_mutex_lock(&mutex);
    while (stptr >= STACK_SIZE) {
        pthread_cond_wait(&condv, &mutex);
    }
    stack[++stptr] = xx;
    pthread_cond_broadcast(&condv);
    pthread_mutex_unlock(&mutex);
}

int
stack_pop()
{
    pthread_mutex_lock(&mutex);
    while (stptr <= 0) {
        pthread_cond_wait(&condv, &mutex);
    }
    int yy = stack[stptr--];
    pthread_cond_broadcast(&condv);
    pthread_mutex_unlock(&mutex);
    return yy;
}

void*
producer_thread(void* arg)
{
    int nn = *((int*) arg);
    free(arg);

    for (int ii = 0; ii < nn; ++ii) {
        stack_push(ii);
    }
}

int
main(int _ac, char* _av[])
{
    pthread_t threads[2];
    pthread_mutex_init(&mutex, 0);
    pthread_cond_init(&condv, 0);

    for (int ii = 0; ii < 2; ++ii) {
        int* nn = malloc(sizeof(int));
        *nn = 1000;
        int rv = pthread_create(&(threads[ii]), 0, producer_thread, nn);
        assert(rv == 0);
    }

    while (1) {
        int yy = stack_pop();
        printf("%d\n", yy);
        usleep(10000);
    }

    return 0;
}