Low-Level Programming University

What is it?

This page is for beginners who want to be low-level programmers.

I’m inspired by google-interview-university. I’d like to share my experience and show a roadmap to becoming a low-level programmer because I have found that these skills are not as common as they once were. In addition, many students and beginners ask me how they could become low-level programmers and Linux kernel engineers.

This page cannot include every link/book/course. For example, this page introduces Arduino but there is not detail information about Arduino and embedded system. You should go further for yourself. You get a keyword “Arduino” on which you can start. So next step probably is googling Arduino, buying kit and do something for yourself, not collecting links or free books. Please remember this page is just a roadmap.

FYI, I have over 10 years of experience as a low-level programmer:

  • 80x86 Assembly programming
  • Hardware device with Atmel chip and firmware
  • C language system programming for Unix
  • Device driver in Linux
  • Linux kernel: page allocation
  • Linux kernel: block device driver and md module

What Is the Low-Level?

I classify low-level programming as programming that is very close to the machine, using a lower level programming language like C or assembly. This is in contrast to higher-level programming, typical of user-space applications, using high level languages (e.g. Python, Java).

Yes, systems programming is a very close concept to low-level programming. This page includes the hardware design and firmware development that is not included in system programming.

Finally, this page includes topics ranging from hardware components to the Linux kernel. That is a huge range of layers. A one page document can never cover the details of all the layers, so the aim of this document is to serve as a starting point for low-level programming.

Theory

There are two background theories to low-level programming:

  • Computer Architecture
  • Operating Systems

You can find many good classes on online universities, for instance, Coursera.org and edx.org. Theory is theory. I don’t think you should get A+ in the class, just understand the big picture in the class. You’ll get better and better with experience.

Let me introduce several books that I’ve read. They are commonly used for text book in the universities. If there is no class with those books in your university, it’s worth to spend some time for those.

  • Computer Architecture
    • Computer Architecture, Fifth Edition: A Quantitative Approach
    • Computer Systems: A Programmer’s Perspective
    • Computer Organization and Design, Fourth Edition: The Hardware/Software Interface
  • Operating Systems
    • The Magic Garden Explained: The Internals of UNIX System V Release 4 an Open Systems Design
    • The Design of the UNIX Operating System
    • Operating Systems: Internals and Design Principles by William Stallings

There is infinite list of good books. I don’t want to say that you should read many books. Just read one book carefully. Whenevery you learn a theory, implement simulation code of it. Implementing one thing is better than knowing one hundread theories.

Languages

Assembly

Choose one between x86 or ARM. No need to know both. It doesn’t matter to know assembly language. The essential is knowing the internal of CPU and computer. So you don’t need to practice the assembly of the latest CPU. Select 8086 or Corex-M.

  • 8086 assembly programming with emu8086
    • basic concepts of CPU and computer architecture
    • basic concepts of C programming language
  • 64bit assembly programming(translation in progress)
    • basic concepts of modern CPU and computer architecture
    • basic concepts of disassembling and debugging of C code
    • need help for translation
  • ARM Architecture Reference Manual, 2nd Edition
    • Complete reference on ARM programming
  • Computer Organization and Design
    • MIPS Edition
    • ARM Edition
    • Academic books that explain how every component of a computer work from the ground up.
    • Explains in details the different concepts that make up computer architecture.
    • They are not for targeted to becoming proficient in a specific assembly language.
    • The MIPS and ARM edition cover the same topics but by dissecting a different architecture.
    • Both editions contain examples in the x86 world

C language

There is no short-cut. Just read the entire book and solve all the exercises.

If you want to be expert of C programming, visit https://leetcode.com/. Good luck!

Applications

Hardware && Firmware

If you want to be an embedded systems engineer, it would be best to start from a simple hardware kit, rather than starting with the latest ARM chipset.

  • Arduino Start Kit
    • There are various series of Arduino but “Arduino Start Kit” has the most simple processor(Atmega328P) and guide book
    • Atmega328P has 8bit core that is the good to start “Digital circuit design” and “Firware development”.
    • You don’t need to know how to draw schematics and layout, and assemble the chips.
    • But you need to know how to read schematics and understand how the chips are connected.
    • Firmware developers should be able to read the schematics and figure out how to send data to the target device.
    • Follow the guide book!
  • 8086 manual
    • If you’re a beginner to x86 architecture, 8086 is also very good guide for processor architecture and 80x86 assembly
  • 80386 manual
    • Best guide for protected mode and paging machanism of 80x86 processor
    • Web version: https://pdos.csail.mit.edu/6.828/2011/readings/i386/toc.htm

At this point, you should be good to start the latest ARM or x86 processor.

  • https://www.raspberrypi.org/
  • https://beagleboard.org/
  • https://www.arduino.cc/en/ArduinoCertified/IntelEdison

For example, the Raspberry Pi board has a Cortex-A53 Processor that supports a 64-bit instruction set. This allows you to experience a modern processor architecture with rPi. Yes, you can buy it… but… what are you going to do with it? If you have no target project, you would be likely to throw the board into a drawer and forget it like other gadgets you may have bought before.

So, I recommend one project for you.

I’ve made a toy kernel that supports 64bit long mode, paging and very simple context switching. Making a toy kernel is good way to understand modern computer architecture and hardware control.

In fact, you have already the latest processor and the latest hardware devices. Your laptop! Your desktop! You already have all to start! You don’t need to buy anything. The qemu emulator can emulate the latest ARM processors and Intel processors. So everything you need is already on hand. There are so many toy kernel and documents you can refer to. Just install qemu emulator and make a tiny kernel that just boots and turns on paging, and prints some messages.

Other toy kernels:

  • https://littleosbook.github.io/
  • https://tuhdo.github.io/os01/

Linux kernel and device driver

You don’t need to make a complete operating system. Join the Linux community and participate in development.

Read carefully

  • Books: Read followings in order
    • The Design of the Unix Operating System
      • The basic concepts of Unix are applied into all operating system.
      • This book is very good to get the concepts of operating system.
    • Linux Device Drivers
      • Make all examples for yourself
    • Linux Kernel Development
      • Understand the design of Linux Kernel
    • Understanding the Linux Kernel
      • Read this book and the kernel source v2.6 at the same time
      • Never start with the latest version, v2.6 is enough!
      • Use qemu and gdb to run the kernel source line by line
        • http://stackoverflow.com/questions/11408041/how-to-debug-the-linux-kernel-with-gdb-and-qemu
        • https://github.com/gurugio/linuxdeveloptip/blob/master/qemu-gdb-kdump.md
      • Use busybox to make the simplest filesystem that takes only 1-second to boot
        • https://github.com/gurugio/linuxdeveloptip/blob/master/minikernelwithbusybox.md
  • The Eudyptula Challenge
    • This is like an awesome private teacher who guide you what to do.
    • If you don’t know what to do, just start this.
  • Block layer and device driver(translation in progress)
    • start from a simple block device driver example (Ramdisk) with multi-queue mode
    • go forward to block layer
    • need help for translation
  • md driver of Linux kernel(in progress)
    • how mdadm tool works and how it calls md driver
    • how md driver works
    • need help for translation

References

Check when you need something

Other application

Yes, you might not be interested in Linux or firmware. If so, you can find other applications:

  • Windows system programming & device driver
  • Security
  • Reverse engineering

I don’t have any knowledge about those applications. Please send me any information for beginners.

Future of low-level programming

I do not know the future, but I keep my eye on RUST.

  • https://hacks.mozilla.org/2016/11/rust-and-the-future-of-systems-programming/

If I could have one week free and alone, I would learn RUST. That is because RUST is the latest language with which I can develop Linux device driver.

IoT is new trend, so it’s worth to check what OSs are for IoT. ARM, Samsung and some companies has their own realtime OS but sadly many of them are close source. But Linux Foundation also has a solution: Zephyr

  • https://www.zephyrproject.org/

Typical cloud server has so many layers, for instance, host OS, kvm driver, qemu process, guest OS and service application. So container has been developed to provide light virtualization. In near future, a new concept of OS, so-called library OS or Unikernel, would be replace the typical stack of SW for virtualization.

  • http://unikernel.org/

Translation

Please send me the pull request if you’d like to translate this page. I’ll list it here.

Origin