docs: Add section on users of PREEMPT_RT and Linux scheduling

2b-t · Nov 5, 2023 · 2566b3a · 2566b3a
1 parent bcd78d8
commit 2566b3a
Showing 1 changed file with 22 additions and 0 deletions.
diff --git a/doc/RealTimeLinux.md b/doc/RealTimeLinux.md
@@ -8,6 +8,8 @@ Author: [Tobit Flatscher](https://github.com/2b-t) (August 2021 - March 2023)
 
 Task scheduling on standard operating systems is to some extend non-deterministic, meaning one cannot give a guaranteed - mathematically provable - upper bound for the execution time. This is somewhat desired for most applications as this increases throughput but for any real-time system one wants to be able to give such an upper bound that a given task will never exceed. Rather than executing something as fast as possible the aim is to **execute tasks consistently**, in a deterministic fashion: What matters is the **worst case latency** rather than the average latency. There are different approaches for rendering (Linux) operating systems real-time capable. These will be discussed in the next section. In case you are interested in how the Linux kernel and scheduler works under the hood I recommend reading [this guide](https://wxdublin.gitbooks.io/deep-into-linux-and-beyond/content/index.html) written by someone who is much more knowledgable on this topic than I am. The topic also discusses load balancing on multi-core architectures.
 
+
+
 ## 2. Real-time Linux: Dual and single kernel approaches
 
 When talking about real-time kernels one differentiates between single kernel approaches, like [`PREEMPT_RT`](https://wiki.linuxfoundation.org/realtime/start), and [dual-kernel approaches](https://linuxgizmos.com/real-time-linux-explained/), such as [Xenomai](https://en.wikipedia.org/wiki/Xenomai). You can use real-time capable Dockers in combination with all of them to produce real-time capable systems but the approaches differ. Clearly this does not depend on the Docker itself but on the **underlying host system**, meaning you still have to properly configure the host system, likely re-compiling its kernel.
@@ -37,6 +39,26 @@ These can be combined with the feature of [control groups (`cgroups` for short)]
 
 **`PREEMPT_RT`** developed from `PREEMPT` and is a set of patches that aims at making the kernel fully preemptible, even in critical sections (`PREEMPT_RT_FULL`). For a more detailed explanation refer to [this Ubuntu in depth-guide](https://ubuntu.com/blog/real-time-kernel-technical). For this purpose e.g. [spinlocks are largely replaced by mutexes](https://wiki.linuxfoundation.org/realtime/documentation/technical_details/sleeping_spinlocks). This way there is no need for kernel space programming - instead on can use the standard C and Posix threading libraries. In mid 2021 Linux lead developer Linus Torvalds [merged 70 of the outstanding 220 patches](https://linutronix.de/news/The-PREEMPT_RT-Locking-Code-Is-Merged-For-Linux-5.15) into the Linux mainline. In the near future `PREEMPT_RT` should be available by default to the Linux community without needing to patch the system, guaranteeing also the maintenance of the patch. For a more detailed overview have a look at [this](https://bootlin.com/doc/training/preempt-rt/preempt-rt-slides.pdf) presentation as well as the Ubuntu introductory series (see [this webinar](https://ubuntu.com/engage/an-introduction-to-real-time-linux-part-i) as well as [this blog post](https://ubuntu.com/blog/real-time-kernel-technical)). One potential problem with this is that the kernel drivers are not necessarily developed with real-time constraints in mind.
 
+`PREEMPT_RT` is widely used in robotics:
+
+- [SpaceX](https://www.reddit.com/r/spacex/comments/gxb7j1/comment/ft6g3dg) runs it on their onboard computers
+- Robotic manufacturers like [Franka Emika](https://frankaemika.github.io/docs/installation_linux.html), [Universal Robots](https://github.com/UniversalRobots/Universal_Robots_ROS_Driver/blob/master/ur_robot_driver/doc/real_time.md) and [Toyota](https://robomechjournal.springeropen.com/articles/10.1186/s40648-019-0132-3) use it for the control computers of their robots. In particular quadrupeds such as the [MIT Mini Cheetah](https://dspace.mit.edu/bitstream/handle/1721.1/126619/IROS.pdf?sequence=2&isAllowed=y) are commonly using it.
+- [National Instruments (NI)](https://www.ni.com/en/shop/linux/introduction-to-ni-linux-real-time.html) uses it on a couple of their controllers
+- [LinuxCNC](https://www.linuxcnc.org/) (originally developed by NIST) uses it for numeical control of CNC machines
+- It is commonly used with the [nVidia Jetson](https://docs.nvidia.com/jetson/archives/r35.1/DeveloperGuide/text/SD/Kernel/KernelCustomization.html) platform
+
+It is important to note that installing the real-time patch only partially resolves the problem. There are [multiple **scheduling policies**](https://man7.org/linux/man-pages/man7/sched.7.html) available that will largely impact the real-time performance of the system. The scheduler is the kernel component that decides which thread should be executed by the CPU next. Each **thread** has its own **scheduling policy** as well as its **scheduling priority**. The scheduler maintains a list of all threads that should be run and their priority. First the scheduler will look at the priority and the scheduling policy will determine where this thread will be inserted in the list of threads with equal static priority and how it will be moved inside that list.
+
+The commonly available scheduling policies are:
+
+- `SCHED_OTHER`: [Completely fair scheduler](https://en.wikipedia.org/wiki/Completely_Fair_Scheduler), not real-time capable, generally the default scheduling policy
+- `SCHED_DEADLINE`: Earliest-deadline-first scheduling
+
+- **`SCHED_FIFO`**: First-in-first-out scheduling without time slicing. This is generally used for real-time applications
+- `SCHED_RR`: Enhancement of `SCHED_FIFO` with time slicing
+
+Check the section on user code to see how to set a scheduling policy from inside the code.
+
 ### 2.3 Performance comparison
 
 Which of two approaches is faster has not been completely settled and is still disputed. Most of the available literature on this topic claims that Xenomai is slightly faster. Jan Altenberg though claims that he could not replicate these studies: *"I figured out that most of the time PREEMPT_RT was poorly configured. So we brought in both a Xenomai expert and a PREEMPT_RT expert, and let them configure their own platforms.”* [Their tests](https://www.youtube.com/watch?v=BKkX9WASfpI) show that the maximum thread wake-up time is of similar magnitude while the average is slightly slower when comparing real-world scenarios in userspace.