Deterministic simulation in linux user-space

TheComet · May 13, 2025, 12:33pm

Hi all!

I’m working on quite a large embedded project using FreeRTOS. Somewhere around 150kloc and ~100 tasks. We are looking into setting up a “Software in the Loop” test framework for integration and regression tests. The obvious approach is to port FreeRTOS to linux and forward the micro controller specific calls to virtual devices.

I saw there is a POSIX port of FreeRTOS, however, we have a few test cases where we need sub-tick-accurate simulation of peripherals, and we have other cases where tasks strictly depend on each other’s priorities. I don’t think the POSIX port is sufficiently deterministic for what we need.

This lead me to wondering: Has anyone written a port that runs FreeRTOS purely in userspace? I.e. no pthreads, no signals for simulating tick interrupts, but an actual x86 port?

For that matter, is anyone else interested in such a port?

richard-damon · May 13, 2025, 1:52pm

The problem will be that “user-space” code can’t really simulate the interrupts. What you would need is something more like QEMU to fully emulate your machine and the I/O + Interrupt system around it.

TheComet · May 13, 2025, 5:28pm

I’ve been thinking about this, and I think I have a pretty good solution.

On a real-world micro controller, xTaskIncrementTick is normally called periodically by a timer interrupt. The timer interrupt’s frequency is either directly derived from the system clock, or if it’s a separate crystal it’ll at least be some multiple of the system clock. This means the number of instruction cycles executed per tick is a fixed number.

The code-under-test could be instrumented with an “instruction callback” function. The idea is you insert calls to this function at periodic intervals. The callback can then essentially be seen as the “CPU clock”, because on average there will be X number of callbacks per Y instructions. From this you can then derive a clock frequency, and call vPortTickISR() after making the appropriate number of divisions.

Such a simulation would be 100% deterministic which I think is very desirable for building tests on top of.

richard-damon · May 13, 2025, 5:35pm

Yes, you could do that, but in my opinion it isn’t a good test of robustness. In particular, you have just made certain that your interrupts are always at chosen spots in your code, while in reality, they can occur at almost any arbitrary location (limited just by critical sections).

If you are only trying to model gross timing, and not looking to try to find race conditions, then that might be good enough.

TheComet · May 13, 2025, 5:45pm

I think for our purposes at least, it would be sufficient. We’re mostly looking to test some very specific edge cases and need the determinism to reproduce the errors we’re seeing.

I can imagine, though, that you could instrument the code in a lot of ways that would also make what you’re saying possible to test. E.g. if you’re interested in finding bugs related to being interrupted mid-instruction, I can think of ways to manipulate the generated assembly to split single operations up into multiple operations. Then you can insert the callback in between.

I haven’t really found any project that is doing the stuff I’m thinking about. Everyone suggests to use the POSIX port.

richard-damon · May 13, 2025, 5:52pm

As I mentioned, there have been a number of comments about using QEMU as a platform, which allows running your code on a PC while still getting full debug capabilities as if on your final platform.

TheComet · May 16, 2025, 3:07pm

For anyone who’s interested, I’ve created a proof-of-concept of the ideas I mentioned in this thread. It’s 32-bit only (x86) and I haven’t added support for FPU or SIMD save/restore.

TheComet/FreeRTOS-sim: Testing some ideas with FreeRTOS simulation in userspace

It’s designed to handle multiple Start/End scheduler calls.

Simulation of peripherals is not part of this example, but the general approach is to create global variables representing the various SFRs of your target MCU. Feeding information into the simulation (for example GPIO interrupt) can be done by writing the pin state to the register and setting the interrupt flag, which gets picked up in ProcessPendingInterrupts() in Simulation.c. Getting information out of the simulation can be done via callback functions. We had the problem that task stack sizes are quite small, so there is a helper function called vPortCallOnMainStack which pivots any calls from within the simulation onto the unused portion of the main() stack.

Simulation_End() can be called from anywhere, but since FreeRTOS can only call vTaskEndScheduler() in task context, there is an additional EndSchedulerTask that is notified with a queue (now that I think about it, I should be using TaskNotify). vPortEndScheduler() cleans up all FreeRTOS global state and then jumps directly to the return address of vPortStartScheduler().

If there is more interest in this project, I could try to upstream it. For now, though, it remains an experiment.