ulTaskNotifyTake: taskENTER_CRITICAL with portYIELD_WITHIN_API

Austin · June 29, 2021, 8:35am

ulTaskNotify has bellow sequence: yield in critical section

If we have a task calling ulTaskNotifyTake, will it cause some interrupt to be masked?
Is this the expected behavior?

For a situation, we have an IRQ, and in the IRQ handler, we call xTaskGenericNotifyFromISR to wake up the task which is waiting on ulTaskNotifyTake. Then the problem is ulTaskNotifyTake will prevent the IRQ, and causes lock?

Source code for reference

	uint32_t ulTaskNotifyTake( BaseType_t xClearCountOnExit, TickType_t xTicksToWait )
	{
	uint32_t ulReturn;

		taskENTER_CRITICAL();
		{
			/* Only block if the notification count is not already non-zero. */
			if( pxCurrentTCB->ulNotifiedValue == 0UL )
			{
				/* Mark this task as waiting for a notification. */
				pxCurrentTCB->ucNotifyState = taskWAITING_NOTIFICATION;

				if( xTicksToWait > ( TickType_t ) 0 )
				{
					prvAddCurrentTaskToDelayedList( xTicksToWait, pdTRUE );
					traceTASK_NOTIFY_TAKE_BLOCK();

					/* All ports are written to allow a yield in a critical
					section (some will yield immediately, others wait until the
					critical section exits) - but it is not something that
					application code should ever do. */
					portYIELD_WITHIN_API();
				}

taskENTER_CRITICAL

#define taskENTER_CRITICAL()		portENTER_CRITICAL()
#define portENTER_CRITICAL()		vPortEnterCritical();

void vPortEnterCritical( void )
{
	/* Mask interrupts up to the max syscall interrupt priority. */
	uxPortSetInterruptMask();
}

**FreeRTOSConfig.h**
/* Interrupts that are assigned a priority above
 * configMAX_API_CALL_INTERRUPT_PRIORITY (which in the GIC means a numerical
 * value below configMAX_API_CALL_INTERRUPT_PRIORITY) cannot call any FreeRTOS
 * API functions, will nest, and will not be masked by FreeRTOS critical
 * sections
*/

#define configMAX_API_CALL_INTERRUPT_PRIORITY	18


UBaseType_t uxPortSetInterruptMask( void )
{
uint32_t ulReturn;

	/* Interrupt in the CPU must be turned off while the ICCPMR is being
	updated. */
	portDISABLE_INTERRUPTS();
	if( portICCPMR_PRIORITY_MASK_REGISTER == ( uint32_t ) ( configMAX_API_CALL_INTERRUPT_PRIORITY << portPRIORITY_SHIFT ) )
	{
		/* Interrupts were already masked. */
		ulReturn = pdTRUE;
	}
	else
	{
		ulReturn = pdFALSE;
		portICCPMR_PRIORITY_MASK_REGISTER = ( uint32_t ) ( configMAX_API_CALL_INTERRUPT_PRIORITY << portPRIORITY_SHIFT );
		__asm volatile (	"dsb sy		\n"
							"isb sy		\n" ::: "memory" );
	}
	portENABLE_INTERRUPTS();

	return ulReturn;
}

portYIELD_WITHIN_API

#ifndef portYIELD_WITHIN_API
	#define portYIELD_WITHIN_API portYIELD
#endif

#if defined( GUEST )
	#define portYIELD() __asm volatile ( "SVC 0" ::: "memory" )
#else
	#define portYIELD() __asm volatile ( "SMC 0" ::: "memory" )
#endif

hs2 · June 29, 2021, 8:47am

Just for clarification: You’re using xTaskNotifyFromISR in the ISR as documented ? Note the FromISR suffix for all FreeRTOS calls usable in ISRs. xTaskGenericNotify is an internal function and never used directly.
Critical sections in FreeRTOS code might cause minimal delays of ISR invocations but no deadlocks.
Also do you have configASSERT defined for development along with stack overflow checking ?

Austin · June 29, 2021, 8:56am

Yes. Forget to mention. xTaskGenericNotifyFromISR is used in IRQ handler.

Also from the code, for the case I mentioned above, if IRQ priority is above configMAX_API_CALL_INTERRUPT_PRIORITY (18), the interrupt may happen and the flow may work.

But if the IRQ priority is lower than configMAX_API_CALL_INTERRUPT_PRIORITY, then lockup will happen?

**FreeRTOSConfig.h**
/* Interrupts that are assigned a priority above
 * configMAX_API_CALL_INTERRUPT_PRIORITY (which in the GIC means a numerical
 * value below configMAX_API_CALL_INTERRUPT_PRIORITY) cannot call any FreeRTOS
 * API functions, will nest, and will not be masked by FreeRTOS critical
 * sections
*/

#define configMAX_API_CALL_INTERRUPT_PRIORITY	18

RAc · June 29, 2021, 9:09am

Hi Austin,

it is not permissable for a task to call a function that may suspend the task with the critical section claimed. That is necessarily a deadlock. Claiming the critical section basically means stalling all OS activity, including the scheduler.

You may fall into the fallacy here that with the critical section in effect, portYIELD_WITHIN_API() will NOT invoke the scheduler immediately because the service interrupt will not execute with interrupts disabled but will be deferred until interrupts are re-enabled (critical section left) AND no higher level priority interrupts are pending. That’s the very architecture of the scheduler.

Austin · June 29, 2021, 10:01am

Thank you. I got your point.

Look at the code sequence

taskENTER_CRITICAL: it just mask IRQ, it won’t mask SVC call (???)

portYIELD_WITHIN_API==>svc 0 ==> Triger exception IMMEDIATELY ==>FreeRTOS_SWI_Handler is called for task switch

My puzzle is from above code sequence, the scheduler is called immediately by portYIELD_WITHIN_API which is “svc 0” and will trigger exception immediately. (and interrupt disabling can’t mask “svc 0”)

Please correct me if something of my understanding is wrong. Thanks

Reference code: FreeRTOS-Kernel/portable/GCC/ARM_CA53_64_BIT/):

taskENTER_CRITICAL
  ==> uxPortSetInterruptMask
    ==> portICCPMR_PRIORITY_MASK_REGISTER = ( uint32_t ) ( configMAX_API_CALL_INTERRUPT_PRIORITY << portPRIORITY_SHIFT );

/******************************************************************************
 * FreeRTOS_SWI_Handler handler is used to perform a context switch.
 *****************************************************************************/
.align 8
.type FreeRTOS_SWI_Handler, %function
FreeRTOS_SWI_Handler:
        /* Save the context of the current task and select a new task to run. */
        portSAVE_CONTEXT
#if defined( GUEST )
        MRS             X0, ESR_EL1
#else
        MRS             X0, ESR_EL3
#endif

        LSR             X1, X0, #26

#if defined( GUEST )
        CMP             X1, #0x15       /* 0x15 = SVC instruction. */
#else
        CMP             X1, #0x17       /* 0x17 = SMC instruction. */
#endif
        B.NE    FreeRTOS_Abort
        BL              vTaskSwitchContext

        portRESTORE_CONTEXT

FreeRTOS_Abort:
        /* Full ESR is in X0, exception class code is in X1. */
        B               .

RAc · June 29, 2021, 10:02am

wrong. See my previous explanation.

hs2 · June 29, 2021, 10:26am

@Austin Do you actually have a problem in your current application or is a request for comment/clarification ?

Austin · June 29, 2021, 12:29pm

We have problem in our application and this is one suspect area, but not confirmed yet.

We got the point Rac mentioned, but not fully align with the code. Also tried to understand if “svc 0” is deferred by taskEnter_Critical, but didn’t get any document for “svc 0” behavior.

We are using the port from FreeRTOS-Kernel/portable/GCC/ARM_CA53_64_BIT/, the yield is a “svc 0” call.

RAc · June 29, 2021, 12:50pm

What do you mean “not fully align with the code?”

https://www.keil.com/support/man/docs/armasm/armasm_dom1361289909139.htm

This one reads that the SVC instruction “causes an exception.” I read this to mean that this is subject to the standard Cortex A9 exception handling which is documented in the “ARM Generic Interrupt Controller Architecture Specification” which you can download from ARM.

Note that the ability to schedule instead of unconditionally invoke an exception handler is at the very heart of each and every processor architecture. Without this, you were right; we would necessarily deadlock. If the SVC would indeed immediately and unconditionally invoke the handler with disregard to priorities and interrupt mask, we wouldn’t need an interrupt in the first place, though. We might as well call a subroutine.

BTW, what is this #ifdef GUEST all about?

hs2 · June 29, 2021, 12:52pm

Ok - there is a problem and you suspect that the FreeRTOS portable layer is the reason ?
Could you explain what exactly the problem is ?
Besides that could you add which FreeRTOS version on which hardware you’re using ?

Austin · June 29, 2021, 1:11pm

What do you mean “not fully align with the code?”

Let me clarify. We using the code from https://github.com/FreeRTOS/FreeRTOS-Kernel.
What I talked above is from folder portable/GCC/ARM_CA53_64_BIT. It is a portable part for Cortex A53 core with AARCH64

In the code:
taskENTER_CRITICAL: it just mask IRQ, it won’t affect SVC call (right?)
portYIELD_WITHIN_API==>svc 0 ==> Triger exception==>FreeRTOS_SWI_Handler is called for task switch (it is my understanding)

If the SVC would indeed immediately and unconditionally invoke the handler with disregard to priorities and interrupt mask

This is my understanding.

BTW, what is this #ifdef GUEST all about?

GEUST means it is running in EL1. For our case, GUEST is true/defined

This is a similar Can a Switch to another task occur between the call “taskENTER_CRITICAL” and “taskEXIT_CRITICAL”? with comments in this thread:

but such as on CA9, the volunteer yield operation is done through “svc” call, which would step into svc flow directely and do the swich at once, i did not see any condition would block this switch,
it seems the swith would happed and success on CA9 port., which meas voluteer yiled could be done in “taskENTER_xxxx” and “taskEXIT_xxxx” region.

Austin · June 29, 2021, 1:24pm

there is a problem and you suspect that the FreeRTOS portable layer is the reason ?

I am not sure. I firstly look at ulTaskNotifyTake and see there is yield between taskENTER_CRITICAL/taskEXIT_CRITICAL and then I dug into the code.

In task.c, it looks ulTaskNotifyTake/xTaskNotifyWait are the only case that has yield between taskENTER_CRITICAL/taskEXIT_CRITICAL

Could you explain what exactly the problem is ?

Problem is the task that has “ulTaskNotifyTake” can’t get notified, and the IRQ that has xTaskGenericNotifyFromISR is not triggered

Besides that could you add which FreeRTOS version on which hardware you’re using ?

Answered in above post

RAc · June 29, 2021, 1:38pm

If this WAS true, then you were right; there’d be an immediate deadlock. Yet I’m almost positive your understanding is wrong. If this would be an unconditional jump to the ISR, it would completly bypass the exception handling architecture. Also there wouldn’t be a point in the SVC call; it would be exactly like a bl, possibly switching processor states.

It’s fairly easy to verify this: Set a breakpoint to the SVC call in the disassembly window, then do a single step. Do this with and without the critical section active. I’m almost 100% sure that in the first scenario, it’ll very simply jump over the instruction onto the next one and enter the context switch ISR only after the critical section is left, whereas in the second scenario, you’ll be right at the ISR. If that wasn’t the case, nothing would work because the scheduler crucially depends on the deferred invocation mechanism.

hs2 · June 29, 2021, 2:21pm

Let’s get on the same page: Your ISR isn’t (never ?) triggered and you think this is root caused by the ulTaskNotifyTake internal implementation ?
That would mean that the ARM_CA53_64_BIT FreeRTOS port is fundamentally broken… to be honest I can’t imagine that this got slipped through the FreeRTOS release tests.
Is there any other possible reason why the ISR is not triggered as expected ?

rtel · June 29, 2021, 3:48pm

It is very helpful to mention which port you are using in your original post. Another tip is to say what your problem actually is, rather than something about a workaround or investigation into the problem.

The FreeRTOS code is portable across more than 40 architectures - so it won’t be a surprise it has to cope with many different context switch mechanisms. On very simple architectures it can just be a function call - but generally there are two classes: Synchronous and asynchronous.

When it is asynchronous, a context switch is requested by pending a low priority interrupt. If that is done inside a critical section then the interrupt remains pending until the critical section is exited, so context switches only ever happen when interrupts are enabled (or unmasked if interrupts are never globally disabled).

When it is synchronous, as in your case where an SVC is used, then the context switch will occur immediately whether you are inside a critical section or not. In those cases the kernel stores the critical section nesting depth as part of the task’s context. If a task switches out form within a critical section (so with interrupts masked), the next time it is switched back in interrupts will again be disabled again (when the critical nesting depth gets restored) before it starts to run - then it will exit the critical section to re-enable interrupts. The net effect is therefore the same as for the asynchronous method but the context switch occurs the other side of exiting the critical section. Likewise if a task switches out with interrupts enabled, the next time it is switched in interrupts will again be enabled even if it switched from a task that had interrupts disabled. The kernel is designed to work like this - it is not recommended for application code to yield in a critical section!

RAc · June 29, 2021, 4:03pm

In that case I stand corrected, ask for apologies for my misassesment and would like to thank Richard for his clarifying explanation!

aggarg · June 29, 2021, 8:50pm

If you are running at EL1, one thing to watch out for is the interrupt priority view of a non-secure group 1 interrupt. The following is from the GIC doc:

For Non-secure writes to a priority field of a Non-secure Group 1 interrupt, before storing the value:
• The value is right-shifted by one bit.
• Bit [7] of the value is set to 1.
This translation means the priority value for the Non-secure Group 1 interrupt is in the bottom half of the priority range.

If your hardware implements 5 priority bits (32 unique priorities) and you want to set a non-secure group 1 interrupt’s priority to 18 (it can be greater than 16 only to ensure that it is in the bottom half range), you should use the following value:

( 18 << 4 ) & 0xFF

instead of

( 18 << 3 ) & 0xFF

Thanks.

Austin · June 30, 2021, 6:37am

Richard, Thanks for the explanation.

Want to make it clear. Let’s say we use Cortex A53 port which uses “svc 0” to yield

You mentioned

it is not recommended for application code to yield in a critical section!

The call ulTaskNotifyTake from APP will have this situation, and is not recommended to use from APP? Am I right?

If it is not the case, do you think bellow usage model is the correct way?
Any IRQ priority settings should be taken care?

Task thread (bottom half of the IRQ):

IRQ_bottom_half_task()
{
  while (true) {
      ulTaskNotifyTake(pdTRUE, portMAX_DELAY) 
      /* We get the notification from ISR, and do the remaining thing here */
  }
}

IRQ_handler

my_irq_isr()
{
   /* Get an IRQ, and notify the bottom half task */
   xTaskGenericNotifyFromISR()
}

Austin · June 30, 2021, 6:53am

Thanks for the reminder. Will have a check

hs2 · June 30, 2021, 9:31am

No. The official FreeRTOS API can be used, of course.
You just shouldn’t call (undocumented) portYIELD_WITHIN_API directly in your application code.
BTW Instead of using undocumented call to xTaskGenericNotifyFromISR I’d recommend to stick to the official API like xTaskNotifyFromISR.

Topic		Replies	Views
xTaskNotifyWait not working Kernel	12	1921	April 4, 2021
Direct task notification as semaphore - get count from ISR Kernel	14	1305	March 25, 2021
Task stuck at taskNOTIFICATION_RECEIVED Kernel	14	3132	November 9, 2024
Critical sections & FreeRTOS API calls Kernel	25	4588	July 6, 2023
Wakeup a task from ISR Kernel	6	876	June 30, 2022

ulTaskNotifyTake: taskENTER_CRITICAL with portYIELD_WITHIN_API

Related topics