STM32F7508-DK + LVGL + AzureRtos

Hi all,
Based on the code off the,

i have setup a template for the STM32F7508 board.
It can be found here :

I have unexpected low performance (low framerate, high cpu usage, not smooth animation) comparing to STM32F746 board, and i can’t figure why ?
I have both card here, both at 200 mhz, DMA2D disable (it doesn’t build on stm32f750, i disable it for fair comparison)

The stm32f7508 run code from qspi, and all data are place in sdram. All cache (icache,dcache and ART) are enable.

On the “Montly Target” for example, it run at about 30fps with >90% cpu usage on stm32f750
and 33fps 15-25% cpu on stm32f746.

Any help to improve the template “performance” will be appreciate.

Have you enabled O3 optimization?