With that speed on a such a small display it should be much faster.
It seems significant amount of time is spent by lv_color_mix_premultiply
. It can happen when you use recolored images. How large images are you recoloring?
BTW, what is this great profiler tool?