For anyone who knows.
Basically, it seems to me like the technology in mobile GPUs is crazier than desktop/laptop GPUs. Desktop GPUs obviously can do things better graphically, but not by enough that it seems to need to be 100x bigger than a mobile GPU. And top end mobile GPUs actually perform quite admirably when it comes to graphics and power.
So, considering that, why are desktop GPUs so huge and power hungry in comparison to mobile GPUs?
Also, AMD APUs use your main RAM, and some systems even allow you to change the allocation - so you could allocate say 16GB for VRAM, if you’ve got 32GB RAM. There are also tools which allow you can run to change the allocation, in case your BIOS does have the option.
This means you can run even LLMs that require a large amount of VRAM, which is crazy if you think about it.
Problem is, system RAM does not have anywhere near the bandwidth that dedicated VRAM does. You can run an AI model, but the performance will be 10x worse due to the bandwidth limits.