In case others are interested on the general compute aspect, e.g inference for self hosted AI, here is something related I found :
- ZLUDA, for CUDA everywhere, https://github.com/vosen/ZLUDA/ unfortunately in the process of a major rewrite https://github.com/vosen/ZLUDA/discussions/261 due to AMD (somehow?!) not wanting their code being used in there (maybe they did some shady RE on NVIDIA work?)
- AMD ROCm https://github.com/ROCm/ROCm and HIP https://github.com/ROCm/HIP
I'm using https://github.com/user234683/youtube-local and it's working without issue.