Would be awesome if support for [multimodal](https://github.com/ggerganov/llama.cpp/pull/3436) input were added? Is this planned at all?
Would be awesome if support for multimodal input were added? Is this planned at all?