Skip to content

Conversation

@schnkmwt
Copy link

@schnkmwt schnkmwt commented Jan 13, 2026

This PR adds the VitisAI Plugin for AMD Ryzen AI NPUs to offload Whisper encoder module.

  1. Adds a new build option WHISPER_VITISAI to build with the new plugin
  2. The plugin adds a redirecting backend for the whisper encoder to run on AMD Strix NPU using a pre-optimized model binary available from Huggingface and returns the outputs back to run on decoder backend.
  3. Requires proprietary packages from AMD

* Added VitisAI encoder module placeholder files

* VitisAI build integration

* VitisAI encoder offload functional

* Clean up vitisai integration

* Add c++17 requirement for Windows

* Enabled preemption for windows runs

* Add model cache override option

* Remove vitisai premature log message

* Add rai support through file mapping

* Fixed flatbuffer loading

* Fixed Windows file mapping issue

* Update FlexmlRT resolution

* Use Flexmlrt wheel pkg to build VitisAI plugin

* Clean up

* Remove prints

* Change flexmlrt target from Shared to Interface

* Add c++17 requirement for Windows

* Enabled preemption for windows runs

* Add rai support through file mapping

* Fixed flatbuffer loading

* Fixed Windows file mapping issue

* Update FlexmlRT resolution

* Use Flexmlrt wheel pkg to build VitisAI plugin

* Clean up

* Remove prints

* Change flexmlrt target from Shared to Interface

* Cleanup FlexmlRT integration

* format fix

* Adding AMD Licenses

* Update CMakeLists.txt

Co-authored-by: Kumawat, Sachin <sachin.kumawat@amd.com>

* Update src/CMakeLists.txt

Co-authored-by: Kumawat, Sachin <sachin.kumawat@amd.com>

* Update whisper.cpp

* Added VitisAI encoder readme section

* Remove license headers from common files to whisper.cpp

---------

Co-authored-by: Sachin Kumawat <sachink@amd.com>
Co-authored-by: Jeff Lin <jeffylin@xilinx.com>
Co-authored-by: Lin <jefflin@amd.com>
Co-authored-by: Lin, Jeff (DCG-ENG) <jeff.lin@amd.com>
Co-authored-by: Iswarya Alex <iswaryaalex96@gmail.com>
Co-authored-by: Alex, Iswarya <Iswarya.Alex@amd.com>
@schnkmwt schnkmwt changed the title Add VitisAI Plugin [AMD] Add VitisAI Plugin for AMD Ryzen AI NPU encoder offload Jan 14, 2026
@schnkmwt schnkmwt marked this pull request as ready for review January 16, 2026 22:16
iswaryaalex and others added 2 commits January 27, 2026 11:51
- RAI EULA Links
- Updated for RAI Whisper instructions
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants