feat(tts): add configurable computeUnits for Kokoro models by integrITsolutions · Pull Request #482 · FluidInference/FluidAudio

integrITsolutions · 2026-04-04T16:25:33Z

Summary

Adds a computeUnits parameter (default: .all) to TtsModels.download(), KokoroTtsManager.init(), and KokoroModelCache.init(), allowing callers to override CoreML compute units for Kokoro model loading.

Problem

iOS 26 (beta, Build 23E246) introduces ANE compiler regressions that cause Kokoro models to fail with:

Error: Cannot retrieve vector from IRValue format int32
Unable to compute the asynchronous prediction using ML Program

This is a known ecosystem-wide issue affecting CoreML models on iOS 26 (see whisper.cpp#3702, executorch#15833, Apple Developer Forums thread 799456). The root cause is changes in the ANE compiler/runtime that break models compiled with computeUnits: .all.

Solution

Exposes the computeUnits parameter so callers can use .cpuAndGPU on iOS 26+ to bypass the ANE, matching the approach PocketTTS already uses to avoid ANE float16 precision artifacts.

Backwards compatible: The default remains .all, preserving existing behavior on iOS 17-18.

Changes

TtsModels.swift: Added computeUnits parameter to download(), piped to DownloadUtils.loadModels()
KokoroTtsManager.swift: Added computeUnits parameter to init(), stored and passed to TtsModels.download() and KokoroModelCache
KokoroModelCache.swift: Added computeUnits parameter to init(), piped to TtsModels.download() in loadModelsIfNeeded()

Usage

// iOS 26+ workaround
let manager = KokoroTtsManager(computeUnits: .cpuAndGPU)
try await manager.initialize()

// Existing behavior unchanged (default .all)
let manager = KokoroTtsManager()
try await manager.initialize()

Testing

Verified Kokoro initialization succeeds with .cpuAndGPU on iOS 26.4 beta (iPhone 14 Pro, A16)
Default .all behavior unchanged on older iOS versions
No API breaking changes

Adds a `computeUnits` parameter (default: `.all`) to `TtsModels.download()`, `KokoroTtsManager.init()`, and `KokoroModelCache.init()`, allowing callers to override CoreML compute units for Kokoro model loading. This is needed because iOS 26 introduces ANE compiler regressions that cause Kokoro models to fail with "Cannot retrieve vector from IRValue format int32" when loaded with `.all` (which includes the Neural Engine). Using `.cpuAndGPU` bypasses the ANE and resolves the issue, matching the approach already used by PocketTTS to avoid ANE float16 precision artifacts. The default `.all` preserves existing behavior on iOS 17-18. Callers on iOS 26+ can pass `.cpuAndGPU` to work around the ANE regression. Example: ```swift let manager = KokoroTtsManager(computeUnits: .cpuAndGPU) try await manager.initialize() ```

When KokoroTtsManager was initialized with a custom computeUnits but no directory (the common case), the modelCache default parameter was used as-is with .all compute units, silently ignoring the caller's setting. This meant on-demand model loading could still hit the ANE, defeating the iOS 26 workaround. Make modelCache optional (nil = not user-provided) so we always create a cache with the correct computeUnits when the caller doesn't supply their own. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

This comment was marked as resolved.

Sign in to view

integrITsolutions and others added 3 commits April 4, 2026 18:54

Merge branch 'main' into feat/kokoro-configurable-compute-units

119f599

Merge branch 'main' into feat/kokoro-configurable-compute-units

db66190

Alex-Wengg merged commit 57551cd into FluidInference:main Apr 4, 2026
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(tts): add configurable computeUnits for Kokoro models#482

feat(tts): add configurable computeUnits for Kokoro models#482
Alex-Wengg merged 4 commits intoFluidInference:mainfrom
IntegrIT-Solutions:feat/kokoro-configurable-compute-units

integrITsolutions commented Apr 4, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

integrITsolutions commented Apr 4, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Changes

Usage

Testing

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

integrITsolutions commented Apr 4, 2026 •

edited by devin-ai-integration bot

Loading