23 APR 2026

rahulmnavneeth

platform behavior

homedocs
PlatformCameraInference EPPlugin
macOSAVFoundationCoreML, ORT CPUCLAP, AU, VST3
iOSAVCaptureSessionCoreMLAUv3 (via clap-wrappers)
WindowsMedia FoundationDirectML, ORT CPUCLAP, VST3
LinuxV4L2ORT CPU, CUDACLAP, VST3
AndroidCamera2 (JNI)NNAPI, XNNPACKN/A
WebgetUserMediaWebGPU / ORT-webN/A

macOS / iOS

Apple framework dependencies live in nix/uify.nix under stdenv.isDarwin. The build picks up AVFoundation, CoreMedia, CoreVideo, CoreML, Metal, AudioToolbox.

Windows

Media Foundation + DirectML. Tested under MSVC; MinGW is not supported.

Linux

V4L2 + ALSA for mic-adjacent cameras. nokhwa is the default abstraction.

Android / iOS

Bindings generated by UniFFI into bindings/kotlin/ and bindings/swift/. Ship as AAR / XCFramework.

Web

The core builds to wasm32-unknown-unknown. The runtime camera backend uses getUserMedia; inference uses ORT-web with WebGPU when available.