Skip to content

ncnn

Classes:

  • NCNN

    NCNN Vulkan backend.

NCNN dataclass

NCNN(
    *,
    device_id: int = 0,
    num_streams: int = 1,
    fp16: bool | None = True,
    fp16_blacklist_ops: Collection[str] | None = None,
    output_format: OutputFormat | None = None,
)

Bases: BackendAutoConvertFloat

NCNN Vulkan backend.

Classes:

Methods:

  • autoselect

    Try to select the best backend for the current system.

  • get_args

    Return backend plugin arguments derived from this configuration.

  • inference

    Run inference with this backend.

Attributes:

MIGX class-attribute instance-attribute

MIGX = MIGX

NCNN class-attribute instance-attribute

NCNN = NCNN

NCNN_VK class-attribute instance-attribute

NCNN_VK = NCNN

ORT class-attribute instance-attribute

ORT = ORT

ORT_COREML class-attribute instance-attribute

ORT_COREML = ORT_COREML

ORT_CPU class-attribute instance-attribute

ORT_CPU = ORT_CPU

ORT_CUDA class-attribute instance-attribute

ORT_CUDA = ORT_CUDA

ORT_DML class-attribute instance-attribute

ORT_DML = ORT_DML

OV class-attribute instance-attribute

OV = OV

OV_CPU class-attribute instance-attribute

OV_CPU = OV_CPU

OV_GPU class-attribute instance-attribute

OV_GPU = OV_GPU

OV_NPU class-attribute instance-attribute

OV_NPU = OV_NPU

TRT class-attribute instance-attribute

TRT = TRT

TRT_RTX class-attribute instance-attribute

TRT_RTX = TRT_RTX

device_id class-attribute instance-attribute

device_id: int = 0

Vulkan device index used by NCNN.

flexible_output_prop class-attribute

flexible_output_prop: str = 'MlrtFlexible'

fp16 class-attribute instance-attribute

fp16: bool | None = True

Enable NCNN FP16 storage/arithmetic where supported.

fp16_blacklist_ops class-attribute instance-attribute

fp16_blacklist_ops: Collection[str] | None = None

ONNX node or op names to keep in FP32 during FP16 conversion.

num_streams class-attribute instance-attribute

num_streams: int = 1

Number of parallel NCNN inference streams.

output_format class-attribute instance-attribute

output_format: OutputFormat | None = None

Requested output precision. Defaults to FP16 when fp16 is enabled, otherwise FP32.

plugin class-attribute instance-attribute

plugin = ncnn

OutputFormat

Bases: IntEnum

Output format for the backend plugin.

Attributes:

FP16 class-attribute instance-attribute

FP16 = 1

FP32 class-attribute instance-attribute

FP32 = 0

autoselect classmethod

autoselect(device_id: int = 0, **kwargs: Any) -> Backend

Try to select the best backend for the current system.

Parameters:

  • device_id

    (int, default: 0 ) –

    The GPU device id.

  • **kwargs

    (Any, default: {} ) –

    Additional arguments to pass to the backend.

Returns:

  • Backend

    The selected backend.

Source code in vsscale/mlrt/backend/base.py
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
@classmethod
def autoselect(cls, device_id: int = 0, **kwargs: Any) -> Backend:
    """
    Try to select the best backend for the current system.

    Args:
        device_id: The GPU device id.
        **kwargs: Additional arguments to pass to the backend.

    Returns:
        The selected backend.
    """

    gpu = get_gpu(device_id)
    vendor = (
        cast(str | None, gpu.vendor)
        if gpu
        else "apple"
        # macOS x86_64 is unsupported
        if platform.system().lower() == "darwin" and platform.machine() == "x86_64"
        else None
    )

    match vendor:
        # Windows & Linux
        case "nvidia":
            if hasattr(core, "trt"):
                backend = Backend.TRT
            elif hasattr(core, "trt_rtx"):
                backend = Backend.TRT_RTX
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = Backend.ORT_DML
            elif hasattr(core, "ort"):
                backend = Backend.ORT_CUDA
            elif hasattr(core, "ncnn"):
                backend = Backend.NCNN
            else:
                backend = Backend.OV_CPU
        # Windows & Linux
        case "amd":
            if platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = Backend.ORT_DML
            elif hasattr(core, "migx"):
                backend = Backend.MIGX
            elif hasattr(core, "ncnn"):
                backend = Backend.NCNN_VK
            else:
                backend = Backend.OV_CPU
        # Windows & Linux
        case "intel":
            # device-smi can't detect Intel NPUs in 0.5.6
            # https://github.com/ModelCloud/Device-SMI#roadmap
            if hasattr(core, "ov"):
                backend = Backend.OV_GPU
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = Backend.ORT_DML
            elif hasattr(core, "ncnn"):
                backend = Backend.NCNN_VK
            else:
                backend = Backend.OV_CPU
        # macOS ARM64 & x86_64
        case "apple":
            if hasattr(core, "ncnn"):
                backend = Backend.NCNN_VK
            elif hasattr(core, "ort"):
                backend = Backend.ORT_COREML
            else:
                backend = Backend.OV_CPU
        case _:
            backend = Backend.OV_CPU

    del gpu

    return backend(**kwargs)

get_args

get_args(clips: VideoNode | Sequence[VideoNode]) -> dict[str, Any]

Return backend plugin arguments derived from this configuration.

Source code in vsscale/mlrt/backend/ncnn.py
32
33
34
35
36
37
38
def get_args(self, clips: vs.VideoNode | Sequence[vs.VideoNode]) -> dict[str, Any]:
    return {
        "fp16": self.fp16,
        "output_format": int(fallback(self.output_format, self.fp16)),
        "device_id": self.device_id,
        "num_streams": self.num_streams,
    }

inference

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> VideoNode | list[VideoNode]

Run inference with this backend.

Parameters:

  • clips

    (VideoNode | Sequence[VideoNode]) –

    Input clip or clips passed to the backend model.

  • network_path

    (str | PathLike[str]) –

    Path to the model file or backend artifact.

  • overlap

    (tuple[int, int]) –

    Horizontal and vertical tile overlap in pixels.

  • tilesize

    (tuple[int, int]) –

    Horizontal and vertical tile size in pixels.

  • flexible

    (bool, default: False ) –

    Return each flexible output plane as a separate clip.

  • **kwargs

    (Any, default: {} ) –

    Additional backend plugin arguments forwarded unchanged.

Returns:

  • VideoNode | list[VideoNode]

    A single output clip, or a list of output clips when flexible is enabled.

Source code in vsscale/mlrt/backend/base.py
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
@copy_signature(Backend.inference)
def inference(
    self,
    clips: vs.VideoNode | Sequence[vs.VideoNode],
    network_path: str | os.PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> vs.VideoNode | list[vs.VideoNode]:
    clips = [
        depth(c, 16 if self.fp16 else 32, sample_type=vs.FLOAT) if c.format.sample_type != vs.FLOAT else c
        for c in to_arr(clips)
    ]
    return super().inference(clips, network_path, overlap, tilesize, flexible=flexible, **kwargs)