ort ¶

Classes:

ORT –

Base ONNX Runtime backend configuration.
ORT_COREML –

ONNX Runtime Core ML execution provider.
ORT_CPU –

ONNX Runtime CPU execution provider.
ORT_CUDA –

ONNX Runtime CUDA execution provider for Nvidia GPUs.
ORT_DML –

ONNX Runtime DirectML execution provider for D3D12 devices.

Attributes:

logger –

logger `module-attribute` ¶

logger = getLogger(__name__)

ORT `dataclass` ¶

ORT(
    *,
    num_streams: int = 1,
    verbosity: int | None = None,
    fp16: bool = True,
    fp16_blacklist_ops: Collection[str] | None = None,
)

Bases: BackendAutoConvertFloat

Base ONNX Runtime backend configuration.

Initialize the backend.

Parameters:

num_streams ¶
(int, default: 1 ) –

Number of parallel inference streams.
verbosity ¶
(int | None, default: None ) –

ONNX Runtime logging verbosity.
fp16 ¶
(bool, default: True ) –

Convert model execution to FP16 where supported.
fp16_blacklist_ops ¶
(Collection[str] | None, default: None ) –

ONNX node or op names to keep in FP32 during FP16 conversion.

Classes:

Verbosity –

Methods:

autoselect –

Try to select the best backend for the current system.
get_args –

Return backend plugin arguments derived from this configuration.
inference –

Run inference with this backend.

Attributes:

flexible_output_prop (str) –
fp16 (bool | None) –
fp16_blacklist_ops (Collection[str] | None) –
num_streams (int) –
plugin –
provider (str) –
verbosity (int) –

Source code in vsscale/mlrt/backend/ort.py

def __init__(
    self,
    *,
    num_streams: int = 1,
    verbosity: int | None = None,
    fp16: bool = True,
    fp16_blacklist_ops: Collection[str] | None = None,
) -> None:
    """
    Initialize the backend.

    Args:
        num_streams: Number of parallel inference streams.
        verbosity: ONNX Runtime logging verbosity.
        fp16: Convert model execution to FP16 where supported.
        fp16_blacklist_ops: ONNX node or op names to keep in FP32 during FP16 conversion.
    """
    object.__setattr__(self, "fp16", fp16)
    object.__setattr__(self, "fp16_blacklist_ops", fp16_blacklist_ops)
    object.__setattr__(self, "num_streams", num_streams)
    object.__setattr__(
        self,
        "verbosity",
        ORT.Verbosity.from_logging(logger.getEffectiveLevel()) if verbosity is None else verbosity,
    )
    super().__init__()

flexible_output_prop `class-attribute` ¶

flexible_output_prop: str = 'MlrtFlexible'

fp16 `instance-attribute` ¶

fp16: bool | None

fp16_blacklist_ops `instance-attribute` ¶

fp16_blacklist_ops: Collection[str] | None

num_streams `instance-attribute` ¶

num_streams: int

plugin `class-attribute` `instance-attribute` ¶

plugin = core.lazy.ort

provider `class-attribute` ¶

provider: str

verbosity `instance-attribute` ¶

verbosity: int

Verbosity ¶

Bases: IntEnum

Methods:

from_logging –

Attributes:

ERROR –
FATAL –
INFO –
VERBOSE –
WARNING –

ERROR `class-attribute` `instance-attribute` ¶

ERROR = 3

FATAL `class-attribute` `instance-attribute` ¶

FATAL = 4

INFO `class-attribute` `instance-attribute` ¶

INFO = 1

VERBOSE `class-attribute` `instance-attribute` ¶

VERBOSE = 0

WARNING `class-attribute` `instance-attribute` ¶

WARNING = 2

from_logging `classmethod` ¶

from_logging(level: int) -> Verbosity

Source code in vsscale/mlrt/backend/ort.py

@classmethod
def from_logging(cls, level: int) -> ORT.Verbosity:
    mapping = {
        DEBUG: cls.VERBOSE,
        INFO: cls.INFO,
        WARNING: cls.WARNING,
        ERROR: cls.ERROR,
        CRITICAL: cls.FATAL,
    }
    return mapping.get(level, cls.WARNING)

autoselect `staticmethod` ¶

autoselect(device_id: int = 0, **kwargs: Any) -> Backend

Try to select the best backend for the current system.

Parameters:

device_id ¶
(int, default: 0 ) –

The GPU device id.
**kwargs ¶
(Any, default: {} ) –

Additional arguments to pass to the backend.

Returns:

Backend –

The selected backend.

Source code in vsscale/mlrt/backend/base.py

@staticmethod
def autoselect(device_id: int = 0, **kwargs: Any) -> Backend:
    """
    Try to select the best backend for the current system.

    Args:
        device_id: The GPU device id.
        **kwargs: Additional arguments to pass to the backend.

    Returns:
        The selected backend.
    """

    gpu = get_gpu(device_id)
    vendor = None if not gpu else str(gpu.vendor).strip()

    match vendor:
        # Windows & Linux
        case "NVIDIA Corporation":
            if hasattr(core, "trt"):
                backend = UserBackend.TRT
            elif hasattr(core, "trt_rtx"):
                backend = UserBackend.TRT_RTX
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "ort"):
                backend = UserBackend.ORT_CUDA
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN
            else:
                backend = UserBackend.OV_CPU
        # Windows & Linux
        case "Advanced Micro Devices, Inc.":
            if platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "migx"):
                backend = UserBackend.MIGX
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            else:
                backend = UserBackend.OV_CPU
        # Windows & Linux
        case "Intel(R) Corporation":
            if hasattr(core, "ov"):
                backend = UserBackend.OV_GPU
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            else:
                backend = UserBackend.OV_CPU
        # macOS ARM64 & x86_64
        case "Apple":
            if hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            elif hasattr(core, "ort"):
                backend = UserBackend.ORT_COREML
            else:
                backend = UserBackend.OV_CPU
        case _:
            backend = UserBackend.OV_CPU

    del gpu

    if hasattr(backend, "device_id"):
        kwargs["device_id"] = device_id

    return backend(**kwargs)

get_args ¶

get_args(clips: VideoNode | Sequence[VideoNode]) -> dict[str, Any]

Return backend plugin arguments derived from this configuration.

Source code in vsscale/mlrt/backend/ort.py

def get_args(self, clips: vs.VideoNode | Sequence[vs.VideoNode]) -> dict[str, Any]:
    return super().get_args(clips) | {
        "fp16": self.fp16,
        "provider": self.provider,
        "num_streams": self.num_streams,
        "verbosity": self.verbosity,
        "fp16_blacklist_ops": self.fp16_blacklist_ops,
    }

inference ¶

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: Literal[False] = ...,
    **kwargs: Any,
) -> VideoNode

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: Literal[True],
    **kwargs: Any,
) -> list[VideoNode]

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = ...,
    **kwargs: Any,
) -> VideoNode | list[VideoNode]

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> VideoNode | list[VideoNode]

Run inference with this backend.

Parameters:

clips ¶
(VideoNode | Sequence[VideoNode]) –

Input clip or clips passed to the backend model.
network_path ¶
(str | PathLike[str]) –

Path to the model file or backend artifact.
overlap ¶
(tuple[int, int]) –

Horizontal and vertical tile overlap in pixels.
tilesize ¶
(tuple[int, int]) –

Horizontal and vertical tile size in pixels.
flexible ¶
(bool, default: False ) –

Return each flexible output plane as a separate clip.
**kwargs ¶
(Any, default: {} ) –

Additional backend plugin arguments forwarded unchanged.

Returns:

VideoNode | list[VideoNode] –

A single output clip, or a list of output clips when flexible is enabled.

Source code in vsscale/mlrt/backend/base.py

def inference(
    self,
    clips: vs.VideoNode | Sequence[vs.VideoNode],
    network_path: str | os.PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> vs.VideoNode | list[vs.VideoNode]:
    """
    Run inference with this backend.

    Args:
        clips: Input clip or clips passed to the backend model.
        network_path: Path to the model file or backend artifact.
        overlap: Horizontal and vertical tile overlap in pixels.
        tilesize: Horizontal and vertical tile size in pixels.
        flexible: Return each flexible output plane as a separate clip.
        **kwargs: Additional backend plugin arguments forwarded unchanged.

    Returns:
        A single output clip, or a list of output clips when `flexible` is enabled.
    """
    UnsupportedSampleTypeError.check(clips, vs.FLOAT, self.__class__)

    args = self.get_args(clips)

    if flexible:
        args = args.copy()
        args["flexible_output_prop"] = self.flexible_output_prop

    logger.info("Calling %s.Model", self.plugin.namespace)
    logger.info("Clips: %r", clips)
    logger.info("Network Path: %s", network_path)
    logger.info("overlap=%s, tilesize=%s, %s", overlap, tilesize, args | kwargs)
    output = self.plugin.Model(clips, network_path, overlap, tilesize, **args | kwargs)

    if flexible:
        clip = output["clip"]
        num_planes = output["num_planes"]

        output = [clip.std.PropToClip(prop=f"{self.flexible_output_prop}{i}") for i in range(num_planes)]

    return output

ORT_COREML `dataclass` ¶

ORT_COREML(
    *,
    ml_program: int = NEURAL_NETWORK,
    fp16: bool = True,
    fp16_blacklist_ops: Collection[str] | None = None,
    num_streams: int = 1,
    verbosity: int | None = None,
)

Bases: ORT

ONNX Runtime Core ML execution provider.

Initialize the backend.

Parameters:

num_streams ¶
(int, default: 1 ) –

Number of parallel inference streams.
verbosity ¶
(int | None, default: None ) –

ONNX Runtime logging verbosity.
fp16 ¶
(bool, default: True ) –

Convert model execution to FP16 where supported.
fp16_blacklist_ops ¶
(Collection[str] | None, default: None ) –

ONNX node or op names to keep in FP32 during FP16 conversion.

Classes:

Provider –
Verbosity –

Methods:

autoselect –

Try to select the best backend for the current system.
get_args –

Return backend plugin arguments derived from this configuration.
inference –

Run inference with this backend.

Attributes:

flexible_output_prop (str) –
fp16 (bool | None) –
fp16_blacklist_ops (Collection[str] | None) –
ml_program (Provider) –

Core ML provider mode.
num_streams (int) –
plugin –
provider –
verbosity (int) –

Source code in vsscale/mlrt/backend/ort.py

def __init__(
    self,
    *,
    ml_program: int = Provider.NEURAL_NETWORK,
    fp16: bool = True,
    fp16_blacklist_ops: Collection[str] | None = None,
    num_streams: int = 1,
    verbosity: int | None = None,
) -> None:
    object.__setattr__(self, "ml_program", ORT_COREML.Provider(ml_program))
    super().__init__(num_streams=num_streams, verbosity=verbosity, fp16=fp16, fp16_blacklist_ops=fp16_blacklist_ops)

flexible_output_prop `class-attribute` ¶

flexible_output_prop: str = 'MlrtFlexible'

fp16 `instance-attribute` ¶

fp16: bool | None

fp16_blacklist_ops `instance-attribute` ¶

fp16_blacklist_ops: Collection[str] | None

ml_program `instance-attribute` ¶

ml_program: Provider

Core ML provider mode.

num_streams `instance-attribute` ¶

num_streams: int

plugin `class-attribute` `instance-attribute` ¶

plugin = core.lazy.ort

provider `class-attribute` `instance-attribute` ¶

provider = 'COREML'

verbosity `instance-attribute` ¶

verbosity: int

Provider ¶

Bases: IntEnum

Attributes:

ML_PROGRAM –
NEURAL_NETWORK –

ML_PROGRAM `class-attribute` `instance-attribute` ¶

ML_PROGRAM = 1

NEURAL_NETWORK `class-attribute` `instance-attribute` ¶

NEURAL_NETWORK = 0

Verbosity ¶

Bases: IntEnum

Methods:

from_logging –

Attributes:

ERROR –
FATAL –
INFO –
VERBOSE –
WARNING –

ERROR `class-attribute` `instance-attribute` ¶

ERROR = 3

FATAL `class-attribute` `instance-attribute` ¶

FATAL = 4

INFO `class-attribute` `instance-attribute` ¶

INFO = 1

VERBOSE `class-attribute` `instance-attribute` ¶

VERBOSE = 0

WARNING `class-attribute` `instance-attribute` ¶

WARNING = 2

from_logging `classmethod` ¶

from_logging(level: int) -> Verbosity

Source code in vsscale/mlrt/backend/ort.py

@classmethod
def from_logging(cls, level: int) -> ORT.Verbosity:
    mapping = {
        DEBUG: cls.VERBOSE,
        INFO: cls.INFO,
        WARNING: cls.WARNING,
        ERROR: cls.ERROR,
        CRITICAL: cls.FATAL,
    }
    return mapping.get(level, cls.WARNING)

autoselect `staticmethod` ¶

autoselect(device_id: int = 0, **kwargs: Any) -> Backend

Try to select the best backend for the current system.

Parameters:

device_id ¶
(int, default: 0 ) –

The GPU device id.
**kwargs ¶
(Any, default: {} ) –

Additional arguments to pass to the backend.

Returns:

Backend –

The selected backend.

Source code in vsscale/mlrt/backend/base.py

@staticmethod
def autoselect(device_id: int = 0, **kwargs: Any) -> Backend:
    """
    Try to select the best backend for the current system.

    Args:
        device_id: The GPU device id.
        **kwargs: Additional arguments to pass to the backend.

    Returns:
        The selected backend.
    """

    gpu = get_gpu(device_id)
    vendor = None if not gpu else str(gpu.vendor).strip()

    match vendor:
        # Windows & Linux
        case "NVIDIA Corporation":
            if hasattr(core, "trt"):
                backend = UserBackend.TRT
            elif hasattr(core, "trt_rtx"):
                backend = UserBackend.TRT_RTX
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "ort"):
                backend = UserBackend.ORT_CUDA
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN
            else:
                backend = UserBackend.OV_CPU
        # Windows & Linux
        case "Advanced Micro Devices, Inc.":
            if platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "migx"):
                backend = UserBackend.MIGX
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            else:
                backend = UserBackend.OV_CPU
        # Windows & Linux
        case "Intel(R) Corporation":
            if hasattr(core, "ov"):
                backend = UserBackend.OV_GPU
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            else:
                backend = UserBackend.OV_CPU
        # macOS ARM64 & x86_64
        case "Apple":
            if hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            elif hasattr(core, "ort"):
                backend = UserBackend.ORT_COREML
            else:
                backend = UserBackend.OV_CPU
        case _:
            backend = UserBackend.OV_CPU

    del gpu

    if hasattr(backend, "device_id"):
        kwargs["device_id"] = device_id

    return backend(**kwargs)

get_args ¶

get_args(clips: VideoNode | Sequence[VideoNode]) -> dict[str, Any]

Return backend plugin arguments derived from this configuration.

Source code in vsscale/mlrt/backend/ort.py

def get_args(self, clips: vs.VideoNode | Sequence[vs.VideoNode]) -> dict[str, Any]:
    return super().get_args(clips) | {"ml_program": self.ml_program}

inference ¶

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: Literal[False] = ...,
    **kwargs: Any,
) -> VideoNode

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: Literal[True],
    **kwargs: Any,
) -> list[VideoNode]

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = ...,
    **kwargs: Any,
) -> VideoNode | list[VideoNode]

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> VideoNode | list[VideoNode]

Run inference with this backend.

Parameters:

clips ¶
(VideoNode | Sequence[VideoNode]) –

Input clip or clips passed to the backend model.
network_path ¶
(str | PathLike[str]) –

Path to the model file or backend artifact.
overlap ¶
(tuple[int, int]) –

Horizontal and vertical tile overlap in pixels.
tilesize ¶
(tuple[int, int]) –

Horizontal and vertical tile size in pixels.
flexible ¶
(bool, default: False ) –

Return each flexible output plane as a separate clip.
**kwargs ¶
(Any, default: {} ) –

Additional backend plugin arguments forwarded unchanged.

Returns:

VideoNode | list[VideoNode] –

A single output clip, or a list of output clips when flexible is enabled.

Source code in vsscale/mlrt/backend/base.py

def inference(
    self,
    clips: vs.VideoNode | Sequence[vs.VideoNode],
    network_path: str | os.PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> vs.VideoNode | list[vs.VideoNode]:
    """
    Run inference with this backend.

    Args:
        clips: Input clip or clips passed to the backend model.
        network_path: Path to the model file or backend artifact.
        overlap: Horizontal and vertical tile overlap in pixels.
        tilesize: Horizontal and vertical tile size in pixels.
        flexible: Return each flexible output plane as a separate clip.
        **kwargs: Additional backend plugin arguments forwarded unchanged.

    Returns:
        A single output clip, or a list of output clips when `flexible` is enabled.
    """
    UnsupportedSampleTypeError.check(clips, vs.FLOAT, self.__class__)

    args = self.get_args(clips)

    if flexible:
        args = args.copy()
        args["flexible_output_prop"] = self.flexible_output_prop

    logger.info("Calling %s.Model", self.plugin.namespace)
    logger.info("Clips: %r", clips)
    logger.info("Network Path: %s", network_path)
    logger.info("overlap=%s, tilesize=%s, %s", overlap, tilesize, args | kwargs)
    output = self.plugin.Model(clips, network_path, overlap, tilesize, **args | kwargs)

    if flexible:
        clip = output["clip"]
        num_planes = output["num_planes"]

        output = [clip.std.PropToClip(prop=f"{self.flexible_output_prop}{i}") for i in range(num_planes)]

    return output

ORT_CPU `dataclass` ¶

ORT_CPU()

Bases: ORT

ONNX Runtime CPU execution provider.

Classes:

Verbosity –

Methods:

autoselect –

Try to select the best backend for the current system.
get_args –

Return backend plugin arguments derived from this configuration.
inference –

Run inference with this backend.

Attributes:

flexible_output_prop (str) –
fp16 (bool | None) –
fp16_blacklist_ops (Collection[str] | None) –
num_streams (int) –
plugin –
provider –
verbosity (int) –

flexible_output_prop `class-attribute` ¶

flexible_output_prop: str = 'MlrtFlexible'

fp16 `instance-attribute` ¶

fp16: bool | None

fp16_blacklist_ops `instance-attribute` ¶

fp16_blacklist_ops: Collection[str] | None

num_streams `instance-attribute` ¶

num_streams: int

plugin `class-attribute` `instance-attribute` ¶

plugin = core.lazy.ort

provider `class-attribute` `instance-attribute` ¶

provider = 'CPU'

verbosity `instance-attribute` ¶

verbosity: int

Verbosity ¶

Bases: IntEnum

Methods:

from_logging –

Attributes:

ERROR –
FATAL –
INFO –
VERBOSE –
WARNING –

ERROR `class-attribute` `instance-attribute` ¶

ERROR = 3

FATAL `class-attribute` `instance-attribute` ¶

FATAL = 4

INFO `class-attribute` `instance-attribute` ¶

INFO = 1

VERBOSE `class-attribute` `instance-attribute` ¶

VERBOSE = 0

WARNING `class-attribute` `instance-attribute` ¶

WARNING = 2

from_logging `classmethod` ¶

from_logging(level: int) -> Verbosity

Source code in vsscale/mlrt/backend/ort.py

@classmethod
def from_logging(cls, level: int) -> ORT.Verbosity:
    mapping = {
        DEBUG: cls.VERBOSE,
        INFO: cls.INFO,
        WARNING: cls.WARNING,
        ERROR: cls.ERROR,
        CRITICAL: cls.FATAL,
    }
    return mapping.get(level, cls.WARNING)

autoselect `staticmethod` ¶

autoselect(device_id: int = 0, **kwargs: Any) -> Backend

Try to select the best backend for the current system.

Parameters:

device_id ¶
(int, default: 0 ) –

The GPU device id.
**kwargs ¶
(Any, default: {} ) –

Additional arguments to pass to the backend.

Returns:

Backend –

The selected backend.

Source code in vsscale/mlrt/backend/base.py

@staticmethod
def autoselect(device_id: int = 0, **kwargs: Any) -> Backend:
    """
    Try to select the best backend for the current system.

    Args:
        device_id: The GPU device id.
        **kwargs: Additional arguments to pass to the backend.

    Returns:
        The selected backend.
    """

    gpu = get_gpu(device_id)
    vendor = None if not gpu else str(gpu.vendor).strip()

    match vendor:
        # Windows & Linux
        case "NVIDIA Corporation":
            if hasattr(core, "trt"):
                backend = UserBackend.TRT
            elif hasattr(core, "trt_rtx"):
                backend = UserBackend.TRT_RTX
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "ort"):
                backend = UserBackend.ORT_CUDA
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN
            else:
                backend = UserBackend.OV_CPU
        # Windows & Linux
        case "Advanced Micro Devices, Inc.":
            if platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "migx"):
                backend = UserBackend.MIGX
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            else:
                backend = UserBackend.OV_CPU
        # Windows & Linux
        case "Intel(R) Corporation":
            if hasattr(core, "ov"):
                backend = UserBackend.OV_GPU
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            else:
                backend = UserBackend.OV_CPU
        # macOS ARM64 & x86_64
        case "Apple":
            if hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            elif hasattr(core, "ort"):
                backend = UserBackend.ORT_COREML
            else:
                backend = UserBackend.OV_CPU
        case _:
            backend = UserBackend.OV_CPU

    del gpu

    if hasattr(backend, "device_id"):
        kwargs["device_id"] = device_id

    return backend(**kwargs)

get_args ¶

get_args(clips: VideoNode | Sequence[VideoNode]) -> dict[str, Any]

Return backend plugin arguments derived from this configuration.

Source code in vsscale/mlrt/backend/ort.py

def get_args(self, clips: vs.VideoNode | Sequence[vs.VideoNode]) -> dict[str, Any]:
    return super().get_args(clips) | {
        "fp16": self.fp16,
        "provider": self.provider,
        "num_streams": self.num_streams,
        "verbosity": self.verbosity,
        "fp16_blacklist_ops": self.fp16_blacklist_ops,
    }

inference ¶

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: Literal[False] = ...,
    **kwargs: Any,
) -> VideoNode

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: Literal[True],
    **kwargs: Any,
) -> list[VideoNode]

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = ...,
    **kwargs: Any,
) -> VideoNode | list[VideoNode]

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> VideoNode | list[VideoNode]

Run inference with this backend.

Parameters:

clips ¶
(VideoNode | Sequence[VideoNode]) –

Input clip or clips passed to the backend model.
network_path ¶
(str | PathLike[str]) –

Path to the model file or backend artifact.
overlap ¶
(tuple[int, int]) –

Horizontal and vertical tile overlap in pixels.
tilesize ¶
(tuple[int, int]) –

Horizontal and vertical tile size in pixels.
flexible ¶
(bool, default: False ) –

Return each flexible output plane as a separate clip.
**kwargs ¶
(Any, default: {} ) –

Additional backend plugin arguments forwarded unchanged.

Returns:

VideoNode | list[VideoNode] –

A single output clip, or a list of output clips when flexible is enabled.

Source code in vsscale/mlrt/backend/base.py

def inference(
    self,
    clips: vs.VideoNode | Sequence[vs.VideoNode],
    network_path: str | os.PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> vs.VideoNode | list[vs.VideoNode]:
    """
    Run inference with this backend.

    Args:
        clips: Input clip or clips passed to the backend model.
        network_path: Path to the model file or backend artifact.
        overlap: Horizontal and vertical tile overlap in pixels.
        tilesize: Horizontal and vertical tile size in pixels.
        flexible: Return each flexible output plane as a separate clip.
        **kwargs: Additional backend plugin arguments forwarded unchanged.

    Returns:
        A single output clip, or a list of output clips when `flexible` is enabled.
    """
    UnsupportedSampleTypeError.check(clips, vs.FLOAT, self.__class__)

    args = self.get_args(clips)

    if flexible:
        args = args.copy()
        args["flexible_output_prop"] = self.flexible_output_prop

    logger.info("Calling %s.Model", self.plugin.namespace)
    logger.info("Clips: %r", clips)
    logger.info("Network Path: %s", network_path)
    logger.info("overlap=%s, tilesize=%s, %s", overlap, tilesize, args | kwargs)
    output = self.plugin.Model(clips, network_path, overlap, tilesize, **args | kwargs)

    if flexible:
        clip = output["clip"]
        num_planes = output["num_planes"]

        output = [clip.std.PropToClip(prop=f"{self.flexible_output_prop}{i}") for i in range(num_planes)]

    return output

ORT_CUDA `dataclass` ¶

ORT_CUDA(
    *,
    num_streams: int = 1,
    verbosity: int | None = None,
    device_id: int = 0,
    cudnn_benchmark: bool = True,
    use_cuda_graph: bool = False,
    fp16: bool = True,
    fp16_blacklist_ops: Collection[str] | None = None,
    tf32: bool = False,
    prefer_nhwc: bool = False,
)

Bases: ORT

ONNX Runtime CUDA execution provider for Nvidia GPUs.

Initialize the backend.

Parameters:

num_streams ¶
(int, default: 1 ) –

Number of parallel inference streams.
verbosity ¶
(int | None, default: None ) –

ONNX Runtime logging verbosity.
device_id ¶
(int, default: 0 ) –

CUDA device index.
cudnn_benchmark ¶
(bool, default: True ) –

Let cuDNN search for faster convolution algorithms.
use_cuda_graph ¶
(bool, default: False ) –

Enable CUDA graph capture to improve performance and reduce CPU overhead for compatible models.
fp16 ¶
(bool, default: True ) –

Convert model execution to FP16 where supported.
fp16_blacklist_ops ¶
(Collection[str] | None, default: None ) –

ONNX node or op names to keep in FP32 during FP16 conversion.
tf32 ¶
(bool, default: False ) –

Allow TensorFloat-32 math on supported Nvidia GPUs.
prefer_nhwc ¶
(bool, default: False ) –

Prefer NHWC layout where ONNX Runtime supports it.

Classes:

Verbosity –

Methods:

autoselect –

Try to select the best backend for the current system.
get_args –

Return backend plugin arguments derived from this configuration.
inference –

Run inference with this backend.

Attributes:

cudnn_benchmark (bool) –
device_id (int) –
flexible_output_prop (str) –
fp16 (bool | None) –
fp16_blacklist_ops (Collection[str] | None) –
num_streams (int) –
plugin –
prefer_nhwc (bool) –
provider –
tf32 (bool) –
use_cuda_graph (bool) –
verbosity (int) –

Source code in vsscale/mlrt/backend/ort.py

def __init__(
    self,
    *,
    num_streams: int = 1,
    verbosity: int | None = None,
    device_id: int = 0,
    cudnn_benchmark: bool = True,
    use_cuda_graph: bool = False,
    fp16: bool = True,
    fp16_blacklist_ops: Collection[str] | None = None,
    tf32: bool = False,
    prefer_nhwc: bool = False,
) -> None:
    """
    Initialize the backend.

    Args:
        num_streams: Number of parallel inference streams.
        verbosity: ONNX Runtime logging verbosity.
        device_id: CUDA device index.
        cudnn_benchmark: Let cuDNN search for faster convolution algorithms.
        use_cuda_graph: Enable CUDA graph capture to improve performance and reduce CPU overhead
            for compatible models.
        fp16: Convert model execution to FP16 where supported.
        fp16_blacklist_ops: ONNX node or op names to keep in FP32 during FP16 conversion.
        tf32: Allow TensorFloat-32 math on supported Nvidia GPUs.
        prefer_nhwc: Prefer NHWC layout where ONNX Runtime supports it.
    """
    object.__setattr__(self, "device_id", device_id)
    object.__setattr__(self, "cudnn_benchmark", cudnn_benchmark)
    object.__setattr__(self, "use_cuda_graph", use_cuda_graph)
    object.__setattr__(self, "prefer_nhwc", prefer_nhwc)
    object.__setattr__(self, "tf32", tf32)
    super().__init__(num_streams=num_streams, verbosity=verbosity, fp16=fp16, fp16_blacklist_ops=fp16_blacklist_ops)

cudnn_benchmark `instance-attribute` ¶

cudnn_benchmark: bool

device_id `instance-attribute` ¶

device_id: int

flexible_output_prop `class-attribute` ¶

flexible_output_prop: str = 'MlrtFlexible'

fp16 `instance-attribute` ¶

fp16: bool | None

fp16_blacklist_ops `instance-attribute` ¶

fp16_blacklist_ops: Collection[str] | None

num_streams `instance-attribute` ¶

num_streams: int

plugin `class-attribute` `instance-attribute` ¶

plugin = core.lazy.ort

prefer_nhwc `instance-attribute` ¶

prefer_nhwc: bool

provider `class-attribute` `instance-attribute` ¶

provider = 'CUDA'

tf32 `instance-attribute` ¶

tf32: bool

use_cuda_graph `instance-attribute` ¶

use_cuda_graph: bool

verbosity `instance-attribute` ¶

verbosity: int

Verbosity ¶

Bases: IntEnum

Methods:

from_logging –

Attributes:

ERROR –
FATAL –
INFO –
VERBOSE –
WARNING –

ERROR `class-attribute` `instance-attribute` ¶

ERROR = 3

FATAL `class-attribute` `instance-attribute` ¶

FATAL = 4

INFO `class-attribute` `instance-attribute` ¶

INFO = 1

VERBOSE `class-attribute` `instance-attribute` ¶

VERBOSE = 0

WARNING `class-attribute` `instance-attribute` ¶

WARNING = 2

from_logging `classmethod` ¶

from_logging(level: int) -> Verbosity

Source code in vsscale/mlrt/backend/ort.py

@classmethod
def from_logging(cls, level: int) -> ORT.Verbosity:
    mapping = {
        DEBUG: cls.VERBOSE,
        INFO: cls.INFO,
        WARNING: cls.WARNING,
        ERROR: cls.ERROR,
        CRITICAL: cls.FATAL,
    }
    return mapping.get(level, cls.WARNING)

autoselect `staticmethod` ¶

autoselect(device_id: int = 0, **kwargs: Any) -> Backend

Try to select the best backend for the current system.

Parameters:

device_id ¶
(int, default: 0 ) –

The GPU device id.
**kwargs ¶
(Any, default: {} ) –

Additional arguments to pass to the backend.

Returns:

Backend –

The selected backend.

Source code in vsscale/mlrt/backend/base.py

@staticmethod
def autoselect(device_id: int = 0, **kwargs: Any) -> Backend:
    """
    Try to select the best backend for the current system.

    Args:
        device_id: The GPU device id.
        **kwargs: Additional arguments to pass to the backend.

    Returns:
        The selected backend.
    """

    gpu = get_gpu(device_id)
    vendor = None if not gpu else str(gpu.vendor).strip()

    match vendor:
        # Windows & Linux
        case "NVIDIA Corporation":
            if hasattr(core, "trt"):
                backend = UserBackend.TRT
            elif hasattr(core, "trt_rtx"):
                backend = UserBackend.TRT_RTX
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "ort"):
                backend = UserBackend.ORT_CUDA
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN
            else:
                backend = UserBackend.OV_CPU
        # Windows & Linux
        case "Advanced Micro Devices, Inc.":
            if platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "migx"):
                backend = UserBackend.MIGX
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            else:
                backend = UserBackend.OV_CPU
        # Windows & Linux
        case "Intel(R) Corporation":
            if hasattr(core, "ov"):
                backend = UserBackend.OV_GPU
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            else:
                backend = UserBackend.OV_CPU
        # macOS ARM64 & x86_64
        case "Apple":
            if hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            elif hasattr(core, "ort"):
                backend = UserBackend.ORT_COREML
            else:
                backend = UserBackend.OV_CPU
        case _:
            backend = UserBackend.OV_CPU

    del gpu

    if hasattr(backend, "device_id"):
        kwargs["device_id"] = device_id

    return backend(**kwargs)

get_args ¶

get_args(clips: VideoNode | Sequence[VideoNode]) -> dict[str, Any]

Return backend plugin arguments derived from this configuration.

Source code in vsscale/mlrt/backend/ort.py

def get_args(self, clips: vs.VideoNode | Sequence[vs.VideoNode]) -> dict[str, Any]:
    return super().get_args(clips) | {
        "device_id": self.device_id,
        "cudnn_benchmark": self.cudnn_benchmark,
        "use_cuda_graph": self.use_cuda_graph,
        "prefer_nhwc": self.prefer_nhwc,
        "tf32": self.tf32,
    }

inference ¶

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: Literal[False] = ...,
    **kwargs: Any,
) -> VideoNode

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: Literal[True],
    **kwargs: Any,
) -> list[VideoNode]

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = ...,
    **kwargs: Any,
) -> VideoNode | list[VideoNode]

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> VideoNode | list[VideoNode]

Run inference with this backend.

Parameters:

clips ¶
(VideoNode | Sequence[VideoNode]) –

Input clip or clips passed to the backend model.
network_path ¶
(str | PathLike[str]) –

Path to the model file or backend artifact.
overlap ¶
(tuple[int, int]) –

Horizontal and vertical tile overlap in pixels.
tilesize ¶
(tuple[int, int]) –

Horizontal and vertical tile size in pixels.
flexible ¶
(bool, default: False ) –

Return each flexible output plane as a separate clip.
**kwargs ¶
(Any, default: {} ) –

Additional backend plugin arguments forwarded unchanged.

Returns:

VideoNode | list[VideoNode] –

A single output clip, or a list of output clips when flexible is enabled.

Source code in vsscale/mlrt/backend/base.py

def inference(
    self,
    clips: vs.VideoNode | Sequence[vs.VideoNode],
    network_path: str | os.PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> vs.VideoNode | list[vs.VideoNode]:
    """
    Run inference with this backend.

    Args:
        clips: Input clip or clips passed to the backend model.
        network_path: Path to the model file or backend artifact.
        overlap: Horizontal and vertical tile overlap in pixels.
        tilesize: Horizontal and vertical tile size in pixels.
        flexible: Return each flexible output plane as a separate clip.
        **kwargs: Additional backend plugin arguments forwarded unchanged.

    Returns:
        A single output clip, or a list of output clips when `flexible` is enabled.
    """
    UnsupportedSampleTypeError.check(clips, vs.FLOAT, self.__class__)

    args = self.get_args(clips)

    if flexible:
        args = args.copy()
        args["flexible_output_prop"] = self.flexible_output_prop

    logger.info("Calling %s.Model", self.plugin.namespace)
    logger.info("Clips: %r", clips)
    logger.info("Network Path: %s", network_path)
    logger.info("overlap=%s, tilesize=%s, %s", overlap, tilesize, args | kwargs)
    output = self.plugin.Model(clips, network_path, overlap, tilesize, **args | kwargs)

    if flexible:
        clip = output["clip"]
        num_planes = output["num_planes"]

        output = [clip.std.PropToClip(prop=f"{self.flexible_output_prop}{i}") for i in range(num_planes)]

    return output

ORT_DML `dataclass` ¶

ORT_DML(
    *,
    device_id: int = 0,
    fp16: bool = True,
    fp16_blacklist_ops: Collection[str] | None = None,
    num_streams: int = 1,
    verbosity: int | None = None,
)

Bases: ORT

ONNX Runtime DirectML execution provider for D3D12 devices.

Initialize the backend.

Parameters:

device_id ¶
(int, default: 0 ) –

DirectML adapter index.
num_streams ¶
(int, default: 1 ) –

Number of parallel inference streams.
verbosity ¶
(int | None, default: None ) –

ONNX Runtime logging verbosity.
fp16 ¶
(bool, default: True ) –

Convert model execution to FP16 where supported.
fp16_blacklist_ops ¶
(Collection[str] | None, default: None ) –

ONNX node or op names to keep in FP32 during FP16 conversion.

Classes:

Verbosity –

Methods:

autoselect –

Try to select the best backend for the current system.
get_args –

Return backend plugin arguments derived from this configuration.
inference –

Run inference with this backend.

Attributes:

device_id (int) –
flexible_output_prop (str) –
fp16 (bool | None) –
fp16_blacklist_ops (Collection[str] | None) –
num_streams (int) –
plugin –
provider –
verbosity (int) –

Source code in vsscale/mlrt/backend/ort.py

def __init__(
    self,
    *,
    device_id: int = 0,
    fp16: bool = True,
    fp16_blacklist_ops: Collection[str] | None = None,
    num_streams: int = 1,
    verbosity: int | None = None,
) -> None:
    """
    Initialize the backend.

    Args:
        device_id: DirectML adapter index.
        num_streams: Number of parallel inference streams.
        verbosity: ONNX Runtime logging verbosity.
        fp16: Convert model execution to FP16 where supported.
        fp16_blacklist_ops: ONNX node or op names to keep in FP32 during FP16 conversion.
    """
    object.__setattr__(self, "device_id", device_id)
    super().__init__(num_streams=num_streams, verbosity=verbosity, fp16=fp16, fp16_blacklist_ops=fp16_blacklist_ops)

device_id `instance-attribute` ¶

device_id: int

flexible_output_prop `class-attribute` ¶

flexible_output_prop: str = 'MlrtFlexible'

fp16 `instance-attribute` ¶

fp16: bool | None

fp16_blacklist_ops `instance-attribute` ¶

fp16_blacklist_ops: Collection[str] | None

num_streams `instance-attribute` ¶

num_streams: int

plugin `class-attribute` `instance-attribute` ¶

plugin = core.lazy.ort

provider `class-attribute` `instance-attribute` ¶

provider = 'DML'

verbosity `instance-attribute` ¶

verbosity: int

Verbosity ¶

Bases: IntEnum

Methods:

from_logging –

Attributes:

ERROR –
FATAL –
INFO –
VERBOSE –
WARNING –

ERROR `class-attribute` `instance-attribute` ¶

ERROR = 3

FATAL `class-attribute` `instance-attribute` ¶

FATAL = 4

INFO `class-attribute` `instance-attribute` ¶

INFO = 1

VERBOSE `class-attribute` `instance-attribute` ¶

VERBOSE = 0

WARNING `class-attribute` `instance-attribute` ¶

WARNING = 2

from_logging `classmethod` ¶

from_logging(level: int) -> Verbosity

Source code in vsscale/mlrt/backend/ort.py

@classmethod
def from_logging(cls, level: int) -> ORT.Verbosity:
    mapping = {
        DEBUG: cls.VERBOSE,
        INFO: cls.INFO,
        WARNING: cls.WARNING,
        ERROR: cls.ERROR,
        CRITICAL: cls.FATAL,
    }
    return mapping.get(level, cls.WARNING)

autoselect `staticmethod` ¶

autoselect(device_id: int = 0, **kwargs: Any) -> Backend

Try to select the best backend for the current system.

Parameters:

device_id ¶
(int, default: 0 ) –

The GPU device id.
**kwargs ¶
(Any, default: {} ) –

Additional arguments to pass to the backend.

Returns:

Backend –

The selected backend.

Source code in vsscale/mlrt/backend/base.py

@staticmethod
def autoselect(device_id: int = 0, **kwargs: Any) -> Backend:
    """
    Try to select the best backend for the current system.

    Args:
        device_id: The GPU device id.
        **kwargs: Additional arguments to pass to the backend.

    Returns:
        The selected backend.
    """

    gpu = get_gpu(device_id)
    vendor = None if not gpu else str(gpu.vendor).strip()

    match vendor:
        # Windows & Linux
        case "NVIDIA Corporation":
            if hasattr(core, "trt"):
                backend = UserBackend.TRT
            elif hasattr(core, "trt_rtx"):
                backend = UserBackend.TRT_RTX
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "ort"):
                backend = UserBackend.ORT_CUDA
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN
            else:
                backend = UserBackend.OV_CPU
        # Windows & Linux
        case "Advanced Micro Devices, Inc.":
            if platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "migx"):
                backend = UserBackend.MIGX
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            else:
                backend = UserBackend.OV_CPU
        # Windows & Linux
        case "Intel(R) Corporation":
            if hasattr(core, "ov"):
                backend = UserBackend.OV_GPU
            elif platform.system().lower() == "windows" and hasattr(core, "ort"):
                backend = UserBackend.ORT_DML
            elif hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            else:
                backend = UserBackend.OV_CPU
        # macOS ARM64 & x86_64
        case "Apple":
            if hasattr(core, "ncnn"):
                backend = UserBackend.NCNN_VK
            elif hasattr(core, "ort"):
                backend = UserBackend.ORT_COREML
            else:
                backend = UserBackend.OV_CPU
        case _:
            backend = UserBackend.OV_CPU

    del gpu

    if hasattr(backend, "device_id"):
        kwargs["device_id"] = device_id

    return backend(**kwargs)

get_args ¶

get_args(clips: VideoNode | Sequence[VideoNode]) -> dict[str, Any]

Return backend plugin arguments derived from this configuration.

Source code in vsscale/mlrt/backend/ort.py

def get_args(self, clips: vs.VideoNode | Sequence[vs.VideoNode]) -> dict[str, Any]:
    return super().get_args(clips) | {"device_id": self.device_id}

inference ¶

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: Literal[False] = ...,
    **kwargs: Any,
) -> VideoNode

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: Literal[True],
    **kwargs: Any,
) -> list[VideoNode]

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = ...,
    **kwargs: Any,
) -> VideoNode | list[VideoNode]

inference(
    clips: VideoNode | Sequence[VideoNode],
    network_path: str | PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> VideoNode | list[VideoNode]

Run inference with this backend.

Parameters:

clips ¶
(VideoNode | Sequence[VideoNode]) –

Input clip or clips passed to the backend model.
network_path ¶
(str | PathLike[str]) –

Path to the model file or backend artifact.
overlap ¶
(tuple[int, int]) –

Horizontal and vertical tile overlap in pixels.
tilesize ¶
(tuple[int, int]) –

Horizontal and vertical tile size in pixels.
flexible ¶
(bool, default: False ) –

Return each flexible output plane as a separate clip.
**kwargs ¶
(Any, default: {} ) –

Additional backend plugin arguments forwarded unchanged.

Returns:

VideoNode | list[VideoNode] –

A single output clip, or a list of output clips when flexible is enabled.

Source code in vsscale/mlrt/backend/base.py

def inference(
    self,
    clips: vs.VideoNode | Sequence[vs.VideoNode],
    network_path: str | os.PathLike[str],
    /,
    overlap: tuple[int, int],
    tilesize: tuple[int, int],
    *,
    flexible: bool = False,
    **kwargs: Any,
) -> vs.VideoNode | list[vs.VideoNode]:
    """
    Run inference with this backend.

    Args:
        clips: Input clip or clips passed to the backend model.
        network_path: Path to the model file or backend artifact.
        overlap: Horizontal and vertical tile overlap in pixels.
        tilesize: Horizontal and vertical tile size in pixels.
        flexible: Return each flexible output plane as a separate clip.
        **kwargs: Additional backend plugin arguments forwarded unchanged.

    Returns:
        A single output clip, or a list of output clips when `flexible` is enabled.
    """
    UnsupportedSampleTypeError.check(clips, vs.FLOAT, self.__class__)

    args = self.get_args(clips)

    if flexible:
        args = args.copy()
        args["flexible_output_prop"] = self.flexible_output_prop

    logger.info("Calling %s.Model", self.plugin.namespace)
    logger.info("Clips: %r", clips)
    logger.info("Network Path: %s", network_path)
    logger.info("overlap=%s, tilesize=%s, %s", overlap, tilesize, args | kwargs)
    output = self.plugin.Model(clips, network_path, overlap, tilesize, **args | kwargs)

    if flexible:
        clip = output["clip"]
        num_planes = output["num_planes"]

        output = [clip.std.PropToClip(prop=f"{self.flexible_output_prop}{i}") for i in range(num_planes)]

    return output

ort ¶

logger module-attribute ¶

ORT dataclass ¶

num_streams ¶

verbosity ¶

fp16 ¶

fp16_blacklist_ops ¶

flexible_output_prop class-attribute ¶

fp16 instance-attribute ¶

fp16_blacklist_ops instance-attribute ¶

num_streams instance-attribute ¶

plugin class-attribute instance-attribute ¶

provider class-attribute ¶

verbosity instance-attribute ¶

Verbosity ¶

ERROR class-attribute instance-attribute ¶

FATAL class-attribute instance-attribute ¶

INFO class-attribute instance-attribute ¶

VERBOSE class-attribute instance-attribute ¶

WARNING class-attribute instance-attribute ¶

from_logging classmethod ¶

autoselect staticmethod ¶

device_id ¶

**kwargs ¶

get_args ¶

inference ¶

clips ¶

network_path ¶

overlap ¶

tilesize ¶

flexible ¶

**kwargs ¶

ORT_COREML dataclass ¶

num_streams ¶

verbosity ¶

fp16 ¶

fp16_blacklist_ops ¶

flexible_output_prop class-attribute ¶

fp16 instance-attribute ¶

fp16_blacklist_ops instance-attribute ¶

ml_program instance-attribute ¶

num_streams instance-attribute ¶

plugin class-attribute instance-attribute ¶

provider class-attribute instance-attribute ¶

verbosity instance-attribute ¶

Provider ¶

ML_PROGRAM class-attribute instance-attribute ¶

NEURAL_NETWORK class-attribute instance-attribute ¶

Verbosity ¶

ERROR class-attribute instance-attribute ¶

FATAL class-attribute instance-attribute ¶

INFO class-attribute instance-attribute ¶

VERBOSE class-attribute instance-attribute ¶

WARNING class-attribute instance-attribute ¶

from_logging classmethod ¶

autoselect staticmethod ¶

device_id ¶

**kwargs ¶

get_args ¶

inference ¶

clips ¶

network_path ¶

overlap ¶

tilesize ¶

flexible ¶

**kwargs ¶

ORT_CPU dataclass ¶

flexible_output_prop class-attribute ¶

fp16 instance-attribute ¶

fp16_blacklist_ops instance-attribute ¶

num_streams instance-attribute ¶

plugin class-attribute instance-attribute ¶

provider class-attribute instance-attribute ¶

verbosity instance-attribute ¶

Verbosity ¶

ERROR class-attribute instance-attribute ¶

FATAL class-attribute instance-attribute ¶

INFO class-attribute instance-attribute ¶

VERBOSE class-attribute instance-attribute ¶

WARNING class-attribute instance-attribute ¶

logger `module-attribute` ¶

ORT `dataclass` ¶

`num_streams` ¶

`verbosity` ¶

`fp16` ¶

`fp16_blacklist_ops` ¶

flexible_output_prop `class-attribute` ¶

fp16 `instance-attribute` ¶

fp16_blacklist_ops `instance-attribute` ¶

num_streams `instance-attribute` ¶

plugin `class-attribute` `instance-attribute` ¶

provider `class-attribute` ¶

verbosity `instance-attribute` ¶

ERROR `class-attribute` `instance-attribute` ¶

FATAL `class-attribute` `instance-attribute` ¶

INFO `class-attribute` `instance-attribute` ¶

VERBOSE `class-attribute` `instance-attribute` ¶

WARNING `class-attribute` `instance-attribute` ¶

from_logging `classmethod` ¶

autoselect `staticmethod` ¶

`device_id` ¶

`kwargs`** ¶

`clips` ¶

`network_path` ¶

`overlap` ¶

`tilesize` ¶

`flexible` ¶

`kwargs`** ¶

ORT_COREML `dataclass` ¶

`num_streams` ¶

`verbosity` ¶

`fp16` ¶

`fp16_blacklist_ops` ¶

flexible_output_prop `class-attribute` ¶

fp16 `instance-attribute` ¶

fp16_blacklist_ops `instance-attribute` ¶

ml_program `instance-attribute` ¶

num_streams `instance-attribute` ¶

plugin `class-attribute` `instance-attribute` ¶

provider `class-attribute` `instance-attribute` ¶

verbosity `instance-attribute` ¶

ML_PROGRAM `class-attribute` `instance-attribute` ¶

NEURAL_NETWORK `class-attribute` `instance-attribute` ¶

ERROR `class-attribute` `instance-attribute` ¶

FATAL `class-attribute` `instance-attribute` ¶

INFO `class-attribute` `instance-attribute` ¶

VERBOSE `class-attribute` `instance-attribute` ¶

WARNING `class-attribute` `instance-attribute` ¶

from_logging `classmethod` ¶

autoselect `staticmethod` ¶

`device_id` ¶

`kwargs`** ¶

`clips` ¶

`network_path` ¶

`overlap` ¶

`tilesize` ¶

`flexible` ¶

`kwargs`** ¶

ORT_CPU `dataclass` ¶

flexible_output_prop `class-attribute` ¶

fp16 `instance-attribute` ¶

fp16_blacklist_ops `instance-attribute` ¶

num_streams `instance-attribute` ¶

plugin `class-attribute` `instance-attribute` ¶

provider `class-attribute` `instance-attribute` ¶

verbosity `instance-attribute` ¶

ERROR `class-attribute` `instance-attribute` ¶

FATAL `class-attribute` `instance-attribute` ¶

INFO `class-attribute` `instance-attribute` ¶

VERBOSE `class-attribute` `instance-attribute` ¶

WARNING `class-attribute` `instance-attribute` ¶

from_logging `classmethod` ¶

autoselect `staticmethod` ¶

`device_id` ¶

`kwargs`** ¶

`clips` ¶

`network_path` ¶

`overlap` ¶

`tilesize` ¶

`flexible` ¶