mloda API

mlodaAPI

The mlodaAPI class serves as the primary interface for interacting with the core system, streamlining the setup and execution of computational workflows

Major components (and steps)

Configuration

Initialize the mloda with features, compute frameworks, and other settings to configure your environment.
Engine Setup

Create an execution plan by setting up the engine, which orchestrates the computation based on the defined features and configurations.
Runner Setup

Prepare the runner that will execute the plan, ensuring that all components are ready for computation.
Run Engine Computation

Apply the runner to execute the computations based on the prepared execution plan, managing the lifecycle of the computational process.

This means, depending on your needs, you can run them all at once (batch run) or split them up, e.g. for realtime needs (inference).

Configuration for mlodaAPI

requested_features: Specify the features to process (as names, Feature objects, or a Features container).
compute_frameworks (optional): Limit the compute frameworks using framework types or names.
links (optional): Define dataset merging links with Link objects.
data_access_collection (optional): Provide data sources for feature identification.

Runner & Execution Configuration

function_extender (optional): Add function extenders to customize computations.
parallelization_modes (optional): Choose between sync, threading, or multiprocessing modes. (Default: sync)
flight_server (optional): Specify a flight server for multiprocessing only.
column_ordering (optional): Control the ordering of result columns. Accepts "alphabetical" (sort columns A-Z) or "request_order" (preserve the order features were requested). Default: None (no guaranteed order).

from mloda.user import mloda

# Alphabetical ordering
result = mloda.run_all(
    ["FeatureC", "FeatureA", "FeatureB"],
    column_ordering="alphabetical"  # Result columns: FeatureA, FeatureB, FeatureC
)

# Preserve request order
result = mloda.run_all(
    ["FeatureC", "FeatureA", "FeatureB"],
    column_ordering="request_order"  # Result columns: FeatureC, FeatureA, FeatureB
)

Plugin Discovery

mloda provides functions to discover and inspect available plugins. Import them from mloda.steward:

from mloda.steward import (
    get_feature_group_docs,
    get_compute_framework_docs,
    get_extender_docs,
    resolve_feature,
)

resolve_feature

Resolve a feature name to its matching FeatureGroup class. This is useful for debugging feature resolution or understanding which FeatureGroup handles a specific feature.

from mloda.steward import resolve_feature

# Successful resolution
result = resolve_feature("my_feature_name")
if result.feature_group:
    print(f"Resolved to: {result.feature_group.__name__}")
else:
    print(f"Error: {result.error}")

# Access all matching candidates (before subclass filtering)
print(f"Candidates: {[fg.__name__ for fg in result.candidates]}")

Parameters:

feature_name (str): The name of the feature to resolve.

Returns: ResolvedFeature dataclass with fields:

feature_name (str): The input feature name.
feature_group (Type[FeatureGroup] | None): The resolved FeatureGroup class, or None if resolution failed.
candidates (List[Type[FeatureGroup]]): All FeatureGroups that matched before subclass filtering.
error (str | None): Error message if resolution failed (no match or multiple conflicts).

get_feature_group_docs

Get documentation for feature groups with optional filtering.

from mloda.steward import get_feature_group_docs

# Get all feature groups
all_fgs = get_feature_group_docs()

# Filter by name
fgs = get_feature_group_docs(name="timestamp")

# Filter by compute framework
fgs = get_feature_group_docs(compute_framework="PandasDataframe")

Parameters:

name (str, optional): Filter by name (case-insensitive partial match).
search (str, optional): Search in description (case-insensitive partial match).
compute_framework (str | Type[ComputeFramework], optional): Filter by compute framework.
version_contains (str, optional): Filter by version substring.

Returns: List[FeatureGroupInfo] sorted by name.

get_compute_framework_docs

Get documentation for compute frameworks with optional filtering.

from mloda.steward import get_compute_framework_docs

# Get all available frameworks
frameworks = get_compute_framework_docs()

# Include unavailable frameworks
all_frameworks = get_compute_framework_docs(available_only=False)

Parameters:

name (str, optional): Filter by name (case-insensitive partial match).
search (str, optional): Search in description (case-insensitive partial match).
available_only (bool, default True): Only return available frameworks.

Returns: List[ComputeFrameworkInfo] sorted by name.

get_extender_docs

Get documentation for extenders with optional filtering.

from mloda.steward import get_extender_docs

# Get all extenders
extenders = get_extender_docs()

# Filter by wrapped function type
extenders = get_extender_docs(wraps="formula")

Parameters:

name (str, optional): Filter by name (case-insensitive partial match).
search (str, optional): Search in description (case-insensitive partial match).
wraps (str, optional): Filter by wrapped function type (case-insensitive exact match).

Returns: List[ExtenderInfo] sorted by name.