Configuration Options

OpenKiwi supports an extensive range of options through its command line interface. (or see note below). All commands are prepended by a `<pipeline>` command. For the available pipelines see: CLI

Note: Args that start with ‘--’ (eg. --save-config) can also be set in a config file (specified via --config). The config file uses YAML syntax and must represent a YAML ‘mapping’ (for details, see http://learn.getgrav.org/advanced/yaml). If an arg is specified in more than one place, then command line values override config file values which override defaults.

For pipeline specific options see:

General Options

These options are pipeline independent and are available in all different pipelines. They are divided into three different categories, general, IO and save/load.

usage: kiwi <pipeline> [-h] [--seed SEED] [--gpu-id GPU_ID]

random

--seed

Random seed

Default: 42

gpu

--gpu-id Use CUDA on the listed devices
usage: kiwi <pipeline> [-h] [--save-config SAVE_CONFIG] [-d] [-q]

I/O

--save-config Save parsed configuration and arguments to the specified file
-d, --debug

Output additional messages.

Default: False

-q, --quiet

Only output warning and error messages.

Default: False

usage: kiwi <pipeline> [-h] [--load-model LOAD_MODEL] [--save-data SAVE_DATA]
                       [--load-data LOAD_DATA] [--load-vocab LOAD_VOCAB]

save-load

--load-model Directory containing a model.torch file to be loaded
--save-data Output dir for saving the preprocessed data files.
--load-data Input dir for loading the preprocessed data files.
--load-vocab Directory containing a vocab.torch file to be loaded
usage: kiwi <pipeline> [-h] [--log-interval LOG_INTERVAL]
                       [--mlflow-tracking-uri MLFLOW_TRACKING_URI]
                       [--experiment-name EXPERIMENT_NAME]
                       [--run-name RUN_NAME] [--run-uuid RUN_UUID]
                       [--output-dir OUTPUT_DIR]
                       [--mlflow-always-log-artifacts [MLFLOW_ALWAYS_LOG_ARTIFACTS]]

Logging

--log-interval

Log every k batches.

Default: 100

--mlflow-tracking-uri
 

If using MLflow, logs model parameters, training metrics, and artifacts (files) to this MLflow server. Uses the localhost by default.

Default: “mlruns/”

--experiment-name
 If using MLflow, it will log this run under this experiment name, which appears as a separate sectionin the UI. It will also be used in some messages and files.
--run-name If using MLflow, it will log this run under this run name, which appears as a separate item in the experiment.
--run-uuid If specified, MLflow/Default Logger will log metrics and params under this ID. If it exists, the run status will change to running. This ID is also used for creating this run’s output directory. (Run ID must be a 32-character hex string)
--output-dir Output several files for this run under this directory. If not specified, a directory under “runs” is created or reused based on the Run UUID. Files might also be sent to MLflow depending on the –mlflow-always-log-artifacts option.
--mlflow-always-log-artifacts
 

If using MLFlow, always log (send) artifacts (files) to MLflow artifacts URI. By default (false), artifacts are only logged ifMLflow is a remote server (as specified by –mlflow-tracking-uri option). All generated files are always saved in –output-dir, so it might be considered redundant to copy them to a local MLflow server. If this is not the case, set this option to true.

Default: False