Choosing an Argo Workflows Executor

How to choose an Argo Workflows Executor

Old Version

This page is about Kubeflow Pipelines V1, please see the V2 documentation for the latest information.

Note, while the V2 backend is able to run pipelines submitted by the V1 SDK, we strongly recommend migrating to the V2 SDK. For reference, the final release of the V1 SDK was kfp==1.8.22, and its reference documentation is available here.

An Argo workflow executor is a process that conforms to a specific interface that allows Argo to perform certain actions like monitoring pod logs, collecting artifacts, managing container lifecycles, etc.

Kubeflow Pipelines runs on Argo Workflows as the workflow engine, so Kubeflow Pipelines users need to choose a workflow executor.

Choosing the Workflow Executor

Emissary executor has been Kubeflow Pipelines’ default executor since Feburay 2022 when KFP 1.8 went GA. We recommend Emissary executor unless you have known compatibility issues with Emissary, in which case please submit your feedback in the Emissary Executor feedback Github issue.
Docker executor is available as a legacy choice. In case you do have compatibilty issues with Emissary executor, and your cluster is running on an older version of Kubernetes (<1.20), you can configure to use Docker executor.

Note that Argo Workflows support other workflow executors, but the Kubeflow Pipelines team only recommend choosing between emissary executor and docker executor.

Emissary Executor

Emissary executor is the default workflow executor for Kubeflow Pipelines v1.8+. It was first released in Argo Workflows v3.1 (June 2021). The Kubeflow Pipelines team believe that its architectural and portability improvements can make it the default executor that most people should use going forward.

Container Runtime: any
Reliability: not yet well-tested and not yet popular, but the Kubeflow Pipelines team supports it.
Security: more secure
- No privileged access.
- Cannot escape the privileges of the pod’s service account.
Migration: command must be specified in Kubeflow Pipelines component specification.
Note, the same migration requirement is required by Kubeflow Pipelines v2 compatible mode, refer to known caveats & breaking changes.

Migrate to Emissary Executor

Prerequisite: emissary executor is only available in Kubeflow Pipelines backend version 1.7+. To upgrade, refer to upgrading Kubeflow Pipelines.

Configure an existing Kubeflow Pipelines cluster to use emissary executor

Install kubectl.
Connect to your cluster via kubectl.
Switch to the namespace you installed Kubeflow Pipelines:
```
kubectl config set-context --current --namespace <your-kfp-namespace>
```
Note, usually it’s kubeflow or default.

Confirm current workflow executor:

kubectl describe configmap workflow-controller-configmap | grep -A 2 containerRuntimeExecutor

You’ll see output like the following when using docker executor:

containerRuntimeExecutor:
----
docker

Configure workflow executor to emissary:

kubectl patch configmap workflow-controller-configmap --patch '{"data":{"containerRuntimeExecutor":"emissary"}}'

Confirm workflow executor is changed successfully:

kubectl describe configmap workflow-controller-configmap | grep -A 2 containerRuntimeExecutor

You’ll see output like the following:

containerRuntimeExecutor:
----
emissary

Deploy a new Kubeflow Pipelines cluster with emissary executor

For AI Platform Pipelines, check the “Use emissary executor” checkbox during installation.

For Kubeflow Pipelines Standalone, install env/platform-agnostic-emissary:

kubectl apply -k "github.com/kubeflow/pipelines/manifests/kustomize/env/platform-agnostic-emissary?ref=$PIPELINE_VERSION"

When in doubt, you can always deploy your Kubeflow Pipelines cluster first and configure workflow executor after installation using the instructions for existing clusters.

Migrate pipeline components to run on emissary executor

Some pipeline components require manual updates to run on emissary executor. For Kubeflow Pipelines component specification YAML, the command field must be specified.

Step by step component migration tutorial:

There is a hello world component:

name: hello-world
implementation:
  container:
    image: hello-world

We can run the container without command/args:

$ docker run hello-world
Hello from Docker!
...

Find out what the default ENTRYPOINT and CMD is in the image:
```
$ docker image inspect -f '{{.Config.Entrypoint}} {{.Config.Cmd}}' hello-world
[] [/hello]
```
So ENTRYPOINT is not specified, and CMD is ["/hello"]. Note, ENTRYPOINT roughly means command and CMD roughly means arguments. command and arguments are concatenated as the user command.

Update the component YAML:

name: hello-world
implementation:
  container:
    image: hello-world
    command: ["/hello"]

The updated component can run on emissary executor now.

Note: Kubeflow Pipelines SDK compiler always specifies a command for python function based components. Therefore, these components will continue to work on emissary executor without modifications.

Docker Executor

Docker executor used to be the default workflow executor before Kubeflow Pipelines v1.8.

Warning

Docker executor depends on docker container runtime, which is deprecated on Kubernetes 1.20+.

Container Runtime: docker only. However, Kubernetes is deprecating Docker as a container runtime after v1.20. On Google Kubernetes Engine (GKE) 1.19+, container runtime already defaults to containerd.
Reliability: most well-tested and most popular argo workflows executor
Security: least secure
- It requires privileged access to docker.sock of the host to be mounted which. Often rejected by Open Policy Agent (OPA) or your Pod Security Policy (PSP). GKE autopilot mode also rejects it, because No privileged Pods.
- It can escape the privileges of the pod’s service account.

Prepare a GKE cluster for Docker Executor

For GKE, the node image decides which container runtime is used. To use docker container runtime, you need to specify a node image with Docker.

You must use one of the following node images:

Container-Optimized OS with Docker (cos)
Ubuntu with Docker (ubuntu)

If your nodes are not using docker as container runtime, when you run pipelines you will always find error messages like:

This step is in Error state with this message: failed to save outputs: Error response from daemon: No such container: XXXXXX

References

Argo Workflow Executors documentation
KFP docker executor doesn’t support Kubernetes 1.19 or above kubeflow/pipelines#5714
Feature request - default to emissary executor kubeflow/pipelines#5718

Feedback

Was this page helpful?

Thank you for your feedback!

We're sorry this page wasn't helpful. If you have a moment, please share your feedback so we can improve.

Last modified July 31, 2024: Fix broken links in Pipelines (#3807) (17e27bf)