kubeflow-lite: Failed to deploy kfp-viz
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Invalid
|
Undecided
|
Unassigned | ||
Snap Store Server |
New
|
Undecided
|
Unassigned |
Bug Description
I am trying to install kubeflow-lite on microk8s on my local machine following the instructions at https:/
I am getting ImagePullBackOff for kfp-viz after running:
$ juju deploy kubeflow-lite --trust
I have tried it many times and I am always getting the same error for the same charm, sometimes I got ImagePullBackOff for kfp-profile-
I get an error related to wrong size:
$ microk8s.kubectl describe pod kfp-viz-
Name: kfp-viz-
Namespace: kubeflow
Priority: 0
Node: nikopc/192.168.2.4
Start Time: Mon, 11 Jul 2022 17:01:45 +0300
Labels: app.kubernetes.
Annotations: apparmor.
Status: Pending
IP: 10.1.184.63
IPs:
IP: 10.1.184.63
Controlled By: ReplicaSet/
Init Containers:
juju-pod-init:
Container ID: containerd:
Image: jujusolutions/
Image ID: docker.
Port: <none>
Host Port: <none>
Command:
/bin/sh
Args:
-c
export JUJU_DATA_
export JUJU_TOOLS_
mkdir -p $JUJU_TOOLS_DIR
cp /opt/jujud $JUJU_TOOLS_
initCmd=
if test -n "$initCmd"; then
$
else
exit 0
fi
State: Terminated
Reason: Completed
Exit Code: 0
Started: Mon, 11 Jul 2022 17:01:47 +0300
Finished: Mon, 11 Jul 2022 17:01:52 +0300
Ready: True
Restart Count: 0
Environment: <none>
Mounts:
/var/lib/juju from juju-data-dir (rw)
/
Containers:
ml-pipeline-
Container ID:
Image: registry.
Image ID:
Port: 8888/TCP
Host Port: 0/TCP
State: Waiting
Reason: ImagePullBackOff
Ready: False
Restart Count: 0
Liveness: exec [wget -q -S -O - http://
Readiness: exec [wget -q -S -O - http://
Environment: <none>
Mounts:
/
/var/lib/juju from juju-data-dir (rw)
/
Conditions:
Type Status
Initialized True
Ready False
ContainersReady False
PodScheduled True
Volumes:
juju-data-dir:
Type: EmptyDir (a temporary directory that shares a pod's lifetime)
Medium:
SizeLimit: <unset>
kube-
Type: Projected (a volume that contains injected data from multiple sources)
TokenExpira
ConfigMapName: kube-root-ca.crt
ConfigMapOp
DownwardAPI: true
QoS Class: BestEffort
Node-Selectors: kubernetes.
Tolerations: node.kubernetes
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal Scheduled 18m default-scheduler Successfully assigned kubeflow/
Normal Pulled 18m kubelet Container image "jujusolutions/
Normal Created 18m kubelet Created container juju-pod-init
Normal Started 18m kubelet Started container juju-pod-init
Warning Failed 14m kubelet Failed to pull image "registry.
Warning Failed 11m kubelet Failed to pull image "registry.
Normal BackOff 10m (x2 over 14m) kubelet Back-off pulling image "registry.
Warning Failed 10m (x2 over 14m) kubelet Error: ImagePullBackOff
Normal Pulling 10m (x3 over 18m) kubelet Pulling image "registry.
Warning Failed 8m18s kubelet Failed to pull image "registry.
Warning Failed 8m18s (x3 over 14m) kubelet Error: ErrImagePull
Warning DNSConfigForming 3m28s (x20 over 18m) kubelet Search Line limits were exceeded, some search paths have been omitted, the applied search line is: kubeflow.
I am running juju v2.9.32
Please let me know if I can provide more info to help you reproduce this.
To add to this, the error is not simply that the image the charm needs never existed. Others can successfully deploy this charm. There is something happening that is either intermittent or user-specific here that's causing the ImagePullBackoff, although I have no idea what