Skip to content

Commit 6ad68f9

Browse files
committed
add link to troubleshooting page
1 parent 22b29d3 commit 6ad68f9

File tree

2 files changed

+4
-2
lines changed

2 files changed

+4
-2
lines changed

docs/vendor/embedded-troubleshooting.mdx

+1-1
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ When troubleshooting, it can be useful to list the cluster and view logs using t
5959

6060
This section provides troubleshooting advice for common errors.
6161

62-
### Installation failure when NVIDIA GPU Operator is included as Helm extension
62+
### Installation failure when NVIDIA GPU Operator is included as Helm extension {#nvidia}
6363

6464
#### Symptom
6565

docs/vendor/embedded-using.mdx

+3-1
Original file line numberDiff line numberDiff line change
@@ -220,5 +220,7 @@ Using the NVIDIA GPU Operator with Embedded Cluster requires configuring the con
220220
When the containerd options are configured as shown above, the NVIDIA GPU Operator automatically creates the required configurations in the `/etc/k0s/containerd.d/nvidia.toml` file. It is not necessary to create this file manually, or modify any other configuration on the hosts.
221221

222222
:::note
223-
If you include the NVIDIA GPU Operator as a Helm extension, remove any existing containerd services that are running on the host (such as those deployed by Docker) before attempting to install the release with Embedded Cluster. If there are any containerd services on the host, the NVIDIA GPU Operator will generate an invalid containerd config, causing the installation to fail.
223+
If you include the NVIDIA GPU Operator as a Helm extension, remove any existing containerd services that are running on the host (such as those deployed by Docker) before attempting to install the release with Embedded Cluster. If there are any containerd services on the host, the NVIDIA GPU Operator will generate an invalid containerd config, causing the installation to fail. For more information, see [Installation failure when NVIDIA GPU Operator is included as Helm extension](#nvidia) in _Troubleshooting Embedded Cluster_.
224+
225+
This is the result of a known issue with v24.9.x of the NVIDIA GPU Operator. For more information about the known issue, see [container-toolkit does not modify the containerd config correctly when there are multiple instances of the containerd binary](https://github.com/NVIDIA/nvidia-container-toolkit/issues/982) in the nvidia-container-toolkit repository in GitHub.
224226
:::

0 commit comments

Comments
 (0)