You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It is highly recommended to use the NVIDIA PyTorch or NeMo container if having issues installing Apex or any other dependencies.
241
+
242
+
While installing Apex, it may raise an error if the CUDA version on your system does not match the CUDA version torch was compiled with.
243
+
This raise can be avoided by commenting it here: https://github.com/NVIDIA/apex/blob/master/setup.py#L32
244
+
245
+
cuda-nvprof is needed to install Apex. The version should match the CUDA version that you are using:
246
+
247
+
.. code-block:: bash
248
+
249
+
conda install -c nvidia cuda-nvprof=11.8
250
+
251
+
packaging is also needed:
252
+
253
+
.. code-block:: bash
254
+
255
+
pip install -y packaging
256
+
257
+
246
258
Transformer Engine
247
259
~~~~~~~~~~~~~~~~~~
248
260
NeMo Megatron GPT has been integrated with `NVIDIA Transformer Engine <https://github.com/NVIDIA/TransformerEngine>`_
249
261
Transformer Engine enables FP8 training on NVIDIA Hopper GPUs.
250
262
`Install <https://docs.nvidia.com/deeplearning/transformer-engine/user-guide/installation.html>`_ it manually if not using the NVIDIA PyTorch container.
0 commit comments