You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+3-2Lines changed: 3 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -26,12 +26,12 @@ For example, the figure above presents the processing time of a single mini-batc
26
26
-[CuPy](https://cupy.chainer.org/)
27
27
-[pynvrtc](https://github.com/NVIDIA/pynvrtc)
28
28
29
-
Install requirements via `pip install -r requirements.txt`. CuPy and pynvrtc needed to compile the CUDA code into a callable function at runtime.
29
+
Install requirements via `pip install -r requirements.txt`. CuPy and pynvrtc needed to compile the CUDA code into a callable function at runtime. Only single GPU training is supported.
30
30
31
31
<br>
32
32
33
33
## Examples
34
-
The usage of SRU is similar to `nn.LSTM`.
34
+
The usage of SRU is similar to `nn.LSTM`. SRU likely requires more stacking layers than LSTM. We recommend starting by 2 layers and use more if necessary (see our report for more experimental details).
0 commit comments