Skip to content

Commit 25cb199

Browse files
authored
Update README.md
1 parent cd89c96 commit 25cb199

File tree

1 file changed

+3
-2
lines changed

1 file changed

+3
-2
lines changed

README.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,12 +26,12 @@ For example, the figure above presents the processing time of a single mini-batc
2626
- [CuPy](https://cupy.chainer.org/)
2727
- [pynvrtc](https://github.com/NVIDIA/pynvrtc)
2828

29-
Install requirements via `pip install -r requirements.txt`. CuPy and pynvrtc needed to compile the CUDA code into a callable function at runtime.
29+
Install requirements via `pip install -r requirements.txt`. CuPy and pynvrtc needed to compile the CUDA code into a callable function at runtime. Only single GPU training is supported.
3030

3131
<br>
3232

3333
## Examples
34-
The usage of SRU is similar to `nn.LSTM`.
34+
The usage of SRU is similar to `nn.LSTM`. SRU likely requires more stacking layers than LSTM. We recommend starting by 2 layers and use more if necessary (see our report for more experimental details).
3535
```python
3636
import torch
3737
from torch.autograd import Variable
@@ -81,5 +81,6 @@ export PYTHONPATH=path_to_repo/sru
8181

8282
## To-do
8383
- [ ] ReLU activation
84+
- [ ] support multi-GPU (context change)
8485
- [ ] Layer normalization + residual to compare with highway connection (current version)
8586

0 commit comments

Comments
 (0)