Skip to content

Commit 5fb9bc1

Browse files
minor
1 parent f85700f commit 5fb9bc1

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

bert_pytorch/dataset/vocab.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -191,9 +191,9 @@ def build():
191191
print("get corpus")
192192
texts = []
193193
for index, corpus in tqdm(enumerate(os.listdir(args.corpus_path))):
194-
print("getting {}".format(corpus))
194+
print("getting{}".format(corpus))
195195
with open(os.path.join(args.corpus_path,corpus), "r", encoding=args.encoding) as f:
196-
texts += f.readlines()
196+
texts += f
197197
# print(type(f))
198198
vocab = WordVocab(texts, max_size=args.vocab_size, min_freq=args.min_freq)
199199
pass

0 commit comments

Comments
 (0)