Skip to content

Commit df9a5ef

Browse files
committed
limit 1:3000
1 parent ab9ff16 commit df9a5ef

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

poc/seq2seq1.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -221,8 +221,8 @@ def collate_fn(batch):
221221

222222
def train_model():
223223
# Load and preprocess data
224-
english_data = open("../synthetic_data/news-commentary-v12.zh-en.en").readlines()
225-
chinese_data = open("../synthetic_data/news-commentary-v12.zh-en.zh").readlines()
224+
english_data = open("../synthetic_data/news-commentary-v12.zh-en.en").readlines()[1:3000]
225+
chinese_data = open("../synthetic_data/news-commentary-v12.zh-en.zh").readlines()[1:3000]
226226

227227
# Tokenize data
228228
english_tokenized = [sentence.strip().split() for sentence in english_data]

0 commit comments

Comments
 (0)