Skip to content

Commit bbf30f7

Browse files
committed
add dataset
1 parent 47286f9 commit bbf30f7

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -309,6 +309,10 @@ BLEU is a modified form of "precision", that used in machine translation evaluat
309309
* [17 Timelines](http://www.l3s.de/~gtran/timeline/)
310310
* [Legal Case Reports Data Set](http://archive.ics.uci.edu/ml/datasets/Legal+Case+Reports)
311311
* [Annotated English Gigaword](https://catalog.ldc.upenn.edu/LDC2012T21)
312+
* [CNN/Daily Mail dataset](https://cs.nyu.edu/~kcho/DMQA/)
313+
* It is originally Q&A dataset, but its `stories` are commonly used as summarization dataset (you can download preprocessed dataset from [here](https://github.com/abisee/cnn-dailymail)).
314+
* [CORNELL NEWSROOM](https://summari.es/)
315+
* It is one of the huge summarization dataset! This dataset contains 1.3 million articles and summaries written by authors and editors in the newsrooms of 38 major publications.
312316

313317
### Libraries
314318

0 commit comments

Comments
 (0)