This is a project to collect and share datasets for GNN training and Graph Computation.
| Graph | Category | Size (V, E) | Features | Labels | Average Degree | Download Link |
|---|---|---|---|---|---|---|
| Cora | Citation Network | 2708,5429 | 1433 | 7 | 2.0 | https://linqs-data.soe.ucsc.edu/public/lbc/cora.tgz |
| Citeseer | Citation Network | 3327,4732 | 3703 | 6 | 1.4 | http://www.cs.umd.edu/~sen/lbc-proj/data/citeseer.tgz |
| Pubmed | Citation Network | 19717,44338 | 500 | 3 | 2.2 | https://linqs-data.soe.ucsc.edu/public/Pubmed-Diabetes.tgz |
| Reddit-small | Forum | 232965,114615892 | 602 | 41 | 491.9 | http://snap.stanford.edu/graphsage/ |
| Reddit-large | Forum | |||||
| ogbn-arxiv | Node classification | 169343,1166243 | 128 | 40 | 6.9 | |
| ogbn-products | Node classification | 2449029,61859140 | 100 | 47 | 25.3 | |
| PPI |
Some commonly used data sets can be found from the following links:
(1) The KONECT Project http://konect.cc/networks/
(2) Stanford Large Network Dataset Collection http://memetracker.org/data/index.html
(3) 9th DIMACS Implementation Challenge - Shortest Paths http://www.diag.uniroma1.it//~challenge9/download.shtml
(4) A large Scale Graph Generater https://github.com/farkhor/PaRMAT ...