Large Network Dataset Collection. Social networks Networks with ground-truth communities Communication networks Citation networks Collaboration networks Web graphs.
Publicly available large data sets for database research. Most database research papers use synthetic data sets.
That is, they use random-number generators to create their data on the fly. A popular generator is dbgen from the Transaction Processing Performance Council (TPC). World Population Prospects, the 2010 Revision.