This is a Collection of URLs (and Outlinked URLs) extracted from a random feed of 1% of all Tweets.
TIMESTAMPS
The Wayback Machine - https://web.archive.org/web/20200909141204/https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf