Dataset Metrics

Overview

Methodology

Number of objects (submissions): 1,363,924Start Date: Sat Jul 06 03:19:53 +0000 2019End Date: Sat Jul 06 03:25:27 +0000 2019Format: csvThis dataset contains tweet ids for a five minute time span starting at the beginning of the 7.1 magnitude Ridgecrest earthquake.

This data was compiled using the methodology detailed here.

The Twitter ids can be rehydrated using Twitter's "statuses lookup" endpoint. Twitter TOS prevents including the actual tweets. However, please feel free to contact the author of this publication if you need further assistance or access to the original data (for academic research purposes only). When collecting the data, the following sequence numbers were used: 0,1,2,3,4,5,6,7,8,9. The machine ids in rotation during this time span were: 341, 322, 327, 332, 334, 335, 336, 326, 321, 382, 366, 378, 364, 374, 373, 361, 333, 363, 377, 379.

This represents a 99% (±0.5%) sample rate of the full population of publicly available tweets during the time span for this data set.

Contact

If you have any questions about the data or require more details on the methodology, you are welcome to contact the author.