This page contains brief descriptions and links to download existing crisis-related collections.
BlackLivesMatterU/T1
Users tweeting about #BlackLivesMatter, labeled by type, race, gender and age.
Data Sources: Twitter
Sampling: hashtag-based query
CrisisLexT26
Tweets from 26 crises, labeled by informativeness, information type and source.
Data Sources: Twitter
Sampling: keyword-based queries
CrisisLexT6
Tweets from 6 crises, labeled by relatedness to the coresponding crisis.
Data Sources: Twitter
Sampling: keyword and geo-based queries
ClimateCovE350
Climate change related events, labeled by relevance, triggers, actions, and news values.
Data Sources: Twitter, GDELT (news)
Sampling: keyword-based queries
SoSItalyT4
Tweets from 4 crises, labeled by the type of information they convey.
Data Sources: Twitter
Sampling: keyword-based queries
ChileEarthquakeT1
Tweets from the Chilean earthquake of 2010, labeled by relatedness.
Data Sources: Twitter
Sampling: keyword-based queries
EnvironmentalPetitionTweets
Petition URLs and tweets containing them.
Data Sources: Twitter
Sampling: url-based queries
SandyHurricaneGeoT1
Geo-tagged tweets from the Sandy Hurricane.
Data Sources: Twitter
Sampling: geo-based queries
BlackLivesMatterU/T1
Users mentioning #BlackLivesMatter, labeled by type, race, age, gender. March 2016This collection includes tweets containing the #BlackLivesMatter hashtag and that were posted from April 2012 to May 2015. It also includes a collection of about 6000 users annotated according to type (organizations vs. individuals), and 3 demographic factors (race, age, gender), which have used the hashtag during this period.
Hashtag | #Tweets | #Users | #Labeled Users |
---|---|---|---|
#BlackLivesMatter | 3.54 million | 0.88 million | 6000 |
If you use the BlackLivesMatterU/T1 collections, please cite:
Annotated data available upon request BlackLivesMatterT1-v1.0.zip (29.1 MB)
ClimateCovE350
Climate events, labeled by relevance, triggers, actions, and news values April 2015This collection includes about 350 events that received medium to high coverage in Twitter, mainstream media, or both, covering a period of 17 months in 2013 and 2014, and are labeled by relevance to climate-chance, triggers, actions, and 6 news values (i.e. extraordinary, unpredictable, high magnitude, negative, conflictive, related to elite persons).
Types (or triggers) | Description | Example |
---|---|---|
Disaster | Disruption of the functioning of a community that involves widespread human, material, or environmental losses | Typhoon, Tornadoes |
Government (all branches) and intergovernmental agencies | Any institution belonging to any government branch (executive, legislative, judicial), or any inter-governmental agency, or any government employee acting in official capacity | Law enforcement agencies, United Nations, Presidency, Ministry |
Groups, NGOs, and universities | Any non-profit, nongovernmental group, formally established or not. We include in this category educational and research institutions | GreenPeace, Stanford, WWF |
For-profit (excl. media, universities) | Any for-profit organization, including business and corporations but excluding media and universities, which appear in the other categories | Google, Shell |
Media | Any media organization | CNN, New York Times, The Guardian, Associated Press |
Individuals | Any individual that is not acting as a representative of any of the organization types listed above | Actors, Neil deGrasse Tyson |
Sub-types (or actions) | Description | Example |
Natural Hazards | Extreme weather and climate events that occur naturally | Typhoon, Drought |
Human-Induced Hazards | Hazards having an element of human intent, negligence, error, or involving a failure of a human-made system | Deforestation, Oil Spill |
Legal actions | Any action that is legally binding, including new executive orders and new laws, plus any action brought to a court of law, such as lawsuits | New legislation, lawsuits |
Publications | Any release of a document to the public, including reports, studies, memoranda, infographics and cartoons | IPCC Reports, Polar bear cartoon |
Meetings | Any meeting, conference, convention, etc | IPCC meeting, UN meetings |
Other | Other types of actions not belonging to the categories above, in our data this corresponded mostly to campaigns and brief public statements | Campaigns, statements, projects |
If you use the ClimateCovE350 collection, please cite:
Browse on GitHub ClimateCovE350-v1.0.zip (48 KB)
CrisisLexT26
Tweets from 26 crises, labeled by informativeness, information type and source Nov 2014This collection includes tweets collected during 26 large crisis events in 2012 and 2013, with about 1,000 tweets labeled per crisis for informativeness (i.e. “informative," or "not informative"), information type, and source.
Crisis | Country | Start / Duration | #Tweets | Category | Sub-Category | Type | Development | Spread |
---|---|---|---|---|---|---|---|---|
2012 Italy earthquakes | Italy | May / 32 days | 7,351 | Natural | Geophysical | Earthquake | Diffused | Instantaneous |
2012 Colorado wildfires | US | Jun / 31 days | 4,172 | Natural | Climatological | Wildfire | Diffused | Progressive |
2012 Philipinnes floods | Philipinnes | Aug / 13 days | 2,950 | Natural | Hydrological | Floods | Diffused | Progressive |
2012 Venezuela refinery explosion | Venezuela | Aug / 12 days | 2,736 | Human-induced | Accidental | Explosion | Focalized | Instantaneous |
2012 Costa Rica earthquake | Costa Rica | Sep / 13 days | 2,193 | Natural | Geophysical | Earthquake | Diffused | Instantaneous |
2012 Guatemala earthquake | Guatemala | Nov / 20 days | 3,261 | Natural | Geophysical | Earthquake | Diffused | Instantaneous |
2012 Typhoon Pablo | Phillipines | Nov / 21 days | 1,944 | Natural | Meteorological | Typhoon | Diffused | Progressive |
2013 Brazil nightclub fire | Brazil | Jan / 16 days | 4,786 | Human-induced | Accidental | Fire | Focalized | Instantaneous |
2013 Queensland floods | Australia | Jan / 19 days | 1,223 | Natural | Hydrological | Floods | Diffused | Progressive |
2013 Russian meteor | Russia | Feb / 19 days | 8,365 | Natural | Others | Meteorite | Focalized | Instantaneous |
2013 Boston bombings | US | Apr / 60 days | 157,454 | Human-induced | Intentional | Bombings | Focalized | Instantaneous |
2013 Savar building collapse | Bangladesh | Apr / 36 days | 4,070 | Human-induced | Accidental | Collapse | Focalized | Instantaneous |
2013 West Texas explosion | US | Apr / 29 days | 14,505 | Human-induced | Accidental | Explosion | Focalized | Instantaneous |
2013 Alberta floods | Canada | Jun / 25 days | 5,887 | Natural | Hydrological | Floods | Diffused | Progressive |
2013 Singapore haze | Singapore | Jun / 19 days | 3,639 | Mixed | Others | Haze | Diffused | Progressive |
2013 Lac-Megantic train crash | Canada | Jul / 14 days | 2,342 | Human-induced | Accidental | Derailment | Focalized | Instantaneous |
2013 Spain train crash | Spain | Jul / 15 days | 3,681 | Human-induced | Accidental | Derailment | Focalized | Instantaneous |
2013 Manila floods | Phillipines | Aug / 11 days | 2,032 | Natural | Hydrological | Floods | Diffused | Progressive |
2013 Colorado floods | US | Sep / 21 days | 1,778 | Natural | Hydrological | Floods | Diffused | Progressive |
2013 Australia wildfires | Australia | Oct / 21 days | 1,982 | Natural | Climatological | Wildfire | Diffused | Progressive |
2013 Bohol earthquake | Phillipines | Oct / 12 days | 2,214 | Natural | Geophysical | Earthquake | Diffused | Instantaneous |
2013 Glasgow helicopter crash | UK | Nov / 30 days | 2,558 | Human-induced | Accidental | Crash | Focalized | Instantaneous |
2013 LA Airport shootings | US | Nov / 12 days | 2,730 | Human-induced | Intentional | Shootings | Focalized | Instantaneous |
2013 NYC train crash | US | Nov / 8 days | 1,066 | Human-induced | Accidental | Derailment | Focalized | Instantaneous |
2013 Sardinia floods | Italy | Nov / 13 days | 1,143 | Natural | Hydrological | Floods | Diffused | Progressive |
2013 Typhoon Yolanda | Phillipines | Nov / 58 days | 38,951 | Natural | Meteorological | Typhoon | Diffused | Progressive |
If you use the CrisisLexT26 collection, please cite:
Browse on GitHub CrisisLexT26-v1.0.zip (4.6 MB)
CrisisLexT6
Tweets from 6 crises, labeled by relatedness June 2014This collection includes English tweets across 6 large events in 2012 and 2013, with about 10,000 tweets labeled by relatedness (as "on-topic", or "off-topic") with each event.
Crisis | Start / Duration | Keyword-based sampling (keywords) | #Tweets | Geo-based sampling (regions or coordinates) | #Tweets |
---|---|---|---|---|---|
2012 Sandy Hurricane | 2012-10-28 / 3 days | 4: hurricane, hurricane sandy, frankenstorm, #sandy | 2,775,812 | NY City; Bergen, Ocean, Union, Atlantic, Essex, Cape May, Hudson, Middlesex; Monmouth County, NJ, US | 279,454 |
2013 Boston Bombings | 2013-04-15 / 5 days | 17: boston explosion, BostonMarathon, boston blast, boston terrorist, boston bomb, boston tragedy, PrayForBoston, boston attack, boston tragic | 3,375,076 | Suffolk and Norfolk Counties, Massachusetts, US | 88,931 |
2013 Oklahoma Tornado | 2013-05-20 / 11 days | 36: oklahoma tornado, oklahoma storm, oklahoma relief, oklahoma volunteer, oklahoma disaster, #moore, moore relief, moore storm, #ok, #okc | 2,742,588 | long. in [-98.25, -96.75] and lat. in [34.5, 35.75] | 62,237 |
2013 West Texas Explosion | 2013-04-17 / 11 days | 9: #westexplosion, #westtx, west explosion, waco explosion, texas explosion, tx explosion, texas fertilizer, #prayfortexas, #prayforwest | 508,333 | long. in [-97.5, -96.5] and lat. in [31.5, 32] | 16,033 |
2013 Alberta Floods | 2013-06-21 / 11 days | 13: alberta flood, #abflood, canada flood, alberta flooding, alberta floods, canada flooding, canada floods, #yycflood, #yycfloods, #yycflooding | 370,762 | Alberta, Canada | 166,012 |
2013 Queensland Floods | 2013-01-27 / 6 days | 4: #qldflood, #bigwet, queensland flood, australia flood | 5,393 | Queensland, Australia | 27,000 |
If you use the CrisisLexT6 collection, please cite:
Browse on GitHub CrisisLexT6-v1.0.zip (3.1 MB)
We would like to host and/or provide links to other crisis-related collections. Please contact us to include other collections in this list.
ChileEarthquakeT1
Tweets from the 2010 Chilean earthquake, labeled by relatedness. June 2015This collection includes about 2000 tweets in Spanish posted after the Chilean earthquake of 2010, all labeled by relatedness (relevant or not relevant).
Crisis | Year | #Tweets |
---|---|---|
Chile Earthquake | 2010 | 2187 |
If you use the ChileEarthquakeT1 collection, please cite:
coboetal2015_twitter.tar.gz (0.2 MB)
SoSItalyT4
Tweets from 4 crises in Italy, labeled by relatedness and type. June 2015This collection includes tweets across 4 different natural disasters that occurred in Italy between 2009 and 2014, with between ~400 to ~3100 tweets labeled by the type of information they convey (as "damage", "no damage", or "not relevant").
Crisis | Year | #Tweets |
---|---|---|
Sardegna Flood | 2013 | 976 |
L'Aquila Earthquake | 2009 | 1,062 |
Emilia Earthquake | 2012 | 3,170 |
Genova Floods | 2014 | 434 |
If you use the SoSItalyT4 collection, please cite:
Browse on Dataset Website Cresci-SWDM15-CSV.zip (0.3 MB)
SandyHurricaneGeoT1
Geo-Located tweets from the 2012 Sandy Hurricane. June 2015This collection includes 6,556,328 geotagged tweets that represent all geotagged tweets from the time and regions impacted by Hurricane Sandy, the largest Atlantic hurricane on record.
Crisis | Year | #Tweets |
---|---|---|
Sandy Hurricane | 2012 | 6,556,328 |
If you use the SandyHurricaneGeoT1 collection, please cite:
Browse on GitHub release.tgz (56.6 MB)
EnvironmentalPetitionTweets
Petition URLs, the tweets containing them and basic stats. May 2016This collection includes tweets containing URLs coresponding to various environmental campaigns from Jan 2015 to April 2015. The dataset also contains basic stats about the collected signatures and the petition signature goal.
Number | Time-interval | #Tweets |
---|---|---|
~200 | Jan 1, 2015 to April 14, 2015 | 37700 |
If you use the EnvironmentalPetitionTweets collection, please cite: