Crowdsourcing Gender and Age Annotator Rationales in Multiple Languages


Rationales - targeted annotator feedback regarding why and how they chose a particular annotation collected via crowdsourcing.
We collected rationale annotations from a set of Twitter profiles with known to us (but not known to the annotators) gender, age and political preference labels and used these "gold-standard" labels as a quality control check during the rationale annotation.
By using this data, models or code, you agree to be bound by the terms of its license. Read the license.

English Gender Rationales

English Age Rationales

English Political Preference Rationales

Spanish Gender Rationales

Spanish Age Rationales

Male and female top rationales collected using crowdsourcing for a sample of 400 Twitter profiles.
Teen (14 - 19 y.o.) and young (20 - 34 y.o.) top rationales collected using crowdsourcing for a sample of 300 Twitter profiles.
Republican and Democratic top rationales collected using crowdsourcing for a sample of 380 Twitter user profiles from the active dataset.