- Data used for the paper Tracking COVID-19 using online search
- Twitter (UK) word embeddings trained using word2vec (skip-gram)
- Twitter (UK) word embeddings trained using word2vec (CBOW) used in the paper Enhancing feature selection using word embeddings: The case of flu surveillance
- Data used for the paper Predicting judicial decisions of the European Court of Human Rights: a Natural Language Processing perspective
- Data used for the paper Inferring the Socioeconomic Status of Social Media Users based on Behaviour and Language
- Data used for the paper Studying User Income through Language, Behaviour and Affect in Social Media
- Data used for the paper Extracting Socioeconomic Patterns from the News: Modelling Text and Outlet Importance Jointly
- Data used for the paper An analysis of the user occupational class through Twitter content (see also the README)