A dataset for the research of domain generation algorithms (DGAs) and machine learning. The dataset contains more than 90m domains and more than 100 families.


Access: Public

Link: Zenodo

We provide a large-scale dataset of the messages exchanged publicly by the streamers and viewers during the live broadcasts of users identified as adult content producers from the LiveMe platform, a major Social Live Streaming Service (SLSS). The dataset comprises 39,382,838 chat messages exchanged by 1,428,284 users, in the context of 293,271 live broadcasts during a period of approximately two years, from July 2016 to June 2018. The analysis of this dataset can be found in our paper "Large-scale analysis of grooming in modern social networks" (arXiv:2004.08205 [cs.SI]).


Access: Only researchers and LEAs upon request

Link: Zenodo

Join the community
Follow us and stay connected and updated.
Slider
EU flag Copyright © 2020 LOCARD. All rights reserved. This project has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under Grant Agreement nº 832735. This project reflects only the author’s view and the Commission is not responsible for any use that may be made of the information it contains.