Copy the page URI to the clipboard
Khare, Prashant; Burel, Gregoire and Alani, Harith
(2019).
DOI: https://doi.org/10.1109/MIS.2019.2917443
Abstract
Social media plays a vital role in information sharing during disasters. Unfortunately, the overwhelming volume and variety of data generated on social media makes it challenging to sieve through such content manually and determine its relevancy. Most automated approaches to classify crisis data for relevancy are based on classic statistical features. However, such approaches do not adapt well to situations when applied on a new crisis event, or to a new language that the model was not trained on. In crisis situations, training a new model for particular crises or languages is not a viable approach. In this paper, we introduce a hybrid semantic-statistical approach for classifying data with regards to relevancy to a given crisis. We demonstrate how this approach outperforms the baselines in scenarios where the model is trained on one type of crisis and language, and tested on new crisis types and additional languages.
Viewing alternatives
Download history
Metrics
Public Attention
Altmetrics from AltmetricNumber of Citations
Citations from DimensionsItem Actions
Export
About
- Item ORO ID
- 61477
- Item Type
- Journal Item
- ISSN
- 1541-1672
- Project Funding Details
-
Funded Project Name Project ID Funding Body COMRADES Not Set EC (European Commission): FP(inc.Horizon2020, H2020, ERC) - Keywords
- semantics; cross-lingual; multilingual; crisis informatics; tweet classification
- Academic Unit or School
-
Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM) - Copyright Holders
- © 2019 IEEE
- Depositing User
- Prashant Khare