Deep Learning Approaches for Multi-Label Incidents Classification from Twitter Textual Information

Authors

  • Sherly Rosa Anggraeni Department of Informatics, Institut Teknologi Sepuluh Nopember, Indonesia, Indonesia
  • Narandha Arya Ranggianto Department of Informatics, Institut Teknologi Sepuluh Nopember, Indonesia, Indonesia
  • Imam Ghozali Department of Informatics, Institut Teknologi Sepuluh Nopember, Indonesia, Indonesia
  • Chastine Fatichah Department of Informatics, Institut Teknologi Sepuluh Nopember, Indonesia, Indonesia
  • Diana Purwitasari
    diana@if.its.ac.id
    Department of Informatics, Institut Teknologi Sepuluh Nopember, Indonesia, Indonesia

Downloads

Background: Twitter is one of the most used social media, with 310 million active users monthly and 500 million tweets per day. Twitter is not only used to talk about trending topics but also to share information about accidents, fires, traffic jams, etc. People often find these updates useful to minimize the impact.

Objective: The current study compares the effectiveness of three deep learning methods (CNN, RCNN, CLSTM) combined with neuroNER in classifying multi-label incidents.

Methods: NeuroNER is paired with different deep learning classification methods (CNN, RCNN, CLSTM).

Results: CNN paired with NeuroNER yield the best results for multi-label classification compared to CLSTM and RCNN.

Conclusion: CNN was proven to be more effective with an average precision value of 88.54% for multi-label incidents classification. This is because the data we used for the classification resulted from NER, which was in the form of entity labels. CNN immediately distinguishes important information, namely the NER labels. CLSTM generates the worst result because it is more suitable for sequential data. Future research will benefit from changing the classification parameters and test scenarios on a different number of labels with more diverse data.

Keywords: CLSTM, CNN, Incident Classification, Multi-label Classification, RCNN