Document Type : Research Paper
Authors
1 Department of IT Management, Qeshm Branch, Islamic Azad University, Qeshm, Iran.
2 Department of Computer Science, Kashan Branch, Islamic Azad University, Kashan, Iran.
3 Department of Management, Central Tehran Branch, Islamic Azad University, Tehran, Iran.
Abstract
With the increase of news on social networks, a way to identify fake news has become an essential matter. Classification is a fundamental task in natural language processing (NLP). Convolutional neural network (CNN), as a popular deep learning model, has shown remarkable success in the task of fake news classification. In this paper, new baseline models were studied for fake news classification using CNN. In these models, documents are fed to the network as a 3-dimensional tensor representation to provide sentence-level analysis. Applying such a method enables the models to take advantage of the positional information of the sentences in the texts. Besides, analyzing adjacent sentences allows extracting additional features. The proposed models were compared with the state-of-the-art models using a collection of real and fake news extracted from Twitter about covid-19, and the fusion layer was used as the decision layer in selecting the best feature. The results showed that the proposed models had better performance, particularly in these documents, and the results were obtained with 97.33% accuracy for classification on Covid-19 after reviewing the evaluation criteria of the proposed decision system model.
Keywords
- Fake news
- Text classification
- Decision-making
- Deep learning
- Convolutional neural network
- Natural language processing
Main Subjects
- Zhou, X., & Zafarani, R. (2018). Fake news: a survey of research, detection methods, and opportunities. Available at arXiv:1812.00315
- Shu, K., Sliva, A., Wang, S., Tang, J., & Liu, H. (2017). Fake news detection on social media: a data mining perspective. ACM SIGKDD explorations newsletter, 19(1), 22-36. https://doi.org/10.1145/3137597.3137600
- Sharma, K., Qian, F., Jiang, H., Ruchansky, N., Zhang, M., & Liu, Y. (2019). Combating fake news: a survey on identification and mitigation techniques. ACM transactions on intelligent systems and technology (TIST), 10(3), 1-42.
- Tacchini, E., Ballarin, G., Della Vedova, M. L., Moret, S., & de Alfaro, L. (2017). Some like it hoax: automated fake news detection in social networks. Available at arXiv:1704.07506
- Rashkin, H., Choi, E., Jang, J. Y., Volkova, S., & Choi, Y. (2017, September). Truth of varying shades: analyzing language in fake news and political fact-checking. Proceedings of the 2017 conference on empirical methods in natural language processing(pp. 2931-2937). Association for Computational Linguistics. https://aclanthology.org/D17-1317
- Agag, G. M., & El-Masry, A. A. (2017). Why do consumers trust online travel websites? drivers and outcomes of consumer trust toward online travel websites. Journal of travel research, 56(3), 347-369.
- Pang, B., Lee, L., & Vaithyanathan, S. (2002). Thumbs up? sentiment classification using machine learning techniques. Available at cs/0205070
- Jindal, N., & Liu, B. (2007, May). Review spam detection. Proceedings of the 16th international conference on world wide web(pp. 1189-1190). Association for Computing Machinery, New York, NY, United States. https://doi.org/10.1145/1242572.1242759
- Blei, D. M. (2012). Probabilistic topic models. Communications of the ACM, 55(4), 77-84. https://doi.org/10.1145/2133806.2133826
- Russel, S., & Norvig, P. (2013). Artificial intelligence: a modern approach. Pearson.
- Wang, S. I., & Manning, C. D. (2012, July). Baselines and bigrams: simple, good sentiment and topic classification. Proceedings of the 50th annual meeting of the association for computational linguistics (Volume 2: Short Papers)(pp. 90-94). Association for Computational Linguistics.
- LeCun, Y., Boser, B., Denker, J. S., Henderson, D., Howard, R. E., Hubbard, W., & Jackel, L. D. (1989). Backpropagation applied to handwritten zip code recognition. Neural computation, 1(4), 541-551. DOI: 1162/neco.1989.1.4.541
- Lipton, Z. C., Berkowitz, J., & Elkan, C. (2015). A critical review of recurrent neural networks for sequence learning. Available at arXiv:1506.00019
- Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural computation, 9(8), 1735-1780. DOI:1162/neco.1997.9.8.1735
- Zhang, X., Zhao, J., & LeCun, Y. (2015). Character-level convolutional networks for text classification. https://proceedings.neurips.cc/paper/2015/hash
/250cf8b51c773f3f8dc8b4be867a9a02-Abstract.html - Feng, G., Li, S., Sun, T., & Zhang, B. (2018). A probabilistic model derived term weighting scheme for text classification. Pattern recognition letters, 110, 23-29. https://doi.org/10.1016/j.patrec.2018.03.003
- Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. Available at arXiv:1310.4546
- Pennington, J., Socher, R., & Manning, C. D. (2014, October). Glove: global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP)(pp. 1532-1543). Association for Computational Linguistics.
- Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation. Availabla at arXiv:1406.1078
- Yogatama, D., Dyer, C., Ling, W., & Blunsom, P. (2017). Generative and discriminative text classification with recurrent neural networks. Availabla at arXiv:1703.01898
- LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11), 2278-2324. DOI:1109/5.726791
- Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., & Kuksa, P. (2011). Natural language processing (almost) from scratch. Journal of machine learning research, 12(ARTICLE), 2493-2537.
- Kim, Y. (2019). Convolutional neural networks for sentence classification. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (pp. 1746–1751). Association for Computational Linguistics.
- Conneau, A., Schwenk, H., Barrault, L., & Lecun, Y. (2016). Very deep convolutional networks for text classification. Availabla at arXiv:1606.01781
- He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition(pp. 770-778). IEEE.
- Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. Available at arXiv:1706.03762
- Lin, Z., Feng, M., Santos, C. N. D., Yu, M., Xiang, B., Zhou, B., & Bengio, Y. (2017). A structured self-attentive sentence embedding. Available at arXiv:1703.03130
- Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., & Hovy, E. (2016, June). Hierarchical attention networks for document classification. Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies(pp. 1480-1489). Association for Computational Linguistics.
- Wang, S., Huang, M., & Deng, Z. (2018, July). Densely connected CNN with multi-scale feature attention for text classification. Proceedings of the 27th international joint conference on artificial intelligence (pp. 4468-4474). AAAI Press.
- Castillo, C., Mendoza, M., & Poblete, B. (2011, March). Information credibility on twitter. Proceedings of the 20th international conference on World wide web(pp. 675-684). Association for Computing, Machinery, New York, NY, United States. https://doi.org/10.1145/1963405.1963500
- Zhang, H., Fan, Z., Zheng, J., & Liu, Q. (2012). An improving deception detection method in computer-mediated communication. Journal of networks, 7(11), 1811-1907.
- Zhou, L., Twitchell, D. P., Qin, T., Burgoon, J. K., & Nunamaker, J. F. (2003, January). An exploratory study into deception detection in text-based computer-mediated communication. Proceedings of the 36th annual Hawaii international conference on system sciences. IEEE. DOI: 1109/HICSS.2003.1173793
- Chang, C., Zhang, Y., Szabo, C., & Sheng, Q. Z. (2016, December). Extreme user and political rumor detection on twitter. International conference on advanced data mining and applications(pp. 751-763). Springer, Cham. https://doi.org/10.1007/978-3-319-49586-6_54
- Aker, A., Derczynski, L., & Bontcheva, K. (2017). Simple open stance classification for rumour analysis. Available at arXiv:1708.05286
- Ruchansky, N., Seo, S., & Liu, Y. (2017, November). Csi: a hybrid deep model for fake news detection. Proceedings of the 2017 ACM on conference on information and knowledge management(pp. 797-806). Association for Computing, Machinery, New York, NY, United States. https://doi.org/10.1145/3132847.3132877
- Giasemidis, G., Singleton, C., Agrafiotis, I., Nurse, J. R., Pilgrim, A., Willis, C., & Greetham, D. V. (2016, November). Determining the veracity of rumours on Twitter. International conference on social informatics(pp. 185-205). Springer, Cham. https://doi.org/10.1007/978-3-319-47880-7_12
- Vosoughi, S. (2015). Automatic detection and verification of rumors on Twitter(Doctoral dissertation, Massachusetts Institute of Technology). Retrieved from https://lsm.media.mit.edu/papers/Soroush_Vosoughi_PHD_thesis.pdf
- Otter, D. W., Medina, J. R., & Kalita, J. K. (2020). A survey of the usages of deep learning for natural language processing. IEEE transactions on neural networks and learning systems, 32(2), 604-624. DOI:1109/TNNLS.2020.2979670
- Zhang, Y., Meng, J. E., Venkatesan, R., Wang, N., & Pratama, M. (2016, July). Sentiment classification using comprehensive attention recurrent models. 2016 international joint conference on neural networks (IJCNN)(pp. 1562-1569). IEEE. DOI: 1109/IJCNN.2016.7727384
- Rojas‐Barahona, L. M. (2016). Deep learning for sentiment analysis. Language and linguistics compass, 10(12), 701-719. https://doi.org/10.1111/lnc3.12228
- Deng, L., & Liu, Y. (Eds.). (2017). Deep learning in natural language processing. Springer, Singapore.
- LeCun, Y., Haffner, P., Bottou, L., & Bengio, Y. (1999). Object recognition with gradient-based learning. In Shape, contour and grouping in computer vision(pp. 319-345). Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46805-6_19
- Le, Q. V., Zou, W. Y., Yeung, S. Y., & Ng, A. Y. (2011, June). Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. CVPR 2011(pp. 3361-3368). IEEE. DOI: 1109/CVPR.2011.5995496
- Tompson, J. J., Jain, A., LeCun, Y., & Bregler, C. (2014). Joint training of a convolutional network and a graphical model for human pose estimation. Proceedings of the 27th international conference on neural information processing systems (pp. 1799-1807). MIT Press.
- Chen, H., Xie, L., Leung, C. C., Lu, X., Ma, B., & Li, H. (2016). Modeling latent topics and temporal distance for story segmentation of broadcast news. IEEE/ACM transactions on audio, speech, and language processing, 25(1), 112-123.
- Zeiler, M. D., & Fergus, R. (2013). Stochastic pooling for regularization of deep convolutional neural networks. Available at arXiv:1301.3557
- He, K., Zhang, X., Ren, S., & Sun, J. (2015). Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE transactions on pattern analysis and machine intelligence, 37(9), 1904-1916. DOI:1109/TPAMI.2015.2389824
- Ouyang, W., Luo, P., Zeng, X., Qiu, S., Tian, Y., Li, H., ... & Tang, X. (2014). Deepid-net: multi-stage and deformable deep convolutional neural networks for object detection. Available at arXiv:1409.3505
- Guo, Y., Liu, Y., Oerlemans, A., Lao, S., Wu, S., & Lew, M. S. (2016). Deep learning for visual understanding: a review. Neurocomputing, 187, 27-48. https://doi.org/10.1016/j.neucom.2015.09.116
- Yang, Y., Zheng, L., Zhang, J., Cui, Q., Li, Z., & Yu, P. S. (2018). TI-CNN: convolutional neural networks for fake news detection. Available at arXiv:1806.00749
- Ajao, O., Bhowmik, D., & Zargari, S. (2018, July). Fake news identification on twitter with hybrid cnn and rnn models. Proceedings of the 9th international conference on social media and society(pp. 226-230). Association for Computing Machinery, New York, NY, United States. https://doi.org/10.1145/3217804.3217917
- Dragoni, M., & Petrucci, G. (2017). A neural word embeddings approach for multi-domain sentiment analysis. IEEE transactions on affective computing, 8(4), 457-470. DOI:1109/TAFFC.2017.2717879
- Bahdanau, D., Cho, K., & Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. Available at arXiv:1409.0473
- Long, Y. (2017). Fake news detection through multi-perspective speaker profiles. The eighth international joint conference on natural language processing (Volume 2: Short Papers). Asian Federation of Natural Language Processing, Taipei, Taiwan.
- Karimi, H., & Tang, J. (2019). Learning hierarchical discourse-level structure for fake news detection. Available at arXiv:1903.07389
- Chauhan, A., Babu, M., Kandru, N., & Lokegaonkar, S. (2018). Empirical Study on convergence of capsule networks with various hyperparameters. Retrieved from https://people.cs.vt.edu/~bhuang/courses/opt18/projects/capsule.pdf
- Sabour, S., Frosst, N., & Hinton, G. E. (2017). Dynamic routing between capsules.https://proceedings.neurips.cc/paper/2017/hash/2cad8fa47bbef282badbb8de5374b894-Abstract.html
- Hinton, G. E., Sabour, S., & Frosst, N. (2018, February). Matrix capsules with EM routing. Paper presented at the metting of International conference on learning representations, Vancouver Convention Center, Vancouver CANADA.
- Deng, F., Pu, S., Chen, X., Shi, Y., Yuan, T., & Pu, S. (2018). Hyperspectral image classification with capsule network using limited training samples. Sensors, 18(9), 3153. https://doi.org/10.3390/s18093153
- Iesmantas, T., & Alzbutas, R. (2018, June). Convolutional capsule network for classification of breast cancer histology images. International conference image analysis and recognition(pp. 853-860). Springer, Cham. https://doi.org/10.1007/978-3-319-93000-8_97
- De La Escalera, A., Moreno, L. E., Salichs, M. A., & Armingol, J. M. (1997). Road traffic sign detection and classification. IEEE transactions on industrial electronics, 44(6), 848-859. DOI:1109/41.649946
- Paoletti, M. E., Haut, J. M., Fernandez-Beltran, R., Plaza, J., Plaza, A., Li, J., & Pla, F. (2018). Capsule networks for hyperspectral image classification. IEEE transactions on geoscience and remote sensing, 57(4), 2145-2160. DOI: 1109/TGRS.2018.2871782
- Goli, A., Zare, H. K., Tavakkoli-Moghaddam, R., & Sadeghieh, A. (2019). Application of robust optimization for a product portfolio problem using an invasive weed optimization algorithm. Numerical algebra, control & optimization, 9(2), 187-209. DOI: 3934/naco.2019014
- Goli, A., Tirkolaee, E. B., & Aydın, N. S. (2021). Fuzzy integrated cell formation and production scheduling considering automated guided vehicles and human factors. IEEE transactions on fuzzy systems, 29(12), 3686-3695. DOI:1109/TFUZZ.2021.3053838
- Goli, A., & Malmir, B. (2020). A covering tour approach for disaster relief locating and routing with fuzzy demand. International journal of intelligent transportation systems research, 18(1), 140-152. https://doi.org/10.1007/s13177-019-00185-2
- Goli, A., Zare, H. K., Moghaddam, R., & Sadeghieh, A. (2018). A comprehensive model of demand prediction based on hybrid artificial intelligence and metaheuristic algorithms: a case study in dairy industry. Journal of industrial and systems engineering, 11, 190-203.
- Goli, A., Khademi-Zare, H., Tavakkoli-Moghaddam, R., Sadeghieh, A., Sasanian, M., & Malekalipour Kordestanizadeh, R. (2021). An integrated approach based on artificial intelligence and novel meta-heuristic algorithms to predict demand for dairy products: a case study. Network: computation in neural systems, 32(1), 1-35. https://doi.org/10.1080/0954898X.2020.1849841
- Lotfi, R., Mardani, N., & Weber, G. W. (2021). Robust bi‐level programming for renewable energy location. International journal of energy research, 45(5), 7521-7534. https://doi.org/10.1002/er.6332
- Lotfi, R., Yadegari, Z., Hosseini, S. H., Khameneh, A. H., Tirkolaee, E. B., & Weber, G. W. (2020). A robust time-cost-quality-energy-environment trade-off with resource-constrained in project management: a case study for a bridge construction project. Journal of industrial & management optimization. 13(5), 1-22. DOI: 3934/jimo.2020158
- Lotfi, R., Nayeri, M., Sajadifar, S., & Mardani, N. (2017). Determination of start times and ordering plans for two-period projects with interdependent demand in project-oriented organizations: a case study on molding industry. Journal of project management, 2(4), 119-142. DOI: 5267/j.jpm.2017.9.001
- Lotfi, R., Mehrjerdi, Y. Z., Pishvaee, M. S., Sadeghieh, A., & Weber, G. W. (2021). A robust optimization model for sustainable and resilient closed-loop supply chain network design considering conditional value at risk. Numerical algebra, control & optimization, 11(2), 221-253.
- Kingma, D. P., & Ba, J. (2014). Adam: a method for stochastic optimization. Available at arXiv:1412.6980
- Joulin, A., Grave, E., Bojanowski, P., & Mikolov, T. (2016). Bag of tricks for efficient text classification. Available at arXiv:1607.01759