Sage Journals: Discover world-class research

Abstract

Understanding semantic themes of short texts is challenging due to limited word co-occurrence information. Utilising pre-trained word embeddings or incorporating contextual information from external sources is likely to increase noise and mislead the thematic representation of the short texts, which declines the classification performance. For higher accuracy in classifying short texts, we propose a knowledge graph-enhanced topic model called Graph Convolutional Embedded Topic Model (GCETM), which simultaneously learns the graph network and topic modelling. GCETM employs the Graph Convolutional Network (GCN) to infuse prior human knowledge of the current short texts into the topic embedding space. For model fitting, we propose a data-driven regularisation for amortised variational inference. Besides GCETM topic inference, we utilise corpus statistics for semantically enriched vectorial representation of short text for their classification. Experimental results using a linear Support Vector Machine (SVM) classifier outperform several state-of-the-art baselines by achieving 97.15% accuracy on the AgNews data set, 97.75% accuracy on the SearchSnippets data set, 98.73% accuracy on the Movie Review (MR) data set, 98.7% accuracy on the TMNews data set, 96.5% accuracy on the Twitter data set and 98.44% accuracy on the R8 data set.

Keywords

Embedded topic model knowledge fusion knowledge graph short text text classification

Get full access to this article

View all access options for this article.

References

Chen

Xiu

Ding

ZY.

Multiple weak supervision for short text classification. Appl Intel 2022; pp. 1–16, Springer.

Topic modeling for short texts via word embedding and document correlation. IEEE Access 2020; 8: 30692–30705.

Demirsoz

Ozcan

Classification of news-related tweets. J Inform Sci 2017; 43(4): 509–524.

Blei

Jordan

MI.

Latent Dirichlet allocation. J Mach Learn Res 2003; 3: 3111–3119.

Deerwester

Dumais

Furnas

, et al. Indexing by latent semantic analysis. J Am Soc Inform Sci 1990; 41(6): 391–407.

Hofmann

. Probabilistic latent semantic indexing. In: Proceedings of the 22nd annual international ACM SIGIR conference on research and development in information retrieval, 1999, pp. 50–57.

Ding

Nallapati

Xiang

Coherence-aware neural topic modeling. In: Proceedings of the 2018 conference on empirical methods in natural language processing, 2018, pp. 830–836.

Dieng

Ruiz

Blei

DM.

Topic modeling in embedding spaces. Trans Assoc Comput Linguist 2020; 8: 439–453.

Hong

Davison

. Empirical study of topic modeling in twitter. In: Proceedings of the first workshop on social media analytics, 2010, pp. 80–88.

10.

Wang

Liu

, et al. Hashtag graph based topic model for tweet mining. In: IEEE international conference on data mining, 2014, pp. 1025–1030.

11.

Lin

Zuo

Liu

, et al. A Pseudo-document-based Topical N-grams model for short texts. World Wide Web 2020; 23(6): 3001–3023.

12.

Zuo

Zhang

, et al. Topic modeling of short texts: a pseudo-document view. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, 2016, pp. 2105–2114.

13.

Weng

Lim

Jiang

, et al. Twitterrank: finding topic-sensitive influential twitterers. In: Proceedings of the third ACM international conference on web search and data mining, 2010, pp. 261–270.

14.

Mehrotra

Sanner

Buntine

, et al. Improving LDA topic models for microblogs via tweet pooling and automatic labeling. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval, 2013, pp. 889–892.

15.

Sengupta

Paka

Roy

, et al. An embedding-based joint sentiment-topic model for short texts. In: International AAAI conference on web and social media (ICWSM), 2021, pp. 633–643.

16.

Murakami

Chakraborty

. Neural topic models for short text using pretrained word embeddings and its application to real data. In: IEEE 4th international conference on knowledge innovation and invention (ICKII), 2021, pp. 146–150.

17.

Pai

Costabello

Learning embeddings from knowledge graphs with numeric edge attributes. arXiv Preprint 2105.08683, 2021, https://arxiv.org/abs/2105.08683

18.

Kristiadi

Khan

Lukovnikov

, et al. Incorporating literals into knowledge graph embeddings. In: International semantic web conference, 2019, pp. 347–363.

19.

Ghoorchian

Sahlgren

GDTM: graph-based dynamic topic models. In: Progress in artificial intelligence, 2020, pp. 195–207.

20.

AlMousa

Benlamri

Khoury

A novel word sense disambiguation approach using WordNet knowledge graph. Comput Speech Lang 2022; 74: 101337.

21.

Ortiz

Segarra

Villazón

, et al. REDI: towards knowledge graph-powered scholarly information management and research networking. J Inform Sci 2022; 48(2): 167–181.

22.

Kipf

Welling

Semi-supervised classification with graph convolutional networks. In: 5th international conference on learning representations (ICLR-17), 2017.

23.

Vashishth

Bhandari

Yadav

, et al. Incorporating syntactic and semantic information in word embeddings using graph convolutional networks. In: Proceedings of the 57th annual meeting of the association for computational linguistics, 2019, pp. 3308–3318.

24.

Bastings

Titov

Aziz

, et al. Graph convolutional encoders for syntax-aware neural machine translation. In: Proceedings of the 2017 conference on empirical methods in natural language processing, 2017, pp. 1957–1967.

25.

Annervaz

Chowdhury

SBR

Dukkipati

. Learning beyond datasets: knowledge graph augmented neural networks for natural language processing. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics, 2018, pp. 313–322.

26.

Yao

Mao

Luo

. Graph convolutional networks for text classification. In: Proceedings of the AAAI conference on artificial intelligence, 2019, pp. 7370–7377.

27.

Miller

GA.

WordNet: a lexical database for English. Commun ACM 1995; 38(11): 39ACM–41.

28.

Zhou

Zhang

, et al. Leverage knowledge graph and GCN for fine-grained-level clickbait detection. World Wide Web 2022; 25(3): 1243–1258.

29.

Kim

. Convolutional neural network for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), 2014, pp. 1746–1751.

30.

Liu

Qiu

Huang

Recurrent neural network for text classification with multi-task learning. arXiv Preprint 1605.05101, 2016, https://arxiv.org/abs/1605.05101

31.

Zhang

Cui

, et al. Every document owns its structure: inductive text classification via graph neural networks. arXiv Preprint 2004.13826, 2020, https://arxiv.org/abs/2004.13826

32.

Semi-supervised node classification via graph learning convolutional neural network. Appl Intel 2022, pp. 1–13.

33.

Song

Giunchiglia

Zhao

, et al. Graph topology enhancement for text classification. Appl Intel 2022, pp 1–14.

34.

Zhou

Wen

, et al. Multi-scale graph classification with shared graph neural network. World Wide Web 2022, pp. 1–18.

35.

Yao

Zhang

Wei

, et al. Incorporating knowledge graph embeddings into topic modeling. In: Thirty-first AAAI conference on artificial intelligence, 2017.

36.

Song

Gao

Pang

, et al. Knowledge base enhanced topic modeling. In: IEEE international conference on knowledge graph (ICKG), 2020, pp. 380–387. New York: IEEE.

37.

Vasantha

Sendhikumar

Developing a conceptual framework for short text categorization using hybrid CNN-LSTM based Caledonian crow optimization. Expert Syst Appl 2023; 212: 1185.

38.

Liang

Feng

Liu

, et al. GLTM: a global and local word embedding-based topic model for short texts. IEEE Access 2018; 6: 43612–43621.

39.

Jin

Zhao

Topic attention encoder: a self-supervised approach for short text clustering. J Inform Sci 2020.

40.

Peng

Extended information inference model for unsupervised categorization of web short texts. J Inform Sci 2012; 38(6): 512–531.

41.

Cui

Kojaku

Masuda

, et al. Solving feature sparseness in text classification using core-periphery decomposition. In: Proceedings of the seventh joint conference on lexical and computational semantics, 2018, pp. 255–264.

42.

Pang

Short text classification via term graph. arXiv Preprint, 2001.10338, 2020, https://arxiv.org/abs/2001.10338

43.

Luu

Dong

. Mitigating data sparsity for short text topic modeling by topic-semantic contrastive learning. arxiv Preprint 2211.12878, 2022, https://arxiv.org/abs/2211.12878

44.

Cao

Yuan

, et al. Self-supervised short text classification with heterogeneous graph neural networks. Xpert Systems 2023: e13249.

45.

Liu

Combining context-relevant features with multi-stage attention network for short text classification. Comput Speech Lang 2022; 71: 101268.

46.

Wang

Zhang

Wang

, et al. A knowledge graph enhanced topic modeling approach for herb recommendation. In: International conference on database systems for advanced applications, 2019, pp. 709–724.

47.

Petrželková

Škrlj

Lavrač

Knowledge graph aware text classification. Inform Soc 2020.

48.

Lai

Zhang

Han

, et al. Fine-grained emotion classification of Chinese microblogs based on graph convolution networks. World Wide Web 2020; 23(5): 2771–2787.

49.

Van

Bach

Than

A graph convolutional topic model for short and noisy text streams. Neurocomputing 2022; 468: 345–335.

50.

Zhou

Xie

Self-supervised regularization for text classification. Trans Assoc Comput Ling 2021; 9: 641–656.

51.

Santos

Hamacher

Oliveira

A data-driven optimization model for the workover rig scheduling problem: case study in an oil company. Trans Assoc Comput Ling 2021; 170: 108088.

52.

Conmy

Mukherjee

Schönlieb

. StyleGAN-induced data-driven regularization for inverse problems. In: ICASSP 2022-2022 IEEE international conference on acoustics, speech and signal processing (ICASSP), 2022, pp. 3788–3792.

53.

Dasarathy

Berisha

Regularization via structural label smoothing. In: Proceedings of the 23rd international conference on artificial intelligence and statistics (AISTATS), 2020, pp. 1453–1463.

54.

Wei

Zou

Eda: easy data augmentation techniques for boosting performance on text classification tasks. arXiv Preprint, 1901.11196, 2019, https://arxiv.org/abs/1901.11196

55.

Berg-Kirkpatrick

, et al. Efficient correlated topic modeling with topic embedding. In: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, 2017, pp. 225–233.

56.

Marino

Yue

Mandt

Iterative amortized inference. In: International conference on machine learning, 2018, pp. 3403–3412.

57.

Choi

Goodman

, et al. Meta-amortized variational inference and learning. Proc AAAI Conf Artif Intel 2020; 34(4): 6404–6412.

58.

Orhan

Tulu

CN.

A novel embedding approach to learn word vectors by weighting semantic relations: SemSpace. Expert Syst Appl 2021; 180: 115146.

59.

Uddin

Chen

Zhang

, et al. Corpus statistics empowered document classification. Electronics 2022; 11(14): 2168.

60.

Lopez

Boyeau

Yosef

, et al. Decision-making with auto-encoding variational Bayes. Adv Neural Inform Proces Syst 2020; 33: 5081–5092.

61.

Hsieh

Tseng

Chiang

. Modeling the idiomaticity of Chinese Quadra-syllabic idiomatic expressions. In: Proceedings of the 33rd Pacific Asia conference on language, information and computation, 2019, pp. 68–75.

62.

Abdeljaber

HA.

Automatic Arabic short answers scoring using longest common subsequence and Arabic WordNet. IEEE Access 2021; 9: 76433–76445.

63.

Joshi

Mittal

Joshi

Improving the performance of semantic graph-based keyword extraction and text summarization using fuzzy relations in Hindi Wordnet. J Intel Fuzzy Syst 2022; 1–18.

64.

Quispe

LVC

Tohalino

JAV

Amancio

. Using virtual edges to improve the discriminability of co-occurrence text networks. Phys A: Stat Mech Appl 2021; 562: 125344.

65.

Kampffmeyer

Chen

Liang

, et al. Rethinking knowledge graph propagation for zero-shot learning. Phys A: Stat Mech Appl 2019; 11487–11496.

66.

Joulin

Grave

Bojanowski

, et al. Bag of tricks for efficient text classification. arXiv Preprint 1607.01759, 2016, https://arxiv.org/abs/1607.01759

67.

Shen

Wang

, et al. Baseline needs more love: on simple word-embedding-based models and associated pooling mechanisms. arXiv Preprint 1805.09843, 2018, https://arxiv.org/abs/1805.09843

68.

Zhou

Shi

Tian

, et al. Attention-based bidirectional long short-term memory networks for relation classification. In: Proceedings of the 54th annual meeting of the association for computational linguistics, 2016, pp. 207–212.

69.

Zhang

Liu

Song

Sentence-state LSTM for text representation. arXiv Preprint 1805.02474, 2018, https://arxiv.org/abs/1805.02474

70.

Yan

Guo

Lan

, et al. A biterm topic model for short texts. In: Proceedings of the 22nd international conference on World Wide Web, 2013, pp. 1445–1456.

71.

Nguyen

Billingsley

, et al. Improving topic models with latent feature word representations. Trans Assoc Comput Linguist 2015; 3: 299–313.

72.

Wang

Zhang

, et al. Topic modeling for short texts with auxiliary word embeddings. In: Proceedings of the 39th international ACM SIGIR conference on research and development in information retrieval, 2016, pp. 165–174.

73.

Zuo

Zhao

Word network topic model: a simple but general solution for short and imbalanced texts. Knowl Inform Syst 2016; 48(2): 379–398.