Online knowledge communities (OKCs), such as question-and-answer sites, have become increasingly popular venues for knowledge sharing. Accordingly, it is necessary for researchers and practitioners to develop effective and efficient text analysis tools to understand the massive amount of user-generated content (UGC) on OKCs. Unsupervised topic modeling has been widely adopted to extract human-interpretable latent topics embedded in texts. These identified topics can be further used in subsequent analysis and managerial practices. However, existing generic topic models that assume documents are independent are inappropriate for analyzing OKCs where structural relationships exist between questions and answers. Thus, a new method is needed to fill this research gap. In this study, we propose a new topic model specifically designed for the text in OKCs. We make three primary contributions to the research on topic modeling in this context. First, we build a general and flexible Bayesian framework to explicitly model structural and temporal dependencies among texts. Second, we statistically demonstrate the approximate model inference using mean-field and coordinate ascent algorithms. Third, we showcase the practical value and relative merit of our method via a specific downstream task (i.e., user profiling). The proposed model is illustrated using two real-world datasets from well-known OKCs (i.e., Stack Exchange and Quora), and extensive experiments demonstrate its superiority over several cutting-edge benchmarks.
TM-OKC: An Unsupervised Topic Model for Text in Online Knowledge Communities
In stock
SKU
48.3.04
Publication History
Received: November 23, 2021
Revised: September 19, 2022; April 14, 2023; September 19, 2023
Accepted: November 10, 2023
Published Accepted Author Version: June 5, 2024
Published Online in Issue: September 1, 2024
https://doi.org/10.25300/MISQ/2023/17885
Abstract
Additional Details
Author | Dongcheng Zhang, Kunpeng Zhang, Yi Yang, and David A. Schweidel |
Year | 2024 |
Volume | 48 |
Issue | 3 |
Keywords | Unsupervised topic model, document dependency, variational inference, practical value, question-and-answer |
Page Numbers | 931-978 |