Adapting GNNs for document understanding: A flexible framework with multiview global graphs
Article
| Article Title | Adapting GNNs for document understanding: A flexible framework with multiview global graphs |
|---|---|
| ERA Journal ID | 212762 |
| Article Category | Article |
| Authors | Wu, Zhuojia, Zhang, Qi, Miao, Duoqian, Zhao, Xuerong and Shi, Kaize |
| Journal Title | IEEE Transactions on Computational Social Systems |
| Journal Citation | 12 (2), pp. 608-621 |
| Number of Pages | 14 |
| Year | 2024 |
| Publisher | IEEE (Institute of Electrical and Electronics Engineers) |
| Place of Publication | United States |
| ISSN | 2329-924X |
| Digital Object Identifier (DOI) | https://doi.org/10.1109/TCSS.2024.3468890 |
| Web Address (URL) | https://ieeexplore.ieee.org/abstract/document/10726642 |
| Abstract | Graph neural networks (GNNs) have recently gained attention for capturing complex relations, prompting researchers to explore their potential in document classification. Existing studies serving this purpose fall into two directions: inductive learning focusing on personalized context relations within documents and transductive learning targeting the global distribution relations among documents in a corpus. Both directions extract distinct types of beneficial structural information and yield encouraging outcomes. However, due to the incompatibility of underlying graph structures and learning settings, developing an enhanced model that effectively integrates local and global relational learning within existing frameworks is challenging. To address this issue, we propose a new GNN-based document representation learning framework that incorporates multiview global graphs at both the word and document levels, focusing on learning the diverse global distribution information of texts at different granularities. Additionally, a contextual encoder derives the initial representations of document nodes from the updated representations of word nodes, integrating personalized context relations into document representations during this process. Finally, we tailor a node representation learning strategy for the multiview global graphs, called the multiview graph sampling and updating module, which allows our framework to operate efficiently during training without being constrained by the scale of the global graph. Experiments indicate that our framework generally enhances performance by integrating both global and local relational learning. When combined with large-scale language models, our framework achieves state-of-the-art results for GNN-based models across multiple datasets. |
| Keywords | Document classification, graph neural network (GNN); inductive learning; multiview global graph; representation learning; transductive learning |
| Contains Sensitive Content | Does not contain sensitive content |
| ANZSRC Field of Research 2020 | 4602. Artificial intelligence |
| Public Notes | Files associated with this item cannot be displayed due to copyright restrictions. |
| Byline Affiliations | Tongji University, China |
| Shanghai Normal University, China | |
| University of Technology Sydney |
https://research.usq.edu.au/item/100985/adapting-gnns-for-document-understanding-a-flexible-framework-with-multiview-global-graphs
21
total views0
total downloads5
views this month0
downloads this month