Homogeneous-listing-augmented Self-supervised Multimodal Product Title Refinement

Paper


Deng, Jiaqi, Shi, Kaize, Huo, Huan, Wang, Dingxian and Xu, Guandong. 2024. "Homogeneous-listing-augmented Self-supervised Multimodal Product Title Refinement." 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’24). Washington DC, United States 14 - 18 Jul 2024 United States. Association for Computing Machinery (ACM). https://doi.org/10.1145/3626772.3661347
Paper/Presentation Title

Homogeneous-listing-augmented Self-supervised Multimodal Product Title Refinement

Presentation TypePaper
AuthorsDeng, Jiaqi, Shi, Kaize, Huo, Huan, Wang, Dingxian and Xu, Guandong
Journal or Proceedings TitleProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’24)
Journal Citationpp. 2870-2874
Number of Pages5
Year2024
PublisherAssociation for Computing Machinery (ACM)
Place of PublicationUnited States
ISBN9798400704314
Digital Object Identifier (DOI)https://doi.org/10.1145/3626772.3661347
Web Address (URL) of Paperhttps://dl.acm.org/doi/abs/10.1145/3626772.3661347
Web Address (URL) of Conference Proceedingshttps://dl.acm.org/doi/proceedings/10.1145/3626772
Conference/Event47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’24)
Event Details
47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’24)
Parent
ACM International Conference on Research and Development in Information Retrieval
Delivery
In person
Event Date
14 to end of 18 Jul 2024
Event Location
Washington DC, United States
Abstract

Product titles on e-commerce marketplaces often suffer from verbosity and inaccuracy, hindering effective communication of essential product details to customers. Refining titles to be more concise and informative is crucial for better user experience and product promotion. Recent solutions to product title refinement follow the standard text extractive and generative methods. Some also leverage multimodal information, e.g. using product images to supplement original titles with visual knowledge. However, these generative methods often produce additional terms not endorsed by sellers. Thus, it remains challenging to incorporate visual information missing from original titles into refined titles without excessively introducing novel terms. Additionally, most existing methods require human-labeled datasets, which are laborious to construct. In response to the two challenges, we present a self-supervised multimodal framework (HLATR) for title refinement that comprises two key modules: (1) a perturbated sample generator that constructs training data by systematically mining homogeneous listing information and (2) a title refinement network that effectively harnesses visual information to refine the original titles. To explicitly balance the extraction from original titles and the generation of supplementary novel terms, we adapt the copy mechanism that is guided by a focused refinement loss. Extensive experiments demonstrate that our proposed framework consistently outperforms others in generating refined titles that contain essential multimodal semantics with minimal deviation from the original ones.

KeywordsProduct title refinement; Multimodal generative mod; Self-supervised learning
Contains Sensitive ContentDoes not contain sensitive content
ANZSRC Field of Research 20204602. Artificial intelligence
Byline AffiliationsUniversity of Technology Sydney
Etsy.com, United States
Permalink -

https://research.usq.edu.au/item/100973/homogeneous-listing-augmented-self-supervised-multimodal-product-title-refinement

Download files


Published Version
3626772.3661347.pdf
License: CC BY 4.0
File access level: Anyone

  • 80
    total views
  • 4
    total downloads
  • 74
    views this month
  • 2
    downloads this month

Export as

Related outputs

Deep Graph Clustering With Triple Fusion Mechanism for Community Detection
Ma, Yuanchi, Shi, Kaize, Peng, Xueping, He, Hui, Zhang, Peng, Liu, Jinyan, Lei, Zhongxiang and Niu, Zhendong. 2025. "Deep Graph Clustering With Triple Fusion Mechanism for Community Detection." IEEE Transactions on Computational Social Systems. 12 (4), pp. 1743-1758. https://doi.org/10.1109/TCSS.2024.3478351
Recommending Learning Objects through Attentive Heterogeneous Graph Convolution and Operation- Aware Neural Network (Extended Abstract)
Zhu, Yifan, Lin, Qika, Lu, Hao, Shi, Kaize, Liu, Donglei, Chambua, James, Wang, Shanshan and Niu, Zhendong. 2024. "Recommending Learning Objects through Attentive Heterogeneous Graph Convolution and Operation- Aware Neural Network (Extended Abstract)." 2024 IEEE 40th International Conference on Data Engineering (ICDE). Utrecht, Netherlands 13 - 14 May 2024 United States. IEEE (Institute of Electrical and Electronics Engineers). https://doi.org/10.1109/ICDE60146.2024.00505
Deep coupling network for multivariate time series forecasting
Yi, Kun, Zhang, Qi, He, Hui, Hui He, Hu, Liang, An, Ning and Niu, Zhendong. 2024. "Deep coupling network for multivariate time series forecasting." ACM Transactions on Information Systems. 42 (5), pp. 1-28. https://doi.org/10.1145/3653447
Adapting GNNs for document understanding: A flexible framework with multiview global graphs
Wu, Zhuojia, Zhang, Qi, Miao, Duoqian, Zhao, Xuerong and Shi, Kaize. 2024. "Adapting GNNs for document understanding: A flexible framework with multiview global graphs." IEEE Transactions on Computational Social Systems. 12 (2), pp. 608-621. https://doi.org/10.1109/TCSS.2024.3468890
Enhancing Academic Title Drafting Through Abstractive Summarization
Wu, Taoyu Wu and Shi, Kaize. 2024. "Enhancing Academic Title Drafting Through Abstractive Summarization." 2024 IEEE International Conference on Behavioural and Social Computing (BESC-2024). Harbin, China 16 - 18 Aug 2024 China. IEEE (Institute of Electrical and Electronics Engineers). https://doi.org/10.1109/BESC64747.2024.10780612
FetchEEG: a hybrid approach combining feature extraction and temporal-channel joint attention for EEG-based emotion classification
Liang, Yu, Zhang, Chenlong, An, Shan, Wang, Zaitian, Shi, Kaize, Peng, Tianhao, Ma, Yuqing, Xie, Xiaoyang, He, Jian and Zheng, Kun. 2024. "FetchEEG: a hybrid approach combining feature extraction and temporal-channel joint attention for EEG-based emotion classification." Journal of Neural Engineering. 21 (3). https://doi.org/10.1088/1741-2552/ad4743
A topic‐controllable keywords‐to‐text generator with knowledge base network
He, Li, Shi, Kaize, Wang, Dingxian, Wang, Xianzhi and Xu, Guandong. 2024. "A topic‐controllable keywords‐to‐text generator with knowledge base network." CAAI Transactions on Intelligence Technology. 9 (3), pp. 585-594. https://doi.org/10.1049/cit2.12280
Distributional drift adaptation with temporal conditional variational autoencoder for multivariate time series forecasting
He, Hui, Zhang, Qi, Yi, Kun, Shi, Kaize, Niu, Zhendong and Cao, Longbing. 2024. "Distributional drift adaptation with temporal conditional variational autoencoder for multivariate time series forecasting." IEEE Transactions on Neural Networks and Learning Systems. 36 (4), pp. 7287-7301. https://doi.org/10.1109/TNNLS.2024.3384842
CrimeAlarm: Towards Intensive Intent Dynamics in Fine-Grained Crime Prediction
Hu, Kaixi, Li, Lin, Xie, Qing, Tao, Xiaohui and Xu, Guandong. 2024. "CrimeAlarm: Towards Intensive Intent Dynamics in Fine-Grained Crime Prediction." H.V., Onizuka M.Lee J.-G.Tong Y.Xiao C.Ishikawa Y.Lu K.Amer-Yahia S.Jagadish (ed.) 29th International Conference on Database Systems for Advanced Applications (DASFAA 2024). Gifu, Japan 02 - 05 Jul 2024 Singapore . Springer. https://doi.org/10.1007/978-981-97-5575-2_7
Decoupled Progressive Distillation for Sequential Prediction with Interaction Dynamics
Hu, Kaixi, Li, Lin, Xie, Qing, Liu, Jianquan, Tao, Xiaohui and Xu, Guandong. 2024. "Decoupled Progressive Distillation for Sequential Prediction with Interaction Dynamics." ACM Transactions on Information Systems. 42 (3), pp. 1-35. https://doi.org/10.1145/3632403
Recommending Learning Objects Through Attentive Heterogeneous Graph Convolution and Operation-Aware Neural Network
Zhu, Yifan, Lin, Qika, Lu, Hao, Shi, Kaize, Liu, Donglei, Chambua, James, Wang, Shanshan and Niu, Zhendong. 2023. "Recommending Learning Objects Through Attentive Heterogeneous Graph Convolution and Operation-Aware Neural Network ." IEEE Transactions on Knowledge and Data Engineering. 35 (4), pp. 4178-4189. https://doi.org/10.1109/TKDE.2021.3125424
AMR-TST: Abstract Meaning Representation-based Text Style Transfer
Shi, Kaize, Sun, Xueyao, He, Li, Wang, Dingxian, Li, Qing and Xu, Guandong. 2023. "AMR-TST: Abstract Meaning Representation-based Text Style Transfer." Findings of the Association for Computational Linguistics: ACL 2023. Toronto, Canada 09 - 14 Jul 2023 Canada.
Multiple knowledge-enhanced meteorological social briefing generation
Shi, Kaize, Peng, Xueping, Lu, Hao, Zhu, Yifan and Niu, Zhendong. 2023. "Multiple knowledge-enhanced meteorological social briefing generation." IEEE Transactions on Computational Social Systems. 11 (2), pp. 2002-2013. https://doi.org/10.1109/TCSS.2023.3298252
MTSTI: A multi-task learning framework for spatiotemporal imputation
Chen, Yakun, Shi, Kaize, Wang, Xianzhi and Xu, Guandong. 2023. "MTSTI: A multi-task learning framework for spatiotemporal imputation." 19th International Conference on Advanced Data Mining and Applications (ADMA'23). Shenyang, China 21 - 23 Aug 2023 Switzerland. Springer. https://doi.org/10.1007/978-3-031-46677-9_13
Application of social sensors in natural disasters emergency management: A review
Shi, Kaize, Peng, Xueping, Lu, Hao, Zhu, Yifan and Niu, Zhendong. 2022. "Application of social sensors in natural disasters emergency management: A review." IEEE Transactions on Computational Social Systems. 10 (6), pp. 3143-3158. https://doi.org/10.1109/TCSS.2022.3211552
Recommending scientific paper via heterogeneous knowledge embedding based attentive recurrent neural networks
Zhu, Yifan, Lin, Qika, Lu, Hao, Shi, Kaize, Qiu, Ping and Niu, Zhendong. 2021. "Recommending scientific paper via heterogeneous knowledge embedding based attentive recurrent neural networks." Knowledge-Based Systems. 215. https://doi.org/10.1016/j.knosys.2021.106744
Position-aware stepwise tagging method for triples extraction of entity-relationship
Wang, Yuan, Shi, Kaize and Niu, Zhendong. 2021. "Position-aware stepwise tagging method for triples extraction of entity-relationship." Data Analysis and Knowledge Discovery. 5 (10), pp. 71-80. https://doi.org/10.11925/infotech.2096-3467.2021.0302
EKGTF: A knowledge-enhanced model for optimizing social network-based meteorological briefings
Shi, Kaize, Wang, Yusen, Lu, Hao, Zhu, Yifan and Niu, Zhendong. 2021. "EKGTF: A knowledge-enhanced model for optimizing social network-based meteorological briefings." Information Processing and Management. 58 (4). https://doi.org/10.1016/j.ipm.2021.102564
Social signal-driven knowledge automation: A focus on social transportation
Lu, Hao, Zhu, Yifan, Yuan, Yong, Gong, Weichao, Li, Juanjuan, Shi, Kaize, Lv, Yisheng, Niu, Zhendong and Wang, Fei-Yue. 2021. "Social signal-driven knowledge automation: A focus on social transportation." IEEE Transactions on Computational Social Systems. 8 (3), pp. 737-753. https://doi.org/10.1109/TCSS.2021.3057332
Improving university faculty evaluations via multi-view knowledge graph
Lin, Qika, Zhu, Yifan, Lu, Hao, Shi, Kaize and Niu, Zhendong. 2021. "Improving university faculty evaluations via multi-view knowledge graph." Future Generation Computer Systems: the international journal of grid computing: theory, methods and applications. 117, pp. 181-192. https://doi.org/10.1016/j.future.2020.11.021
Welcome message BESC 2018
Wang, Leon S. L., Hong, Tzung-Pei, Xu, Guandong, Ting, I-Hsien and Tao, Xiaohui. 2018. "Welcome message BESC 2018." 5th International Conference on Behavioral, Economic, and Socio-Cultural Computing (BESC 2018). Kaohsiung, Taiwan 12 - 14 Nov 2018 Los Alamitos, CA, United States. https://doi.org/10.1109/BESC.2018.8697887
Social weather: A review of crowdsourcing‐assisted meteorological knowledge services through social cyberspace
Zhu, Yifan, Zhang, Sifan, Li, Yinan, Lu, Hao, Shi, Kaize and Niu, Zhendong. 2020. "Social weather: A review of crowdsourcing‐assisted meteorological knowledge services through social cyberspace." Geoscience Data Journal. 7 (1), pp. 61-79. https://doi.org/10.1002/gdj3.85
Heterogeneous teaching evaluation network based offline course recommendation with graph learning and tensor factorization
Zhu, Yifan, Lu, Hao, Qiu, Ping, Shi, Kaize, Chambua, James and Niu, Zhendong. 2020. "Heterogeneous teaching evaluation network based offline course recommendation with graph learning and tensor factorization." Neurocomputing. 415, pp. 84-95. https://doi.org/10.1016/j.neucom.2020.07.064
A Session-based Job Recommendation System Combining Area Knowledge and Interest Graph Neural Networks
Wang, Yusen, Shi, Kaize and Niu, Zhendong. 2020. "A Session-based Job Recommendation System Combining Area Knowledge and Interest Graph Neural Networks." 32nd International Conference on Software Engineering and Knowledge Engineering (SEKE 2020). Pittsburgh, United States 09 - 11 Jul 2020 United States. Knowledge Systems Institute.
Automatic generation of meteorological briefing by event knowledge guided summarization model
Shi, Kaize, Lu, Hao, Zhu, Yifan and Niu, Zhendong. 2020. "Automatic generation of meteorological briefing by event knowledge guided summarization model." Knowledge-Based Systems. 192. https://doi.org/10.1016/j.knosys.2019.105379
Wide-grained capsule network with sentence-level feature to detect meteorological event in social network
Shi, Kaize, Gong, Changjin, Lu, Hao, Zhu, Yifan and Niu, Zhendong. 2020. "Wide-grained capsule network with sentence-level feature to detect meteorological event in social network." Future Generation Computer Systems: the international journal of grid computing: theory, methods and applications. 102, pp. 323-332. https://doi.org/10.1016/j.future.2019.08.013