Homogeneous-listing-augmented Self-supervised Multimodal Product Title Refinement

Paper

Deng, Jiaqi, Shi, Kaize, Huo, Huan, Wang, Dingxian and Xu, Guandong. 2024. "Homogeneous-listing-augmented Self-supervised Multimodal Product Title Refinement." 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’24). Washington DC, United States 14 - 18 Jul 2024 United States. Association for Computing Machinery (ACM). https://doi.org/10.1145/3626772.3661347

Paper/Presentation Title	Homogeneous-listing-augmented Self-supervised Multimodal Product Title Refinement
Presentation Type	Paper
Authors	Deng, Jiaqi, Shi, Kaize, Huo, Huan, Wang, Dingxian and Xu, Guandong
Journal or Proceedings Title	Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’24)
Journal Citation	pp. 2870-2874
Number of Pages	5
Year	2024
Publisher	Association for Computing Machinery (ACM)
Place of Publication	United States
ISBN	9798400704314
Digital Object Identifier (DOI)	https://doi.org/10.1145/3626772.3661347
Web Address (URL) of Paper	https://dl.acm.org/doi/abs/10.1145/3626772.3661347
Web Address (URL) of Conference Proceedings	https://dl.acm.org/doi/proceedings/10.1145/3626772
Conference/Event	47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’24)
Event Details	47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’24) Parent ACM International Conference on Research and Development in Information Retrieval Delivery In person Event Date 14 to end of 18 Jul 2024 Event Location Washington DC, United States
Abstract	Product titles on e-commerce marketplaces often suffer from verbosity and inaccuracy, hindering effective communication of essential product details to customers. Refining titles to be more concise and informative is crucial for better user experience and product promotion. Recent solutions to product title refinement follow the standard text extractive and generative methods. Some also leverage multimodal information, e.g. using product images to supplement original titles with visual knowledge. However, these generative methods often produce additional terms not endorsed by sellers. Thus, it remains challenging to incorporate visual information missing from original titles into refined titles without excessively introducing novel terms. Additionally, most existing methods require human-labeled datasets, which are laborious to construct. In response to the two challenges, we present a self-supervised multimodal framework (HLATR) for title refinement that comprises two key modules: (1) a perturbated sample generator that constructs training data by systematically mining homogeneous listing information and (2) a title refinement network that effectively harnesses visual information to refine the original titles. To explicitly balance the extraction from original titles and the generation of supplementary novel terms, we adapt the copy mechanism that is guided by a focused refinement loss. Extensive experiments demonstrate that our proposed framework consistently outperforms others in generating refined titles that contain essential multimodal semantics with minimal deviation from the original ones.
Keywords	Product title refinement; Multimodal generative mod; Self-supervised learning
Contains Sensitive Content	Does not contain sensitive content
ANZSRC Field of Research 2020	4602. Artificial intelligence
Byline Affiliations	University of Technology Sydney
	Etsy.com, United States

Permalink -

https://research.usq.edu.au/item/100973/homogeneous-listing-augmented-self-supervised-multimodal-product-title-refinement

Download files

Published Version

	3626772.3661347.pdf
License: CC BY 4.0
File access level: Anyone

83
total views
6
total downloads
0
views this month
0
downloads this month

Export as

Related outputs

Deep Graph Clustering With Triple Fusion Mechanism for Community Detection

Ma, Yuanchi, Shi, Kaize, Peng, Xueping, He, Hui, Zhang, Peng, Liu, Jinyan, Lei, Zhongxiang and Niu, Zhendong. 2025. "Deep Graph Clustering With Triple Fusion Mechanism for Community Detection." IEEE Transactions on Computational Social Systems. 12 (4), pp. 1743-1758. https://doi.org/10.1109/TCSS.2024.3478351

Recommending Learning Objects through Attentive Heterogeneous Graph Convolution and Operation- Aware Neural Network (Extended Abstract)

Zhu, Yifan, Lin, Qika, Lu, Hao, Shi, Kaize, Liu, Donglei, Chambua, James, Wang, Shanshan and Niu, Zhendong. 2024. "Recommending Learning Objects through Attentive Heterogeneous Graph Convolution and Operation- Aware Neural Network (Extended Abstract)." 2024 IEEE 40th International Conference on Data Engineering (ICDE). Utrecht, Netherlands 13 - 14 May 2024 United States. IEEE (Institute of Electrical and Electronics Engineers). https://doi.org/10.1109/ICDE60146.2024.00505

Deep coupling network for multivariate time series forecasting

Yi, Kun, Zhang, Qi, He, Hui, Hui He, Hu, Liang, An, Ning and Niu, Zhendong. 2024. "Deep coupling network for multivariate time series forecasting." ACM Transactions on Information Systems. 42 (5), pp. 1-28. https://doi.org/10.1145/3653447

Adapting GNNs for document understanding: A flexible framework with multiview global graphs

Wu, Zhuojia, Zhang, Qi, Miao, Duoqian, Zhao, Xuerong and Shi, Kaize. 2024. "Adapting GNNs for document understanding: A flexible framework with multiview global graphs." IEEE Transactions on Computational Social Systems. 12 (2), pp. 608-621. https://doi.org/10.1109/TCSS.2024.3468890

Enhancing Academic Title Drafting Through Abstractive Summarization

Wu, Taoyu Wu and Shi, Kaize. 2024. "Enhancing Academic Title Drafting Through Abstractive Summarization." 2024 IEEE International Conference on Behavioural and Social Computing (BESC-2024). Harbin, China 16 - 18 Aug 2024 China. IEEE (Institute of Electrical and Electronics Engineers). https://doi.org/10.1109/BESC64747.2024.10780612

FetchEEG: a hybrid approach combining feature extraction and temporal-channel joint attention for EEG-based emotion classification

Liang, Yu, Zhang, Chenlong, An, Shan, Wang, Zaitian, Shi, Kaize, Peng, Tianhao, Ma, Yuqing, Xie, Xiaoyang, He, Jian and Zheng, Kun. 2024. "FetchEEG: a hybrid approach combining feature extraction and temporal-channel joint attention for EEG-based emotion classification." Journal of Neural Engineering. 21 (3). https://doi.org/10.1088/1741-2552/ad4743

A topic‐controllable keywords‐to‐text generator with knowledge base network

He, Li, Shi, Kaize, Wang, Dingxian, Wang, Xianzhi and Xu, Guandong. 2024. "A topic‐controllable keywords‐to‐text generator with knowledge base network." CAAI Transactions on Intelligence Technology. 9 (3), pp. 585-594. https://doi.org/10.1049/cit2.12280

Distributional drift adaptation with temporal conditional variational autoencoder for multivariate time series forecasting

He, Hui, Zhang, Qi, Yi, Kun, Shi, Kaize, Niu, Zhendong and Cao, Longbing. 2024. "Distributional drift adaptation with temporal conditional variational autoencoder for multivariate time series forecasting." IEEE Transactions on Neural Networks and Learning Systems. 36 (4), pp. 7287-7301. https://doi.org/10.1109/TNNLS.2024.3384842

CrimeAlarm: Towards Intensive Intent Dynamics in Fine-Grained Crime Prediction

Hu, Kaixi, Li, Lin, Xie, Qing, Tao, Xiaohui and Xu, Guandong. 2024. "CrimeAlarm: Towards Intensive Intent Dynamics in Fine-Grained Crime Prediction." H.V., Onizuka M.Lee J.-G.Tong Y.Xiao C.Ishikawa Y.Lu K.Amer-Yahia S.Jagadish (ed.) 29th International Conference on Database Systems for Advanced Applications (DASFAA 2024). Gifu, Japan 02 - 05 Jul 2024 Singapore . Springer. https://doi.org/10.1007/978-981-97-5575-2_7

Decoupled Progressive Distillation for Sequential Prediction with Interaction Dynamics

Hu, Kaixi, Li, Lin, Xie, Qing, Liu, Jianquan, Tao, Xiaohui and Xu, Guandong. 2024. "Decoupled Progressive Distillation for Sequential Prediction with Interaction Dynamics." ACM Transactions on Information Systems. 42 (3), pp. 1-35. https://doi.org/10.1145/3632403

Recommending Learning Objects Through Attentive Heterogeneous Graph Convolution and Operation-Aware Neural Network

Zhu, Yifan, Lin, Qika, Lu, Hao, Shi, Kaize, Liu, Donglei, Chambua, James, Wang, Shanshan and Niu, Zhendong. 2023. "Recommending Learning Objects Through Attentive Heterogeneous Graph Convolution and Operation-Aware Neural Network ." IEEE Transactions on Knowledge and Data Engineering. 35 (4), pp. 4178-4189. https://doi.org/10.1109/TKDE.2021.3125424

AMR-TST: Abstract Meaning Representation-based Text Style Transfer

Shi, Kaize, Sun, Xueyao, He, Li, Wang, Dingxian, Li, Qing and Xu, Guandong. 2023. "AMR-TST: Abstract Meaning Representation-based Text Style Transfer." Findings of the Association for Computational Linguistics: ACL 2023. Toronto, Canada 09 - 14 Jul 2023 Canada.

Multiple knowledge-enhanced meteorological social briefing generation

Shi, Kaize, Peng, Xueping, Lu, Hao, Zhu, Yifan and Niu, Zhendong. 2023. "Multiple knowledge-enhanced meteorological social briefing generation." IEEE Transactions on Computational Social Systems. 11 (2), pp. 2002-2013. https://doi.org/10.1109/TCSS.2023.3298252

MTSTI: A multi-task learning framework for spatiotemporal imputation

Chen, Yakun, Shi, Kaize, Wang, Xianzhi and Xu, Guandong. 2023. "MTSTI: A multi-task learning framework for spatiotemporal imputation." 19th International Conference on Advanced Data Mining and Applications (ADMA'23). Shenyang, China 21 - 23 Aug 2023 Switzerland. Springer. https://doi.org/10.1007/978-3-031-46677-9_13

Application of social sensors in natural disasters emergency management: A review

Shi, Kaize, Peng, Xueping, Lu, Hao, Zhu, Yifan and Niu, Zhendong. 2022. "Application of social sensors in natural disasters emergency management: A review." IEEE Transactions on Computational Social Systems. 10 (6), pp. 3143-3158. https://doi.org/10.1109/TCSS.2022.3211552

Recommending scientific paper via heterogeneous knowledge embedding based attentive recurrent neural networks

Zhu, Yifan, Lin, Qika, Lu, Hao, Shi, Kaize, Qiu, Ping and Niu, Zhendong. 2021. "Recommending scientific paper via heterogeneous knowledge embedding based attentive recurrent neural networks." Knowledge-Based Systems. 215. https://doi.org/10.1016/j.knosys.2021.106744

Position-aware stepwise tagging method for triples extraction of entity-relationship

Wang, Yuan, Shi, Kaize and Niu, Zhendong. 2021. "Position-aware stepwise tagging method for triples extraction of entity-relationship." Data Analysis and Knowledge Discovery. 5 (10), pp. 71-80. https://doi.org/10.11925/infotech.2096-3467.2021.0302

EKGTF: A knowledge-enhanced model for optimizing social network-based meteorological briefings

Shi, Kaize, Wang, Yusen, Lu, Hao, Zhu, Yifan and Niu, Zhendong. 2021. "EKGTF: A knowledge-enhanced model for optimizing social network-based meteorological briefings." Information Processing and Management. 58 (4). https://doi.org/10.1016/j.ipm.2021.102564

Social signal-driven knowledge automation: A focus on social transportation

Lu, Hao, Zhu, Yifan, Yuan, Yong, Gong, Weichao, Li, Juanjuan, Shi, Kaize, Lv, Yisheng, Niu, Zhendong and Wang, Fei-Yue. 2021. "Social signal-driven knowledge automation: A focus on social transportation." IEEE Transactions on Computational Social Systems. 8 (3), pp. 737-753. https://doi.org/10.1109/TCSS.2021.3057332

Improving university faculty evaluations via multi-view knowledge graph

Lin, Qika, Zhu, Yifan, Lu, Hao, Shi, Kaize and Niu, Zhendong. 2021. "Improving university faculty evaluations via multi-view knowledge graph." Future Generation Computer Systems: the international journal of grid computing: theory, methods and applications. 117, pp. 181-192. https://doi.org/10.1016/j.future.2020.11.021

Welcome message BESC 2018

Wang, Leon S. L., Hong, Tzung-Pei, Xu, Guandong, Ting, I-Hsien and Tao, Xiaohui. 2018. "Welcome message BESC 2018." 5th International Conference on Behavioral, Economic, and Socio-Cultural Computing (BESC 2018). Kaohsiung, Taiwan 12 - 14 Nov 2018 Los Alamitos, CA, United States. https://doi.org/10.1109/BESC.2018.8697887

Social weather: A review of crowdsourcing‐assisted meteorological knowledge services through social cyberspace

Zhu, Yifan, Zhang, Sifan, Li, Yinan, Lu, Hao, Shi, Kaize and Niu, Zhendong. 2020. "Social weather: A review of crowdsourcing‐assisted meteorological knowledge services through social cyberspace." Geoscience Data Journal. 7 (1), pp. 61-79. https://doi.org/10.1002/gdj3.85

Heterogeneous teaching evaluation network based offline course recommendation with graph learning and tensor factorization

Zhu, Yifan, Lu, Hao, Qiu, Ping, Shi, Kaize, Chambua, James and Niu, Zhendong. 2020. "Heterogeneous teaching evaluation network based offline course recommendation with graph learning and tensor factorization." Neurocomputing. 415, pp. 84-95. https://doi.org/10.1016/j.neucom.2020.07.064

A Session-based Job Recommendation System Combining Area Knowledge and Interest Graph Neural Networks

Wang, Yusen, Shi, Kaize and Niu, Zhendong. 2020. "A Session-based Job Recommendation System Combining Area Knowledge and Interest Graph Neural Networks." 32nd International Conference on Software Engineering and Knowledge Engineering (SEKE 2020). Pittsburgh, United States 09 - 11 Jul 2020 United States. Knowledge Systems Institute.

Automatic generation of meteorological briefing by event knowledge guided summarization model

Shi, Kaize, Lu, Hao, Zhu, Yifan and Niu, Zhendong. 2020. "Automatic generation of meteorological briefing by event knowledge guided summarization model." Knowledge-Based Systems. 192. https://doi.org/10.1016/j.knosys.2019.105379

Wide-grained capsule network with sentence-level feature to detect meteorological event in social network

Shi, Kaize, Gong, Changjin, Lu, Hao, Zhu, Yifan and Niu, Zhendong. 2020. "Wide-grained capsule network with sentence-level feature to detect meteorological event in social network." Future Generation Computer Systems: the international journal of grid computing: theory, methods and applications. 102, pp. 323-332. https://doi.org/10.1016/j.future.2019.08.013