Response Ranking with Multi-types of Deep Interactive Representations in Retrieval-based Dialogues

Ruijian Xu; Chongyang Tao; Jiazhan Feng; Wei Wu; Rui Yan; Dongyan Zhao

doi:10.1145/3462207

Loading next page...

References (67)

Jia-Chen Gu, Tianda Li, Quan Liu, Xiaodan Zhu, Zhenhua Ling, Zhiming Su, Si Wei (2020)
Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots
Proceedings of the 29th ACM International Conference on Information & Knowledge Management
Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova (2019)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Lifeng Shang, Zhengdong Lu, Hang Li (2015)
Neural Responding Machine for Short-Text Conversation
ArXiv, abs/1503.02364
H. Shum, Xiaodong He, Di Li (2018)
From Eliza to XiaoIce: challenges and opportunities with social chatbots
Frontiers of Information Technology & Electronic Engineering, 19
(2015)
TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
N. Glas, K. Prepin, C. Pelachaud (2015)
Engagement driven Topic Selection for an Information-Giving Agent
Alan Ritter, Colin Cherry, W. Dolan (2011)
Data-Driven Response Generation in Social Media
Zhengdong Lu, Hang Li (2013)
A Deep Architecture for Matching Short Texts
Heyuan Wang, Ziyi Wu, Junyu Chen (2019)
Multi-Turn Response Selection in Retrieval-Based Chatbots with Iterated Attentive Convolution Matching Network
Proceedings of the 28th ACM International Conference on Information and Knowledge Management
Taesun Whang, Dongyub Lee, Chanhee Lee, Kisu Yang, Dongsuk Oh, Heuiseok Lim (2019)
An Effective Domain Adaptive Post-Training Method for BERT in Response Selection
(1999)
Modern Information Retrieval
(2015)
TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. Retrieved from https://www.tensorflow.org
Zhuosheng Zhang, Jiangtong Li, Peng Zhu, Zhao Hai, Gongshen Liu (2018)
Modeling Multi-turn Conversation with Deep Utterance Aggregation
Qian Chen, Xiao-Dan Zhu, Zhenhua Ling, Si Wei, Hui Jiang, D. Inkpen (2016)
Enhanced LSTM for Natural Language Inference
Hao Wang, Zhengdong Lu, Hang Li, Enhong Chen (2013)
A Dataset for Research on Short-Text Conversations
Oriol Vinyals, Quoc Le (2015)
A neural conversational model
arXiv:1506.05869. Retrieved from https://arxiv.org/abs/1506.05869.
Zhen Xu, Bingquan Liu, Baoxun Wang, Chengjie Sun, Xiaolong Wang (2016)
Incorporating loose-structured knowledge into conversation modeling via recall-gate LSTM
2017 International Joint Conference on Neural Networks (IJCNN)
Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Zhao, Dianhai Yu, Hua Wu (2018)
Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network
Chunyuan Yuan, W. Zhou, Mingming Li, Shangwen Lv, Fuqing Zhu, Jizhong Han, Songlin Hu (2019)
Multi-hop Selector Network for Multi-turn Response Selection in Retrieval-based Chatbots
Chongyang Tao, Shen Gao, Mingyue Shang, Wei Wu, Dongyan Zhao, Rui Yan (2018)
Get The Point of My Utterance! Learning Towards Effective Responses with Multi-Head Attention Mechanism
S. Hochreiter, J. Schmidhuber (1997)
Long Short-Term Memory
Neural Computation, 9
Rui Yan, Yiping Song, Hua Wu (2016)
Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation System
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval
Iulian Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, Joelle Pineau (2015)
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, Rui Yan (2019)
One Time of Interaction May Not Be Enough: Go Deep with an Interaction-over-Interaction Network for Response Selection in Dialogues
Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton (2012)
ImageNet Classification with deep convolutional neural networks
Advances in Neural Information Processing Systems
G. Salton, C. Buckley (1988)
Term-Weighting Approaches in Automatic Text Retrieval
Inf. Process. Manag., 24
Hsin-Yuan Huang, Chenguang Zhu, Yelong Shen, Weizhu Chen (2017)
FusionNet: Fusing via Fully-Aware Attention with Application to Machine Comprehension
ArXiv, abs/1711.07341
Shuohang Wang, Jing Jiang (2015)
Learning Natural Language Inference with LSTM
ArXiv, abs/1512.08849
Zihang Dai, Zhilin Yang, Yiming Yang, J. Carbonell, Quoc Le, R. Salakhutdinov (2019)
Transformer-XL: Attentive Language Models beyond a Fixed-Length Context
ArXiv, abs/1901.02860
S. Young, Milica Gasic, Blaise Thomson, J. Williams (2013)
POMDP-Based Statistical Spoken Dialog Systems: A Review
Proceedings of the IEEE, 101
E. Voorhees (2001)
The TREC-8 Question Answering Track
Natural Language Engineering, 7
G. Ferguson, James Allen, B. Miller (1996)
TRAINS-95: Towards a Mixed-Initiative Planning Assistant
Gao Huang, Zhuang Liu, Kilian Weinberger (2016)
Densely Connected Convolutional Networks
2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Feng-Lin Li, Minghui Qiu, Haiqing Chen, Xiongwei Wang, Xing Gao, Jun Huang, Juwei Ren, Zhongzhou Zhao, Weipeng Zhao, Lei Wang, Guwei Jin, Wei Chu (2017)
AliMe Assist: An Intelligent Assistant for Creating an Innovative E-commerce Experience
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
Tomas Mikolov, I. Sutskever, Kai Chen, G. Corrado, J. Dean (2013)
Distributed Representations of Words and Phrases and their Compositionality
Ming Tan, Bing Xiang, Bowen Zhou (2015)
LSTM-based Deep Learning Models for non-factoid answer selection
ArXiv, abs/1511.04108
Shengxian Wan, Yanyan Lan, Jun Xu, J. Guo, Liang Pang, Xueqi Cheng (2016)
Match-SRNN: Modeling the Recursive Matching Structure with Spatial RNN
ArXiv, abs/1604.04378
Qian Chen, Wen Wang (2019)
Sequential Matching Model for End-to-end Multi-turn Response Selection
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Chongyang Tao, Shen Gao, Mingyue Shang, Wei Wu, Dongyan Zhao, Rui Yan (2018)
Get the point of my utterance! learning towards effective responses with multi-head attention mechanism
Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI’18). International Joint Conferences on Artificial Intelligence Organization
Kyunghyun Cho, B. Merrienboer, Çaglar Gülçehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, Yoshua Bengio (2014)
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
(1999)
1999.Modern Information Retrieval
Michael A. Nielsen (2015)
Neural Networks and Deep Learning
Diederik Kingma, Jimmy Ba (2014)
Adam: A Method for Stochastic Optimization
CoRR, abs/1412.6980
J. Weizenbaum (1966)
ELIZA—a computer program for the study of natural language communication between man and machine
Communications of the ACM, 9
Jesse Vig, Kalai Ramea (2018)
Comparison of Transfer-Learning Approaches for Response Selection in Multi-Turn Conversations
(2015)
A neural conversational model. arXiv:1506.05869. Retrieved from https://arxiv.org/ abs/1506.05869
Jiwei Li, Will Monroe, Alan Ritter, Dan Jurafsky, Michel Galley, Jianfeng Gao (2016)
Deep Reinforcement Learning for Dialogue Generation
ArXiv, abs/1606.01541
Yu Wu, Wei Wu, Ming Zhou, Zhoujun Li (2016)
Sequential Match Network: A New Architecture for Multi-turn Response Selection in Retrieval-based Chatbots
ArXiv, abs/1612.01627
Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton (2016)
Layer normalization
arXiv:1607.06450. Retrieved from https://arxiv.org/abs/1607.06450.
Seungwan Seo, Czangyeob Kim, Haedong Kim, Kyounghyun Mo, Pilsung Kang (2020)
Comparative Study of Deep Learning-Based Sentiment Classification
IEEE Access, 8
Ryan Lowe, Michael Noseworthy, Iulian Serban, Nicolas Angelard-Gontier, Yoshua Bengio, Joelle Pineau (2017)
Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses
Ryan Lowe, Nissan Pow, Iulian Serban, Joelle Pineau (2015)
The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems
K. Colby, Sylvia Weber, F. Hilf (1975)
Artificial Paranoia
Artif. Intell., 2
Yizhe Zhang, Michel Galley, Jianfeng Gao, Zhe Gan, Xiujun Li, Chris Brockett, W. Dolan (2018)
Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan Gomez, Lukasz Kaiser, Illia Polosukhin (2017)
Attention is All you Need
Rudolf Kadlec, Martin Schmid, Jan Kleindienst (2015)
Improved Deep Learning Baselines for Ubuntu Corpus Dialogs
ArXiv, abs/1510.03753
Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio (2014)
Neural Machine Translation by Jointly Learning to Align and Translate
CoRR, abs/1409.0473
Tomas Mikolov, Kai Chen, G. Corrado, J. Dean (2013)
Efficient Estimation of Word Representations in Vector Space
Xiangyang Zhou, Daxiang Dong, Hua Wu, Shiqi Zhao, Dianhai Yu, Hao Tian, Xuan Liu, Rui Yan (2016)
Multi-view Response Selection for Human-Computer Conversation
Iulian Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, Joelle Pineau (2016)
Building end-to-end dialogue systems using generative hierarchical neural network models
Proc. AAAI Conf. Artif. Intell., 30
Saizheng Zhang, Emily Dinan, Jack Urbanek, Arthur Szlam, Douwe Kiela, J. Weston (2018)
Personalizing Dialogue Agents: I have a dog, do you have pets too?
ArXiv, abs/1801.07243
Mingxuan Wang, Zhengdong Lu, Hang Li, Qun Liu (2015)
Syntax-based Deep Matching of Short Texts
ArXiv, abs/1503.02427
Zhilin Yang, Zihang Dai, Yiming Yang, J. Carbonell, R. Salakhutdinov, Quoc Le (2019)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Junyoung Chung, Çaglar Gülçehre, Kyunghyun Cho, Yoshua Bengio (2014)
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
ArXiv, abs/1412.3555
A. Krizhevsky, I. Sutskever, Geoffrey Hinton (2012)
ImageNet classification with deep convolutional neural networks
Communications of the ACM, 60
Baosong Yang, Longyue Wang, Derek Wong, Lidia Chao, Zhaopeng Tu (2019)
Convolutional Self-Attention Networks
ArXiv, abs/1904.03107
Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, Rui Yan (2019)
Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based Chatbots
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining

Publisher: Association for Computing Machinery
Copyright: Copyright © 2021 Copyright held by the owner/author(s). Publication rights licensed to ACM.
ISSN: 1046-8188
eISSN: 1558-2868
DOI: 10.1145/3462207
Publisher site: See Article on Publisher Site

Abstract

Building an intelligent dialogue system with the ability to select a proper response according to a multi-turn context is challenging in three aspects: (1) the meaning of a context–response pair is built upon language units from multiple granularities (e.g., words, phrases, and sub-sentences, etc.); (2) local (e.g., a small window around a word) and long-range (e.g., words across the context and the response) dependencies may exist in dialogue data; and (3) the relationship between the context and the response candidate lies in multiple relevant semantic clues or relatively implicit semantic clues in some real cases. However, existing approaches usually encode the dialogue with mono-type representation and the interaction processes between the context and the response candidate are executed in a rather shallow manner, which may lead to an inadequate understanding of dialogue content and hinder the recognition of the semantic relevance between the context and response. To tackle these challenges, we propose a representation[K]-interaction[L]-matching framework that explores multiple types of deep interactive representations to build context-response matching models for response selection. Particularly, we construct different types of representations for utterance–response pairs and deepen them via alternate encoding and interaction. By this means, the model can handle the relation of neighboring elements, phrasal pattern, and long-range dependencies during the representation and make a more accurate prediction through multiple layers of interactions between the context–response pair. Experiment results on three public benchmarks indicate that the proposed model significantly outperforms previous conventional context-response matching models and achieve slightly better results than the BERT model for multi-turn response selection in retrieval-based dialogue systems.

Journal

ACM Transactions on Information Systems (TOIS) – Association for Computing Machinery

Published: Aug 17, 2021

Keywords: Retrieval-based dialogue systems

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Response Ranking with Multi-types of Deep Interactive Representations in Retrieval-based Dialogues

Response Ranking with Multi-types of Deep Interactive Representations in Retrieval-based Dialogues

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Response Ranking with Multi-types of Deep Interactive Representations in Retrieval-based Dialogues

Response Ranking with Multi-types of Deep Interactive Representations in Retrieval-based Dialogues

References (67)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies