Access the full text.
Sign up today, get DeepDyve free for 14 days.
Jia-Chen Gu, Tianda Li, Quan Liu, Xiaodan Zhu, Zhenhua Ling, Zhiming Su, Si Wei (2020)
Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based ChatbotsProceedings of the 29th ACM International Conference on Information & Knowledge Management
Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova (2019)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Lifeng Shang, Zhengdong Lu, Hang Li (2015)
Neural Responding Machine for Short-Text ConversationArXiv, abs/1503.02364
H. Shum, Xiaodong He, Di Li (2018)
From Eliza to XiaoIce: challenges and opportunities with social chatbotsFrontiers of Information Technology & Electronic Engineering, 19
(2015)
TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
N. Glas, K. Prepin, C. Pelachaud (2015)
Engagement driven Topic Selection for an Information-Giving Agent
Alan Ritter, Colin Cherry, W. Dolan (2011)
Data-Driven Response Generation in Social Media
Zhengdong Lu, Hang Li (2013)
A Deep Architecture for Matching Short Texts
Heyuan Wang, Ziyi Wu, Junyu Chen (2019)
Multi-Turn Response Selection in Retrieval-Based Chatbots with Iterated Attentive Convolution Matching NetworkProceedings of the 28th ACM International Conference on Information and Knowledge Management
Taesun Whang, Dongyub Lee, Chanhee Lee, Kisu Yang, Dongsuk Oh, Heuiseok Lim (2019)
An Effective Domain Adaptive Post-Training Method for BERT in Response Selection
(1999)
Modern Information Retrieval
(2015)
TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. Retrieved from https://www.tensorflow.org
Zhuosheng Zhang, Jiangtong Li, Peng Zhu, Zhao Hai, Gongshen Liu (2018)
Modeling Multi-turn Conversation with Deep Utterance Aggregation
Qian Chen, Xiao-Dan Zhu, Zhenhua Ling, Si Wei, Hui Jiang, D. Inkpen (2016)
Enhanced LSTM for Natural Language Inference
Hao Wang, Zhengdong Lu, Hang Li, Enhong Chen (2013)
A Dataset for Research on Short-Text Conversations
Oriol Vinyals, Quoc Le (2015)
A neural conversational modelarXiv:1506.05869. Retrieved from https://arxiv.org/abs/1506.05869.
Zhen Xu, Bingquan Liu, Baoxun Wang, Chengjie Sun, Xiaolong Wang (2016)
Incorporating loose-structured knowledge into conversation modeling via recall-gate LSTM2017 International Joint Conference on Neural Networks (IJCNN)
Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Zhao, Dianhai Yu, Hua Wu (2018)
Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network
Chunyuan Yuan, W. Zhou, Mingming Li, Shangwen Lv, Fuqing Zhu, Jizhong Han, Songlin Hu (2019)
Multi-hop Selector Network for Multi-turn Response Selection in Retrieval-based Chatbots
Chongyang Tao, Shen Gao, Mingyue Shang, Wei Wu, Dongyan Zhao, Rui Yan (2018)
Get The Point of My Utterance! Learning Towards Effective Responses with Multi-Head Attention Mechanism
S. Hochreiter, J. Schmidhuber (1997)
Long Short-Term MemoryNeural Computation, 9
Rui Yan, Yiping Song, Hua Wu (2016)
Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation SystemProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval
Iulian Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, Joelle Pineau (2015)
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, Rui Yan (2019)
One Time of Interaction May Not Be Enough: Go Deep with an Interaction-over-Interaction Network for Response Selection in Dialogues
Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton (2012)
ImageNet Classification with deep convolutional neural networksAdvances in Neural Information Processing Systems
G. Salton, C. Buckley (1988)
Term-Weighting Approaches in Automatic Text RetrievalInf. Process. Manag., 24
Hsin-Yuan Huang, Chenguang Zhu, Yelong Shen, Weizhu Chen (2017)
FusionNet: Fusing via Fully-Aware Attention with Application to Machine ComprehensionArXiv, abs/1711.07341
Shuohang Wang, Jing Jiang (2015)
Learning Natural Language Inference with LSTMArXiv, abs/1512.08849
Zihang Dai, Zhilin Yang, Yiming Yang, J. Carbonell, Quoc Le, R. Salakhutdinov (2019)
Transformer-XL: Attentive Language Models beyond a Fixed-Length ContextArXiv, abs/1901.02860
S. Young, Milica Gasic, Blaise Thomson, J. Williams (2013)
POMDP-Based Statistical Spoken Dialog Systems: A ReviewProceedings of the IEEE, 101
E. Voorhees (2001)
The TREC-8 Question Answering TrackNatural Language Engineering, 7
G. Ferguson, James Allen, B. Miller (1996)
TRAINS-95: Towards a Mixed-Initiative Planning Assistant
Gao Huang, Zhuang Liu, Kilian Weinberger (2016)
Densely Connected Convolutional Networks2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Feng-Lin Li, Minghui Qiu, Haiqing Chen, Xiongwei Wang, Xing Gao, Jun Huang, Juwei Ren, Zhongzhou Zhao, Weipeng Zhao, Lei Wang, Guwei Jin, Wei Chu (2017)
AliMe Assist: An Intelligent Assistant for Creating an Innovative E-commerce ExperienceProceedings of the 2017 ACM on Conference on Information and Knowledge Management
Tomas Mikolov, I. Sutskever, Kai Chen, G. Corrado, J. Dean (2013)
Distributed Representations of Words and Phrases and their Compositionality
Ming Tan, Bing Xiang, Bowen Zhou (2015)
LSTM-based Deep Learning Models for non-factoid answer selectionArXiv, abs/1511.04108
Shengxian Wan, Yanyan Lan, Jun Xu, J. Guo, Liang Pang, Xueqi Cheng (2016)
Match-SRNN: Modeling the Recursive Matching Structure with Spatial RNNArXiv, abs/1604.04378
Qian Chen, Wen Wang (2019)
Sequential Matching Model for End-to-end Multi-turn Response SelectionICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Chongyang Tao, Shen Gao, Mingyue Shang, Wei Wu, Dongyan Zhao, Rui Yan (2018)
Get the point of my utterance! learning towards effective responses with multi-head attention mechanismProceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI’18). International Joint Conferences on Artificial Intelligence Organization
Kyunghyun Cho, B. Merrienboer, Çaglar Gülçehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, Yoshua Bengio (2014)
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
(1999)
1999.Modern Information Retrieval
Michael A. Nielsen (2015)
Neural Networks and Deep Learning
Diederik Kingma, Jimmy Ba (2014)
Adam: A Method for Stochastic OptimizationCoRR, abs/1412.6980
J. Weizenbaum (1966)
ELIZA—a computer program for the study of natural language communication between man and machineCommunications of the ACM, 9
Jesse Vig, Kalai Ramea (2018)
Comparison of Transfer-Learning Approaches for Response Selection in Multi-Turn Conversations
(2015)
A neural conversational model. arXiv:1506.05869. Retrieved from https://arxiv.org/ abs/1506.05869
Jiwei Li, Will Monroe, Alan Ritter, Dan Jurafsky, Michel Galley, Jianfeng Gao (2016)
Deep Reinforcement Learning for Dialogue GenerationArXiv, abs/1606.01541
Yu Wu, Wei Wu, Ming Zhou, Zhoujun Li (2016)
Sequential Match Network: A New Architecture for Multi-turn Response Selection in Retrieval-based ChatbotsArXiv, abs/1612.01627
Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton (2016)
Layer normalizationarXiv:1607.06450. Retrieved from https://arxiv.org/abs/1607.06450.
Seungwan Seo, Czangyeob Kim, Haedong Kim, Kyounghyun Mo, Pilsung Kang (2020)
Comparative Study of Deep Learning-Based Sentiment ClassificationIEEE Access, 8
Ryan Lowe, Michael Noseworthy, Iulian Serban, Nicolas Angelard-Gontier, Yoshua Bengio, Joelle Pineau (2017)
Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses
Ryan Lowe, Nissan Pow, Iulian Serban, Joelle Pineau (2015)
The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems
K. Colby, Sylvia Weber, F. Hilf (1975)
Artificial ParanoiaArtif. Intell., 2
Yizhe Zhang, Michel Galley, Jianfeng Gao, Zhe Gan, Xiujun Li, Chris Brockett, W. Dolan (2018)
Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan Gomez, Lukasz Kaiser, Illia Polosukhin (2017)
Attention is All you Need
Rudolf Kadlec, Martin Schmid, Jan Kleindienst (2015)
Improved Deep Learning Baselines for Ubuntu Corpus DialogsArXiv, abs/1510.03753
Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio (2014)
Neural Machine Translation by Jointly Learning to Align and TranslateCoRR, abs/1409.0473
Tomas Mikolov, Kai Chen, G. Corrado, J. Dean (2013)
Efficient Estimation of Word Representations in Vector Space
Xiangyang Zhou, Daxiang Dong, Hua Wu, Shiqi Zhao, Dianhai Yu, Hao Tian, Xuan Liu, Rui Yan (2016)
Multi-view Response Selection for Human-Computer Conversation
Iulian Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, Joelle Pineau (2016)
Building end-to-end dialogue systems using generative hierarchical neural network modelsProc. AAAI Conf. Artif. Intell., 30
Saizheng Zhang, Emily Dinan, Jack Urbanek, Arthur Szlam, Douwe Kiela, J. Weston (2018)
Personalizing Dialogue Agents: I have a dog, do you have pets too?ArXiv, abs/1801.07243
Mingxuan Wang, Zhengdong Lu, Hang Li, Qun Liu (2015)
Syntax-based Deep Matching of Short TextsArXiv, abs/1503.02427
Zhilin Yang, Zihang Dai, Yiming Yang, J. Carbonell, R. Salakhutdinov, Quoc Le (2019)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Junyoung Chung, Çaglar Gülçehre, Kyunghyun Cho, Yoshua Bengio (2014)
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence ModelingArXiv, abs/1412.3555
A. Krizhevsky, I. Sutskever, Geoffrey Hinton (2012)
ImageNet classification with deep convolutional neural networksCommunications of the ACM, 60
Baosong Yang, Longyue Wang, Derek Wong, Lidia Chao, Zhaopeng Tu (2019)
Convolutional Self-Attention NetworksArXiv, abs/1904.03107
Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, Rui Yan (2019)
Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based ChatbotsProceedings of the Twelfth ACM International Conference on Web Search and Data Mining
Building an intelligent dialogue system with the ability to select a proper response according to a multi-turn context is challenging in three aspects: (1) the meaning of a context–response pair is built upon language units from multiple granularities (e.g., words, phrases, and sub-sentences, etc.); (2) local (e.g., a small window around a word) and long-range (e.g., words across the context and the response) dependencies may exist in dialogue data; and (3) the relationship between the context and the response candidate lies in multiple relevant semantic clues or relatively implicit semantic clues in some real cases. However, existing approaches usually encode the dialogue with mono-type representation and the interaction processes between the context and the response candidate are executed in a rather shallow manner, which may lead to an inadequate understanding of dialogue content and hinder the recognition of the semantic relevance between the context and response. To tackle these challenges, we propose a representation[K]-interaction[L]-matching framework that explores multiple types of deep interactive representations to build context-response matching models for response selection. Particularly, we construct different types of representations for utterance–response pairs and deepen them via alternate encoding and interaction. By this means, the model can handle the relation of neighboring elements, phrasal pattern, and long-range dependencies during the representation and make a more accurate prediction through multiple layers of interactions between the context–response pair. Experiment results on three public benchmarks indicate that the proposed model significantly outperforms previous conventional context-response matching models and achieve slightly better results than the BERT model for multi-turn response selection in retrieval-based dialogue systems.
ACM Transactions on Information Systems (TOIS) – Association for Computing Machinery
Published: Aug 17, 2021
Keywords: Retrieval-based dialogue systems
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.