SG01 at the NTCIR-13 STC-2 task Haizhou Zhao , Yi Du, Hangyu Li, - PowerPoint PPT Presentation

NTCIR-13, December 2017, Tokyo, Japan SG01 at the NTCIR-13 STC-2 task Haizhou Zhao , Yi Du, Hangyu Li, Qiao Qian, Hao Zhou, Minlie Huang, Jingfang Xu Sogou Inc. | Beijing, China Tsinghua University | Beijing, China

Introduction  Team Name: SG01  Joint Team from Sogou Inc. and Tsinghua University  Subtask: Chinese subtask  Retrieval-based method: 3 submissions  Generation-based method: 5 submissions  Top performance in both methods  Next …  Retrieval-based Method  Generation-based Method  Conclusions  Q & A 2

Overview of Retrieval-based Method features query Learn rn to to rank nk 10 pairs repo 500 pairs 50 pairs Retri rieve Rankin king g Rankin king g Stage Stage I Stage II 3

Retrieve Stage | Retrieval-based Method  Data-Preprocessing  Remove frequent, advertising and short post-comment pairs  Put the repository into a light-weighted search engine  Treat post-comment pairs as webpages  Retrieve 500 pairs for a given query (or “new post”)  Keep the calculated features for searching for later usage  BM25  MRF for term dependency [D. Metzler 2005]  Proximity [T. Tao 2007]  … 4

Ranking Stage I | Retrieval-based Method  Employ features more intuitive in STC task  cosine similarity of TF-IDF vector between …  negative Word Mover Distance [M. J. Kusner 2015] between …  query ↔ post  query ↔ comment  query ↔ post + comment  Translation based language model [Z. Ji 2014] 𝑇𝑑𝑝𝑠𝑓 𝑢𝑠𝑏𝑜𝑡  Ranking  Treat each feature as a ranker  Simply add the sequence numbers to get a final rank  Keep top 50 pairs 5

Ranking Stage II: new features | Retrieval-based Method  Employ more neural network features capturing richer structure in STC  𝑇𝑑𝑝𝑠𝑓 𝑓𝑛𝑐𝑒  𝑇𝑑𝑝𝑠𝑓 𝐶𝑗𝑀𝑇𝑈𝑁+𝐷𝑂𝑂 [R. Yan 2016] 𝑀 = max(0, 1 − 𝑡 𝑦, 𝑧 + + 𝑡(𝑦, 𝑧 − )) ↑ Trained with ranking based objective , using given repository plus extra 12 million crawled post-comment pairs, noted as 𝑆𝑓𝑞𝑝 𝑓𝑦𝑢𝑜  𝑇𝑑𝑝𝑠𝑓 𝑇2𝑇 − 𝑞2𝑑 ← Defined later in Generation-based Method  𝑇𝑑𝑝𝑠𝑓 𝑇2𝑇 − 𝑑2𝑞 6

Ranking Stage II: learning 2 rank | Retrieval-based Method  Use all features aforementioned  Training data: given 11 thous. plus 30 thous. labeled pairs  LambdaMART  Top 10 to be the final result  𝑇𝑑𝑝𝑠𝑓 𝑢𝑠𝑏𝑜𝑡 and 𝑇𝑑𝑝𝑠𝑓 𝐶𝑗𝑀𝑇𝑈𝑁+𝐷𝑂𝑂 are a little more important 7

Experiments | Retrieval-based Method Learning to rank respect to Submission nG@1 P+ nERR@10 which measure on training data SG01-C-R1 nG@1 0.5355 0.6084 0.6579 SG01-C-R2 nERR@10 0.5168 0.5944 0.6461 SG01-C-R3 P+ 0.5048 0.6200 0.6663 8

Overview of Generation-based Method + addmem Scoring ng Segme ment nt- S2SAttn & beam-searc earch Ranking king decodi ding ng 10 pairs query candidates +addmem VAEAttn Generativ ative Models ls 9

Generative Models | Generation-based Method  𝑇2𝑇𝐵𝑢𝑢𝑜  seq2seq [I. Sutskever 2014] with attention mechanism  𝑇2𝑇𝐵𝑢𝑢𝑜−𝑏𝑒𝑒𝑛𝑓𝑛  Add dynamic memory to the attention  𝑊𝐵𝐹𝐵𝑢𝑢𝑜  Use Variational Auto-Encoder  𝑊𝐵𝐹𝐵𝑢𝑢𝑜−𝑏𝑒𝑒𝑛𝑓𝑛  Training data: 𝑆𝑓𝑞𝑝 𝑓𝑦𝑢𝑜 with data-preprocessing  Decode using segment-beam-search 10

Candidates Ranking: scores | Generation-based Method  Scoring Features  likelihood  log 𝑄 𝑍 ′ 𝑌 , for post 𝑌 and generated comment 𝑍 ′  We note score from one model as 𝑇𝑑𝑝𝑠𝑓 𝑡2𝑡−𝑞2𝑑  For scores from different models (except VAE models) and implementations, we add them up as 𝑴𝒋  posterior  log 𝑄 𝑌 𝑍 ′  𝑇𝑑𝑝𝑠𝑓 𝑡2𝑡−𝑑2𝑞  𝑸𝒑  Calculated by our well trained models 11

Candidates Ranking: rank & output | Generation-based Method  Ranking 𝜇∗𝑀𝑗+ 1−𝜇 ∗𝑄𝑝  𝑡𝑑𝑝𝑠𝑓 = 𝑚𝑞(𝑍 ′ ) (𝑑+ 𝑍 ′ ) 𝛽  Discount factor 𝑚𝑞 𝑍 ′ = [Y. Wu 2016] (𝑑+1) 𝛽  Before Final Output: Process candidates by rules  Abandon candidates with keywords in blacklist  De-duplicate consecutively repeated segments  Truncate consecutively repeated punctuations 12

Experiments | Generation-based Method Submission Fusion of candidates from* Scoring By** nG@1 P+ nERR@10 SG01-C-G5 𝑀𝑗 0.3820 0.5068 0.5596 𝑊𝐵𝐹𝐵𝑢𝑢𝑜 , 𝑊𝐵𝐹𝐵𝑢𝑢𝑜−𝑏𝑒𝑒𝑛𝑓𝑛 SG01-C-G4 𝑇2𝑇𝐵𝑢𝑢𝑜 , 𝑇2𝑇𝐵𝑢𝑢𝑜−𝑏𝑒𝑒𝑛𝑓𝑛 𝑀𝑗 0.4483 0.5545 0.6129 𝑀𝑗 & 𝑄𝑝 SG01-C-G3 𝑇2𝑇𝐵𝑢𝑢𝑜 , 𝑇2𝑇𝐵𝑢𝑢𝑜−𝑏𝑒𝑒𝑛𝑓𝑛 0.5633 0.6567 0.6947 𝑀𝑗 & 𝑄𝑝 SG01-C-G2 𝑊𝐵𝐹𝐵𝑢𝑢𝑜 , 𝑊𝐵𝐹𝐵𝑢𝑢𝑜−𝑏𝑒𝑒𝑛𝑓𝑛 0.5483 0.6335 0.6783 𝑀𝑗 & 𝑄𝑝 SG01-C-G1 All 4 kinds of models 0.5867 0.6670 0.7095 *: could be multiple implementations for one model, using different subset of corpus and hyper-parameters **: all scores are discounted by 𝑚𝑞 13

Analysis | Generation-based Method  The feature 𝑸𝒑 brings advantage with statistical significance to those without 𝑄𝑝 , by giving higher rank to more informative candidates  𝑾𝑩𝑭 does worse than traditional seq2seq, but it can bring in interesting candidates  Using fusion of results from models do better than relying on single model, because the ranking will bring preferable candidates to top 10 14

Conclusions  Comparison between methods  Generation-based method does better, however, it still tends to generate “safe” responses  Retrieval-based method tends to get context-dependent or in- coherent comments  Size of training data maters 15

References Z. . Ji Ji, Z. Lu, and H. Li. An information retrieval approach to short text conversation. CoRR, abs/1408.6988,  2014. 2014 M. J. Kusner, Y. Sun, N. I. Kolkin, and K. Q. Weinberger. From word embeddings to document distances. In  Proceedings of the 32Nd International Conference on International Conference on Machine Learning - Volume 37, ICML’15, pages 957– 966. JMLR.org, 2015 2015. D. Metzle ler and W. B. Croft. A markov random field model for term dependencies. In Proceedings of the  28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’05, pages 472– 479, New York, NY, USA, 2005 2005. ACM. I. . Suts utskever, O. Vinyals, and Q. V. Le. Sequence to sequence learning with neural networks. In Z. Ghahramani,  M. Welling, C. Cortes, N. D. Lawrence, and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems 27, pages 3104 – 3112. Curran Associates, Inc., 2014 2014. T. Tao and C. Zhai. An exploration of proximity measures in information retrieval. In Proceedings of the 30th  Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’07, pages 295– 302, New York, NY, USA, 2007 2007. ACM. Y. Wu, M. Schuster, Z. Chen, Q. V. Le, M. Norouzi, W. Macherey, M. Krikun, Y. Cao, Q. Gao, K. Macherey, J.  Klingner, A. Shah, M. Johnson, X. Liu, L. Kaiser, S. Gouws, Y. Kato, T. Kudo, H. Kazawa, K. Stevens, G. Kurian, N. Patil, W. Wang, C. Young, J. Smith, J. Riesa, A. Rudnick, O. Vinyals, G. Corrado, M. Hughes, and J. Dean. Google’s neural machine translation system: Bridging the gap between human and machine translation. CoRR, abs/1609.08144, 2016 2016. R. Yan , Y. Song, X. Zhou, and H. Wu. “Shall I Be Your Chat Companion?”: Towards an Online Human -  Computer Conversation System. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM ’16, pages 649– 658, New York, NY, USA, 2016 2016. ACM. 16

zhaohaizhou@sogou-inc.com

SG01 at the NTCIR-13 STC-2 task Haizhou Zhao , Yi Du, Hangyu Li, - PowerPoint PPT Presentation

NTCIR-13, December 2017, Tokyo, Japan SG01 at the NTCIR-13 STC-2 task Haizhou Zhao , Yi Du, Hangyu Li, Qiao Qian, Hao Zhou, Minlie Huang, Jingfang Xu Sogou Inc. | Beijing, China Tsinghua University | Beijing, China Introduction Team Name:

NTCIR-9 Kick-Off Event ff 2010.10.05 : 13:30- English Session: 15:30-

Neuchatel at NTCIR-4 From CLEF to NTCIR Jacques Savoy University of Neuchatel, Switzerland

I t Introduction to NTCIR-7 d ti t NTCIR 7 N Noriko Kando k K d National Institute of

KSU Teams QA System for World History Exams at the NTCIR-13 QA Lab-3 Task Tasuku Kimura, Ryo

Kyoto-U: Syntactical EBMT System for NTCIR 7 Patent System for NTCIR-7 Patent Translation Task

TUA1 at the NTCIR-14 STC-3 Task Chinese Emotional Conversation Generation Subtask Tokushima

IMTKU Emotional Dialogue System for Short Text Conversation at NTCIR-14 STC-3 (CECG) Task

Dialogue Quality and Nugget Detection for Short Text Conversation (STC-3) based on Hierarchical

CECG Subtask Speaker: Jiaqing Liu School of Information Renmin University of China Author:

YJTI at the NTCIR-13 STC Japanese Subtask Dec. 7, 2017 Toru Shimizu 1 Overview

WUST at the NTCIR-14 STC-3 Dialogue Quality and Nugget Detection Subtask Ming Yan, Maofu Liu,

Overview of the Sixth NTCIR Workshop Noriko Kando National Institute of Informatics

NTCIR 2014 Slides - TUW-IMP at the NTCIR-11 Math-2 Presentation February 2015 CITATIONS READS

CUTKB at NTCIR-14 QALab-PoliInfo Task Toshiki Tomihira and Yohei Seki University of Tsukuba,

Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Text Conversation Task

RMIT at the NTCIR-13 We Want Web Task Luke Gallagher with Joel Mackenzie, Rodger Benham,

Workshop for Web-based Signage Electronics and Telecommunications Research Institute Sunghan Kim

and Data Science Lecture 1, September 9, 2015 Maria-Florina (Nina) Balcan Course Staff

Lecture 20 Jan-Willem van de Meent Schedule Schedule Adjustments Wed 28 Nov: Review Lecture

Jennifer Goforth Gregory Carol Tice The Content Marketing Writer Make a Living Writing blog

Public Hearing: 2017 CDBG-DR Harris County Draft Infrastructure Project Application No. 8

Designing a Private CDN with an Off-Sourced Network Infrastructure: Model and Case Study Claudia

Q3 05 RISK REVIEW Investor Community Conference Call BOB McGLASHAN Executive Vice-President

Barclays CEO Energy-Power Conference Moray Dewhurst Vice Chairman and CFO, NextEra Energy CFO,