BRNIR at the NTCIR-14 finnum task: Scalable feature extraction - PowerPoint PPT Presentation

Jun 14, 2023 •264 likes •422 views

BRNIR at the NTCIR-14 finnum task: Scalable feature extraction technique for numeral classification Alan Spark, Team Lead at AUTO1 GROUP 1 Agenda Motivation Features types Extraction pipeline Experiment design Results 2

BRNIR at the NTCIR-14 finnum task: Scalable feature extraction technique for numeral classification Alan Spark, Team Lead at AUTO1 GROUP 1
Agenda • Motivation • Features types • Extraction pipeline • Experiment design • Results 2
Motivation • Focus on feature extraction in unsupervised fashion • Experiments on different features concatenations • Suggest a feature extraction pipeline 3
Features types Topic distribution Tickers Tags a vector with topics multi-label encoding of multi-label encoding of distribution of a tweet tickers presented in a tags presented in a tweet tweet Number properties Token context Character context a vector encoding a "Bag-of-words" like encoding "Bag-of-words" like number properties such a of tokens neighboring a encoding of characters value, position & type and number. neighboring a number other 4
Extraction pipeline 5
Experiment design 6
Results 7
Summary and Future work • unsupervised approaches for feature extraction in application to FinNum task • methods are parallelizable and meant to be run at scale • utilize data discovered at preprocessing step • address natural imbalance • embedding for all “sparse” features • experiment with classification models 8
9 Thank you
10 Q&A
AUTO1 Group GmbH c/o Alan Spark Bergmannstraße 72 10961 Berlin alan.spark@auto1.com mail@alanspark.net 11
Additional plots 12
Preprocessing highlight $ FNKO $ 10 is a no-brainer. Should trade back to IPO price $ 12. Remember, initial range on IPO was $ 16 on high end. Quiet period expiry soon. target num : [”10”, ”12.”] discovered numbers : [10, 12, 16] The approach detects extra numbers in more than 32% of tweets in given corpus 13
Number of ”target numbers” per tweet on the left, Number of unique categories/subcategories per tweet on the right 14

Recommend

Incorporating Knowledge into DNN for Financial Numeral Classification ChaoChun Liang Institute

ASNLU at NTCIR-14 Finnum Task: Incorporating Knowledge into DNN for Financial Numeral Classification ChaoChun Liang Institute of Information Science Academia Sinica, Taipei June 12, 2019 0 ASNLU at the NTCIR-14 FinNum Task, June 12, 2019

545 views • 20 slides

KSU Teams QA System for World History Exams at the NTCIR-13 QA Lab-3 Task Tasuku Kimura, Ryo

KSU Teams QA System for World History Exams at the NTCIR-13 QA Lab-3 Task KSU Teams QA System for World History Exams at the NTCIR-13 QA Lab-3 Task Tasuku Kimura, Ryo Tagami and Hisashi Miyamori Kyoto Sangyo University NTCIR-13 DAY-3: Dec

469 views • 28 slides

CUTKB at NTCIR-14 QALab-PoliInfo Task Toshiki Tomihira and Yohei Seki University of Tsukuba,

CUTKB at NTCIR-14 QALab-PoliInfo Task Toshiki Tomihira and Yohei Seki University of Tsukuba, Japan June 12 th , 2019@NTCIR-14 INDEX 1. Motivation 2. Classification task 3. Our approach 4. Evaluation results 5. Summary 1.Motivation

394 views • 24 slides

Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Text Conversation Task

Proceedings of the 12th NTCIR Conference on Evaluation of Information Access Technologies, June 7-10, 2016 Tokyo Japan Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Text Conversation Task Kozo Chikai Yuki Arase

336 views • 8 slides

RMIT at the NTCIR-13 We Want Web Task Luke Gallagher with Joel Mackenzie, Rodger Benham,

RMIT at the NTCIR-13 We Want Web Task Luke Gallagher with Joel Mackenzie, Rodger Benham, Ruey-Cheng Chen, Falk Scholer, and J. Shane Culpepper School of Science (Computer Science) RMIT University NTCIR 17 (December 8, 2017) Gallagher,

454 views • 10 slides

SG01 at the NTCIR-13 STC-2 task Haizhou Zhao , Yi Du, Hangyu Li, Qiao Qian, Hao Zhou, Minlie

NTCIR-13, December 2017, Tokyo, Japan SG01 at the NTCIR-13 STC-2 task Haizhou Zhao , Yi Du, Hangyu Li, Qiao Qian, Hao Zhou, Minlie Huang, Jingfang Xu Sogou Inc. | Beijing, China Tsinghua University | Beijing, China Introduction Team Name:

360 views • 17 slides

VCI 2 R at the NTCIR-13 Lifelog-2 LIT Task Presented by: Qianli Xu Co-authors: Qianli Xu, V.

VCI 2 R at the NTCIR-13 Lifelog-2 LIT Task Presented by: Qianli Xu Co-authors: Qianli Xu, V. Subbaraju, Ana del Molino, Jie Lin, Fen Fang, Joo-Hwee Lim, Liyuan Li, V. Chandrasekhar Organization: Institute for Infocomm Research, A*STAR,

587 views • 14 slides

VCI 2 R at the NTCIR-13 Lifelog-2 LSAT Task Presented by: Qianli Xu Co-authors: Jie Lin, Ana del

VCI 2 R at the NTCIR-13 Lifelog-2 LSAT Task Presented by: Qianli Xu Co-authors: Jie Lin, Ana del Molino, Qianli Xu, Fen Fang, V. Subbaraju, Joo-Hwee Lim, Liyuan Li, V. Chandrasekhar Organization: Institute for Infocomm Research, A*STAR,

300 views • 9 slides

MPII at the NTCIR-14 WWW-2 Task Andrew Yates Max Planck Institute for Informatics Motivation

MPII at the NTCIR-14 WWW-2 Task Andrew Yates Max Planck Institute for Informatics Motivation Opportunity to evaluate NIR model (participatingin pool) Previously evaluated on TREC Web Track 09-14 (WSDM '18, EMNLP '17) With long queries

369 views • 21 slides

TUA1 at the NTCIR-14 STC-3 Task Chinese Emotional Conversation Generation Subtask Tokushima

TUA1 at the NTCIR-14 STC-3 Task Chinese Emotional Conversation Generation Subtask Tokushima University Department of Information Science & Intelligent Systems Yangyang Zhou, Zheng Liu, Xin Kang, Yunong Wu, and Fuji Ren Faculty of

430 views • 12 slides

SSTUT at NTCIR-4 Web task Yinghui Xu Kyoji Umemura Software System Lab. (Umemura Lab)

SSTUT at NTCIR-4 Web task Yinghui Xu Kyoji Umemura Software System Lab. (Umemura Lab) Information and Computer Science Dept. Toyohashi University of Technology June 3, 2004 1 Web Searching Using term entropy on Virtual Document and

491 views • 24 slides

THUIR at the NTCIR-14 Lifelog-3 (LIT Task): How does lifelog help the users status recognition

THUIR at the NTCIR-14 Lifelog-3 (LIT Task): How does lifelog help the users status recognition Isadora Nguyen Van Khan, Pranita Shrestha, Min Zhang, Yiqun Liu and Shaoping Ma Tsinghua University z-m@tsinghua.edu.cn June 12, Tokyo, 2019

287 views • 15 slides

DCU at the NTCIR-14 OpenLiveQ-2 Task Piyush Arora & Gareth J.F. Jones ADAPT Centre, School of

DCU at the NTCIR-14 OpenLiveQ-2 Task Piyush Arora & Gareth J.F. Jones ADAPT Centre, School of Computing Dublin City University, Ireland {Piyush.Arora,Gareth.Jones}@dcu.ie Date: 13th June 2019 1 Outline www.adaptcentre.ie Task Overview

659 views • 19 slides

Forst: Question Answering System for Term and Essay Questions at NTCIR-13 QA Lab-3 Task Kotaro

Forst: Question Answering System for Term and Essay Questions at NTCIR-13 QA Lab-3 Task Kotaro Sakamoto*1, *2, Madoka Ishioroshi*2, Yuta Fukuhara*1, Akihiro Iizuka*1, Hideyuki Shibuki*1, Tatsunori Mori*1, Noriko Kando*2, *3 *1: Yokohama

132 views • 12 slides

DCU at the NTCIR-11 SpokenQuery&Doc Task David N. Racca, Gareth J.F. Jones CNGL Centre for

DCU at the NTCIR-11 SpokenQuery&Doc Task David N. Racca, Gareth J.F. Jones CNGL Centre for Global Intelligent Content School of Computing, Dublin City University Dublin, Ireland Overview We participated in the slide-group SQ-SCR.

1.03k views • 18 slides

Kyoto-U: Syntactical EBMT System for NTCIR 7 Patent System for NTCIR-7 Patent Translation Task

Kyoto-U: Syntactical EBMT System for NTCIR 7 Patent System for NTCIR-7 Patent Translation Task Translation Task Kyoto University Toshiaki Nakazawa Toshiaki Nakazawa Sadao Kurohashi Sadao Kurohashi Overview of Kyoto-U System Overview of

576 views • 32 slides

MPII at the NTCIR-14 CENTRE Task Andrew Yates Max Planck Institute for Informatics Motivation

MPII at the NTCIR-14 CENTRE Task Andrew Yates Max Planck Institute for Informatics Motivation Why did I participate? Reproducibility is important! Lets support it Didnt hurt that I had implementations available We need incentives

769 views • 27 slides

NTCIR-9 Kick-Off Event ff 2010.10.05 : 13:30- English Session: 15:30-

Welcome! Twitter: #ntcir9 Ust: ntcir-9-kick NTCIR-9 Kick-Off Event ff 2010.10.05 : 13:30- English Session: 15:30- li h S i 30 1 Program Program About NTCIR Ab t NTCIR About NTCIR-9 Accepted Tasks

1.14k views • 69 slides

SLWWW at the NTCIR-13 WWW Task Peng XIAO , Yimeng FAN , Lingtao Li, Tetsuya Sakai Waseda

SLWWW at the NTCIR-13 WWW Task Peng XIAO , Yimeng FAN , Lingtao Li, Tetsuya Sakai Waseda University Outlines 1. Objective 2. Data 3. Query expansion based on word embedding 4. Official result and analysis 5. Conclusion Objective Chinese

1.11k views • 17 slides

Overview of Patent Retrieval Task at NTCIR-4 Atsushi Fujii (Univ. of Tsukuba) Makoto Iwayama

Overview of Patent Retrieval Task at NTCIR-4 Atsushi Fujii (Univ. of Tsukuba) Makoto Iwayama (Hitaci, Ltd.) Noriko Kando (National Inst. of Informatics) Introduction Large test collections for Human Language Technology (HLT) have been

630 views • 36 slides

I t Introduction to NTCIR-7 d ti t NTCIR 7 N Noriko Kando k K d National Institute of

I t Introduction to NTCIR-7 d ti t NTCIR 7 N Noriko Kando k K d National Institute of Informatics, Japan http://research.nii.ac.jp/ntcir/ h // h ii j / i / kando (at) nii. ac. Jp Noriko Kando NTC intro 2008-12-16 1 Road map

1.31k views • 95 slides

RICT at the NTCIR-14 QALab- PoliInfo Task Jiawei Yong, Shintaro Kawamura, Katsumi Kanasaki,

RICT at the NTCIR-14 QALab- PoliInfo Task Jiawei Yong, Shintaro Kawamura, Katsumi Kanasaki, Shoichi Naitoh, and Kiyohiko Shinomiya Ricoh Company, Ltd. Index Segmentation subtask Overall thought for segmentation Cue-phrase-based idea

431 views • 20 slides

IMTKU Emotional Dialogue System for Short Text Conversation at NTCIR-14 STC-3 (CECG) Task

Tamkang University IMTKU Emotional Dialogue System for Short Text Conversation at NTCIR-14 STC-3 (CECG) Task Department of Information Management Tamkang University, Taiwan Min-Yuh Day Chi-Sheng Hung Yi-Jun Xie Jhih-Yi Chen Yu-Ling Kuo

664 views • 28 slides

TRECVID-2005 High-Level Feature task: Overview Wessel Kraaij TNO & Paul Over NIST

TRECVID-2005 High-Level Feature task: Overview Wessel Kraaij TNO & Paul Over NIST High-level feature task o Goal: Build benchmark collection for detection methods o Secondary goal: feature-indexing could help search/browsing o

543 views • 32 slides