The View from AI2 Oren Etzioni, CEO Allen Institute for AI (AI2)

Mission: contribute to the world through high-impact AI research and engineering, with emphasis on reasoning, learning, and reading capabilities. Outline: 1. Overview of AI2 (rapid) 2. Observations about knowledge (simple) 3. Information Extraction (visual) 4. Reasoning in Aristo (hard) 2

AI2 Chronology and “Geography” Time Line AI2 launched Jan. 2014 Team of 30 + 12 interns Fall 2014 Team of 50 Dec. 2015

Summer of 2014 Interns

Scientific Advisory Board (SAB) Adam Cheyer Dan Roth Co-founder and VP Engineering Professor at University of Illinois at Siri, Inc. Urbana-Champaign, fellow of ACM, AAAI, and ACL, Associate Editor in Chief of JAIR Eric Horvitz Dan Weld Director of Microsoft Research Professor at University (Redmond), fellow of AAAI and Washington, fellow of ACM and AAAS, AAAI President (2007-09) AAAI Tom Mitchell Chair of Machine Learning Department, Carnegie-Mellon, fellow of AAAI and AAAS, AAAI Distinguished Service Award 5

Research Scientists Peter Clark (leader) Tushar Khot UT Austin University of Wisconsin Santosh Divvala Jayant Krishnamurthy CMU CMU Tony Fader Ashish Sabharwal UW UW Vu Ha Oyvind Tafjord University of Wisconsin Princeton Mark Hopkins Peter Turney UCLA University of Toronto Kevin Humphreys Ali Farhadi (leader) , UIUC University of Edinburgh 6

Common Themes in AI2 Projects  Ambitious, long-term goals  Measurable results in 1-3 years  Standardized, unseen test questions  “Beyond the Turing Test”  Open & collaborative (papers, ADI )  Leveraging NLP, ML, and vision for: Knowledge Reasoning Explanation

Core Projects 66% Science AKBC over (4 th grade, Science NDMC) corpus Aristo Da Vinci EMNLP ’14 AKBC from 77.7 % Images & arithmetic diagrams AAAI ‘14 Geometry Plato Euclid

High-level observations about knowledge & reasoning 9

Do we need a body to acquire intelligence? (too philosophical for us) 10

Do we need a body to acquire common-sense knowledge? (a bit vague) 11

Do we need a body to pass the 4 th grade science test? (we can answer this one!) 12

Factual Knowledge for 4 th Grade Science Taxonomy Taxonomy Language Actions + States Processes Actions + States “Squirrels are animals” “Squirrels are animals” Paraphrases; "Brushing our teeth "Photosynthesis is a "Brushing our teeth “A rock is considered a active/passive removes the food and process by which plants removes the food and nonliving thing" transformations; helps keep them strong" make their own food and helps keep them strong" apositives; give off oxygen and wate coreference ; idioms; … that they are not using.” "As an organism moves into an adult stage of life they continue to grow" Properties Behavior Properties Behavior “Water freezes at 32F” "Animals need air, water, “Water freezes at 32F” "Animals need air, water, and food to live and and food to live and “This book has a mass survive” survive” and a volume" "Some animals grow thicker fur in winter to stay warm" Processes "Photosynthesis is a Etc. Part/whole process by which plants Structure Geometry, diagrams, … make their own food and "The lungs are an organ in “Plants have roots” Qualitative Relations give off oxygen and water Qualitative Relations the body" that they are not using.” "The lungs are an organ “Increased water flow “Increased water flow widens a river bed” in the body" widens a river bed” 13

2014 Knowledge Tour Google

These KBs are fact rich but knowledge poor!

Machine Reading Source: DARPA, Machine Reading initiative

Nell lexical

Open Information Extraction (Banko, et al, 2007) Question: can we leverage regularities in language to extract information in a relation-independent way? Relations typically:  anchored in verbs  exhibit simple syntactic form Virtues:  No hand-labeled data  “No sentence left behind”  Exploit redundancy of Web

IE over Web sentences suffers from Attention Deficit Disorder!

Common-Sense Knowledge from Images Which animals lay eggs? 21

Obtaining Visual Knowledge 1. Detect Objects (nouns) 2. Reason about Actions (verbs) Key Challenges:  Supervision (Bounding boxes, Spatial relations)  Large-Scale (~10 5 objects, ~10 3 actions) 22 Do bears catch salmon?

VisIE: Visual Information Extraction (Sadeghi, Divvala, Farhadi, submitted) Do dogs eat ice cream? Do snakes lay egg? ( ( ) ) , , , , dog dog eating ice cream Snake laying eggs egg OpenIE OpenIE ConceptNet ConceptNet VisIE VisIE • Builds object detectors based on Google images • Utilizes a joint model over detectors to assess triples • Mean Average Precision = 0.54

Facts are necessary, but not sufficient A Theory also includes:  Rules  Reasoning  Explanation A Theory is Greater than the Sum of its Facts

Aristo Demo 1. General rules from Barron’s Study Guide 2. Background facts stated in the question 3. Multiple Choice Aristo Demo 29

Reasoning Method  Deductive reasoning is too restrictive:  fall down fall down to the ground  Most animals have legs dogs have legs…  Shallow text alignment is too permissive:  {turn,a, liquid ,into,a, solid }  {turn,a, solid ,into,a, liquid }  Probabilistic reasoning is challenging  Text  MLN mapping is unsolved  “ People breathe air .”  Naïve encoding of single sentence  10^10 node Markov Logic Network (MLN) 30

MLN Scaling for Rules Extracted from Text A short study guide example : “Some animals grow thick fur in winter to stay warm.” First order representation using 6 variables, 6 non-Isa predicates, 2 existentials :  a, g, f, w: Isa(a , “Some animals”), Isa(g , “grow”), Isa(f , “thicker fur”) , Isa(w , “the winter”) , Agent(g, a), Object(g, f), In(g, w)   s, m: Isa(s , “stays”), Isa( m , “warm”), Enables(g, s), Agent(s, a), Object(s, m) Non-CNF Ground MLN Rules 1.00E+18 MLN encoding k science rules  ~(D*k) V ground network rules 1.00E+16 1.00E+14 1.00E+12 Variables per rule 1.00E+10  ~10 for extracted rules Domain size D=10, V=10  ~3 in typical hand-coded rules  ~10 1.00E+08  But no symmetry 1.00E+06 or exchangeability 0 2 4 6 8 10 Number of Science Rules 31

Enhancements for Tractability 1. Add semantic constraints  E.g., Cause(x,y) => Effect(y,x ), events have unique agents, … 2. Use hard constraints to simplify & reduce soft constraints  SAT solver for unit propagation + backbone/fixed variable detection 3. Use refined types to reduce domain size  Consider only lexically similar entities/events 4. Use constants in place of first-order variables, where possible Still slow and inaccurate!  3 min per question (with just 1 extracted rule)  47% accuracy (4-way multiple choice) 32

Motivation for New Approach  Can treat all mentioned entities/events as constants  Inference requires “fuzzy” matching between extracted terms thicker fur ≈ thicker fur in winter ≈ heavier coat We formulate matching as a probabilistic inference 33

Probabilistic Alignment over graphs Treat extracted rules as graphs  vertices = entities/events;  edges = relations; partitioned into antecedent/consequent  Sibling inference tasks: AlignmentMLN + InferenceMLN Directional Inference with extracted rules Lexical Reasoning o Multi-path version of reasoning in Structured alignment beyond BOW: “the demo” word similarity + graph structure o Directionality: thick fur => warm, but warm ≠> thick fur 34

ProbAligner Method: Inference (work in progress)  Example Question: Is it true that a decomposer is an organism that recycles nutrients ?  Example Rules ( antecedent => consequent ) : 1. Decomposers are living things that break down and recycle 2. Decomposers are living things that recycle their[consumers] nutrients into the soil Question Rule 1 Rule 2 36

ProbAligner Results (work in progress)  Faster  Few variables per rule (independent of extracted rule length)  No existentially quantified variables => Better scaling 200 180 160 Runtime (seconds) 140 120 100 80 60 Original Approach 40 20 ProbAligner 0 0 1 2 3 4 5 6 7  More robust Number of Extracted Rules 37

Conclusion AI2 is one year old We are hard at work on:  Sophisticated IE (rules, processes)  Probabilistic reasoning over extracted rules  Question understanding We utilize standardized tests to assess progress  Early results on Arithmetic & Geometry (EMNLP & AAAI) Data and publications are here: www.allenai.org 38

Join Us! 39

The View from AI2 Oren Etzioni, CEO Allen Institute for AI (AI2) - PowerPoint PPT Presentation

The View from AI2 Oren Etzioni, CEO Allen Institute for AI (AI2) Mission: contribute to the world through high-impact AI research and engineering, with emphasis on reasoning, learning, and reading capabilities. Outline: 1. Overview of AI2

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view

AI2 - Module 3 Task 5: Learning from Data Overview Task 5: Learning from Data Task 6: Coping

Maple View Flats 1.24.17 1 Site Plan Maple View Flats - 1.24.17 Historic Homes to be moved and

Student view socrative.com STUDENT LOGIN or LOGIN, STUDENT LOGIN Room name: ALLIANCE Student

IGES view on new market IGES view on new market- IGES view on new market IGES view on new market

A O I Posterior View A O I Posterior View A O I

101 iOS Container View Controllers Container View Controllers Display a view controller inside

Towards Deep Multi-View Stereo Silvano Galliani October 2, 2017 1 / 40 Towards Deep Multi-View

View Volumes Canonical View Volumes Why Canonical View Volumes? University of British Columbia

Campsie View School 1 Campsie View at the Movies 2 Campsie View School 21 Teachers 34

LG s View on View on LG s LG s View on Future Mobile Future Mobile Future Mobile

VTC Series video cart basic dimensions front view side view rear view monitor and camera

The Pilots View. 9th November 2017 Capt. Hywel Pugh Pilot Port Of London Pilots View. Tug

Cate Construction Projects April 2019 View from Day Walkway View from Kirby Quad View from

Proposed Site Plan 33 Visualisation of the main entrance from Mansfield Road 34 View of the

LATERAL VIEW 1 EXTERNAL VIEW 2 FORWARD CABIN 3 FORWARD CABIN WRITING DESK 4 PORT AMIDSHIP CABIN 5

Towards More Adequate Natural Idea: Using . . . Linear Dependence . . . Value-Added How to

MOL2NET, 2017 , 3, doi:10.3390/mol2net-03-xxxx 2 25923, Pseudomonas aeruginosa ATCC 27853, Proteus

Regulations.gov Overview of the Latest Features and Functionality The Status of Social Media in

Practical implementation of k-means clustering Karolis Urbonas Head of Data Science, Amazon

Multi-Task Minimum Error Rate Training for SMT Patrick Simianer, Katharina W aschle, Stefan

DATA Act Webinar for Agencies January 5, 2016 Analysis with Structured Data Brought to you

Exporting IDA Debug Information Overview Who am I? What's the problem? What does

Anatomy of cross-compilation toolchains Thomas Petazzoni thomas.petazzoni@free-electrons.com