CAPTO Gennaio 2010 1 The problem to solve Nowadays information - PowerPoint PPT Presentation

CAPTO Gennaio 2010 1

The problem to solve • Nowadays information published on internet is not manageable any more; the consequence is that any internet search is not precise. • Due to the overwhelming amount of information and the inherent nature of internet (polling protocol), manual internet retrieval can be a human exhaustive activity; • The relevant information is only a fraction of the available one; • All these problems, that lead to a loss of information (hence power), pertain to the information created by a company as well; CAPTO Gennaio 2010 2

The Goal To have a way to retrieve an information: On time => When needed Precise => Noise reduction Fruitful => Structured and harmonized Complete => Extracted from any media CAPTO Gennaio 2010 3

The solution Capto is the complete solution to create information acquiring and indexing media from multiple sources CAPTO Gennaio 2010 4

Characteristics • Focus on relevant information; • A unique portal to retrieve all the information you need; • Users can subscribe to ‘information channels’, being notified when new pertinent information is created; • A complete information management workflow; CAPTO Gennaio 2010 5

Technical Characteristics • Enhanced crawling capabilities (authentication, javascript processing, WEB 2.0); • Distributed and scalable acquisition from internet sources; • Enhanced Text Indexing (stemming, ranking (BM25), probabilistic search,…); • An highly configurable CMS portal (Jsr-168 compatible portlets, can be registered in any legacy CMS); • Can scale up to millions of indexed documents; CAPTO Gennaio 2010 6

Application domains • Data Monitoring: • Finance, stock markets… • Information monitoring and analysis (document repositories, news, web press, news feeds, blogs, mails,…) • Brand analysis (brand monitoring, sentiment analysis,…) • Massive text indexing and retrieval • …by and large any domain where the retrieval and analysis of information creates new (and more useful) information; CAPTO Gennaio 2010 7

The architecture Domain dependent Domain independent www External File System, DBMS,… CAPTO Gennaio 2010 8

PA Case history:Edison The problem : monitoring of Italian laws and regulations on the environmental impact related with the production of Energy The solution : • Automatic acquisition from several national, regional, federal and local web portals; • A complete validation workflow; • Information precision: before (manual acquisition) <50%, after ~100% CAPTO Gennaio 2010 9

Other products on the market Text indexing and ranking : • Apache Lucene (http://lucene.apache.org) • ClusterClick (www.clusterclick.com) • Amberfish (http://www.etymon.com/tr.html) • Terrier (http://ir.dcs.gla.ac.uk/terrier/) Document Management : • OpenText (www.opentext.com) • SearchExpress (www.searchexpress.com) • IndexData (www.indexdata.com) • AutonomyVirage (www.virage.com) Internet Information Retrieval: • HtDig (www.htdig.org) CAPTO Gennaio 2010 10

Conclusions • Can be used to monitor the acquisition of multimedia from internet sources; • Can be used to index and retrieve textual information from any archived media; • Can be used to shorten the time-to-information; • Can be used to provide a more precise information (and to map the information you have); • Can be easily adopted (low cost of software adoption) • Domain agnostic and multi-language CAPTO Gennaio 2010 11

CAPTO Gennaio 2010 1 The problem to solve Nowadays information - PowerPoint PPT Presentation

CAPTO Gennaio 2010 1 The problem to solve Nowadays information published on internet is not manageable any more; the consequence is that any internet search is not precise. Due to the overwhelming amount of information and the

Solve a Security Problem Instead By Ivan Ristic 1 / 35 Stop complaining and solve a security

Eye and Brain Eye and Brain Central visual pathways 1 2/22/2010 2 2/22/2010 3 2/22/2010 4

Representing Knowledge Given a problem to solve, how do you solve it? What is a solution to

Representing Knowledge Given a problem to solve, how do you solve it? What is a solution to

M. Seri marco.seri@unibo.it Bologna 24 Gennaio 2009 Definitions Definitions Gene Gene :

16 Gennaio 2017 NECSTLab Me Federico Izzo federico.izzo42@gmail.com github.com/Nimayer A

NUOVI FARMACI E TRAPIANTO Udine 21-22 gennaio 2016 Integrazione dei nuovi farmaci nel programma

Problem Definition Problem Definition Problem Definition Problem Definition Problem Definition

Approximation Algorithms Q. Suppose I need to solve an NP-hard problem. What should I do? A.

Texture Synthesis Presented by James Hays Problem Statement 1 Problem Statement Problem

Module 5 19/05/2015 2 Agenda 1. When and how to solve a problem 2. Praise, criticism and

The Four Steps 1 Solve the problem. 2 Write the app. 3 Compile the app. 4 Run the app. CSE 1020

I.M. Skaugen SE 3Q 2010 presentation IMS Innovative Maritime Solutions 15 October 2010 1

Financial Results for 4/2010- -9/2010 9/2010 Financial Results for 4/2010 and and Financial

Persuasive Communication Sweet Spot Relate to the audience Solve a problem Tell a story

2010 Interim Results 2010 Interim Results 12 August 2010 2010 Interim Results 2010 Interim

NIHE Asset Management Strategy Elma Newberry Housing Executive The Regional Housing Authority

Lessons from the Shared Air / Shared Action: Community Empowerment through Low Cost Air Pollution

underwater noise indicators (MSFD D.11, Ind. 11.1.1 and 11.1.2) Jukka Pajala HELCOM MONAS

Measuring Immediate Adaptation Performance for Neural Machine Translation Patrick Simianer , Joern

Unleashing the Power of GPUs over the Web Vishal Vaidyanathan Royal Caliber LLC GPUs are

People helping pets, pets helping people A1 Petline A1 Petline was started in 1995 by a

T he T histle a nd the Ma ple L e a f: Inte rna tiona l Colla bora tion to e nha nc e CPD

Well-Being in Resource Communi2es: Reflec2ons from the

Sambuz

Useful Links

Newsletter

Mail Us