what questions do you have
play

What questions do you have? Analytics Accelerator 1 1 My - PowerPoint PPT Presentation

What questions do you have? Analytics Accelerator 1 1 My background Math, Stats, Data Science, Solution Architect, CS / ML Data Engineer Presales Entity Resolution and Data Quality Problem Traditional Methods Dont Scale with Number of


  1. What questions do you have? Analytics Accelerator 1 1

  2. My background Math, Stats, Data Science, Solution Architect, CS / ML Data Engineer Presales

  3. Entity Resolution and Data Quality Problem

  4. Traditional Methods Don’t Scale with Number of Sources Marginal Cost Opportunity For Strategic Data Asset Number of Siloed Data Sources

  5. Why I get excited about Enterprise: Scale, Scale, Scale Problem Solution Results 8 Divisions 500+ Sales Revenue Suppliers Marketing Impact ERP Systems 100K+ People Customers Logistics Total Suppliers Landed Cost $300M+ 10M+ Parts 0.5% of Direct Spend Customers Confidential

  6. Dirty Little Secret: Data Variety in Enterprise What most people think enterprise What enterprise data is really like - “random data salad” data looks like Prone to constant change/entropy “Data M&A Hoarding” Politics Leadership Dynamic Schema Restructuring Legacy Changes DBs - Mongo et al Burden 6 CONFIDENTIAL

  7. What Tamr Does Tamr solves the enterprise data variety problem to power transformative analytic and operational outcomes. 10X Reduction $500M+ Savings Customer Insights 5000+ Studies In New Data Set Integration Unified clinical study data From Sourcing Analytics Unified buyer profiles across From 6 Months to 2 Weeks to empower researchers Across Businesses siloed dealer systems in 30+ geos Video Case Study Video Case Study Video Case Study Case Study CONFIDENTIAL

  8. Reality for Global Corporate IT as Data Broker Most data is untreated + unprepared for expensive analytics tools Sales HR Finance Divisions Marketing Manufacturing Engineering

  9. Some Options Option #1 - Deny Variety - use information that is easiest/closest Option #2 - Manage Variety incrementally - using traditional approaches: ● Standardization ● Aggregation ● Master Data Management ● Rationalize Systems ● Throw Bodies at it ● Improve Individual Productivity Option #3 - Embrace Variety using probabilistic/model based approach - Tamr

  10. Option #1: “Deny” Variety Use only the information that is closest, most familiar, easiest to obtain

  11. Option #2: “Manage” Variety Using Traditional Approaches Traditional Data Management Approaches: Necessary but not sufficient ● Standardization ● Aggregation ● Master Data Management ● Rationalize Systems ● Throw Bodies at it One Schema to Rule them All ● Improve Individual Productivity

  12. Option #2: “Manage” Variety Using Traditional Approaches Traditional Data Management Approaches: Necessary but not sufficient ● Standardization ● Aggregation ● Master Data Management ● Rationalize Systems ● Throw Bodies at it ● Improve Individual Productivity

  13. Option #2: “Manage” Variety Using Traditional Approaches Traditional Data Management Approaches: Necessary but not sufficient ● Standardization ● Aggregation ● Master Data Management ● Rationalize Systems ● Throw Bodies at it ● Improve Individual Productivity

  14. Option #2: “Manage” Variety Using Traditional Approaches Traditional Data Management Approaches: Necessary but not sufficient ● Standardization ● Aggregation ● Master Data Management ● Rationalize Systems ● Throw Bodies at it ● Improve Individual Productivity

  15. Option #2: “Manage” Variety Using Traditional Approaches Traditional Data Management Approaches: Necessary but not sufficient ● Standardization ● Aggregation ● Master Data Management ● Rationalize Systems ● Throw Bodies at it ● Improve Individual Productivity

  16. Option #2: “Manage” Variety Using Traditional Approaches Traditional Data Management Approaches: Necessary but not sufficient ● Standardization ● Aggregation ● Master Data Management ● Rationalize Systems ● Throw Bodies at it ● Improve Individual Productivity

  17. Logical Evolution to Probabilistic/Model-Based Approach Probabilistic (Tamr) Complements , NOT Replaces, Deterministic (MDM) Today Future Probabilistic Probabilistic Deterministic Deterministic

  18. Option #3: “Embrace” Variety -- Tamr’s NextGen Approach Managing enterprise information as an asset requires a new, bottom-up design pattern Combine Consolidate Classify ALL your metadata and Entities and attributes to Organize your data into an map it to logical entities remove information silos analytics-ready hierarchy

  19. The Two Second Rule. �A�ythi�g that takes a hu�a� lo�ger tha� two seconds is probably unlikely for ML to auto�atically lear�.� - Andrew Ng, Chief Scientist, Baidu 19 CONFIDENTIAL

Recommend


More recommend