with tableau
play

with Tableau Avirup Chakraborty(MDS201908) Debangshu - PowerPoint PPT Presentation

Big Data Visualization with Tableau Avirup Chakraborty(MDS201908) Debangshu Bhattacharya(MDS201910) Ipsita Ghosh(MDS201913) Swaraj Bose(MDS201936) Sreya K.K.(MDS201804) What is big data? Extremely large data sets that may be analyzed


  1. Big Data Visualization with Tableau Avirup Chakraborty(MDS201908) Debangshu Bhattacharya(MDS201910) Ipsita Ghosh(MDS201913) Swaraj Bose(MDS201936) Sreya K.K.(MDS201804)

  2. What is big data? Extremely large data sets that may be analyzed computationally to reveal patterns, trends, and associations, especially relating to human behaviour and interactions.  Velocity  Variety  Volume  Veracity

  3. Why data visualization is important?  communicates relationships of the data with images  allows trends and patterns to be more easily seen  give meaning to complicated datasets so that their message is clear and concise  outlier detection becomes easier  results from complex algorithms are much easier to understand in a visual format  summary of data

  4. Challenges in big data visualization (4 V’s yet again!)  Traditional visualization tools are not capable of handling large datasets. Eg: MS Excel, Minitab  Providing low latency in visualization  Parallelization is required  Dimensions of the data has to be carefully chosen  Most current visualization tools have low performance w.r.t scalability, functionality and response time

  5. Steps for big data visualization Parsing Mining Data and hidden acquisition filtering patterns Data Refinement visualization

  6. What is Tableau?  a powerful and fast growing data visualization tool used in the Business Intelligence Industry.  connects easily to nearly any data source.  allows for instantaneous insight on data by transforming it into interactive visualizations called dashboards.

  7. Why is Tableau helpful?  Handle large volume of data  No scripts or code required, provides user interface  Filter multiple datasets simultaneously  Creates interactive and shareable dashboards depicting trends and variations  Incorporate other programming languages to do complex calculations  And many more….

  8. Trivia  Founded: January 2003, California  Founders: Christian Chabot, Chris Stolte (Stanford University) , Pat Hanrahan  Headquarters: Seattle, California  Website: https://www.tableau.com/  Built using C++  Latest version: 2020.1

  9. Tableau Desktop: A data visualization tool designed to create data visualization, report and dashboard in a fast and intelligent way.  Users can connect to multiple data sources, carry out multi-dimensional data analysis, create dashboards or report, modify metadata and publish a complete workbook to Tableau server if needed.  Adapt your content performance for any size and any device (i.e. Desktop, laptop, tablet or even a smartphone!).

  10. Personal Edition Tableau Desktop Professional Edition

  11. Personal Edition Professional Edition Connects to limited data sources as: Connects to a wider variety of data sources: Microsoft Access, Amazon Redshift , Microsoft Excel, Google Analytics , Microsoft Azure, Google BigQuery , Tableau Data Extract, Hortonworks Hadoop , Text files (CSVs). OLAP databases , Salesforce . Cannot connect to Tableau Server but allows Enables connection to Tableau Server and users to create package files for Tableau creating package files for Tableau Reader. Reader. Costs $999 per user. Costs $1999 per user.

  12. Tableau Server: Tableau server is essentially an online hosting platform to hold all your tableau workbooks, data sources and more. It works like any other server, you can store things here and they will safe from fires and pesky hackers. So, what are the advantages of Tableau Server??

  13. 1. Firstly…. COLLABORATION!  Being a Tableau product, Tableau Server lets us to use the functionality of Tableau, without needing to always be downloading and opening workbooks.  Users need not to install Tableau Desktop on their machine, and they can still interact with dashboards shared with them.

  14. 2. CLOUD SUPPORT Tableau server can be deployed on-premises as well as in public clouds like Azure, AWS, IBM Cloud, Google Cloud Platform etc. It also enables an administrator to track and manage the content, licenses, performance, and permissions for data sources with ease. 3. COMPATIBILITY Tableau Server supports variety of Android apps, iPhone apps and Web browsers like Internet Explorer, Mozilla Firefox, Google Chrome and Safari.

  15. 4. LIMITED ACCESS DESIGN On Tableau Server, we can set permissions to different bits of work, to allow us as an organization to determine who can access and interact with what. Let us illustrate this using a really simple example >>

  16. Consider this ‘imaginary’ company consisting: and Tony Stark Dr. Bruce Banner Nick Fury ❖ Tony Stark has access on server to upload and edit work in a project containing test documents. ❖ Dr. Banner can interact with only the production quality documents. ❖ And…Nick Fury can access but not edit the final presentation documents. ❖ Of course… Loki cannot even have a look at the documents! (at least we can hope so)

  17. Tableau Public: Tableau Public is a FREE tool that anyone can use to connect to data, create interactive data visualizations and publish them on the web.  Once these visualizations are in Tableau public one can share to social medias or even can embed on webpages.  Since everyone has access to published data, user should be careful not to put the proprietary data on Tableau Public.

  18. Limitations to Tableau Public:  Row limitation: Limited to 15,000,000 rows of data per workbook.  Limited storage: Limited to ten gigabytes (10 GB) of storage space for your workbooks.  No workbook privacy: Tableau Public does not allow to save workbooks locally. One has to save them publicly which means that everyone can see the data since it’s saved on the cloud.  No security: As visualizations are public so anyone can access the data and make change by downloading the workbook.

  19. Tableau Online: Tableau Online is a hosted version of Tableau Server. It is the business analytics platform where people can share dashboards, interact with report and gain insights. It is hosted in the cloud so that there is no hardware, no set-up time needed. “Want the sharing and collaboration of Server, but without having to actually manage a server? Then you want Tableau Online. Secure. Scalable. And Look Ma — No hardware to maintain!” - https://www.tableau.com/products Roughly, Tableau Online can also be thought as a private version (and paid, obviously) of Tableau Public.

  20. Key Features:  Fully hosted in the cloud. Servers are managed by Tableau Team.  Supports live data connections to Amazon Redshift, Google BigQuery, as well as to SQL-based sources hosted on cloud platforms.  Ideal for small number of users who need to be able to interact with the data and visualizations in a secure way.

  21. Key Features (Contd.):  Easily accessible from a browser or Tableau Mobile App.  Authenticate users through TableauID (email address and password). No guest access allowed.  Subscription rate is $500 per user for one year (half the price of individual Tableau Server Licenses)

  22. Tableau Reader  Tableau Reader is a FREE desktop application  Allows interaction with data visualizations, created with Tableau Desktop.  Users can filter, drill-down and view the details of the data as long as the author allows.

  23. Tableau Start Page

  24. Canvas- Displays Left pane- Displays the information about connected data source how the data and other details about source is set up and your data. options for combining the data. Data grid - Displays first 1,000 rows of the data contained in the Tableau data source. Metadata grid- Displays the fields in your data source as rows.

  25. Tableau Worksheet

  26. The Dashboard Workspace

  27. DEMO

  28. Philosophy of Tableau working with Big Data • Democratisation of Data : Knowledge workers of all skill levels should be able to access and analyze data wherever it resides. • Partnerships within the Big Data Ecosystem

  29. Overview of how Tableau works with big data

  30. Data access and connectivity To enable analysis of data of any size and format, Tableau supports broad access to data wherever it lives. o SQL and NoSQL based connections — Tableau uses SQL to interface with Hadoop, NoSQL databases and Spark. o Open Database Connectivity(ODBC) — By using ODBC, one can access any data source that supports the SQL standard and implements the ODBC API. For Hadoop, this includes interfaces such as Hive Query Language (HiveQL), Impala SQL, BigSQL and Spark SQL. o Web Data Connector — With the Tableau Web Data Connector SDK, users can build connections to data that lives outside of the existing connectors which is any data accessible over HTTP , including internal web services, JSON data, and REST API.

  31. Fast Interaction with all data at scale 1. Hyper data engine • Hyper is a high-performance in-memory data engine technology that helps customers analyze large or complex data sets faster. • They use dynamic code generation and cutting-edge parallelism techniques to achieve high query speed. • Hyper can also augment and accelerate slower data sources by creating an extract of the data and bringing it in-memory.

Recommend


More recommend