Introduction to Business Statistics Introduction to Business - PowerPoint PPT Presentation

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics Introduction to Business Statistics QM 120 Ch Chapter 3 t 3 Spring 2008 Dr. Mohammad Zainal

Measures of central tendency for ungrouped data 2 2 � Graphs are very helpful to describe the basic shape of a data distribution; “a picture is worth a thousand words.” There are distribution; a picture is worth a thousand words. There are limitations, however to the use of graphs. � One way to overcome graph problems is to use numerical � One way to overcome graph problems is to use numerical measures, which can be calculated for either a sample or a population. population. � Numerical descriptive measures associated with a population of measurements are called parameters ; those computed from of measurements are called parameters ; those computed from sample measurements are called statistics . � A measure of central tendency gives the center of a histogram � A measure of central tendency gives the center of a histogram or frequency distribution curve. � The measures are: mean , median , and mode . Th di d d QM-120, M. Zainal

Measures of central tendency for ungrouped data 3 3 � Data that give information on each member of the population or sample individually are called ungrouped data, whereas or sample individually are called ungrouped data, whereas grouped data are presented in the form of a frequency distribution table. Mean � The mean (average) is the most frequently used measure of ( g ) q y central tendency. � The mean for ungrouped data is obtained by dividing the sum e ea o u g ouped data is obtai ed by di idi g t e su of all values by the number if values in the data set. Thus, ∑ ∑ x x μ = Mean for population : N ∑ x x = = Mean Mean for for sample sample : : x n QM-120, M. Zainal

Measures of central tendency for ungrouped data 4 4 Example: The following table gives the 2002 total payrolls of five MLB teams. 2002 total payroll MLB team (millions of dollars) Anaheim Angels 62 Atlanta Braves 93 New York Yankees New York Yankees 126 126 St. Louis Cardinals 75 Tampa Bay Devil Rays 34 The Mean is a balancing point 34 62 75 93 126 QM-120, M. Zainal

Measures of central tendency for ungrouped data 5 5 � Sometimes a data set may contain a few very small or a few very large values. Such values are called outliers or extreme very large values. Such values are called outliers or extreme values. � We should be very cautious when using the mean It may not � We should be very cautious when using the mean. It may not always be the best measure of central tendency. Example: The following table lists the 2000 populations (in Example: The following table lists the 2000 populations (in thousands) of five Pacific states. Excluding California Population + + + State 5894 3421 627 1212 = = (thousands) Mean 2788 . 5 4 Washington 5894 Oregon 3421 Including California Alaska 627 + + + + An 5894 3421 627 1212 33,872 = = Mean 9005 . 2 Hawaii 1212 outlier 5 5 33,872 California QM-120, M. Zainal

Measures of central tendency for ungrouped data 6 6 Weighted Mean � Sometimes we may assign weight (importance) to each � Sometimes we may assign weight (importance) to each observation before we calculate the mean. � A mean computed in this manner is refereed to as a weighted mean and it is given by g y ∑ W x = i i x ∑ ∑ W i i where x i is the value of observation i and W i is weight for obser ation i observation i. QM-120, M. Zainal

Measures of central tendency for ungrouped data 7 7 Example: Consider the following sample of four purchases of one stock in the KSE Find the average cost of the stock one stock in the KSE. Find the average cost of the stock. Purchase Price Quantity 1 1 .300 300 5 000 5,000 2 .325 15,000 3 .350 10,000 4 .295 20,000 QM-120, M. Zainal

Measures of central tendency for ungrouped data 8 8 Median � The median is the value of the middle term in a data set has been ranked in increasing order. th + ⎛ ⎞ n 1 = ⎜ ⎟ Median Value of the term in a ranked data set ⎝ ⎝ ⎠ ⎠ 2 2 � The value .5(n + 1) indicates the position in the ordered data set. � If n is even, we choose a value halfway between the two middle observations Example: Find the median for the following two sets of measurements 2, 9, 11, 5, 6 2, 9, 11, 5, 6, 27 QM-120, M. Zainal

Measures of central tendency for ungrouped data 9 9 Mode � The mode is the value that occurs with the highest frequency g q y in a data set. � A data with each value occurring only once has no mode. � A data with each value occurring only once has no mode. � A data set with only one value occurring with highest frequency has only one mode, it is said to be unimodal . frequency has only one mode it is said to be unimodal � A data set with two values that occurs with the same (highest) frequency has two modes it is said to be bimodal frequency has two modes, it is said to be bimodal . � If more than two values in a data set occur with the same (hi h (highest) frequency, it is said to be multimodal . t) f it i id t b lti d l QM-120, M. Zainal

Measures of central tendency for ungrouped data 10 10 Example: You are given 8 measurements: 3, 5, 4, 6, 12, 5, 6, 7. Find a) The mean. b) The median. c) The mode. Relationships among the mean, median, and Mode p g � Symmetric histograms when Mean = Median = Mode Mean Median Mode � Right skewed histograms when Mean > Median > Mode Mean > Median > Mode � Left skewed histograms when Mean < Median < Mode d d QM-120, M. Zainal

Measures of dispersion for ungrouped data 11 11 Range � Data sets may have same center but look different because of y the way the numbers are spread out from center. Example: Company 1: 47 38 35 40 36 45 39 Company 2: 70 33 18 52 27 � Measure of variability can help us to create a mental picture of the spread of the data. � The range for ungrouped data Range = Largest value – Smallest value � The range, like the mean, is highly influenced by outliers. � The range is based on two values only. g y QM-120, M. Zainal

Measures of dispersion for ungrouped data 12 12 Variance and standard deviation � The standard deviation is the most used measure of dispersion. p It tells us how closely the values of a data set are clustered around the mean. � In general, larger values of standard deviation indicate that values of that data set are spread over a relatively larger range p y g g around the mean and vice versa. ( ( ) ) ( ( ) ) ∑ ∑ ∑ ∑ 2 2 x x x x ∑ ∑ ∑ ∑ − − 2 2 2 2 x x N n σ = = 2 2 Population : , and sample : s − N n 1 σ = σ s = 2 2 and s � Standard deviation is always non ‐ negative � Standard deviation is always non negative QM-120, M. Zainal

Measures of dispersion for ungrouped data 13 13 Example: Find the standard deviation of the data set in table. 2002 payroll MLB team (millions of dollars) Anaheim Angels A h i A l 62 62 Atlanta Braves 93 New York Yankees 126 St. Louis Cardinals 75 Tampa Bay Devil Rays 34 QM-120, M. Zainal

Measures of dispersion for ungrouped data 14 14 � In some situations we may be interested in a descriptive statistics that indicates how large is the standard deviation statistics that indicates how large is the standard deviation compared to the mean. � It is very useful when comparing two different samples with � It is very useful when comparing two different samples with different means and standard deviations. � It is given by: � It is given by: ⎛ σ ⎛ σ ⎞ ⎞ = ⎜ × ⎟ CV 100 ⎟ % ⎜ μ ⎝ ⎠ QM-120, M. Zainal

Mean, variance, and standard deviation for grouped data 15 15 Mean for grouped data � Once we group the data, we no longer know the values of g p , g individual observations. � Thus, we find an approximation for the sum of these values. � Thus, we find an approximation for the sum of these values. ∑ ∑ mf f μ = Mean M f for population l ti : N ∑ mf = Mean Mean for for sample sample : : x x n Where m is the midpoint and f is the frequency of a class. QM-120, M. Zainal

Mean, variance, and standard deviation for grouped data 16 16 Variance and standard deviation for grouped data ( ) − ∑ 2 mf ∑ ∑ 2 m f f N σ = 2 Population : N ( ( ) ) ∑ ∑ 2 mf mf ∑ ∑ − 2 2 m f n = 2 s Sample : − n 1 Where m is the midpoint and f is the frequency of a class. σ = σ s = 2 2 and s QM-120, M. Zainal

Mean, variance, and standard deviation for grouped data 17 17 � Example: The table below gives the frequency distribution of the daily commuting times (in minutes) from home to CBA for all 25 students in QMIS 120. Calculate the mean and the standard deviation of the daily commuting times. Daily commuting time (min) f 0 to less than 10 4 10 to less than 20 10 to less than 20 9 9 20 to less than 30 6 30 to less than 40 4 40 to less than 50 2 Total 25 QM-120, M. Zainal

Introduction to Business Statistics Introduction to Business - PowerPoint PPT Presentation

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics Introduction to Business Statistics QM 120 Ch Chapter 3 t 3 Spring 2008 Dr. Mohammad Zainal Measures of central tendency for ungrouped data

Official Statistics Matt Dray, Assistant Statistician Official Statistics 2 Official

Areal statistics Barry Rowlingson Research Fellow DataCamp Spatial Statistics in R Borders

1 Practical Information 2 Introduction to Statistics Per Bruun Brockhoff 3 Descriptive Statistics:

Statistics for Social Sciences I: Introduction to Statistics Introduction to Statistics

Introduction to Business Statistics Professor Jarad Niemi STAT 226 - Iowa State University

The Pulse monitors: Statistics Smartpods PULSE 1 - Improve Facility Efficiencies 2 - Increase

Quality Assurance in Official Statistics Directorate of Economics & Statistics, Planning

UK Bleeding Disorder Statistics UK Bleeding Disorder Statistics UK Bleeding Disorder Statistics

The Statistics Network The Statistics Network Statistics network Compute servers Desktop PCs

Business and Business Environment Business and Business Environment Introduction Business is

Introduction to Business Statistics Introduction to Business Statistics QM 120 Ch Chapter 4 t

Global assessment of linking trade statistics and the business register Nancy Snyder United

Order Statistics and Applications Rosemary Smith Introduction to Order Statistics Unordered

Business statistics and Globalisation UN Committee of Experts on Business Statistics First

Order Statistics and Pitman Closeness Katherine F. Davies Department of Statistics University of

Advanced Statistics Janette Walde janette.walde@uibk.ac.at Department of Statistics University

TheAmbiguityofDisrup3on NatalieKlym nklym@cfp.mit.edu Overview

Prediction Markets Thomas Steinke & David Rezza Baqaee October 17, 2010 Introduction

Agenda 1) Recap of Process- Where are We? 2) Research with other MLPs- Results 3) Review of

Mark Frerker MLB: $7.7 billion revenue in 2012 Statistics driven Moneyball

Delving further into privacy policies Engineering & Public Policy Lorrie Cranor October

Diagnosis (05) Qualitative Reasoning Alban Grastien alban.grastien@rsise.anu.edu.au Issue

Reflexive Modernization Key Concepts Tension between flexibility & control designing

Cultural heritage and the ancient world: authenticity, ethics, collecting, and 'ownership'

Sambuz

Useful Links

Newsletter

Mail Us

Introduction to Business Statistics Introduction to Business - PowerPoint PPT Presentation

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics Introduction to Business Statistics QM 120 Ch Chapter 3 t 3 Spring 2008 Dr. Mohammad Zainal Measures of central tendency for ungrouped data

Official Statistics Matt Dray, Assistant Statistician Official Statistics 2 Official

Areal statistics Barry Rowlingson Research Fellow DataCamp Spatial Statistics in R Borders

1 Practical Information 2 Introduction to Statistics Per Bruun Brockhoff 3 Descriptive Statistics:

Statistics for Social Sciences I: Introduction to Statistics Introduction to Statistics

Introduction to Business Statistics Professor Jarad Niemi STAT 226 - Iowa State University

The Pulse monitors: Statistics Smartpods PULSE 1 - Improve Facility Efficiencies 2 - Increase

Quality Assurance in Official Statistics Directorate of Economics &amp; Statistics, Planning

UK Bleeding Disorder Statistics UK Bleeding Disorder Statistics UK Bleeding Disorder Statistics

The Statistics Network The Statistics Network Statistics network Compute servers Desktop PCs

Business and Business Environment Business and Business Environment Introduction Business is

Introduction to Business Statistics Introduction to Business Statistics QM 120 Ch Chapter 4 t

Global assessment of linking trade statistics and the business register Nancy Snyder United

Order Statistics and Applications Rosemary Smith Introduction to Order Statistics Unordered

Business statistics and Globalisation UN Committee of Experts on Business Statistics First

Order Statistics and Pitman Closeness Katherine F. Davies Department of Statistics University of

Advanced Statistics Janette Walde janette.walde@uibk.ac.at Department of Statistics University

TheAmbiguityofDisrup3on NatalieKlym nklym@cfp.mit.edu Overview

Prediction Markets Thomas Steinke &amp; David Rezza Baqaee October 17, 2010 Introduction

Agenda 1) Recap of Process- Where are We? 2) Research with other MLPs- Results 3) Review of

Mark Frerker MLB: $7.7 billion revenue in 2012 Statistics driven Moneyball

Delving further into privacy policies Engineering &amp; Public Policy Lorrie Cranor October

Diagnosis (05) Qualitative Reasoning Alban Grastien alban.grastien@rsise.anu.edu.au Issue

Reflexive Modernization Key Concepts Tension between flexibility &amp; control designing

Cultural heritage and the ancient world: authenticity, ethics, collecting, and 'ownership'

Sambuz

Useful Links

Newsletter

Mail Us

Quality Assurance in Official Statistics Directorate of Economics & Statistics, Planning

Prediction Markets Thomas Steinke & David Rezza Baqaee October 17, 2010 Introduction

Delving further into privacy policies Engineering & Public Policy Lorrie Cranor October

Reflexive Modernization Key Concepts Tension between flexibility & control designing