ECON 214 Elements of Statistics for Economists Session 2 – Presentation of Data: Graphical Methods Lecturer: Dr. Bernardin Senadza , Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education School of Continuing and Distance Education 2014/2015 – 2016/2017
Session Overview • The aim of descriptive statistical methods is to present information in clear, concise and accurate manner. This session will illustrate the methods for summarizing data in a very informative way, using tables and graphs. • At the end of the session, the student will – Be able to organize data into frequency distribution – Be able to portray a frequency distribution in histogram, frequency polygon, and cumulative frequency polygon – Be able to present data using bar charts, line graphs and pie charts Slide 2
Session Outline The key topics to be covered in the session are as follows: • Frequency distributions • Graphic presentation of frequency distributions • Other graphic forms of presenting data Slide 3
Reading List • Michael Barrow, “Statistics for Economics, Accounting and Business Studies”, 4 th Edition, Pearson • R.D. Mason , D.A. Lind, and W.G. Marchal , “Statistical Techniques in Business and Economics”, 10 th Edition, McGraw- Hill Slide 4
Topic One FREQUENCY DISTRIBUTIONS Slide 5
FREQUENCY DISTRIBUTIONS • Raw data in itself is meaningless unless it can be presented in an informative way. • Descriptive statistics summarizes raw information in a comprehensible way. • We may use tables, graphs and/or numeric values. • A frequency distribution is grouping of data into categories showing the number of observations in each mutually exclusive category. Slide 6
FREQUENCY DISTRIBUTIONS • The purpose of constructing a frequency distribution is to condense raw data into a table that is more readily comprehended. • Although we lose the identification of the specific value of each measurement, the advantage of a frequency distribution is that such a table makes it easier to interpret the reported values. Slide 7
FREQUENCY DISTRIBUTIONS • Some definitions are worth knowing: – Class limits : They are the smallest and largest observations (values) in each class of a frequency distribution. Each class has two limits; we have the lower class limit and the upper class limit. – Class frequency : The number of observed values in each class. – Class boundaries : They denote specific points along a measurement scale separating adjoining classes. The lower class boundary is obtained by subtracting 0.5 from the lower class limit, while the upper class boundary is obtained by adding 0.5 to the upper class limit. Slide 8
FREQUENCY DISTRIBUTIONS – Class interval or width (or size) : The number of measurement units or range of values included in each class. It is obtained as upper boundary minus lower boundary. It is also obtained by subtracting the lower limit of a class from the lower limit of the next class. – Class mark or midpoint : The value that divides a class into two equal parts. This is the simple average of the upper and lower class limits. • Note that in some frequency distributions, class boundaries class limits overlap. Slide 9
FREQUENCY DISTRIBUTIONS • Illustration – The Dean of the School of Continuing and Distance Education wishes to determine the hours of study Distance Education students do. He selects a random sample of 30 students and determines the number of hours each student studies per week as follows: – 15.0, 23.7, 19.7, 15.4, 18.3, 23.0, 14.2, 20.8, 13.5, 20.7, 17.4, 18.6, 12.9, 20.3, 13.7, 21.4, 18.3, 29.8, 17.1, 18.9, 9.1, 26.1, 15.7, 14.0, 17.8, 36.3, 23.2, 12.9, 27.1, 16.6. • Organize the data into a frequency distribution. Slide 10
FREQUENCY DISTRIBUTIONS CONT’D • Assume the number of classes is pre-determined to be 6. • The class intervals used in the frequency distribution should be equal. • Determine the class interval by using the formula: i = (highest value-lowest value)/number of classes i = 36.3- 9.1/6 = 4.53 ≈ 5. • Starting the lower limit of the first class at 8, we can have the following classes: 8-12; 13-17; 18-22; 23-27; 28-32; 33-37. • Count the number of values in each class and indicate the number against each class. Slide 11
FREQUENCY DISTRIBUTIONS CONT’D The class interval is 5 (i.e. 13 minus 8). Hours of Frequency, f studying 8-12 1 13-17 12 18-22 10 23-27 5 28-32 1 33-37 1 Total 30 Slide 12
FREQUENCY DISTRIBUTIONS CONT’D • The relative frequency of a class is obtained by dividing the class frequency by the total frequency. Hours of Frequency, f Relative frequency, studying rf 1 / 30 8-12 1 12 / 30 13-17 12 10 / 30 18-22 10 5 / 10 23-27 5 1 / 30 28-32 1 1 / 30 33-37 1 Total 30 1 Slide 13
Topic Two GRAPHIC PRESENTATION OF A FREQUENCY DISTRIBUTION Slide 14
Graphic Presentation of a Frequency Distribution • The three commonly used graphic forms are histograms, frequency polygons, and a cumulative frequency curve (ogive). Slide 15
Histogram • Histogram: A graph in which the classes are marked on the horizontal axis and the class frequencies on the vertical axis. The class frequencies are represented by the heights of the bars and the bars are drawn adjacent to each other and without spaces/gaps among the bars. 14 12 Frequency 10 8 6 4 2 0 10 15 20 25 30 35 Hours spent studying Slide 16
Frequency Polygon • A frequency polygon consists of line segments connecting the points formed by the class midpoint and the class frequency. 14 12 10 Frequency 8 6 4 2 0 10 15 20 25 30 35 Hours spent studying Slide 17
Cumulative Frequency Distribution (Ogive) • A cumulative frequency curve (ogive) is used to determine how many or what proportion of the data values are below a certain value (or the upper boundary of each class). • The cumulative frequency for a given class is obtained by adding the frequency of that class to the cumulative frequency of the preceding class. • The cumulative frequency for the last class always equals the total frequency. Hours of studying less Cumulative than frequency 7.5 0 12.5 1 17.5 13 22.5 23 27.5 28 32.5 29 37.5 30 Slide 18
• The Ogive is obtained as line segments connecting the points formed by the class midpoint and the cumulative frequency. – Note: alternatively the upper class boundary is used. 35 30 25 20 Frequency 15 10 5 0 10 15 20 25 30 35 Hours Spent Studying Slide 19
• Topic Three OTHER GRAPHIC FORMS OF PRESENTING DATA Slide 20
The Bar Chart • A bar chart depicts frequencies for different categories (of data) by a series of bars (separated by spaces/gaps). – Consider the data on the education level and employment status data of the labour force of a country (measured in ’000s). – Note that the data is already summarised (cross-tabulated). – We can graphically represent the data by bar charts. Higher A levels Other No Total education qualification qualification In work 8,224 5,654 11,167 2,583 27,628 Unemployed 217 231 693 303 1,444 Inactive 956 1,354 3,107 2,549 7,966 Total 9,397 7,239 14,967 5,435 37,038 Slide 21
Bar chart by education qualification of people who work 12000 Number of people (000s) 10000 8000 6000 4000 2000 0 Higher Advanced level Other No education qualifications qualifications • The bar chart is for education levels of only people who work. • The height of each bar is determined by the associated frequency. • The first bar is 8224 units high, the second is 5654, and so on. The ordering of the bars could be reversed (‘no qualifications’ becoming the first category) without altering the message . Slide 22
Multiple bar chart: Educational qualifications by employment category 12000 Number of people (000s) 10000 8000 In work 6000 Unemployed Inactive 4000 2000 0 Higher Advanced Other No education level qualifications qualifications • Note that the bars for unemployed and inactive are constructed in the same way as for those in work: the height of the bar is determined by the frequency or number of persons in each category. Slide 23
Stacked bar chart: Educational qualifications and employment status 16000 14000 Number of people (000s) 12000 10000 Inactive 8000 Unemployed In work 6000 4000 2000 0 Higher education Advanced level Other No qualifications qualifications • Note: The overall height of each bar is determined by the sum of the frequencies of the category, given in the final raw of the Table containing the data. Slide 24
Stacked bar chart: Educational qualifications and employment status (percentages) 100% 80% Inactive 60% Unemployed 40% In work 20% 0% Higher Advanced Other No education level qualifications qualifications • Percentages in each employment category, by educational qualification. – That is, number of persons in each employment status in each educational qualification category has been converted into a percentage of the total persons in each educational qualification category. • Instead of bars, we could have used lines in each of the 4 charts to obtain line charts. Slide 25
Recommend
More recommend