The General Social S u r v e y IN FE R E N C E FOR C ATE G OR IC - PowerPoint PPT Presentation

The General Social S u r v e y IN FE R E N C E FOR C ATE G OR IC AL DATA IN R Andre w Bra y Assistant Professor of Statistics at Reed College

INFERENCE FOR CATEGORICAL DATA IN R

E x ploring GSS library(dplyr) glimpse(gss) Observations: 3,300 Variables: 25 $ id <dbl> 518, 1092, 2094, 229, 979, 554, 491, 319, 3143, 1... $ year <dbl> 1982, 1982, 1982, 1982, 1982, 1982, 1982, 1982, 1... $ age <fct> 49, 22, 26, 75, 71, 33, 56, 33, 69, 40, 44, 42, 5... $ class <fct> WORKING CLASS, WORKING CLASS, WORKING CLASS, LOWE... $ degree <fct> HIGH SCHOOL, HIGH SCHOOL, HIGH SCHOOL, LT HIGH SC... $ sex <fct> MALE, MALE, MALE, MALE, FEMALE, FEMALE, MALE, FEM... $ happy <fct> HAPPY, HAPPY, HAPPY, HAPPY, HAPPY, HAPPY, HAPPY, ... INFERENCE FOR CATEGORICAL DATA IN R

E x ploring GSS gss2016 <- filter(gss, year == 2016) ggplot(gss2016, aes(x = happy)) + geom_bar() INFERENCE FOR CATEGORICAL DATA IN R

E x ploring GSS p_hat <- gss2016 %>% summarize(prop_happy = mean(happy == "HAPPY")) %>% pull() p_hat 0.7733333 INFERENCE FOR CATEGORICAL DATA IN R

General 95% confidence inter v al ( ^ − 2 × SE , ^ + 2 × SE ) p p Sample proportion pl u s or min u s t w o standard errors INFERENCE FOR CATEGORICAL DATA IN R

Bootstrap INFERENCE FOR CATEGORICAL DATA IN R

Bootstrap Confidence Inter v al library(infer) boot boot <- gss2016 %>% specify(response = happy, Response: happy (factor) success = “HAPPY”) %>% # A tibble: 500 x 2 generate(reps = 500, replicate stat type = "bootstrap") %>% <int> <dbl> calculate(stat = "prop") 1 1 0.827 2 2 0.740 3 3 0.780 4 4 0.773 5 5 0.747 6 6 0.753 INFERENCE FOR CATEGORICAL DATA IN R

Bootstrap Confidence Inter v al ggplot(boot, aes(x = stat)) + geom_density() INFERENCE FOR CATEGORICAL DATA IN R

Bootstrap Confidence Inter v al SE <- boot %>% summarize(sd(stat)) %>% pull() SE 0.03482251 ( ^ − 2 × SE , ^ + 2 × SE ) p p c(p_hat - 2 * SE, p_hat + 2 * SE) 0.7051883 0.8412784 INFERENCE FOR CATEGORICAL DATA IN R

Let ' s practice ! IN FE R E N C E FOR C ATE G OR IC AL DATA IN R

Interpreting a Confidence Inter v al IN FE R E N C E FOR C ATE G OR IC AL DATA IN R Andre w Bra y Assistant Professor of Statistics at Reed College

Confidence inter v als Concl u sion : the tr u e proportion of Americans that are happ y is bet w een 0.705 and 0.841. What do w e mean b y con � dent ? INFERENCE FOR CATEGORICAL DATA IN R

Dataset 1 ds1 <- filter(gss, year == 2016) p_hat <- ds1 %>% summarize(mean(happy == "HAPPY")) %>% pull() SE <- ds1 %>% specify(response = happy, success = "HAPPY") %>% generate(reps = 500, type = "bootstrap") %>% calculate(stat = "prop") %>% summarize(sd(stat)) %>% pull() c(p_hat - 2 * SE, p_hat + 2 * SE) 0.7073114 0.8393553 INFERENCE FOR CATEGORICAL DATA IN R

Confidence Inter v als Interpretation : “ We ’ re 95% con � dent that the tr u e proportion of Americans that are happ y is bet w een 0.705 and 0.841.” Width of the inter v al a � ected b y n con � dence le v el p INFERENCE FOR CATEGORICAL DATA IN R

Let ' s practice ! IN FE R E N C E FOR C ATE G OR IC AL DATA IN R

The appro x imation shortc u t IN FE R E N C E FOR C ATE G OR IC AL DATA IN R Andre w Bra y Assistant Professor of Statistics at Reed College

Confidence Inter v als SE Standard errors increase w hen n is small 0.009998905 p is close to 0.5 SE_small_n 0.03809731 SE_low_p 0.00547912 INFERENCE FOR CATEGORICAL DATA IN R

The normal distrib u tion A . K . A the " bell c u r v e ". If obser v ations are independent n is large Then ^ p follo w s a normal distrib u tion INFERENCE FOR CATEGORICAL DATA IN R

Standard de v iation √ ^ × (1 − ^ ) p p n INFERENCE FOR CATEGORICAL DATA IN R

Assessing model ass u mptions Ho w do I check " obser v ations are independent "? This depends u pon the data collection method . What does " n is large " mean ? n × ^ > 10 p n × (1 − ^ ) > 10 p INFERENCE FOR CATEGORICAL DATA IN R

Calc u lating standard error : appro x imation p_hat <- gss2016 %>% summarize(mean(happy == "HAPPY")) %>% pull() n <- nrow(gss2016) c(n * p_hat, n * (1 - p_hat)) 116 35 SE_approx <- sqrt(p_hat * (1 - p_hat) / n) SE_approx 0.03418468 INFERENCE FOR CATEGORICAL DATA IN R

The General Social S u r v e y IN FE R E N C E FOR C ATE G OR IC - PowerPoint PPT Presentation

The General Social S u r v e y IN FE R E N C E FOR C ATE G OR IC AL DATA IN R Andre w Bra y Assistant Professor of Statistics at Reed College INFERENCE FOR CATEGORICAL DATA IN R INFERENCE FOR CATEGORICAL DATA IN R INFERENCE FOR CATEGORICAL

Wednesday, November 30, 2016 3:41 PM General Page 1 General Page 2 General Page 3 General Page

European Social Network Social services in Europe Christian Fillet Chair, European Social

social-emotional functioning Dr Dawn Watling Department of Psychology Social Withdrawal

Outline Social Contagion Social Contagion Social Contagion Social Contagion Models Models

SOCI 210: Sociological Perspectives Nov. 3 1. Social Change 2. Collective behavior 3. Social

SOCIAL PROGRESS INDEX SOCIAL SOCIAL PROGRESS PROGRESS IMPERATIVE IMPERATIVE Social Progress

Social Entrepreneurship Caravan Social entrepreneurship aims at solving social problems by

Presentation 1 What is social media? Get Media Smart social media 2 What is social media?

Getting Social What is social media? Why does social media matter? What social media

Social Media for Business July 28, 2009 What is it? Social media marketing also known as social

Social Media donts What is social media Social media is nothing new Just an extension

Social Structure & Society Chapter 5 Section 1 SOCIAL STRUCTURE & STATUS Social

Social Marketing Social Marketing PPA March 2 0 0 7 PPA March 2 0 0 7 Objectives

Studying Social Policy at Salford What is Social Policy? What is social policy? Society

Social Security: An Overview For the National Academy of Social Insurance Demystifying Social

Social Media Legal Issues Brian C. England Deputy City Attorney Garland, Texas March 7, 2018

Limits of simple regression E XP L OR ATOR Y DATA AN ALYSIS IN P YTH ON Allen Do w ne y

GSS-Proxy: Better privilege separation Simo Sorce Principal Software Engineer, Red Hat February

REST-GSS Authentication for HTTP Apps, Browser and Otherwise Nicolas Williams

Create Distributions Empirically using Excel V0E 10/11/2014 0E 2014 Schield Creating

Erik Shifflett Granite State Solar, LLC 57 Ryan Rd. Bow, NH 03304 603.369.4318

Population Estimates Program: Census 2020 Preparations Working to ensure accurate and

Green Software Services From requirements to Business Models Schahram Dustdar TU Wien Austria

Gas in the Ring Geometry Eugene Zaremba Queens University, Kingston, Ontario, Canada

The General Social S u r v e y IN FE R E N C E FOR C ATE G OR IC - PowerPoint PPT Presentation

The General Social S u r v e y IN FE R E N C E FOR C ATE G OR IC AL DATA IN R Andre w Bra y Assistant Professor of Statistics at Reed College INFERENCE FOR CATEGORICAL DATA IN R INFERENCE FOR CATEGORICAL DATA IN R INFERENCE FOR CATEGORICAL

Wednesday, November 30, 2016 3:41 PM General Page 1 General Page 2 General Page 3 General Page

European Social Network Social services in Europe Christian Fillet Chair, European Social

social-emotional functioning Dr Dawn Watling Department of Psychology Social Withdrawal

Outline Social Contagion Social Contagion Social Contagion Social Contagion Models Models

SOCI 210: Sociological Perspectives Nov. 3 1. Social Change 2. Collective behavior 3. Social

SOCIAL PROGRESS INDEX SOCIAL SOCIAL PROGRESS PROGRESS IMPERATIVE IMPERATIVE Social Progress

Social Entrepreneurship Caravan Social entrepreneurship aims at solving social problems by

Presentation 1 What is social media? Get Media Smart social media 2 What is social media?

Getting Social What is social media? Why does social media matter? What social media

Social Media for Business July 28, 2009 What is it? Social media marketing also known as social

Social Media donts What is social media Social media is nothing new Just an extension

Social Structure &amp; Society Chapter 5 Section 1 SOCIAL STRUCTURE &amp; STATUS Social

Social Marketing Social Marketing PPA March 2 0 0 7 PPA March 2 0 0 7 Objectives

Studying Social Policy at Salford What is Social Policy? What is social policy? Society

Social Security: An Overview For the National Academy of Social Insurance Demystifying Social

Social Media Legal Issues Brian C. England Deputy City Attorney Garland, Texas March 7, 2018

Limits of simple regression E XP L OR ATOR Y DATA AN ALYSIS IN P YTH ON Allen Do w ne y

GSS-Proxy: Better privilege separation Simo Sorce Principal Software Engineer, Red Hat February

REST-GSS Authentication for HTTP Apps, Browser and Otherwise Nicolas Williams

Create Distributions Empirically using Excel V0E 10/11/2014 0E 2014 Schield Creating

Erik Shifflett Granite State Solar, LLC 57 Ryan Rd. Bow, NH 03304 603.369.4318

Population Estimates Program: Census 2020 Preparations Working to ensure accurate and

Green Software Services From requirements to Business Models Schahram Dustdar TU Wien Austria

Gas in the Ring Geometry Eugene Zaremba Queens University, Kingston, Ontario, Canada

Social Structure & Society Chapter 5 Section 1 SOCIAL STRUCTURE & STATUS Social