Accessing Statistics Canada Data and Resources Hugh McCague Valerie Preston Walter Giesbrecht Sara Tumpane
Outline • Survey Terminology • Research Data Centre (RDC) • RDC versus Public Use Microdata Files (PUMF) • Accessing the RDC • Statistics Canada Surveys and Data • Statistical Software • Statistical Consulting Service • Resources
Some Survey Terminology • Population • Elements • Sample: Simple Random Sample, Probability Sample • Response Rate • Weights: Simple Weights 3
Some Survey Terminology • Demographics • Strata • Clusters (primary sampling units, PSUs) • Complex Sample • Complex Weights, Bootstrap and Jackknife Replicate Weights 4
Some Survey Terminology • Cross-sectional data • Longitudinal data : periods, waves, cycles, trajectory, life course • Attrition : attrition rate. • Helpful reference : Ornstein, Michael. A Companion to Survey Research . London; Thousand Oaks, CA: SAGE, 2013. 5
Research Data Center (RDC) • Access to Statistics Canada data and statistical software • Microdata & administrative data • For York students and faculty, access is free • A “secure” environment • Researchers are “deemed employees” of Statistics Canada • Must work in RDC • CRDCN Network
The CRDCN Network
York RDC • 282 York Lanes • Staffed by: Analyst Sara Tumpane (yorkrdc2@yorku.ca) • Assistant Theresa Kim (yorkrdc3@yorku.ca) • • 8 workstations • Open 3 days/ wk • http://www.isr.yorku.ca/rdc / 8
Before you apply to the RDC… • Consider your options • Is what you need in some more readily accessible source (either PUMF or aggregate file)
RDC or PUMF? Confidential Microdata in Research Public Use Microdata Files accessed Data Centres online Characteristics: Characteristics: o Contains most of the original o Manipulated by aggregating, information collected during the capping, or deleting variables that survey could be “identifiers”; survey o Continuous variables are accessible respondents cannot be identified o Longitudinal identifiers provided o Many continuous variables o Contains bootstrap weights used for transformed into categorical calculating exact variance variables o Longitudinal identifiers stripped Access is appropriate when: Access is appropriate when: o Sensitive variables not provided in o Immediate data access is required o Analysis is for a course paper or PUMF o A PUMF does not exist equivalent o Longitudinal data is necessary o Data exploration o Analytical work is complex in nature
Labour Force Survey PUMF Master file • Demographic variables • Demographic variables o o Geography Geography o o Age Age o o Sex Sex o o Marital status Marital status o Country of birth o Country completed highest post- secondary degree/certificate/diploma o Landed immigrant status o Detailed Aboriginal status
CCHS 2012 Example 1 PUMF Master File • 1815 variables • 1381 variables • Sources of personal income • Sources of personal income o wages and salaries o Employment inc. o income from self-employment o o EI/Worker's comp dividends and interest o employment insurance o Senior benefits o worker's compensation o Other o CPP or QPP o job related retirement pensions o RRSP/RRIF o OAS and GIS o social assistance/welfare o child tax benefits o child support o alimony o other o none
CCHS 2012 Example 2 PUMF Master File • Geography • Geography o Province of residence of respondent o Province of residence of respondent-(G) o Postal code - (D) o Health Region - (G) o Health region of residence of respondent - (D) o B.C. Health Authority (BCHA) - (D) o Sub-health region (Québec only) - (D) o Nova Scotia district health authority o British Columbia local health authority - (D) o Regional health authority (RHA) - Alberta - (D) o British Columbia health authority - (D) o Local health integrated networks - Ontario - (D) o 2006 census dissemination area o Federal electoral district - (D) o Census subdivision - (D) o Census division - (D) o Statistical area classification type - (D) o 2006 Census metropolitan area (CMA) o Health region peer group o Urban and rural areas o Urban and rural areas - 2 levels - (D) o Subzones for Alberta o Manitoba health authority - (D)
Accessing PUMFs & master file metadata • Statistics Canada Nesstar data portal o metadata only, for PUMFs and master files o http://www62.statcan.ca/webview/ • YUL: Data & Statistics library guide o http://researchguides.library.yorku.ca/data • <odesi> (OCUL) o http://www.library.yorku.ca/e/resolver/id/1165738
http://www.andertoons.com/data/cartoon/6543/things-good-stuff-ok-i-reiterate-request-for-specific-data
How to apply to an RDC and available datasets • RDC Application Pages • Data available in the RDCs • SSHRC Website
Accessing the RDC Action Timeline Notes Provide list of academic Apply through the 1-2 Hours contributions, project SSHRC website proposal Approval based on Evaluation of the relevance of methods and 2-4 Weeks data, and demonstrated proposal need for microdata Security screening 1-3 Weeks for approval process Sign Microdata Research 1-3 Weeks for approval Contract
Project Proposal • The project proposal includes the following elements: o Title of the Project o Rationale and objectives of the study o Proposed data analysis and software requirements o Data requirements o Expected project start and end dates o Expected products o References
Data at the RDC • Labour Force Survey (LFS): 1976 - 2014 o Monthly estimates of employment & unemployment o Rotating 6 month panel, N= ~ 16,500 • Paper: Seasonal Adjustment, Demography, and GDP Growth , Dunbar, G.R. (2013), Canadian Journal of Economics • Survey of Labour and Income Dynamics (SLID): 1993 – 2011 o Changes in well-being over time o Overlapping 6 year panels, N= ~ 17,000 • Paper: An Empirical Model of Tax Convexity and Self-Employment, Wen, J-F. & Gordon, D. (2014), The Review of Economics and Statistics • Workplace and Employee Survey (WES): 1999 – 2006 o Employer: competitiveness, innovation, technology use: N= ~ 6,300 o Employee: training, job stability, earnings: N= ~24,000 • Paper: Organizational Redesign, Information Technologies and Workplace Productivity , Dostie, B. & Jayaraman, R. (2012), The B.E. Journal of Economic Analysis and Policy
Data (continued) • Survey of Household Spending (SHS): 1986 - 2012 o Spending, investments, and savings: household and person o Cross-sectional: N= ~17,000 (households) • Paper: Does One Size Fit All? The CPI and Canadian Seniors , Brzozowski, M. (2006), Canadian Public Policy • Survey of Financial Security (SFS): 1999 – 2012 o Net worth (wealth) of Canadian families: assets, debt, employment, income, education o Cross-sectional: N= ~20,000 (households) • Paper: New Evidence on Taxes and Portfolio Choice , Atalay, K. et al. (2009), Social and Economic Dimensions of an Aging Population (SEDAP) Research Papers • Census & National Household Survey (NHS): 1911 – 2011 o Demographic, social, and economic characteristics o Cross-sectional (mandatory): 20% sample, N= ~6,000,000 • Paper: Quality of Life, Firm Productivity, and the Value of Amenities Across Canadian Cities , Albouy, D. Leibovici, F. & Warman, C. (2013), Canadian Journal of Economics
Data by Themes • Health and Health Care • National Population Health Survey (NPHS) • Participation and Activity Limitation Survey (PALS) • Canadian Tobacco, Alcohol and Drugs Survey (CTADS) • Occupations and Organizations • Workplace and Employee Survey (WES) • Survey of Labour and Income Dynamics (SLID) • Census • Education • Youth in Transition Survey (YITS) • National Graduates Survey (NGS) • Race and Ethnicity • Aboriginal Peoples Survey (APS) • Longitudinal Survey of Immigrants to Canada (LSIC) • Ethnic Diversity Survey (EDS)
Pilot Data • Canadian Cancer Registry (CCR) • Vital Statistics • Uniform Crime Reporting • Homicide Survey • Hate Crime Data • Ministry of Community and Social Services (MCSS) • Citizenship and Immigration Canada (CIC)
Which Statistical Software to use at the York RDC? Features to Consider • SPSS 23 • SAS 9.4 • Stata 13 • R 3.0.3 Statistical Software Resources: Institute for Digital Research and Educations (idre), UCLA http://www.ats.ucla.edu/stat/
Statistical Consulting Service (SCS ) • Statistical Consulting provided by a group of York faculty and graduate students with staff at the Institute for Social Research (ISR). • Usually, no fee for York faculty and student researchers • Online appointment scheduler 24
http://truthfacts.com/truthfacts/2014/04/09
Statistical Consulting Service (SCS ) • ISR/SCS Short Courses and Spring Seminar Series on data analysis, qualitative research methods, survey methods, and related software • More details: http://www.isryorku.ca/centres/scs/ 26
Contact Information and Resources • http://www.isryorku.ca/econ
Recommend
More recommend