Descripti v e statistics P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON Conor De w e y Data Scientist , Sq u arespace
What are descripti v e statistics ? 1 Wikimedia PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Meas u res of centralit y Mean Median Mode PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Meas u res of centralit y 1.6 mode 1.4 median 1.2 mean 1.0 0.8 σ = 0.25 0.6 0.4 σ = 1 0.2 0.0 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 2.2 1 Wikimedia PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Meas u res of v ariabilit y Variance Standard de v iation Range PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Meas u res of v ariabilit y PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Modalit y 1 Wikimedia PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Ske w ness 1 Wikimedia PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
S u mmar y De � ning descripti v e statistics Mean , median , and mode Standard de v iation and v ariance Modalit y and ske w ness PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Let ' s prepare for the inter v ie w! P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
Categorical data P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON Conor De w e y Data Scientist , Sq u arespace
T y pes of v ariables PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Encoding categorical data 1 What is One Hot Encoding and Ho w to Do It PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
E x ample : laptop models PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
E x ample : laptop models company_count = df['Company'].value_counts() sns.barplot(company_count.index, company_count.values) PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Bo x plots lower upper upper lower median whisker whisker quartile quartile outliers 1 Wikimedia PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
E x ample : laptop models df.boxplot('Price', 'Company', rot = 30, figsize=(12,8), vert=False) PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
S u mmar y T y pes of v ariables Encoding techniq u es Sample e x plorator y data anal y sis PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Let ' s prepare for the inter v ie w! P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
T w o or more v ariables P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON Conor De w e y Data Scientist , Sq u arespace
T y pes of relationships 1 Wikimedia PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
What is correlation ? Statistical relationship bet w een v ariables Stronger correlation = more information 1 Wikimedia PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Co v ariance PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Pearson ' s correlation PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Pearson ' s correlation 1 0.8 0.4 0 -0.4 -0.8 -1 1 1 1 -1 -1 -1 0 0 0 0 0 0 0 1 Wikimedia PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Correlation v s . ca u sation 1 x kcd PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Correlation v s . ca u sation 1 Correlation does not mean Ca u sation PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
S u mmar y T y pes of relationships Re v ie w of correlation Co v ariance Pearson ' s correlation Correlation v s . ca u sation PRACTICING STATISTICS INTERVIEW QUESTIONS IN PYTHON
Let ' s prepare for the inter v ie w! P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON
Recommend
More recommend