digital medicine i introduction to programming
play

Digital Medicine I: Introduction to Programming Pandas Autumn 2019 - PowerPoint PPT Presentation

Hans-Joachim Bckenhauer and Dennis Komm Digital Medicine I: Introduction to Programming Pandas Autumn 2019 December 19, 2019 So far. . . numpy and matplotlib The Modules numpy and matplotlib numpy Calculations with vectors and matrices


  1. Hans-Joachim Böckenhauer and Dennis Komm Digital Medicine I: Introduction to Programming Pandas Autumn 2019 – December 19, 2019

  2. So far. . . numpy and matplotlib

  3. The Modules numpy and matplotlib numpy Calculations with vectors and matrices Numerical methods Documentation: https://numpy.org/doc/ Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 1 / 12

  4. The Modules numpy and matplotlib numpy Calculations with vectors and matrices Numerical methods Documentation: https://numpy.org/doc/ matplotlib Data visualization (Plots) Documentation: https://matplotlib.org/contents.html Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 1 / 12

  5. Now. . . pandas

  6. The Module pandas pandas Processing of large sets of data Allows a functionality similar to Excel Documentation: https://pandas.pydata.org/pandas-docs/stable/ Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 2 / 12

  7. The Module pandas pandas Processing of large sets of data Allows a functionality similar to Excel Documentation: https://pandas.pydata.org/pandas-docs/stable/ Project 3: Reading in and processing CSV file “manually” pandas contains data structures and functions for this Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 2 / 12

  8. The Module pandas Import pandas analogously to numpy and matplotlib import pandas as pd Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 3 / 12

  9. The Module pandas Import pandas analogously to numpy and matplotlib import pandas as pd Read in CSV file and store it in a special data type pandas dataframe (instead of Python list or numpy array) data = pd.read_csv("daten.csv") Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 3 / 12

  10. The Module pandas Import pandas analogously to numpy and matplotlib import pandas as pd Read in CSV file and store it in a special data type pandas dataframe (instead of Python list or numpy array) data = pd.read_csv("daten.csv") Files in Excel format can be read in analogously data = pd.read_excel("daten.xlsx") Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 3 / 12

  11. Air Measurements using pandas

  12. Exercise – Air Measurements Air measurements Copy the data file from project 3 ugz_luftqualitaetsmessung_seit-2012.csv Read in the CSV file and output its content To this end, use read_csv() and print() Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 4 / 12

  13. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  14. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  15. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists print(data.iloc[5]) print(data.iloc[0:10]) print(data.head(3)) print(data.iloc[8, 0]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  16. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists print(data.iloc[5]) Output line 5 print(data.iloc[0:10]) print(data.head(3)) print(data.iloc[8, 0]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  17. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists print(data.iloc[5]) Output lines 0 to 9 print(data.iloc[0:10]) print(data.head(3)) print(data.iloc[8, 0]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  18. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists print(data.iloc[5]) print(data.iloc[0:10]) print(data.head(3)) Output lines 0 to 2 print(data.iloc[8, 0]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  19. Air Measurements import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") print(data) Accessing individually cells using data.iloc Same functionality as lists print(data.iloc[5]) print(data.iloc[0:10]) print(data.head(3)) print(data.iloc[8, 0]) Output line 8, column 0 Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 5 / 12

  20. Reading in and Processing CSV Files Extract data Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 6 / 12

  21. Reading in and Processing CSV Files Extract data Numerical data starts from line 5 We are only interested in the first 3 columns We want to change column names Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 6 / 12

  22. Reading in and Processing CSV Files Extract data Numerical data starts from line 5 We are only interested in the first 3 columns We want to change column names import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") newdata = data.iloc[5:, 0:3] newdata = newdata.rename(columns={"Zürich Stampfenbachstrasse": "SO2", \ "Zürich Stampfenbachstrasse.1": "CO"}) newdata.to_csv("messungen.csv") Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 6 / 12

  23. Reading in and Processing CSV Files Extract data Numerical data starts from line 5 Selection from line 5 We are only interested in the first 3 columns and columns 0 to 2 We want to change column names import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") newdata = data.iloc[5:, 0:3] newdata = newdata.rename(columns={"Zürich Stampfenbachstrasse": "SO2", \ "Zürich Stampfenbachstrasse.1": "CO"}) newdata.to_csv("messungen.csv") Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 6 / 12

  24. Reading in and Processing CSV Files Extract data Numerical data starts from line 5 We are only interested in the first 3 columns Rename columns We want to change column names import pandas as pd data = pd.read_csv("ugz_luftqualitaetsmessung_seit-2012.csv") newdata = data.iloc[5:, 0:3] newdata = newdata.rename(columns={"Zürich Stampfenbachstrasse": "SO2", \ "Zürich Stampfenbachstrasse.1": "CO"}) newdata.to_csv("messungen.csv") Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 6 / 12

  25. Reading in and Processing CSV Files Accessing data using the column names Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 7 / 12

  26. Reading in and Processing CSV Files Accessing data using the column names Output all column names as list print(data.columns) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 7 / 12

  27. Reading in and Processing CSV Files Accessing data using the column names Output all column names as list print(data.columns) Output column “Datum” print(data["Datum"]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 7 / 12

  28. Reading in and Processing CSV Files Accessing data using the column names Output all column names as list print(data.columns) Output column “Datum” print(data["Datum"]) Output column “Zürich Stampfenbachstrasse – Kohlenmonoxid” print(data["Zürich Stampfenbachstrasse.1"]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 7 / 12

  29. Reading in and Processing CSV Files Filtering data Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 8 / 12

  30. Reading in and Processing CSV Files Filtering data Use loc instead of iloc in order to specify conditions print(data.loc[data["Datum"] == "2014-12-19"]) Digital Medicine I: Introduction to Programming – Pandas Autumn 2019 Böckenhauer, Komm 8 / 12

Recommend


More recommend