TEXT MPA 635: Data Visualization November 13, 2018
P L A N F O R T O D A Y Surveys and qualitative data Digital humanities Visualizing text with R
S U R V E Y S A N D Q U A L I TAT I V E DATA
S I N G L E R E S P O N S E Q U E S T I O N S
M U L T I P L E R E S P O N S E Q U E S T I O N S
C O O C C U R R E N C E A N A LY S I S
F R E E R E S P O N S E S
I S T H I S O K A Y ?
W O R D C L O U D S F O R G R O W N U P S Counting words, but in fancier ways
D I G I TA L H U M A N I T I E S
C R A S H C O U R S E I N C O M P U T A T I O N A L L I N G U I S T I C S Tokens, lemmas, and parts of speech Sentiment analysis tf-idf Topics and LDA Fingerprinting
T I D Y T E X T
T O K E N S Element of the text Word n-gram Sentence Verse Line Paragraph
T O K E N F R E Q U E N C Y
N - G R A M F R E Q U E N C Y
P A R T S O F S P E E C H
P A R T O F S P E E C H F R E Q U E N C Y
A R T S Y S T U F F
S E N T I M E N T A N A LY S I S How positive or negative a text is
S E N T I M E N T A N A LY S I S
T F - I D F Term frequency-inverse document frequency How important a term is compared to the rest of the documents
T O P I C M O D E L I N G
L A T E N T D I R I C H L E T A L L O C A T I O N
C L U S T E R S O F R E L A T E D W O R D S
T R A C K T O P I C S O V E R T I M E
F I N G E R P R I N T I N G Analyze richness or uniqueness of document Punctuation patterns, vocabulary choices, sentence length Hapax legomenon
Sentence length
Hapax legomena
Verse length
V I S U A L I Z I N G T E X T W I T H R
Recommend
More recommend