Skip to content

Instantly share code, notes, and snippets.

@pabloav
Created March 13, 2016 01:56
Show Gist options
  • Save pabloav/1ae2053a9551b6ff9c5d to your computer and use it in GitHub Desktop.
Save pabloav/1ae2053a9551b6ff9c5d to your computer and use it in GitHub Desktop.
Output for various speech analyses
Using speeches/clinton01.txt
Unique words: 1365
Unique words that are simple: 410
Unique words that are simple (%): 30 %
---
Total words: 4710
Total words that are simple: 3159
Total words that are simple (%): 67 %
---
Top simple words by frequency:
- and (186)
- the (175)
- to (157)
- of (86)
- a (84)
---
Top non-simple words by frequency:
- americans (23)
- america (23)
- country (18)
- president (14)
- families (12)
Using speeches/sanders01.txt
Unique words: 720
Unique words that are simple: 284
Unique words that are simple (%): 39 %
---
Total words: 2202
Total words that are simple: 1534
Total words that are simple (%): 69 %
---
Top simple words by frequency:
- the (116)
- and (93)
- to (64)
- we (55)
- of (51)
---
Top non-simple words by frequency:
- country (15)
- going (13)
- campaign (13)
- political (11)
- hampshire (7)
Using speeches/trump01.txt
Unique words: 873
Unique words that are simple: 356
Unique words that are simple (%): 40 %
---
Total words: 3865
Total words that are simple: 2931
Total words that are simple (%): 75 %
---
Top simple words by frequency:
- and (156)
- the (137)
- i (95)
- to (93)
- have (83)
---
Top non-simple words by frequency:
- going (29)
- country (16)
- doing (11)
- coming (11)
- president (9)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment