Skip to content

Instantly share code, notes, and snippets.

@trickyBytes
Last active October 22, 2020 09:53
Show Gist options
  • Save trickyBytes/d9fc2d8ba319e032294b5610fbf3f1d8 to your computer and use it in GitHub Desktop.
Save trickyBytes/d9fc2d8ba319e032294b5610fbf3f1d8 to your computer and use it in GitHub Desktop.
Facts and definitions
Here are a few facts and definition about the text above:
- Everything is lowercase.
- There are only letters, full stops (`.`), and single whitespace characters.
- A _word_ is defined as a sequence of letters delimited by either a whitespace or a full stop `.` character.
- A full stop character is not considered a word. A full stop is never preceded or followed by whitespace.
- Any two words are separated either by a single whitespace character (`dolor sit`), or by a full stop with no spaces (`elit.sed`)
- A _sentence_ is defined as a sequence of words delimited by a full stop `.` character.
### Code answers to the following questions about the text above
- How many words are there in the text?
- How many sentences are there in the text?
- What is the length of the longest word?
- Which six words occur the most in the text?
- What percentage of the words are unique?
- What is the average number of words per sentence?
- Which three two-word phrases occur the most in the text?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment