This page gives a list of the most frequently used words in a given text, and creates a chart of that distribution. The chart should hypothetically follow Zipf’s law:
Zipf's law states that given some corpus of natural language utterances, the frequency of any word is inversely proportional to its rank in the frequency table. Thus the most frequent word will occur approximately twice as often as the second most frequent word, three times as often as the third most frequent word, etc.
If you divide the chart into 4 sections, patterns begin to emerge:
Here's a link to the .txt of Moby Dick you could use, or perhaps Cicero's Orations or Treatment of the diseases of the eye, by means of prussic acid vapour, and other medical agents.
use text sample