Skip to content

Instantly share code, notes, and snippets.

@dzhulk
Created May 3, 2014 18:47
Show Gist options
  • Save dzhulk/96848f938975674d93fb to your computer and use it in GitHub Desktop.
Save dzhulk/96848f938975674d93fb to your computer and use it in GitHub Desktop.
(def common-words (-> (slurp "http://www.textfixer.com/resources/common-english-words-with-contractions.txt")
(clojure.string/split #",")
set))
(def text (slurp "http://www.clearwhitelight.org/hitch/hhgttg.txt"))
(->> text
(re-seq #"[\w|']+")
(map #(clojure.string/lower-case %))
(remove common-words)
frequencies
(sort-by val)
reverse)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment