A fun little exercise I’ve been doing is a statistical and language analysis tool to analyse Urban Dictionary. The idea for the project came about when it was pointed out that my own name was on UD and I realised that many of the definitions were of a sexual nature or offering praise to the holder of the name. I suspect that people are adding definitions of either their own name, or their partners or relatives. I thought it would be fun to programmatically analyse the various definitions and group them by content, maybe also ranking the most popular keywords or other interesting statistics.
The finished (though I’ll add to it overtime) product is available here: https://www.acarrick.com/urban_stats
Continue reading for some technical details….