You are looking at content from Sapping Attention, which was my primary blog from 2010 to 2015; I am republishing all items from there on this page, but for the foreseeable future you should be able to read them in their original form at For current posts, see here.

Isms and ists

Nov 15 2010

Hank asked for a couple of charts in the comments, so I thought I’d oblige. Since I’m starting to feel they’re better at tracking the permeation of concepts, we’ll use appearances per 1000 books as the y axis:

And Darwin after the break.

In both these cases, we have a lead word that captures most of the movement, and then one or two follower words below that check in decade later and then fairly well track the main word’s movement. I’d have to plot them normalized to be sure, but I think my similarity functions would notice the similar curves here. The only worry I have is that “Darwin” has a baseline of about 50, which might make the function think that it’s very different from the derived adjectives. (It is different, of course, but short of running a script to get the data for “Charles Darwin”, which I remain loathe to do, there’s no way to exclude the references in there to Erasmus, the Australian city, etc.).


Good stuff, Ben. I want “Darwinist,” for…

Hank - Nov 5, 2010

Good stuff, Ben. I want “Darwinist,” for the record, on the last one. Any chance? Also, any way to tell whether things like “Darwinism” are getting mention more in natural-sciences works or elsewhere? Maybe by finding linked words (before you get your “genre” thing up and running)?

Been meaning to comment–“darwinist” is …

Ben - Dec 1, 2010

Been meaning to comment–“darwinist” is very rare, and didn’t make my initial cut. There are 3922 occurrences of ‘darwinism’ in the data set, and just 84 of ‘darwinist’ and ‘darwinists’ combined. So whatever the numbers are, they’re going to be tiny compared to the others. (I’d put them up, but I have to fix a few things in the database before it works again.)