You are looking at content from Sapping Attention, which was my primary blog from 2010 to 2015; I am republishing all items from there on this page, but for the foreseeable future you should be able to read them in their original form at sappingattention.blogspot.com. For current posts, see here.

Top ten authors

Nov 28 2010

Most intensive text analysis is done on heavily maintained sources. Im using a mess, by contrast, but a much larger one. Partly, Im doing this tendentiouslyI think its important to realize that we can accept all the errors due to poor optical character recognition, occasional duplicate copies of works, and so on, and still get workable materials.

Using worse sources is something of a necessity for digital history. The text recognition and the metadata for a lot of the sources we use oftengoogle books, jstor, proquestis full of errors under the surface, and its OK for us to work with such data in the open. The historical profession doesnt have any small-ish corpuses we would be interested in analyzing again and again. This isnt true of English departments, who seem to be well ahead of historians in computer-assisted text analysis, and have the luxury of emerging curated text sources like the one Martin Mueller describes here.

But the side effect of that is that we need to be careful about understanding what were working with. So Im running periodic checks on the data in my corpus of books by major American publishers (described more earlier) to see whats in there. I thought Id post the list of the top twenty authors, because I found it surprising, though not in a bad way. Well do from no. 20 ranking on up, because thats how they do it on sports blogs. (What I really should do is a slideshow to increase pageviews). Ill identify the less famous names.

William Dean Howells (54)

Albert Harkness (54) classicist, wrote textbooks

Walter Scott (55)

John Burroughs (56) naturalist, essayist

James Russell Lowell (57) poet, critic, Lowell.

Francis Parkman (57) historian, though I guess everyone knows that

Bayard Taylor (64) poet, critic

Mark Twain (65)

Bret Harte (66) Western novelist

Nathaniel Hawthorne (68)

Charles Dickens (69)

Oliver Wendell Holmes (73)

John Fiske (73) philosopher/polymath

James Fenimore Cooper (74)

Robert Louis Stevenson (74)

Thackeray, William Makepeace, 1811-1863 (86)

Henry Wadsworth Longfellow (91)

William Shakespeare (92)

Herbert Spencer (116) British philosopher/polymath

Washington Irving (170)

Spencer and Fiske in the top 10 is what I like. There are quite a few Brits being published in the US, but thats fine. And for the most part, the rest of these are old American studies chestnuts that no one reads much anymore. Instead of Melville, Thoreau, Emerson, and the James brothers, we have Fenimore Cooper, Burroughs, Fiske, and Spencer and Howells. Twain is really the only American on the list to have prospered in the last hundred years. That sounds right if we want to look at what people were reading, although its a little far from what weve decided was important.

Comments:

Muellers post was great. Do you have any thou

Jamie - Nov 2, 2010

Muellers post was great. Do you have any thoughts on digital-based assignments for history undergrads? He also ended with a comment similar to what youve said before, that there are parallels between the ninteenth-century editorial projects and todays digital work. It seems like hed agree that it opens up new ways to read with the grain, in addition to against it.