Posts with tag Bookworm


← Back to all posts
Apr 19 2022

Its not very hard to get individual texts in digital form. But working with grad students in the humanities looking for large sets of texts to do analysis across, I find that larger corpora are so hodgepodge as to be almost completely unusable. For humanists and ordinary people to work with large textual collections, they need to be distributed in ways that are actually accessible, not just open access.

Apr 28 2021

I mentioned earlier that Ive been doing some work on the old Bookworm project as I see that theres nothing else that occupies quite the same spot in the world of public- facing, nonconsumptive text tools.

Mar 07 2021

I used to blog everything that I did about a project like Bookworm, but have got out of the habit. There are some useful changes coming through through the pipeline, so I thought Id try to keep track of them, partly to update on some of the more widely used installations and partly

Feb 26 2020

As I often do, Im going to pull away from various forms of Internet reading/engagement through Lent. This year, this brings to mind one of my favorite stray observations about digital libraries that Ive never posted anywhere.

Feb 06 2015

Just some quick FAQs on my professor evaluations visualization: adding new ones to the front, so start with 1 if you want the important ones.

Dec 12 2014

I promised Matt Jockers Id put together a slightly longer explanation of the weird constraints Ive imposed on myself for topic models in the Bookworm system, like those I used to look at the breakdown of typical TV show episode structures. So here they are.

Sep 23 2014

Ive been seeing how deeply we could integrate topic models into the underlying Bookworm architecture a bit lately.

Aug 29 2014

I thought it would be worth documenting the difficulty (or lack of) in building a Bookworm on a small corpus: Ive been reading too much lately about the Simpsons thanks to the FX marathon, so figured Id spend a couple hours making it possible to check for changing language in the longest running TV show of all time.