Quantity or quality?

This might seem like an odd question, especially given the vast (vast) quantity of effort that goes into digitisation, rights checking, caption authoring and so on. But I’m also a fan of taking a step back at least every so often and asking odd, obvious and possibly stupid questions. The question is in part prompted by … Read more

hoard.it : bootstrapping the NAW

What seems like a looong time ago I came up with an idea for “bootstrapping” the Non API Web (NAW), particularly around extracting un-structured content from (museum) collections pages. The idea of scraping pages when there’s a lack of data access API isn’t new: Dapper launched a couple of years ago with a model for … Read more