Posts Tagged ‘visualization’

mining for big history: uncouth things i want to do with archives

Wednesday, June 24th, 2009

woohoo THATcampers!  i’m so psyched to hang out with you.  actually, i need to learn from your enormous brains…

a major theme of my graduate course in digital history at the u of c was the opportunities lying around unprecedented scale of access and manipulability.

historians, for instance, typically train to write 20- to 40-year studies, at most 100-year histories; they frequently teach by the century, at most the five-hundred-year time period.  proposal: digital archives, as a revolution in access, radically open the horizons for legitimate big history of long-term trends.

ideas for sessions:

* how would you text mine a 500-yr history?  how bout a 5000-yr history?  many of the tools for text-mining (cf philologic) look narrower and narrower within a peculiar text; how could these tools be used to crunch many texts across large time periods (off the top of my head: graph for me, computer, the top verbs used around the word “eye” in medical texts since Greece …  )?  how can timelines more usefully render the results visual (and interactive!)?

* how bout images.  here we’re talking about 200 years   what can you do with 1 billion photographs?  what happens when you automagically photosynth (livelabs.com/photosynth/) the entire nineteenth- to twentieth-century city of London?  what about “averaging” photos: www.faceresearch.org/demos/average ?  what does the average house look like, decade by decade?  what does an average coal miner look like?

* how bout maps.  doug knox (hi doug!) and i have been talking with the newberry map librarians about how you’d collate atlases of place names, travelers’ diaries, and maps to annotate an interactive atlas of chicago where any given block could be peeled back, year by year.  how would you make a 300-year thick map of the american west?

Visualizing time

Wednesday, June 24th, 2009

For the last two years, I have been very interested in visualizing data that emerges within my particular field: literature. This interest emerged as I read Moretti’s Graphs, Maps, Trees at the same time that I was experimenting with using GIS tools like Google Earth as a portion of the analysis in the last chapter of my dissertation. In my last year as a graduate student, a fellowship in the Emory Center for Interactive Teaching gave me additional time to begin experimenting with timelines. Timelines in literary studies were nothing new, but I wondered if it would be possible to have a class collaboratively build one in a manner similar to writing a wiki. The result was–in turn–a collaboration with Jason Jones (@jbj) where I coded a timeline, he designed an assignment, and his students created the data for a timeline of the Victorian Age. I’ve since had the chance to play with the tool in my own classes.

Jason and I both thought that timelines would be a fruitful subject for conversation THATCampers. And as many others have done, I thought I would share my original THATCamp proposal and then propose some ideas about where a discussion might go:

I would like discuss the different web-based tools and software that can be used to produce interactive and collaborative timelines. The presentation would involve demonstrating the different tools, showing the strengths and the weaknesses of each one, and producing a finished product. The tools would range from CHNM’s Timeline Builder to xtimeline and from Bee Docs Timeline 3D to the Timeline and Exhibit widgets that were developed in MIT’s Simile project. Having already spent some time with these tools, I think that the tools from Simile might be the most interesting to THATCamp participants due to their flexibility in representing data in multiple ways, including color coding events, sorting events, and with GIS data, as well as the ability to grab data from sources as diverse as a Google Docs spreadsheet or Twitter. Perhaps the best demonstration of the usefulness of a timeline would be to create–during the session/event–a timeline of THATCamp.

My current thinking:

As I’ve been preparing for THATCamp, I have gone ahead and evaluated as many of the timeline tools as I’ve had time for. I’ll be looking at another one or two tomorrow. I’ve gone ahead and created a spreadsheet listing the abilities of these different tools, along with some evaluation. Admittedly, some of the categories that I was using to evaluate the timelines stem from my deep involvement with the Simile widgets, and so the cases might not stack up as being completely fair to the competition.

Also, wanting to blend together both streams of data visualization that seemed valuable to me, I’ve also expanded on the original timelines that I designed for my courses by adding a Google Maps view this week. You can choose to either look at one view at a time or a dual view.

While a conversation could certainly be held about the different strengths and weaknesses of these different tools, most of the timeline tools that are available are going to be fairly easy for THATCampers to pick up and run with. The most complicated among them is the Simile tool, but I’ve heard there’s a fairly straightforward tutorial on building your own. Instead (or in addition to), I wonder if it could be possible to have a conversation about other possible research and pedagogical uses for timelines than those to which Jason and I have put them to use thus far. One obvious apporach would be to timeline a particular text (say, Slaughterhouse-Five) rather than a contextual time period. But what else could we do with timelines to make them valuable?

Moreover, I wonder if a discussion about visualizing time could be a part of a larger discussion about visualization that seems to be on the minds of other THATCampers (at least per their blog posts) such as Tonya Howe and Amanda Watson. How best can we use such visualizations in our research and/or teaching? At what point are there diminishing returns on such projects? Since these tools are relatively easy to learn (as opposed to programming languages), are they a good gateway tool for “traditional faculty” to begin comfortably integrating new technologies into their research/teaching? And, perhaps most broadly, what is the relationship between digital humanities and visualization

(I should meniton that while Jason and I proposed related ideas to THATCamp, this post is my own. So don’t hold him responsible for my shortcomings in expression.)

Visualization, Literary Study, and the Survey Class

Thursday, June 18th, 2009

I hope I’ve not missed the boat on the pre-un-conference-idea-generating-posts! In brief, I’d like to meet up people interested in a web project visually weighting by color simple semantic relations in literary texts and/or putting together an NEH grant for said project. Caveat: I’m not an expert on this. Here’s my initial proposal, though in retrospect it looks rather stilted and sad:

For the past year or so, I’ve been interested in putting together a small team of like-minded folks to help bring to fruition a data visualization project that could benefit less-prepared college students, teachers in the humanities, and researchers alike. Often, underprepared or at-risk educational populations struggle to connect literary study with the so-called “real world,” leading to a saddening lack of interest in the possibilities of the English language, much less literary study. I am currently working with Doug Eyman, a colleague at GMU, to develop a web application drawing on WordNet—and particularly the range of semantic similarity extensions built around WordNet—to visually mark up and weight by color the semantic patterns emerging from small uploaded portions of text. This kind of application can not only help students attend more fully to the structures of representation in literature and the larger world around them—through the means of a tool emphatically of the “real world”—but also enable scholars to unearth unexpected connections in larger bodies of text. Like literary texts to many students, the existing semantic similarity tools available through the open source community can seem inaccessible, even foreign, to a lay audience; this project seeks to lay open the language that so many fear, while enabling the critical thinking involved in literary analysis. Ultimately, we hope to extend this application with a collaborative and growing database of user-generated annotations, and perhaps with time, to fold in a historically-conscious dictionary as well. We are seeking an NEH Digital Humanities startup grant to pursue this project fully, and I’d like the opportunity to throw our idea into the ring at THATcamp to explore its problems as well as possibilities, even gathering more collaborators along the way.

Here’s a hand-colored version of something like what I’m thinking; I used WordNet::Similarity to generate the numbers indicating degree of relatedness, and then broke those numbers into a visual weighting system. Implementation hurdles do come out pretty clearly when you see how the numbers are generated, so I’m hoping someone out there will have better insights into the how of it all.

To a related, larger point: I always have the sneaking suspicion that this has been done before–Jodi Schneider mentioned LiveInk, a program that reformats text according to its semantic units, so that readers can more effectively grasp and retain content. This strikes me as simlar, as well, to the kinds of issues raised by Douglas Knox–using scale and format to retrieve “structured information.” Do the much-better-informed Campers out there know of an already-existing project like this? I wish the checklist of visual thinking tools that George Brett proposes were already here!

To a related, larger point: I always have the sneaking suspicion that this has been done before–Jodi Schneider mentioned LiveInk, a program that reformats text according to its semantic units, so that readers can more effectively grasp and retain content. This strikes me as simlar, as well, to the kinds of issues raised by Douglas Knox–using scale and format to retrieve “structured information.” Do the much-better-informed Campers out there know of an already-existing project like this? I wish the checklist of visual thinking tools that George Brett proposes were already here…

Here's what others are saying about THATCamp on Twitter