Gary Wolf and Kevin Kelly have been documenting an emerging phenomenon they call “the quantified self“. The term refers to a set experiments that people are conducting – primarily on themselves – to understand their own bodies and behavior. In an article for the New York Times magazine, Wolf details a range of these experiments. One engineer weans himself off coffee and compares his reported levels of concentration with and without caffiene. Others use sensors like the Zeo to track their sleep patterns, or the Fitbit to track physical activity. Some track what they eat and drink, how much they weigh, their emotional states.
Wolf acknowledges that some of the people profiled in the article sound obsessive and notes that people engaged in detailed self-tracking may be “outliers”. And he’s careful to offer testimonies from people who engaged in self-tracking and gave it up, feeling like the data they generated was relentless and remorseless. (As someone who’s had to engage in self-tracking of blood glucose levels as a type 1 diabetic for the past 25 years, “relentless and remorseless” are my words, not Wolf’s.) But he’s clearly a believer that tracking can be a tool for self-discovery, a way of learning what constitutes normal behavior for each of us, not just a tool for moving towards a goal, like increased fitness or better sleep.
The experiment in self-tracking that I’m considering is more about self knowledge than self improvement, though I’m finding it’s hard to separate the two. I’m looking for ways to monitor my personal information flow. I’d like to understand how I get information about the world – through television, the web, radio, email and the people I talk to. The hope is to use myself as a guinea pig, to see what’s possible as far as active and passive monitoring of information flows, in the hope of opening the experiment to a wider population.
I’ve made the case – in my recent TED talk and elsewhere – that many of us overestimate the amount of diverse, international information we encounter through the internet and other communications networks. We run the danger of being “imaginary cosmopolitans”, convinced we’re encountering information from all corners of the world, while we might be trapped in homogenous echo chambers.
There’s some data to support this theory, both from experiments colleagues and I have been carrying out looking at cosmopolitan and parochial consumption of media online, and there are terrific analyses like Pippa Norris and Ronald Inglehart’s “Cosmopolitan Communications“, which looks at vast data sets about communication flows across borders. But I’ve not been able to find much information that compares the media diets of individuals at a level that allows me to answer questions like, “What percentage of news encountered is local, national and international, on average? What media is most likely to make an individual seek out more information – a mainstream media story, a citizen media post or a personal recommendation?
Responding to an earlier blog post of mine, my friend David Sasaki proposed an experiment: keep a communications diary that tracked interactions via different media. If I’m going to argue that people’s uses of the Internet are disproportionately domestic, it would be good to compare those online interactions to other media. Sure, 90% of my Facebook interactions might be domestic, but perhaps that vastly outpaces my face to face interactions – that might then be an argument that the Internet is, on balance, more likely to help us interact with people from different nations (different religions, different political perspectives, take your pick) than other technologies.
Media diaries aren’t new – take an intro communications class at many universities, and you’re likely to be asked to keep one. They tend to be pretty superficial – it requires some serious obsessiveness to log the individual stories you encounter, rather than writing down “NPR – 7am – 7:20am. And the process of keeping a diary tends to shape your behavior – for the month Rachel and I were a Nielsen family (years back), we watched vastly more public television than we do in an average month.
It’s easier than ever to keep a diary with tools like Your Flowing Data, a Twitter-based service that allows you to send direct messages via the web or SMS. I just logged “d yfd listened WNNZ 0750 – 0830″, a syntax that I hope will let me start collecting information on what media I encounter offline, and who I interact with in the real world.
But what I really want is data on the dozen or more stories I heard on NPR during that morning drive – coding each in terms of subject and geography would mean either logging while driving or writing a tool that turns the name of a broadcast media source and an interval into a stream of metadata. (To a certain extent, this is one of the functions of MediaCloud, but we’re a long way from being able to do this with media that isn’t also creating RSS and Atom feeds.) Furthermore, I know that the process of logging my behavior will influence that behavior. I can already see myself tweeting “d yfd watched football 1130 – 2045″ on Sunday and the accompanying feelings of guilt, shame and, if the Packers lose, frustration.
Logging my media diet is clearly going to involve some diary work, but it would be great if I could automate collection as much as possible, both to minimize the time requirements and the influence logging will have on my behavior. And, if this is an experiment I hope others will repeat, logging needs to be as automatic as possible. So I’ve been looking for tools that will log and analyze my online behavior transparently.
My friend and colleague Judith Donath was responsible for a number of early tools designed to allow self-monitoring of email use, including Themail, developed with her student (and now, world-leading designer) Fernanda Viegas, Themail. I asked her advice on locating appropriate self-tracking tools to understand how I’m getting information through email, the web, Twitter and other media. Her suggestion: look at productivity tools.
Good advice. Most of the tools I’d been finding to track web use either are designed to allow bosses to monitor their workers or spouses to read each others’ email. Judith’s advice led me to Rescue Time, an amazing package that monitors everything you do on your computer… and nags you when it perceives you to be wasting time. I may break down and turn off the messages that urge me to account for every five minutes of inactivity, but I’m finding the ability to track what applications I’m using to be hugely helpful, if slightly dispiriting.
Apparently, I spent roughly twice as much time answering email as I do anything else. Writing (BBedit) comes in second place, though I apparently spend almost half as much time on Twitter as I do writing. And while I’d likely tell you I get most of my important news from Global Voices, the New York Times and Foreign Policy’s Passport, the logs tell the tale of my secret shame: a need to view every single goofy image posted on Reddit.
That’s not an easy balance to strike. I’ve been looking at mail analysis tools as well, since Themail no longer exists. Mail Trends gets the job done… if you’re a GMail user and if you don’t mind mucking about on the command line. (It’s a very elegant Python script, which needs the Cheetah templating library. In my experience, it chokes when I try to feed it more than 100,000 emails, but works like a champ on 50,000 or so.) Mail Trends does a great job in offering a topline summary – I now know that my primary research collaborator sends me roughly twice as much email as my wife… which may or may not tell me something helpful about both relationships. But it’s not able to tell me what URLs I follow within emails and which I ignore, data that I’d need to understand how I get information from mailing lists and individuals. My guess is that a tool specific enough to track the URLs I read would be almost unusable in terms of showing the overall patterns of email usage.
I’m having similar problems figuring out how to analyze Twitter. Tweetstats offers insights on who I retweet and who I reply to – good indicators of who I read closely within the set of 585 people I follow. And MMMeeja provides a pretty map of those 585 who have provided information about their location, letting me see that I follow a lot of Africans and not many South Americans. Again, what I’d really like is something that collected every URL presented to me via Twitter and tracked which ones I follow and which I ignore. Ditto for Facebook, though I use it lots less.
So – here are my open questions:
– What are tools I’ve not yet found that solve some of the problems I’ve described here? Is there a good tool that can turn an interval of radio or television into a stream of story metadata? Has anyone developed a tool that tracks every URL I encounter across applications and examines whether I’ve followed it?
– Has anyone come up with a way to make offline media tracking easier to do? Something like Shazam, which could listen to radio or television with me and tell me what stories I’m hearing? A microformat for tracking conversations with individuals?
– If I want anyone else to participate in this project – and I do -what’s the right balance between the overbroad and the spookily specific? If I’m not willing to start using Eyebrowse, what level of specificity is the right one? Your top eight sites, as Chrome present to you? The aggregate data of RescueTime? A world map that shows how often different corners of the world are presented to you in the course of a month?
– To make the process of media self-monitoring worth engaging in, there needs to be a reward, either in terms of self-knowledge or self-improvement. What sorts of knowledge would make you willing to participate in an experiment like this? Are there behaviors that you’d like to change that such an experiment would help you identify and address? Or have I simply descended too deep into the realm of the obsessive outlier?