Ethan Zuckerman’s online home, since 2003

Exploring the Chinese internet with WeiboScope

Scholars of social media spend a lot of time studying Twitter. Twitter’s not the largest social network in the world – Facebook has at least twice as many users – but it’s massive and influential, particularly in the world of journalism, where smart practitioners have learned to report on stories using accounts from Twitter. And Twitter is something of a model organism for social media researchers. Most relationships and content on Twitter are public, while relationships and content on Facebook are often private. There’s an ecosystem of tools that use Twitter’s API to understand popular topics and networks of influence on Twitter, and countless research projects that use Twitter’s API to understand behavioral dynamics on social networks.

By contrast, there’s little scholarly research in English on Sina Weibo, China’s most popular microblogging network. (The top article on Google Scholar that comes up for a search on “twitter” has 637 cites. Top article for “sina weibo” has 9 cites.) The service is structurally similar to Twitter, with @usernames, hashtags, reposting, and URL shortening (using the t.cn site instead of t.co used by Twitter.) In one sense, the service is richer than Twitter, as posts can contain both 140 characters (which may contain significantly more information than 140 alphanumeric characters, as the 140 characters in Chinese are ideograms), and an embedded image or video. And Sina Weibo offers an API and supports an ecosystem of tools and applications that interact with Weibo data. Oh, and Sina Weibo has almost as many users as Twitter – 250 million in October 2011, as compared to roughly 300 million for Twitter at the end of 2011.

The obvious reason for the lack of English language research is that most English-speaking social media scholars don’t read Chinese very well. But this a lame excuse for ignoring a powerful media tool. John Kelly of Morningside Analytics doesn’t speak Persian, but he’s done groundbreaking research mapping links in the Iranian blogosphere. Colleagues at the Berkman Center are using Media Cloud (built by researchers who speak no Russian) to understand conversations taking place in Russian blogs versus those in state-influenced media. Language is a powerful, but not insurmountable, barrier to researching a media space. In both the cases I mention above, English-speaking researchers worked with translators to understand novel social media phenomena.

I sometimes wonder whether English-speaking scholars pay insufficient attention to Chinese social media due to an assumption that Chinese media has been censored to the point of sterility. I often speak about internet censorship, and American audiences in particular are quick to share their knowledge of the “great firewall”, the “fifty cent party” and other aspects of Chinese internet censorship. Because Chinese censorship has been widely reported in American media, I suspect many Americans know more about what’s not on the Chinese internet than what’s present. (David Talbot of Technology Review wrote an excellent article about “China’s Internet Paradox” which makes the case that the Chinese internet is freer and more complicated than most audiences think.)

One of the best ways to get a sense for the complexity of Sina Weibo is through WeiboScope, a tool created by Cedric Sam and colleagues at the University of Hong Kong. WeiboScope uses Sina Weibo’s API to collect posts from 200,000 Sina Weibo users. His sample is a subset of Sina Weibo’s most popular users, and contains only users who have at least 1000 followers. (His blog, the Rice Cooker, offers lots of details on building and deploying the system.) Taking advantage of the fact that many Sina Weibo posts include images, WeiboScope offers a visual version of Weibo “trending topics”, showing the images associated with the most retweeted posts.

A first glance at WeiboScope offers a sense for what’s hot in the Chinese internet. There’s lots of images of pop stars, and lots of pretty women showing off cleavage. Dig a bit further and there’s some hope for the xenophiles amongst us: internet memes that need to translation. Sam the Seagull – a bird who steals Doritos from an Aberdeen convenience store – has been kicking around the internet since at least 2007, and an animated GIF of the thieving bird is the second most popular post today. Other memes appear to be shared in realtime – this comparison of pollution in a Chinese city versus the skies above Australia featured on WeiboScope today, and also appeared on Reddit this morning.

Dig a bit deeper and there’s quite a bit of political content. Take this deeply disconcerting image:

The face of the mammarilly-enhanced cow is that of Niu Gensheng, CEO of Mengniu Dairy, one of the companies implicated in the 2008 Melamine scandal, where companies apparently added a toxic chemical to milk powder to increase protein content in their products. Mengniu recently revealed that some of their milk is testing positive for another toxin, apparently because cows were fed moldy feed. The company’s share price dropped 24% on this news today, knocking more than $1 billion of the company’s value. The text accompanying the Gensheng cartoon warns the executive of the dangers of angering 1.3 billion people. Another post, the most popular today, links to an article on Songshuhui.net that argues that Chinese people should stop drinking milk. While the article doesn’t explicitly mention Mengniu, it references scandals about milk, and it’s likely that the conversation about eschewing milk is directly related to the Mengniu news. Another popular post suggests a boycott of Mengniu, reminding readers that Saatchi & Saatchi, which had worked to rebrand the company, left after the tainted milk scandal of 2008.

I suspect some readers will note that the story I’m featuring about popular dissent is about consumer issues, not about direct opposition to the government. It’s worth remembering that popular protest often focuses more on economic and social issues than on overtly political issues – the Occupy movement in the US has been triggered by frustration with banks at least as much as it is with frustration with US politics. And there’s more directly political content on Weibo as well – this post talks about a family’s house that’s demolished by the government and a man’s protests in Beijing. This isn’t to say that Sina Weibo isn’t censored – it is. But the speed of Weibo means that stories can be widely discussed before censors declare a topic off limits, as we saw with extensive online coverage of the July high speed train collision. And the popularity of Weibo gives Chinese authorities a classic Cute Cats problem – censoring the service too heavily would alienate the 250 million people who use it, including the majority who are largely interested in scantily dressed celebrities.

I should note: I don’t speak or read Chinese. That means that my interpretation of the Mengniu cow could be deeply mistaken. But it also means that it’s possible to puzzle out a breaking story in Chinese media using WeiboScope, Google Translate and a few web searches.

Here’s hoping tools like WeiboScope will help make the Chinese internet seem like less of a foreign land and more like a near neighbor.


Oiwan Lam at Global Voices has posted about online activism around Mengniu, with some wonderful (and generally less disturbing!) images. And An Xiao offers a great reaction post to the ideas I’m putting forward here, including a clever inversion of the Cute Cat Theory: “with Chinese political memes, the cute cats are the activist message.” Very interesting, something I’m still digesting.

3 Responses to “Exploring the Chinese internet with WeiboScope”

  1. kenyatta says:

    Hey Ethan.

    You should talk to Tricia Wang. She’s doing quite a bit of work on Weibo, most of which isn’t public yet. (Alex Madrigal has a small piece on her look at Weibo and dating.)

  2. Ethan,

    Just a niggling detail from a chemist. The melamine that was added to the milk was not done to increase the protein content but to give the appearance of having higher protein.

    Proteins contain the element nitrogen, whereas fats, carbs, and water do not, so an easy (but not overly accurate) method to measure protein ( = milk quality) is to just test for nitrogen. That’s where melamine comes in. It is loaded with nitrogen – 2/3 of every pound is nitrogen, and since it is very cheap, you can add a little to milk or whatever and up the overall nitrogen content.

    A slightly more sophisticated analytical test however, will easily differentiated between melamine and protein.

  3. Ethan says:

    John, thanks for that – a fascinating detail and an important one: it suggests that part of that scandal was the primitive nature of food safety inspection equipment in the parts of China where the tainted powder was found.

Trackbacks/Pingbacks

  1. Social Media Street Art: Censorship, China’s Political Memes and the Cute Cat Theory | An Xiao Studio: the virtual studio of an xiao mina - [...] Zuckerman just published a great post about the importance of studying Sina Weibo, the popular microblogging tool in China.  …
  2. Exploring the Chinese Internet with WeiboScope » OWNI.eu, News, Augmented - [...] This article originally appeared on Ethan Zuckerman’s blog. [...]
  3. January Month in Review! | An Xiao Studio: the virtual studio of an xiao mina - [...] but I wrote up a long response post about Sina Weibo and political memes on my blog, in response …
  4. Why Real Name Registration Matters in China–and What Might Be Lost | an xiao studio: the virtual studio of an xiao mina - [...] As Ethan Zuckerman blogged earlier (my response here), Sina Weibo and other Chinese social media are scarcely studied in …
  5. WeiboScope sfida la censura della Rete in Cina « EJO – European Journalism Observatory - [...] Kong University, infatti, sta sviluppando WeiboScope, un progetto che grazie a un software riesce a tracciare i contenuti che …

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

 
Powered by WordPress | Designed by Elegant Themes