Connecte Dness

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Friday, 27 May 2005

Mining Social Networks from Email

Posted on 12:45 by Unknown
I recently acquired a couple new toys--an IBM Thinkpad last month and a Canon Pixma multifunction printer/copier/fax/scanner just today. I go a while between upgrades so when the new stuff comes in it really blows me away. Today's revelation is optical character recognition, or OCR. How OCR works I have no idea but here's what it can do:

My regular readers may have already detected that I am a New Yorker magazine junkie. My friends can hardly fail to notice this, since I am always saying, "Yes, and that reminds me of an article I just read in the New Yorker," at which point I take over the conversation for a few minutes. In the olden times (before today) that was more than enough for my friends. But as of today it is just the beginning. Now I can go home to my personal NYer archives (dating from 9-11), grab the issue in question, put it through my scanner, and sit back while my computer receives the entire article in the form of a Word document (with columns, pages, and cartoons all properly configured) or a PDF (with text searching). I leave the rest of the story to your imagination, since this is a copyright-friendly blog.

If any of you just happen to be thinking about email right now, let me say--that reminds me of a great article I just read in the New York Times: "Enron Offers an Unlikely Boost to E-Mail Surveillance." I am a bit embarassed to be mentioning this article now. It was published very prominently on Sunday. But I have been so preoccupied with my new ThinkPad that real life is apparently passing me by. So thanks to Jim Murphy for clipping the article and handing it to me, in a quaint nod to life before scanners. Jim's gift prompted me to check Patti Anklam's blog and see her review of the article which she wrote the day after its publication.

The gist of the story is that a huge pile of Enron email is now publically available. The email provides a detailed look at communication from before the California energy crisis right up to the final bankruptcy scandal. This is an unprecendented resource for sociologists and computer scientists, who have proceeded to demonstrate not only the power of textual analysis (how often do people say "Dynergy" or "bankruptcy" week by week) but also the power of network analysis (who sends email to whom and when, regardless of the content).

The article features a beautiful network diagram:

Note the use of a hierarchical circular layout that places people in three categories: (1) periphery, (2) mid-level, and (3) core. That's a great way not to distract people with unnecessary detail.

The Enron analysis is being led by David Skillicorn, Kathleen Carley, and Michael Berry.

Want to try this at home? You can! Investigate your own email communication network by downloading Peter Gloor's TeCFlow.
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest
Posted in | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Even with Web 2.0, we still occasionally need to meet face-to-face
    [In case my irony did not come through in the subject line, let me preface this post with a comment that I am an online community skeptic. H...
  • How to build your network by Brian Uzzi and Shannon Dunlap
    Last week I analyzed the introductions underlying my professional network. Coincidentally, my colleague Steve Frigand sent me a nice foll...
  • Viewing network data in Excel... with banana
    Today I received an invitation from Harvard's Program on Networked Governance to watch Marc Smith demonstrate the powers of . NetMap -...
  • Why math will rock your world (BusinessWeek)
    Click on the image below to read the latest cover story from BusinessWeek : " Why math will rock your world ." When you are ready ...
  • Holiday Special -- The Corrections
    I am just back from Bethlehem, PA, recovering from family time, and settling in for the final countdown to 2005. It's a longish drive fr...
  • Free online network survey utility for Organizational Network Analysis
    Back in December I gave my readers a Christmas present: this free spreadsheet utility for organizational network analysis. Quite a few peop...
  • I hate physicists; Barry Wellman is God
    I attended a talk recently that reminded me of the not-so-hidden rivalry between sociologists and physicists who study networks. Convenientl...
  • Social isolation in America increasing dramatically
    The front page of today's Boston Globe announces " It's lonely out there. " For substantially more detail on this sobering...
  • Qualitative Data, Quantitative Analysis
    Pacey Foster (soon to be professor in the School of Management at UMASS Boston) points me to this essay by H Russell Bernard , "Qualit...
  • Web science, Webwhompers
    I have just unveiled Webwhompers , which bears the fruit of four years of my teaching Web science at Boston University. The site features a ...

Blog Archive

  • ►  2012 (1)
    • ►  June (1)
  • ►  2010 (3)
    • ►  June (2)
    • ►  May (1)
  • ►  2009 (22)
    • ►  December (1)
    • ►  September (2)
    • ►  August (2)
    • ►  July (1)
    • ►  June (5)
    • ►  May (4)
    • ►  March (2)
    • ►  February (4)
    • ►  January (1)
  • ►  2008 (36)
    • ►  December (3)
    • ►  November (2)
    • ►  October (1)
    • ►  September (6)
    • ►  August (4)
    • ►  July (2)
    • ►  June (8)
    • ►  May (4)
    • ►  April (3)
    • ►  February (1)
    • ►  January (2)
  • ►  2007 (42)
    • ►  December (1)
    • ►  November (1)
    • ►  October (2)
    • ►  September (6)
    • ►  August (6)
    • ►  July (5)
    • ►  June (8)
    • ►  May (4)
    • ►  March (3)
    • ►  February (1)
    • ►  January (5)
  • ►  2006 (63)
    • ►  December (4)
    • ►  October (2)
    • ►  September (2)
    • ►  August (3)
    • ►  July (7)
    • ►  June (10)
    • ►  May (10)
    • ►  April (4)
    • ►  March (8)
    • ►  February (6)
    • ►  January (7)
  • ▼  2005 (136)
    • ►  December (11)
    • ►  November (13)
    • ►  October (11)
    • ►  September (9)
    • ►  August (10)
    • ►  July (10)
    • ►  June (10)
    • ▼  May (12)
      • The Network Roundtable is off and running
      • Mining Social Networks from Email
      • Health Information Liquidity
      • Subtleties of Centrality
      • Grokker Maps the Information Community
      • Commercializing social networks
      • Barry Wellman's Net Lab: Community Central
      • Social Network Analysis Master Class June 13-15
      • Social Networks Get Serious
      • Annotated Bibliography of Social Network Analysis ...
      • The Tipping Point of Organizational Change
      • Stanley Wasserman and Visible Path
    • ►  April (13)
    • ►  March (15)
    • ►  February (9)
    • ►  January (13)
  • ►  2004 (99)
    • ►  December (9)
    • ►  November (18)
    • ►  October (13)
    • ►  September (16)
    • ►  August (15)
    • ►  July (20)
    • ►  June (8)
Powered by Blogger.

About Me

Unknown
View my complete profile