Connecte Dness

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Friday, 27 May 2005

Mining Social Networks from Email

Posted on 12:45 by Unknown
I recently acquired a couple new toys--an IBM Thinkpad last month and a Canon Pixma multifunction printer/copier/fax/scanner just today. I go a while between upgrades so when the new stuff comes in it really blows me away. Today's revelation is optical character recognition, or OCR. How OCR works I have no idea but here's what it can do:

My regular readers may have already detected that I am a New Yorker magazine junkie. My friends can hardly fail to notice this, since I am always saying, "Yes, and that reminds me of an article I just read in the New Yorker," at which point I take over the conversation for a few minutes. In the olden times (before today) that was more than enough for my friends. But as of today it is just the beginning. Now I can go home to my personal NYer archives (dating from 9-11), grab the issue in question, put it through my scanner, and sit back while my computer receives the entire article in the form of a Word document (with columns, pages, and cartoons all properly configured) or a PDF (with text searching). I leave the rest of the story to your imagination, since this is a copyright-friendly blog.

If any of you just happen to be thinking about email right now, let me say--that reminds me of a great article I just read in the New York Times: "Enron Offers an Unlikely Boost to E-Mail Surveillance." I am a bit embarassed to be mentioning this article now. It was published very prominently on Sunday. But I have been so preoccupied with my new ThinkPad that real life is apparently passing me by. So thanks to Jim Murphy for clipping the article and handing it to me, in a quaint nod to life before scanners. Jim's gift prompted me to check Patti Anklam's blog and see her review of the article which she wrote the day after its publication.

The gist of the story is that a huge pile of Enron email is now publically available. The email provides a detailed look at communication from before the California energy crisis right up to the final bankruptcy scandal. This is an unprecendented resource for sociologists and computer scientists, who have proceeded to demonstrate not only the power of textual analysis (how often do people say "Dynergy" or "bankruptcy" week by week) but also the power of network analysis (who sends email to whom and when, regardless of the content).

The article features a beautiful network diagram:

Note the use of a hierarchical circular layout that places people in three categories: (1) periphery, (2) mid-level, and (3) core. That's a great way not to distract people with unnecessary detail.

The Enron analysis is being led by David Skillicorn, Kathleen Carley, and Michael Berry.

Want to try this at home? You can! Investigate your own email communication network by downloading Peter Gloor's TeCFlow.
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest
Posted in | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Happy, or at least healthy endings
    Yesterday was the 8th anniversary of my first Connectedness post , but it's been 3 years since I was even semi-active in this space. One...
  • How to build your network by Brian Uzzi and Shannon Dunlap
    Last week I analyzed the introductions underlying my professional network. Coincidentally, my colleague Steve Frigand sent me a nice foll...
  • Social capital in one easy lesson
    The power of social network analysis for business is getting a lot of press these days (like this big BusinessWeek article ). Without taking...
  • Viewing network data in Excel... with banana
    Today I received an invitation from Harvard's Program on Networked Governance to watch Marc Smith demonstrate the powers of . NetMap -...
  • Web science, Webwhompers
    I have just unveiled Webwhompers , which bears the fruit of four years of my teaching Web science at Boston University. The site features a ...
  • Why math will rock your world (BusinessWeek)
    Click on the image below to read the latest cover story from BusinessWeek : " Why math will rock your world ." When you are ready ...
  • Evil-Doers at Sunbelt in San Diego
    Tomorrow I fly to San Diego to attend Sunbelt , the annual SNA extravaganza. The keynote address, by Phillip Bonacich , is "Using Socia...
  • Holiday Special -- The Corrections
    I am just back from Bethlehem, PA, recovering from family time, and settling in for the final countdown to 2005. It's a longish drive fr...
  • Free online network survey utility for Organizational Network Analysis
    Back in December I gave my readers a Christmas present: this free spreadsheet utility for organizational network analysis. Quite a few peop...
  • Weekend Edition: More Sex is Safer Sex
    Thanks to my friend Neal Young ( professor of computer science at UC Riverside ) for pointing me to the writings of Steven Landsburg , pro...

Blog Archive

  • ►  2012 (1)
    • ►  June (1)
  • ►  2010 (3)
    • ►  June (2)
    • ►  May (1)
  • ►  2009 (22)
    • ►  December (1)
    • ►  September (2)
    • ►  August (2)
    • ►  July (1)
    • ►  June (5)
    • ►  May (4)
    • ►  March (2)
    • ►  February (4)
    • ►  January (1)
  • ►  2008 (36)
    • ►  December (3)
    • ►  November (2)
    • ►  October (1)
    • ►  September (6)
    • ►  August (4)
    • ►  July (2)
    • ►  June (8)
    • ►  May (4)
    • ►  April (3)
    • ►  February (1)
    • ►  January (2)
  • ►  2007 (42)
    • ►  December (1)
    • ►  November (1)
    • ►  October (2)
    • ►  September (6)
    • ►  August (6)
    • ►  July (5)
    • ►  June (8)
    • ►  May (4)
    • ►  March (3)
    • ►  February (1)
    • ►  January (5)
  • ►  2006 (63)
    • ►  December (4)
    • ►  October (2)
    • ►  September (2)
    • ►  August (3)
    • ►  July (7)
    • ►  June (10)
    • ►  May (10)
    • ►  April (4)
    • ►  March (8)
    • ►  February (6)
    • ►  January (7)
  • ▼  2005 (136)
    • ►  December (11)
    • ►  November (13)
    • ►  October (11)
    • ►  September (9)
    • ►  August (10)
    • ►  July (10)
    • ►  June (10)
    • ▼  May (12)
      • The Network Roundtable is off and running
      • Mining Social Networks from Email
      • Health Information Liquidity
      • Subtleties of Centrality
      • Grokker Maps the Information Community
      • Commercializing social networks
      • Barry Wellman's Net Lab: Community Central
      • Social Network Analysis Master Class June 13-15
      • Social Networks Get Serious
      • Annotated Bibliography of Social Network Analysis ...
      • The Tipping Point of Organizational Change
      • Stanley Wasserman and Visible Path
    • ►  April (13)
    • ►  March (15)
    • ►  February (9)
    • ►  January (13)
  • ►  2004 (99)
    • ►  December (9)
    • ►  November (18)
    • ►  October (13)
    • ►  September (16)
    • ►  August (15)
    • ►  July (20)
    • ►  June (8)
Powered by Blogger.

About Me

Unknown
View my complete profile