Connecte Dness

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Friday, 27 May 2005

Mining Social Networks from Email

Posted on 12:45 by Unknown
I recently acquired a couple new toys--an IBM Thinkpad last month and a Canon Pixma multifunction printer/copier/fax/scanner just today. I go a while between upgrades so when the new stuff comes in it really blows me away. Today's revelation is optical character recognition, or OCR. How OCR works I have no idea but here's what it can do:

My regular readers may have already detected that I am a New Yorker magazine junkie. My friends can hardly fail to notice this, since I am always saying, "Yes, and that reminds me of an article I just read in the New Yorker," at which point I take over the conversation for a few minutes. In the olden times (before today) that was more than enough for my friends. But as of today it is just the beginning. Now I can go home to my personal NYer archives (dating from 9-11), grab the issue in question, put it through my scanner, and sit back while my computer receives the entire article in the form of a Word document (with columns, pages, and cartoons all properly configured) or a PDF (with text searching). I leave the rest of the story to your imagination, since this is a copyright-friendly blog.

If any of you just happen to be thinking about email right now, let me say--that reminds me of a great article I just read in the New York Times: "Enron Offers an Unlikely Boost to E-Mail Surveillance." I am a bit embarassed to be mentioning this article now. It was published very prominently on Sunday. But I have been so preoccupied with my new ThinkPad that real life is apparently passing me by. So thanks to Jim Murphy for clipping the article and handing it to me, in a quaint nod to life before scanners. Jim's gift prompted me to check Patti Anklam's blog and see her review of the article which she wrote the day after its publication.

The gist of the story is that a huge pile of Enron email is now publically available. The email provides a detailed look at communication from before the California energy crisis right up to the final bankruptcy scandal. This is an unprecendented resource for sociologists and computer scientists, who have proceeded to demonstrate not only the power of textual analysis (how often do people say "Dynergy" or "bankruptcy" week by week) but also the power of network analysis (who sends email to whom and when, regardless of the content).

The article features a beautiful network diagram:

Note the use of a hierarchical circular layout that places people in three categories: (1) periphery, (2) mid-level, and (3) core. That's a great way not to distract people with unnecessary detail.

The Enron analysis is being led by David Skillicorn, Kathleen Carley, and Michael Berry.

Want to try this at home? You can! Investigate your own email communication network by downloading Peter Gloor's TeCFlow.
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest
Posted in | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Happy, or at least healthy endings
    Yesterday was the 8th anniversary of my first Connectedness post , but it's been 3 years since I was even semi-active in this space. One...
  • Discussion with Valdis Krebs: What is a "social" network?
    Congratulations to Valdis Krebs for correctly identifying three out of four of my " mystery quotes " from last week. For those of...
  • Social capital in one easy lesson
    The power of social network analysis for business is getting a lot of press these days (like this big BusinessWeek article ). Without taking...
  • Evil-Doers at Sunbelt in San Diego
    Tomorrow I fly to San Diego to attend Sunbelt , the annual SNA extravaganza. The keynote address, by Phillip Bonacich , is "Using Socia...
  • Social Network Analysis article in "Wired"
    Thanks to Don Steiny for posting this reference to Nov 2004 Wired Magazine on the SOCNET mailing list: " Science's Next Big Score ...
  • How to build your network by Brian Uzzi and Shannon Dunlap
    Last week I analyzed the introductions underlying my professional network. Coincidentally, my colleague Steve Frigand sent me a nice foll...
  • Viewing network data in Excel... with banana
    Today I received an invitation from Harvard's Program on Networked Governance to watch Marc Smith demonstrate the powers of . NetMap -...
  • Web science, Webwhompers
    I have just unveiled Webwhompers , which bears the fruit of four years of my teaching Web science at Boston University. The site features a ...
  • Why math will rock your world (BusinessWeek)
    Click on the image below to read the latest cover story from BusinessWeek : " Why math will rock your world ." When you are ready ...
  • The Pulse-Taker, by Karen Stephenson
    Courtesy of Langemarks Cafe , here is a wonderful article about Karen Stephenson and her work in social network analysis, published by Booz...

Blog Archive

  • ►  2012 (1)
    • ►  June (1)
  • ►  2010 (3)
    • ►  June (2)
    • ►  May (1)
  • ►  2009 (22)
    • ►  December (1)
    • ►  September (2)
    • ►  August (2)
    • ►  July (1)
    • ►  June (5)
    • ►  May (4)
    • ►  March (2)
    • ►  February (4)
    • ►  January (1)
  • ►  2008 (36)
    • ►  December (3)
    • ►  November (2)
    • ►  October (1)
    • ►  September (6)
    • ►  August (4)
    • ►  July (2)
    • ►  June (8)
    • ►  May (4)
    • ►  April (3)
    • ►  February (1)
    • ►  January (2)
  • ►  2007 (42)
    • ►  December (1)
    • ►  November (1)
    • ►  October (2)
    • ►  September (6)
    • ►  August (6)
    • ►  July (5)
    • ►  June (8)
    • ►  May (4)
    • ►  March (3)
    • ►  February (1)
    • ►  January (5)
  • ►  2006 (63)
    • ►  December (4)
    • ►  October (2)
    • ►  September (2)
    • ►  August (3)
    • ►  July (7)
    • ►  June (10)
    • ►  May (10)
    • ►  April (4)
    • ►  March (8)
    • ►  February (6)
    • ►  January (7)
  • ▼  2005 (136)
    • ►  December (11)
    • ►  November (13)
    • ►  October (11)
    • ►  September (9)
    • ►  August (10)
    • ►  July (10)
    • ►  June (10)
    • ▼  May (12)
      • The Network Roundtable is off and running
      • Mining Social Networks from Email
      • Health Information Liquidity
      • Subtleties of Centrality
      • Grokker Maps the Information Community
      • Commercializing social networks
      • Barry Wellman's Net Lab: Community Central
      • Social Network Analysis Master Class June 13-15
      • Social Networks Get Serious
      • Annotated Bibliography of Social Network Analysis ...
      • The Tipping Point of Organizational Change
      • Stanley Wasserman and Visible Path
    • ►  April (13)
    • ►  March (15)
    • ►  February (9)
    • ►  January (13)
  • ►  2004 (99)
    • ►  December (9)
    • ►  November (18)
    • ►  October (13)
    • ►  September (16)
    • ►  August (15)
    • ►  July (20)
    • ►  June (8)
Powered by Blogger.

About Me

Unknown
View my complete profile