21st Century Library through Data Analytics

With the encouragement of my Professor in Data Analytics at Wayne State University, Dr. John Heinrichs, I decided to post a few words and the analysis document of a project I just completed for the above course. Using NodeXL (http://nodexl.codeplex.com/), which is an add-on to the Excel spreadsheet, and the VOSON application (http://voson.anu.edu.au/), which allows data downloads from websites, I downloaded data about a network based on seven seed-sites, ALA (http://www.ala.org/), IFLA (http://www.ifla.org/), Worldcat (http://www.worldcat.org/), 21st Century Library(https://21stcl.wordpress.com/), Information Science on Scoop.it! (http://www.scoop.it/t/infoscienceaddict), Qualitative and Quantitative Methods in Libraries (http://qqml.net/index.html), and Library Conference Planner (http://lcp.douglashasty.com/). Knowing that our new site does not have a very prominent position on the web yet, I decided to check it out and present this analysis as one of the projects for the class. A characteristic graph from stage 4 of my analysis is  presented here. GraphImage 5 Please click on the graph to enlarge it. The graph shows the first five seed-sites (labeled) out of the starting seven (mentioned above) and their network of sites/pages.  The details of the analysis are for readers who have at least basic knowledge of NodeXL and would find the analysis document below an easy read. For the readers without knowledge of NodeXL, I would like to mention that I was surprised to find out (among other things) that our site is ranked fourth in this network, with only ALA, IFLA, and Worldcat being ranked higher than 21st Century Library. This was a surprising discovery, considering that our site existed only 39 days at the time of the creation of this post. The entire analysis is provided below: Selected Project 1 Analysis


2 thoughts on “21st Century Library through Data Analytics

  1. Thank you Florence!
    There were a number of reasons, listed below, why I made this choice.
    a. I don’t use social media like Facebook, Twitter, Flickr, etc.;
    b. LinkedIn currently is not included in NodeXL’s features;
    c. Downloading data from my e-mail would require a maneuver through Outlook, which I don’t care having connected to my Hotmail;
    d. It would be interesting to see where our site stands. In the actual NodeXL workbooks of my project and the analysis that I have attached, there are many more interesting facts.
    The next project will be on YouTube, which is included in the features of NodeXL.

