Partially Attended

an irregularly updated blog by Ian Mulvany

blog posts about big data

US phone companies are selling your real time location data. File this one under “you do it to yourself” Of course the key thing here is that directly selling this data to law enforcement is illegal, but indirectly selling it is not. The market is acting as an evolutionary force to find ways to extract money by moving around laws that are out in place to protect citizens. In this case the economy can move at a faster pace with business innovation that the law can to protect citizens. ... (more)

choice magazine podcast Questions

This week I was interviewed for the ALA choice podcast, a podcast that the that is a weekly program featuring in-depth conversations about contemporary trends, best practices, and case studies important to academic librarians. Hosted by Bill Mickey, the Editorial Director at Choice The topic was about trends in big data and the role of the library, and it was really fun to participate in, and the panel I was on included Caroline Muglia from the University of Southern California Libraries as well as Andy Rutkowski and Eimmy Karina Solis from USC libraries. ... (more)

What do we mean when we talk about Big Data?

What do we mean when we talk about Big Data? The following blog post about this article provides the following definition of big data: “High volume data that frequently combines highly structured administrative data actively collected by public sector organisations with continuously and automatically collected structured and unstructured real-time data that are often passively created by public and private entities through their internet.” The article is behind a paywall, but the blog is pretty clearly laid out. ... (more)

Textometrica, a tool review

A quick spin with Textometrica Yesterday I had a good conversation with Simon Lindgren, the creator of textometrica. I decided to try out the tool before chatting to him. Textometrica encapsulates a process for understanding the relationship and distribution of the occurrence of concepts in a body of plain text. It provides a multi-step online tool for the analysis. The advantage of using this tool is that you don’t need to be able to do any coding to get to a point where you have some quite interesting analysis of your corpus. ... (more)