Skip to main content
Home
Snurblog — Axel Bruns

Main navigation

  • Home
  • Information
  • Blog
  • Research
  • Publications
  • Presentations
  • Press
  • Creative
  • Search Site

Making 'Big Data' Manageable

Snurb — Tuesday 16 October 2012 20:15
'Big Data' | CCC 2012 |

The next speaker at the CCC Symposium is Rasmussen Helles, who takes us back to the problem of 'big data'. Such data lend themselves well to visualisation, but this also creates substantial new problems as we make sense of data through their visual representations: we may see the patterns in the data, but we still don't necessarily know what they mean.

To establish such media usually requires much more manual approaches of analysis, beyond (algorithmic) visualisation. This means content coding – a structured interpretation of data at a meaningful level, which cannot be done automatically at this point –, but how can this be done effectively with big and complex datasets? One solution is to go deep, and engage in very labour-intensive studies that result in a very fine-grained coding of data; the other is to generalise and establish only broad categories which are applied to the data.

That approach may also establish broad patterns which may not otherwise become apparent. For example, Rasmus's work has identified three major genres of Websites, which account for some 95% of the time which Danes spend online (content sites, citizen and consumer sites, and specialised services), and these site genres are as prevalent amongst the most popular sites in Denmark as they are in the long tail, even if the specific focus of the respective sites may be different.

So, in this case, the long tail is simply a tiered version of the top sites - long-tail specialisation simply follows geographical and topical diversification. This shows that the genres apply across the entire dataset, which is also of importance for further research: big data, in this case, may safely be made manageable by probability sampling.

  • 3123 views
INFORMATION
BLOG
RESEARCH
PUBLICATIONS
PRESENTATIONS
PRESS
CREATIVE

Recent Work

Presentations and Talks

Beyond Interaction Networks: An Introduction to Practice Mapping (ACSPRI 2024)

» more

Books, Papers, Articles

Untangling the Furball: A Practice Mapping Approach to the Analysis of Multimodal Interactions in Social Networks (Social Media + Society)

» more

Opinion and Press

Inside the Moral Panic at Australia's 'First of Its Kind' Summit about Kids on Social Media (Crikey)

» more

Creative Work

Brightest before Dawn (CD, 2011)

» more

Lecture Series


Gatewatching and News Curation: The Lecture Series

Bluesky profile

Mastodon profile

Queensland University of Technology (QUT) profile

Google Scholar profile

Mixcloud profile

[Creative Commons Attribution-NonCommercial-ShareAlike 4.0 Licence]

Except where otherwise noted, this work is licensed under a Creative Commons BY-NC-SA 4.0 Licence.