Skip to main content
Home
Snurblog — Axel Bruns

Main navigation

  • Home
  • Information
  • Blog
  • Research
  • Publications
  • Presentations
  • Press
  • Creative
  • Search Site

Matching Diverse Web Taxonomies

Snurb — Wednesday 25 May 2016 19:48
Produsage Communities | WebSci '16 |

The next session at Web Science 2016 starts with Natalia Boldyrev, whose focus is on Web taxonomies. There are a number of different approaches to taxonomies, from traditional librarian approaches to user-generated taxonomies, and from hierarchical catalogues of terms to unordered tag clouds. Such taxonomies are also culturally predicated: the taxonomy for football-related books in the German Amazon is much more detailed than it is in Amazon US, for instance.

Matching such diverse taxonomies in order to connect the datasets they describe is difficult. This is, on the face of it, an ontology matching problem, and can also be understood as a catalogue integration challenge; where catalogues in different languages come into play, multilingual matching also needs to be performed.

Such matching might begin by computing a ranked list of the most appropriate counterparts for any one term in the primary catalogue; this list can be created for instance by querying Wikipedia for the term at hand – but of course Wikipedia itself may also be ambiguous and adds further complexities. The approach here is to query Wikipedia for semantic labels.

Several alignment methods can be used to improve the matching quality. Constraints against misalignments can be introduced, but these need to be soft enough to not exclude valid solutions. In the absence of any prior ground truths, the results of such alignment must further be evaluated by human coders to test the quality of the matching.

This produces good results overall – but Wikipedia is not always available as a mediation source, especially for more obscure topics. What other, additional sources could be used here?

  • 1541 views
INFORMATION
BLOG
RESEARCH
PUBLICATIONS
PRESENTATIONS
PRESS
CREATIVE

Recent Work

Presentations and Talks

Beyond Interaction Networks: An Introduction to Practice Mapping (ACSPRI 2024)

» more

Books, Papers, Articles

Untangling the Furball: A Practice Mapping Approach to the Analysis of Multimodal Interactions in Social Networks (Social Media + Society)

» more

Opinion and Press

Inside the Moral Panic at Australia's 'First of Its Kind' Summit about Kids on Social Media (Crikey)

» more

Creative Work

Brightest before Dawn (CD, 2011)

» more

Lecture Series


Gatewatching and News Curation: The Lecture Series

Bluesky profile

Mastodon profile

Queensland University of Technology (QUT) profile

Google Scholar profile

Mixcloud profile

[Creative Commons Attribution-NonCommercial-ShareAlike 4.0 Licence]

Except where otherwise noted, this work is licensed under a Creative Commons BY-NC-SA 4.0 Licence.