Skip to main content
Home
Snurblog — Axel Bruns

Main navigation

  • Home
  • Information
  • Blog
  • Research
  • Publications
  • Presentations
  • Press
  • Creative
  • Search Site

Time-Sensitive Embeddings of News Content

Snurb — Monday 8 June 2026 00:15
Politics | Journalism | Industrial Journalism | 'Big Data' | ICA 2026 | Liveblog |

The next speaker in this session at the 2026 International Communication Association conference in Cape Town is Rupert Kiddle, whose interest is in encoder-produced news embeddings. This is an increasingly common technique, which helps analyse and categorise news articles both for internal journalistic purposes and for scholarly research. But they are not very sensitive to differences over time, and instead engage in a kind of temporal averaging of embeddings; this can be addressed, but remains difficult.

Most models also remain intransparent about their training data and weighting approaches, so there is a need to develop new approaches. This project draws on GDELT for its data; uses the Nomic Contrastors repository for its fully open embedding pipeline; and implements a training task called NewsCycle which engages in temporally sensitive embedding processes.

GDELT is used to collect English-language news from 2020 to 2025, selecting some 2,200 articles per day and engaging in a range of filtering steps. The result of the NewsCycle task can then be tested by querying for specific content at particular points in time while hiding the timestamps of articles from the retrieval system; ideally, it will still select for appropriate articles and reject those which represent news content from different timeframes, based only on the temporally distinct embeddings it has generated.

To date this is English- and text-only, but can be extended further; additional evaluation for a broader range of tasks beyond document retrieval is also necessary.

  • 1 view
INFORMATION
BLOG
RESEARCH
PUBLICATIONS
PRESENTATIONS
PRESS
CREATIVE

Recent Work

Presentations and Talks

Revisiting ‘the’ Public Sphere and Its Algorithmically Shaped Publics (ZeMKI ComAI 2026)

» more

Books, Papers, Articles

Untangling the Furball: A Practice Mapping Approach to the Analysis of Multimodal Interactions in Social Networks (Social Media + Society)

» more

Opinion and Press

Inside the Moral Panic at Australia's 'First of Its Kind' Summit about Kids on Social Media (Crikey)

» more

Creative Work

Brightest before Dawn (CD, 2011)

» more

Lecture Series


Gatewatching and News Curation: The Lecture Series

Bluesky profile

Mastodon profile

Queensland University of Technology (QUT) profile

Google Scholar profile

Mixcloud profile

[Creative Commons Attribution-NonCommercial-ShareAlike 4.0 Licence]

Except where otherwise noted, this work is licensed under a Creative Commons BY-NC-SA 4.0 Licence.