Local media and institutional news sources in Russia-held territories in Ukraine and beyond

A dataset with basic details about relevant online sources

Author

Giorgio Comai

Published

February 7, 2023

Modified

March 13, 2023

Work-in-progress

This page is still a work-in-progress. It is shared in the spirit of keeping the research process as open as possible, but it still a draft document, possibly an early draft: incomplete, unedited, and possibily inaccurate. Datasets included may likewise not be fully verified.

Context

This document introduces a dataset with basic details about online sources that are relevant to this research. These include not only sources from or about Russia-held territories in Ukraine, but also other online sources that may be relevant either as a term of reference (e.g. local media in neighbouring Russian regions or contested territories elsewhere in the post-Soviet space) or because of their direct or indirect impact (e.g. national media in Russia, Ukraine).

This is a work in progress and unlikely to ever reach a state where it can be considered complete. Indeed, I add sources that may be of relevance as I advance in my research or as I encouter them serendipitously.

Its purpose is to keep a systematic record of relevant sources that can be used as a starting point for more specific research questions, and grow as a result.

The full dataset can be downloaded as a .csv file. Full metadata specification is likewise available as a .csv file

Note

Allegiance is provided only in reference to local media and institutions in contested territories when it is immediately apparent and obvious. Independent sources or others whose allegiance is not self-evident are all recorded as “other”. The field allegiance is not relevant for sources not specifically related to contested territories (e.g. it is not used for national Russian or Ukrainian media).

Media or other online sources reporting news from or about Russia-held territories in Ukraine

Allegiance to Russia

domain
entity
medium
language
name_original
name_en
kherson-news.ru
Kherson
website
ru
Лента новостей Херсона
dan-news.ru
Donetsk
website
ru
Донецкое Агентство Новостей
Donetsk News Agency
dontimes.ru
Donetsk
website
ru
Донецкое время

Allegiance to Ukraine

domain
entity
medium
language
name_original
name_en
No rows found

Other

domain
entity
medium
language
name_original
name_en
No rows found

Local institutions of Russia-held territories in Ukraine

Allegiance to Russia

domain
entity
medium
language
name_original
name_en
pravdnr.ru
Donetsk
website
ru
Правительство Донецкой Народной Республики
Government of the DNR
днронлайн.рф
Donetsk
website
ru
Донецкая Народная Республика
Donetsk People's Republic - official website

Allegiance to Ukraine

domain
entity
medium
language
name_original
name_en
dn.gov.ua
Donetsk
website
uk
Донецька обласна державна адміністрація
Donetsk regional administration

Other

domain
entity
medium
language
name_original
name_en
No rows found

Russian national media

domain
medium
language
name_original
name_en
zavtra.ru
website
ru
Завтра
Zavtra
1tv.ru
website
ru
Первый канал
First Channel (Russia)
smotrim.ru
website
ru
Смотрим
anna-news.info
website
ru
Информационное агентство «ANNA NEWS»
News Agency «ANNA NEWS»

Russian national institutions

domain
medium
language
name_original
name_en
mil.ru
website
ru
Министерство обороны Российской Федерации
Ministry of Defence of the Russian Federation
kremlin.ru
website
ru
Президент России
President of Russia
en.kremlin.ru
website
en
President of Russia
mid.ru
website
ru
Министерство иностранных дел Российской Федерации
Ministry of Foreign Affairs of the Russian Federation
mid.ru/en
website
en
Ministry of Foreign Affairs of the Russian Federation
Ministry of Foreign Affairs of the Russian Federation
duma.gov.ru
website
ru
Государственная Дума - Федерального Собрания Российской Федерации
Russian parliament

Metadata specification

The table with metadata description includes a technical specification and a brief description for all fields included in the full dataset available for download:

id, domain_id, wikidata_id, date_recorded, url, news_section_url, base_state, entity, allegiance, category, medium, language, name_original, self_described_as, name_en, description_en, activity_start, activity_end, earliest_available_online, license, telegram, vkontakte, twitter, facebook, instagram, tiktok, youtube, rutube, odnoklassniki, yandex_zen, tiktok, fediverse.

The full metadata specification is available as a .csv file, as well as in the table below.

column_name
mandatory
format
description
id
true
numeric
Numeric identifier to be attributed to each language version of each source; to be used for reference and without intrinsic meaning.
domain_id
true
character string
A combination of the plain domain name, underscore, and the language code of the main language. E.g. 1tv.ru_ru for the Russian language version of the website of Russian tv 1tv.ru
wikidata_id
false
character string. Must start with capital Q followed by an integer
Wikidata identifier of the source.
date_recorded
true
a date in the YYYY-MM-DD format
Date when a given record was added, updated, or checked
url
true
a url, must start with http
Link to the home page or main page of the source in the relevant language
news_section_url
false
a url, must start with http
Link to the page were “all news” (or similar) are posted.
base_state
false
Two-letter country code, iso2c, fallback on Eurostat
State whose sovereignty over a territory is generally recognised. To be left empty in case of international media.
entity
false
character. Common name of entity or region.
The name of the region, territory, or entity. Generic name of region to be preferred to self-designations that apply to only a specific period. E.g. “Donetsk” would apply to pre-2014, 2014-2022, and post-2022. To be left empty if it applies to a whole state.
allegiance
false
character string. Common name.
If the source is explicitly loyal to one side in a conflict or to de facto authorities (see e.g. P945 in Wikidata)
category
true
one of: media, local institution
Category a given source belongs to
medium
true
one of: website, telegram, [name of social media]
Type of source
language
true
WMF language code; typically a two letter languge code for most languages; reference: https://www.wikidata.org/wiki/Help:Wikimedia_language_codes/lists/all; use “und” for undetermined
Main language used in the source. Separate language versions of a source should be recorded separately; in case of mixed-language sources, values can be separated by a semicolumn.
name_original
true
character string, original language
Name of the source or institutions in its original format
self_described_as
false
character string, original language
The source as it describes itself, in its original language
name_en
true
character string, English
Name of the source, typically a transliteration of the original or common name if used
description_en
false
character string, English
A brief description of the source. It may include the translation of the original name.
activity_start
false
a date in the YYYY-MM-DD format (or only YYYY, or only YYYY-MM if exact date unknown)
Date when the source started publications
activity_end
false
a date in the YYYY-MM-DD format (or only YYYY, or only YYYY-MM if exact date unknown)
Date when the source ended publications or was closed
earliest_available_online
false
a date in the YYYY-MM-DD format (or only YYYY, or only YYYY-MM if exact date unknown)
Refers to the date of the earliest post available online
license
false
character string, acronym of the license or brief characterisation
If available, concise reference to the license (e.g. CC-BY)
telegram
false
a url, must start with http
Direct link to main page of the given source on the respective service
vkontakte
false
a url, must start with http
Direct link to main page of the given source on the respective service
twitter
false
a url, must start with http
Direct link to main page of the given source on the respective service
facebook
false
a url, must start with http
Direct link to main page of the given source on the respective service
instagram
false
a url, must start with http
Direct link to main page of the given source on the respective service
tiktok
false
a url, must start with http
Direct link to main page of the given source on the respective service
youtube
false
a url, must start with http
Direct link to main page of the given source on the respective service
rutube
false
a url, must start with http
Direct link to main page of the given source on the respective service
odnoklassniki
false
a url, must start with http
Direct link to main page of the given source on the respective service
yandex_zen
false
a url, must start with http
Direct link to main page of the given source on the respective service
tiktok
false
a url, must start with http
Direct link to main page of the given source on the respective service
fediverse
false
a url, must start with http
Direct link to main page of the given source on the respective service
Datasets available for download