Text as data & data in the text

Studying conflicts in post-Soviet spaces through structured analysis of textual contents available on-line

A project led by Giorgio Comai, researcher and data analyst at OBCT/CCI, carried out with the support of the Italian MFA (see below for details and disclaimers).


About this project

Posts and updates

No matching items

Review of literature

Datasets

Title Description Categories
Russian state institutions 2024 This is a collection of full-text datasets based on contents extracted from the websites of Russian institutions.  
Summary of a sample of Zavtra.ru articles published in 1996 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 1997 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 1998 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 1999 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2000 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2001 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2002 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2003 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2004 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2005 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2006 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2007 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2008 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2009 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2010 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2011 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2012 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2013 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2014 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2015 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2016 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2017 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2018 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2019 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2020 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2021 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2022 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2023 LLM-generated content, may include inaccuracies  
Summary of a sample of Zavtra.ru articles published in 2024 LLM-generated content, may include inaccuracies  
No matching items

Tutorials

The tutorials are mostly based on castarter - Content Analysis Starter Toolkit for the R programming language, and will target users with beginner or beginner-intermediate coding skills. As the package gains new features, the tutorials will become more accessible; eventually, some of them will be accessible to users with no coding experience at all.

A draft version of the documentation for the package castarter is already available online. Both documentation and functionalities of the package will mature in the coming months.


Funding and disclaimers

This project is hosted by OBCT/CCI. It is carried out with the support of the Italian Ministry of Foreign Affairs and International Cooperation under art. 23 bis, D.P.R. 18/1967. All opinions expressed within the scope of this project represent the opinion of their author and not those of the Ministry.

Le posizioni contenute nel presente report sono espressione esclusivamente degli autori e non rappresentano necessariamente le posizioni del Ministero degli Affari Esteri e della Cooperazione Internazionale”