Doctoral Candidate at UNICAEN
Connect
Follow
Søren Fomsgaard
DC 6 at UNICAEN
Coming from computational linguistics and philosophy technology, I am interested in how linguistic patterns or styles differ between humans and machines.
Textual online content generated and/or propagated by non-human agents has become an increasingly present phenomenon on social media platforms. When deployed with malicious intent, the behavior of this kind of agent can have impacts that are either directly harmful to people that are targeted by this behavior, for example, through toxic language use, or impacts that are more indirectly harmful, for example, by intervening in political discourse.
With the advent of contemporary large language models, the near future may feature more bots that employ increasingly sophisticated human-like harmful language online. Therefore, being able to detect linguistic patterns and idiosyncrasies will play a role in identifying harmful textual content that is not only spread but also generated by machines.
My research interests include authorship analysis, computational sociolinguistics, and the philosophical issue of what defines and differentiates human, non-human, and hybrid forms of agency. I joined the HYBRIDS project because of its focus on interdisciplinary research, interaction, and collaboration, both in its individual research tracks and at the network level.
ESR6 Individual Project
Detection of toxic bots in Twitter by analysing textual content
Research Objectives:
AI-Generated Text Classification for Detecting Political Bots
To design and develop classification strategies to detect AI-generated texts applied to detect different types of political bots that are able to influence election campaigns.
Twitter Data Collection for Campaign Analysis
To crawl and scrape Twitter data for a specific campaign and for a specific period.
Linguistic Analysis of Bot-Generated Messages
To analyse the linguistic content and relevant features of the messages sent by bots by focusing on main topics, extreme opinions, language patterns, etc.
Taxonomy Development for Bot Classification
To make a deep taxonomy of bots by considering different aspects, such as their aims, attitudes, linguistic register, or topics.
Expected Results:
Bot Detection and Classification System Development
Development of a system for bot detection and classification.
Fine-Grained Annotated Election Campaign Dataset Creation
Creation of a benchmark dataset related to a specific election campaign, provided with fine-grained annotations of different types of bots.
Comprehensive Evaluation Leveraging Knowledge and Datasets
Evaluation of the classification system by making use of the knowledge (not only the previously created dataset) acquired from the analysis.
Application to Diverse Use Cases: Brexit, Immigration, Climate, and Elections
Application on use cases related to European questions (Brexit and Euroscepticism), immigration, and climate emergency, as well as domestic political elections.
Supervisors
In our pursuit of academic excellence, HYBRIDS Doctoral Candidates are guided by a dedicated team of supervisors. Comprising the Main Supervisor, Co-Supervisor, and Inter-sectoral Supervisor, this team of professionals offers a wealth of knowledge, mentorship, and interdisciplinary insights.
Main Supervisor
Dr. Gaël Dias
University of Caen Normandy (UNICAEN)
Co-Supervisor
Dr. Berta Garcia Orosa
University of Santiago de Compostela (USC)
Inter-sectoral Supervisors
Dr. José Ramon Pichel
Factoría Software e Multimedia (IMAXIN)
Mr. Ettore Di Cesare
Fondazione OPENPOLIS