文章基本信息

标题：Towards an Automatic Recognition of Mixed Languages: The Case of Ukrainian-Russian Hybrid Language Surzhyk
其他标题：Towards an Automatic Recognition of Mixed Languages: The Case of Ukrainian-Russian Hybrid Language Surzhyk
本地全文：下载
作者：Nataliya Sira ; Giorgio Maria Di Nunzio ; Viviana Nosilia 等
期刊名称：Umanistica Digitale
电子版ISSN：2532-8816
出版年度：2020
卷号：9
页码：97-116
DOI：10.6092/issn.2532-8816/10740
出版社：University of Bologna
摘要：Il concetto di interferenza linguistica è diventato caratterizzante nelle società multilingue odierne, nelle quali più e più lingue entrano in contatto e portano alla creazione di lingue ibride. Tali lingue, assieme alle discussioni sul loro diritto di essere riconosciute a livello ufficiale, hanno fatto emergere il problema della loro identificazione e elaborazione automatica nell'area della linguistica computazionale. Nel presente articolo proponiamo un primo tentativo di identificare gli elementi di una lingua ibrida ucraino-russa, il suržyk, attraverso l'adozione di regole basate sugli esempi create con gli strumenti del linguaggio di programmazione R. Il nostro studio basato sugli esempi consiste in: 1) analisi delle registrazioni di suržyk parlato raccolte da Del Gaudio (2010) nell'area di Kyjiv e creazione di un corpus scritto; 2) produzione e implementazione di regole specifiche sull’identificazione degli elementi di suržyk; 3) test del codice creato e analisi di efficacia del classificatore della lingua ibrida.
其他摘要：Language interference is common in today’s multilingual societies where more languages are in contact, and as a global result leads to the creation of hybrid languages. These, together with doubts on their right to be officially recognised, emerged the problem of their automatic identification and further elaboration in the area of computational linguistics. In this paper, we propose a first attempt to identify the elements of a Ukrainian-Russian hybrid language, Surzhyk, through the adoption of the example-based rules created with the instruments of programming language R. Our example-based study consists of: 1) analysis of spoken samples of Surzhyk registered by Del Gaudio (2010) in Kyiv area and creation of the written corpus; 2) production of specific rules on the identification of Surzhyk patterns and their implementation; 3) testing the code and analysing the effectiveness of the hybrid language classifier.
关键词：Mixed Languages; Qualitative analysis; Quantitative analysis
其他关键词：Mixed Languages;Quantitative analysis;Qualitative analysis