Yezh Ar Vro -The language of the country: Building the appropriation of data collection applications *
Résumé
The accelerated technologization of human relationships (Sayers et al., 2021) endangers the
practice of languages for which new linguistic technologies cannot be deployed. Building
digital resources that can be used in NLP is therefore an essential task in preserving human
linguistic diversity. As far as oral technologies are concerned, there are software solutions for
the participative acquisition of data, such as Common Voice (Ardila et al., 2020). It is however
clear that appropriation remains insufficient by speaking communities 1. We present here a
pilot project in the Breton linguistic context. It aims at validating the following hypothesis :
early interdisciplinary collaboration with speaking communities for the design of data
acquisition tools significantly increases their appropriation and therefore the effectiveness of
the tools.
Fichier principal
Antoine & al. 2024. Building_appropriation_of_data_mining_applications__Translation_of_LIFT2.pdf (90.87 Ko)
Télécharger le fichier
Origine | Fichiers produits par l'(les) auteur(s) |
---|