Towards educating and motivating the crowd – a crowdsourcing platform for harvesting the fruits of NLP students' labour (CROSBI ID 656901)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Jaworski, Rafał ; Seljan, Sanja ; Dunđer, Ivan
engleski
Towards educating and motivating the crowd – a crowdsourcing platform for harvesting the fruits of NLP students' labour
This paper presents an idea to bring crowdsourcing to a higher level, for the purpose of acquiring valuable machine translation and natural language processing resources. In the proposed scenario, students are being educated in order to improve the quality and effectiveness of their natural language processing (NLP) related work. Their motivation is ensured by introducing an element of gamification – a ranking is kept, where the best contributing users are decorated with medals. The ranking is available at all times to all users and is always up-to-date, hence the effects of the contributions are immediately visible to the users. This scenario was applied to a group of students enrolled in Natural Language Processing course, who were presented with a task of collecting parallel corpora for less-resourced language pairs, in this case Croatian-English and English- Croatian. The whole experiment was supervised with the help of a custom-made open-source system named TMrepository, developed and maintained by the authors of this paper.
crowdsourcing, gamification, NLP, machine translation resources, parallel corpora, sentence alignment, less-resourced languages, Croatian, TMrepository
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
332-336.
2017.
objavljeno
Podaci o matičnoj publikaciji
Human language technologies as a challenge for computer science and linguistics: proceedings
Vetulani, Zygmunt ; Paroubek, Patrick
Poznań: Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu
978-83-64864-94-0
Podaci o skupu
8th Language & technology conference: human language technologies as a challenge for computer science and linguistics
predavanje
17.11.2017-19.11.2017
Poznań, Poljska