Napredna pretraga

Pregled bibliografske jedinice broj: 915533

Towards educating and motivating the crowd – a crowdsourcing platform for harvesting the fruits of NLP students' labour


(Adam Mickiewicz University in Poznań, Poland) Jaworski, Rafał; Seljan, Sanja; Dunđer, Ivan
Towards educating and motivating the crowd – a crowdsourcing platform for harvesting the fruits of NLP students' labour // Human Language Technologies as a Challenge for Computer Science and Linguistics / Vetulani, Zygmunt ; Paroubek, Patrick (ur.).
Poznan: Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu, 2017. str. 332-336 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


Naslov
Towards educating and motivating the crowd – a crowdsourcing platform for harvesting the fruits of NLP students' labour

Autori
Jaworski, Rafał ; Seljan, Sanja ; Dunđer, Ivan

Kolaboracija
Adam Mickiewicz University in Poznań, Poland

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Human Language Technologies as a Challenge for Computer Science and Linguistics / Vetulani, Zygmunt ; Paroubek, Patrick - Poznan : Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu, 2017, 332-336

ISBN
978-83-64864-94-0

Skup
8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics

Mjesto i datum
Poznan, Poljska, 17-19.11.2017.

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Crowdsourcing, gamification, NLP, machine translation resources, parallel corpora, sentence alignment, less-resourced languages, Croatian, TMrepository

Sažetak
This paper presents an idea to bring crowdsourcing to a higher level, for the purpose of acquiring valuable machine translation and natural language processing resources. In the proposed scenario, students are being educated in order to improve the quality and effectiveness of their natural language processing (NLP) related work. Their motivation is ensured by introducing an element of gamification – a ranking is kept, where the best contributing users are decorated with medals. The ranking is available at all times to all users and is always up-to-date, hence the effects of the contributions are immediately visible to the users. This scenario was applied to a group of students enrolled in Natural Language Processing course, who were presented with a task of collecting parallel corpora for less-resourced language pairs, in this case Croatian-English and English-Croatian. The whole experiment was supervised with the help of a custom-made open-source system named TMrepository, developed and maintained by the authors of this paper.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti



POVEZANOST RADA


Ustanove
Filozofski fakultet, Zagreb