Pregled bibliografske jedinice broj: 931709
FAQIR – A Frequently Asked Questions Retrieval Test Collection
FAQIR – A Frequently Asked Questions Retrieval Test Collection // Proceedings of the 19th International Conference on Text, Speech, and Dialogue / Sojka, Petr ; Horák, Aleš ; Kopeček, Ivan ; Pala, Karel (ur.).
Brno: Springer, 2016. str. 74-81 doi:10.1007/978-3-319-45510-5_9 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 931709 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
FAQIR – A Frequently Asked Questions Retrieval Test Collection
Autori
Karan, Mladen ; Šnajder, Jan
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the 19th International Conference on Text, Speech, and Dialogue
/ Sojka, Petr ; Horák, Aleš ; Kopeček, Ivan ; Pala, Karel - Brno : Springer, 2016, 74-81
ISBN
978-3-319-45510-5
Skup
International Conference on Text, Speech, and Dialogue
Mjesto i datum
Brno, Češka Republika, 12.09.2016. - 16.09.2016
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Frequently asked questions, Information retrieval, Question answering
Sažetak
Frequently asked question (FAQ) collections are commonly used across the web to provide information about a specific domain (e.g., services of a company). With respect to traditional information retrieval, FAQ retrieval introduces additional challenges, the main ones being (1) the brevity of FAQ texts and (2) the need for topic-specific knowledge. The primary contribution of our work is a new domain-specific FAQ collection, providing a large number of queries with manually annotated relevance judgments. On this collection, we test several unsupervised baseline models, including both count based and semantic embedding based models, as well as a combined model. We evaluate the performance across different setups and identify potential venues for improvement. The collection constitutes a solid basis for research in supervised machine-learning-based FAQ retrieval.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Projekti:
036-1300646-1986 - Otkrivanje znanja u tekstnim podacima (Dalbelo-Bašić, Bojana, MZO ) ( CroRIS)
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb