Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Improved Methods of Word Acquisition in developing Hascheck Spell Checker Web Service System (CROSBI ID 547487)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Pavlek, Jakov ; Dembitz, Šandor ; Matasić, Marko Improved Methods of Word Acquisition in developing Hascheck Spell Checker Web Service System // X International PhD Workshop OWD 2008 / Grzegorz Kłapyta (ur.). Gliwice: PTETiS, 2008. str. 029-034

Podaci o odgovornosti

Pavlek, Jakov ; Dembitz, Šandor ; Matasić, Marko

engleski

Improved Methods of Word Acquisition in developing Hascheck Spell Checker Web Service System

Public service Hascheck (Croatian Academic Spell CHECKer) is a free Web service on the global level with continually growing base of its users and with rapidly increasing service volume. In this paper we discuss methods used for processing and learning new, previously unknown words to the Hascheck system. Interface for manual word acquisition has been developed using Google Web Search engine from appropriate given domains as a part of the improvement of the Hascheck service. In this matter already existing systematized knowledge resources, specifically Wikipedia and Croatian Spell Checker for MS Word, have been intensively used. Program modules for automatic retrieval and classification of word types based on information about domain, language, and way of spelling have been developed. As a result, some 135000 of new word types have been processed and classified into adequate classes using the developed software. We also evaluate earlier methods used in the same process and compare them to the new ones regarding their accuracy, efficiency and the time they take to process words. Combining new methods the processing of word types, that is, supervised learning in the Hascheck system, has been accelerated and the time of decision-making process has been significantly reduced.

spell checker; word acquisition; web service; Google Search; Wikipedia

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

029-034.

2008.

objavljeno

Podaci o matičnoj publikaciji

X International PhD Workshop OWD 2008

Grzegorz Kłapyta

Gliwice: PTETiS

Podaci o skupu

X International PhD Workshop OWD 2008

predavanje

18.10.2008-21.10.2008

Wisła, Poljska

Povezanost rada

Elektrotehnika