Improved sentence retrieval using local context and sentence length (CROSBI ID 193820)
Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Doko, Alen ; Štula, Maja ; Šerić, Ljiljana
engleski
Improved sentence retrieval using local context and sentence length
In this paper we propose improved variants of the sentence retrieval method TF-ISF (a TF-IDF or Term Frequency – Inverse Document Frequency variant for sentence retrieval). The improvement is achieved by using context consisting of neighboring sentences and at the same time promoting the retrieval of longer sentences. We thoroughly compare new modified TF-ISF methods to the TF-ISF baseline, to an earlier attempt to include context into TF-ISF named tfmix and to a language modeling based method that uses context and promoting retrieval of long sentences named 3MMPDS. Experimental results show that the TF-ISF method can be improved using local context. Results also show that the TF-ISF method can be improved by promoting the retrieval of longer sentences. Finally we show that the best results are achieved when combining both modifications. All new methods (TFISF variants) also show statistically significant better results than the other tested methods.
sentence retrieval; TF-ISF; context; sentence length
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o izdanju
49 (6)
2013.
1301-1312
objavljeno
0306-4573
10.1016/j.ipm.2013.06.004
Povezanost rada
Računarstvo