Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 884405

Detection of Near-duplicate Documents Using Simhash Algorithm


Yagüe Gonzalez, Daniel
Detection of Near-duplicate Documents Using Simhash Algorithm, 2017., diplomski rad, preddiplomski, Fakultet elektrotehnike i računarstva, Zagreb


CROSBI ID: 884405 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Detection of Near-duplicate Documents Using Simhash Algorithm
(Otkrivanje sličnih dokumenata koristeći algoritam simhash)

Autori
Yagüe Gonzalez, Daniel

Vrsta, podvrsta i kategorija rada
Ocjenski radovi, diplomski rad, preddiplomski

Fakultet
Fakultet elektrotehnike i računarstva

Mjesto
Zagreb

Datum
25.06

Godina
2017

Stranica
27

Mentor
Vladimir, Klemo

Ključne riječi
simhash algoritam ; hamming udaljenost ; otkrivanje sličnih dokumenata ; sažetak dokumenta
(simhash algorithm ; hamming distance ; near-duplicate detection ; document fingerprint)

Sažetak
Description of methods for detection of near duplicate textual documents. Explanation of the Simhash algorithm and Hamming distance. C++ programming implementation of the Simhash algorithm tested on a collection of texts. Evaluation of the method and it is efficiency.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo



POVEZANOST RADA


Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Avatar Url Klemo Vladimir (mentor)


Citiraj ovu publikaciju:

Yagüe Gonzalez, Daniel
Detection of Near-duplicate Documents Using Simhash Algorithm, 2017., diplomski rad, preddiplomski, Fakultet elektrotehnike i računarstva, Zagreb
Yagüe Gonzalez, D. (2017) 'Detection of Near-duplicate Documents Using Simhash Algorithm', diplomski rad, preddiplomski, Fakultet elektrotehnike i računarstva, Zagreb.
@phdthesis{phdthesis, author = {Yag\"{u}e Gonzalez, Daniel}, year = {2017}, pages = {27}, keywords = {simhash algoritam, hamming udaljenost, otkrivanje sli\v{c}nih dokumenata, sa\v{z}etak dokumenta}, title = {Detection of Near-duplicate Documents Using Simhash Algorithm}, keyword = {simhash algoritam, hamming udaljenost, otkrivanje sli\v{c}nih dokumenata, sa\v{z}etak dokumenta}, publisherplace = {Zagreb} }
@phdthesis{phdthesis, author = {Yag\"{u}e Gonzalez, Daniel}, year = {2017}, pages = {27}, keywords = {simhash algorithm, hamming distance, near-duplicate detection, document fingerprint}, title = {Otkrivanje sli\v{c}nih dokumenata koriste\'{c}i algoritam simhash}, keyword = {simhash algorithm, hamming distance, near-duplicate detection, document fingerprint}, publisherplace = {Zagreb} }




Contrast
Increase Font
Decrease Font
Dyslexic Font