Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1102549

Using Machine Learning for Web Page Classification in Search Engine Optimization


Matošević, Goran; Dobša, Jasminka; Mladenić, Dunja
Using Machine Learning for Web Page Classification in Search Engine Optimization // Future Internet, 13 (2021), 1; 9, 20 doi:10.3390/fi13010009 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 1102549 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Using Machine Learning for Web Page Classification in Search Engine Optimization

Autori
Matošević, Goran ; Dobša, Jasminka ; Mladenić, Dunja

Izvornik
Future Internet (1999-5903) 13 (2021), 1; 9, 20

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
search engine optimization ; SEO optimization ; on-page optimization ; classification ; machine learning

Sažetak
This paper presents a novel approach of using machine learning algorithms based on experts’ knowledge to classify web pages into three predefined classes according to the degree of content adjustment to the search engine optimization (SEO) recommendations. In this study, classifiers were built and trained to classify an unknown sample (web page) into one of the three predefined classes and to identify important factors that affect the degree of page adjustment. The data in the training set are manually labeled by domain experts. The experimental results show that machine learning can be used for predicting the degree of adjustment of web pages to the SEO recommendations—classifier accuracy ranges from 54.59% to 69.67%, which is higher than the baseline accuracy of classification of samples in the majority class (48.83%). Practical significance of the proposed approach is in providing the core for building software agents and expert systems to automatically detect web pages, or parts of web pages, that need improvement to comply with the SEO guidelines and, therefore, potentially gain higher rankings by search engines. Also, the results of this study contribute to the field of detecting optimal values of ranking factors that search engines use to rank web pages. Experiments in this paper suggest that important factors to be taken into consideration when preparing a web page are page title, meta description, H1 tag (heading), and body text—which is aligned with the findings of previous research. Another result of this research is a new data set of manually labeled web pages that can be used in further research.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Ustanove:
Fakultet organizacije i informatike, Varaždin,
Sveučilište Jurja Dobrile u Puli

Profili:

Avatar Url Jasminka Dobša (autor)

Avatar Url Goran Matošević (autor)

Poveznice na cjeloviti tekst rada:

doi www.mdpi.com

Citiraj ovu publikaciju:

Matošević, Goran; Dobša, Jasminka; Mladenić, Dunja
Using Machine Learning for Web Page Classification in Search Engine Optimization // Future Internet, 13 (2021), 1; 9, 20 doi:10.3390/fi13010009 (međunarodna recenzija, članak, znanstveni)
Matošević, G., Dobša, J. & Mladenić, D. (2021) Using Machine Learning for Web Page Classification in Search Engine Optimization. Future Internet, 13 (1), 9, 20 doi:10.3390/fi13010009.
@article{article, author = {Mato\v{s}evi\'{c}, Goran and Dob\v{s}a, Jasminka and Mladeni\'{c}, Dunja}, year = {2021}, pages = {20}, DOI = {10.3390/fi13010009}, chapter = {9}, keywords = {search engine optimization, SEO optimization, on-page optimization, classification, machine learning}, journal = {Future Internet}, doi = {10.3390/fi13010009}, volume = {13}, number = {1}, issn = {1999-5903}, title = {Using Machine Learning for Web Page Classification in Search Engine Optimization}, keyword = {search engine optimization, SEO optimization, on-page optimization, classification, machine learning}, chapternumber = {9} }
@article{article, author = {Mato\v{s}evi\'{c}, Goran and Dob\v{s}a, Jasminka and Mladeni\'{c}, Dunja}, year = {2021}, pages = {20}, DOI = {10.3390/fi13010009}, chapter = {9}, keywords = {search engine optimization, SEO optimization, on-page optimization, classification, machine learning}, journal = {Future Internet}, doi = {10.3390/fi13010009}, volume = {13}, number = {1}, issn = {1999-5903}, title = {Using Machine Learning for Web Page Classification in Search Engine Optimization}, keyword = {search engine optimization, SEO optimization, on-page optimization, classification, machine learning}, chapternumber = {9} }

Časopis indeksira:


  • Web of Science Core Collection (WoSCC)
    • Emerging Sources Citation Index (ESCI)
  • Scopus


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font