Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1126245

Improving public sector efficiency using advanced text mining in the procurement process


Modrušan, Nikola; Rabuzin, Kornelije; Mršić, Leo
Improving public sector efficiency using advanced text mining in the procurement process // Proceedings of the 9th International Conference on Data Science, Technology and Applications / Hammoudi, Slimane ; Quix, Christoph ; Bernardino, Jorge (ur.).
Setúbal: SCITEPRESS, 2020. str. 200-206 doi:10.5220/0009823102000206 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 1126245 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Improving public sector efficiency using advanced text mining in the procurement process

Autori
Modrušan, Nikola ; Rabuzin, Kornelije ; Mršić, Leo

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the 9th International Conference on Data Science, Technology and Applications / Hammoudi, Slimane ; Quix, Christoph ; Bernardino, Jorge - Setúbal : SCITEPRESS, 2020, 200-206

ISBN
978-989-758-440-4

Skup
9th International Conference on Data Science, Technology and Applications (DATA 2020)

Mjesto i datum
Portugal, 07.07.2020. - 09.07.2020

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Text Mining, Natural Language Processing, Rule Extraction, Automatic Extraction, Data Mining, Knowledge Discovery, Fraud Detection ; Corruption Indices ; Public Procurement ; Big Data
(Fraud Detection ; Corruption Indices ; Public Procurement ; Big Data)

Sažetak
The analysis of the Public Procurement Processes (PPP) and the detection of suspicious or corrupt procedures is an important topic, especially for improving the process’s transparency and for protecting public financial interests. Creating a quality model as a foundation to perform a quality analysis largely depends on the quality and volume of data that is analyzed. It is important to find a way to identify anomalies before they occur and to prevent any kind of harm that is of public interest. For this reason, we focused our research on an early phase of the PPP, the preparation of the tender documentation. During this phase, it is important to collect documents, detect and extract quality content from it, and analyze this content for any possible manipulation of the PPP’s outcome. Part of the documentation related to defining the rules and restrictions for the PPP is usually within a specific section of the documents, often called “technical and professional ability.” In previous studies, the authors extracted and processed these sections and used extracted content in order to develop a prediction model for indicating fraudulent activities. As the criteria and conditions can also be found in other parts of the PPP’s documentation, the idea of this research is to detect additional content and to investigate its impact on the outcome of the prediction model. Therefore, our goal was to determine a list of relevant terms and to develop a data science model finding and extracting terms in order to improve the predictions of suspicious tender. An evaluation was conducted based on an initial prediction model trained with the extracted content as additional input parameters. The training results show a significant improvement in the output metrics. This study presents a methodology for detecting the content needed to predict suspicious procurement procedures, for measuring the relevance of extracted terms, and for storing the most important information in a relational structure in a database.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Ustanove:
Fakultet organizacije i informatike, Varaždin

Profili:

Avatar Url Kornelije Rabuzin (autor)

Avatar Url Leo Mršić (autor)

Poveznice na cjeloviti tekst rada:

doi www.scitepress.org

Citiraj ovu publikaciju:

Modrušan, Nikola; Rabuzin, Kornelije; Mršić, Leo
Improving public sector efficiency using advanced text mining in the procurement process // Proceedings of the 9th International Conference on Data Science, Technology and Applications / Hammoudi, Slimane ; Quix, Christoph ; Bernardino, Jorge (ur.).
Setúbal: SCITEPRESS, 2020. str. 200-206 doi:10.5220/0009823102000206 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Modrušan, N., Rabuzin, K. & Mršić, L. (2020) Improving public sector efficiency using advanced text mining in the procurement process. U: Hammoudi, S., Quix, C. & Bernardino, J. (ur.)Proceedings of the 9th International Conference on Data Science, Technology and Applications doi:10.5220/0009823102000206.
@article{article, author = {Modru\v{s}an, Nikola and Rabuzin, Kornelije and Mr\v{s}i\'{c}, Leo}, year = {2020}, pages = {200-206}, DOI = {10.5220/0009823102000206}, keywords = {Text Mining, Natural Language Processing, Rule Extraction, Automatic Extraction, Data Mining, Knowledge Discovery, Fraud Detection, Corruption Indices, Public Procurement, Big Data}, doi = {10.5220/0009823102000206}, isbn = {978-989-758-440-4}, title = {Improving public sector efficiency using advanced text mining in the procurement process}, keyword = {Text Mining, Natural Language Processing, Rule Extraction, Automatic Extraction, Data Mining, Knowledge Discovery, Fraud Detection, Corruption Indices, Public Procurement, Big Data}, publisher = {SCITEPRESS}, publisherplace = {Portugal} }
@article{article, author = {Modru\v{s}an, Nikola and Rabuzin, Kornelije and Mr\v{s}i\'{c}, Leo}, year = {2020}, pages = {200-206}, DOI = {10.5220/0009823102000206}, keywords = {Fraud Detection, Corruption Indices, Public Procurement, Big Data}, doi = {10.5220/0009823102000206}, isbn = {978-989-758-440-4}, title = {Improving public sector efficiency using advanced text mining in the procurement process}, keyword = {Fraud Detection, Corruption Indices, Public Procurement, Big Data}, publisher = {SCITEPRESS}, publisherplace = {Portugal} }

Časopis indeksira:


  • Scopus


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font