Pregled bibliografske jedinice broj: 1079139
A dictionary for translation from natural to formal data model language
A dictionary for translation from natural to formal data model language // Computational intelligence, 37 (2021), 1; 87-127 doi:10.1111/coin.12393 (međunarodna recenzija, članak, znanstveni)
CROSBI ID: 1079139 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
A dictionary for translation from natural to
formal data model language
Autori
Šuman, Sabrina ; Jakupović, Alen ; Marinac, Mladen
Izvornik
Computational intelligence (0824-7935) 37
(2021), 1;
87-127
Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni
Ključne riječi
Data modeling ; Dictionary ; Formal language ; Knowledge‐based system ; Linguistic corpus ; Machine translation ; Pattern recognition
Sažetak
The paper describes our current research activities and results related to developing knowledge‐based systems to support the creation of entity‐relationship (ER) models. The authors based obtaining an ER model in textual form on translation from one language into another, that is, from an English controlled natural language into the formalized language of an ER data model. Our translation method consisted of creating translation rules of sentential form parts into ER model constructs based on the textual and character patterns detected in the business descriptions. To enable the computer analyses necessary for creating translation mechanisms, we created a linguistic corpus that contains lists of the business descriptions and the texts of other business materials. From the corpus, we then created a specific dictionary and linguistic rules to automate the business descriptions' translation into the ER data model language. Before that, however, the corpus was enriched by adding annotations to the words related to ER data model constructs. In this paper, we also present the main issues uncovered during the translation process and offer a possible solution with utility evaluation: applying information‐extraction performance measures to a set of sentences from the corpus.
Izvorni jezik
Engleski
Znanstvena područja
Interdisciplinarne tehničke znanosti, Informacijske i komunikacijske znanosti, Interdisciplinarne humanističke znanosti
Citiraj ovu publikaciju:
Časopis indeksira:
- Current Contents Connect (CCC)
- Web of Science Core Collection (WoSCC)
- Science Citation Index Expanded (SCI-EXP)
- SCI-EXP, SSCI i/ili A&HCI
- Scopus