Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1101764

Od specijaliziranih mrežnih korpusa do rječnika za neizvorne govornike


Srdanović, Irena
Od specijaliziranih mrežnih korpusa do rječnika za neizvorne govornike // Rasprave Instituta za hrvatski jezik i jezikoslovlje, 46 (2020), 2; 1059-1083 doi:10.31724/rihjj.46.2.31 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 1101764 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Od specijaliziranih mrežnih korpusa do rječnika za neizvorne govornike
(From Specialized Web Corpora of Tourism to a Learner’s Dictionary)

Autori
Srdanović, Irena

Izvornik
Rasprave Instituta za hrvatski jezik i jezikoslovlje (1331-6745) 46 (2020), 2; 1059-1083

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
izgradnja korpusa ; tehnologija BootCat ; područje turizma ; rječnik za neizvorne govornike ; Sketch Engine ; specijalizirani mrežni korpus...
(corpus building ; BootCat technology ; tourism domain ; learners’s dictionary ; Sketch Engine ; specialized web corpus of Croatian tourism in Japanese)

Sažetak
This paper presents the two approaches used in creating specialized web corpora of Croatian tourism in Japanese for their usage in building a specialized learners’ dictionary. Both approaches use the WebBootCat technology (Baroni et al. 2006, Kilgarriff et al. 2014) to automatically create specialized web corpora. The first approach creates the corpora from the selected seed words most relevant to the topic. The second approach specifies a number of web pages that cover tourism-oriented information on specified regions, cities, and sites in Croatia available in Japanese, which are then used for web corpora creation inside the Sketch Engine platform. Both approaches provide specialized web corpora small in size, but quite useful for lexical profiling in the specific field of tourism. In the process of dictionary creation, the second approach has proven to be especially useful for the selection of lexical items, while both approaches have proven to be highly useful for the exploration and selection of authentic examples from the corpora. The research exposes some shortcomings in Japanese language processing, such as errors in the lemmatization of some culturally specific terms and indicates the need to refine existing language processing tools in Japanese. The Japanese- Croatian bilingual learner’s dictionary (Srdanović 2018) is currently in the pilot phase and is being used and built by learners and teachers through the open-source dictionary platform Lexonomy (Mechura 2017). In addition to the fact that work on the bilingual dictionary is useful as a means for training students in language analysis and description using modern technologies (e.g. corpora, corpus query systems, dictionary editing platform), the dictionary is also important in educating new personnel capable of working in tourism using the Japanese language, which is strongly needed. In future, the same approach could be used for creating specialized corpora and dictionaries for Japanese and other language pairs.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti, Pedagogija, Filologija, Interdisciplinarne humanističke znanosti



POVEZANOST RADA


Ustanove:
Sveučilište Jurja Dobrile u Puli

Profili:

Avatar Url Irena Srdanović (autor)

Citiraj ovu publikaciju:

Srdanović, Irena
Od specijaliziranih mrežnih korpusa do rječnika za neizvorne govornike // Rasprave Instituta za hrvatski jezik i jezikoslovlje, 46 (2020), 2; 1059-1083 doi:10.31724/rihjj.46.2.31 (međunarodna recenzija, članak, znanstveni)
Srdanović, I. (2020) Od specijaliziranih mrežnih korpusa do rječnika za neizvorne govornike. Rasprave Instituta za hrvatski jezik i jezikoslovlje, 46 (2), 1059-1083 doi:10.31724/rihjj.46.2.31.
@article{article, author = {Srdanovi\'{c}, Irena}, year = {2020}, pages = {1059-1083}, DOI = {10.31724/rihjj.46.2.31}, keywords = {izgradnja korpusa, tehnologija BootCat, podru\v{c}je turizma, rje\v{c}nik za neizvorne govornike, Sketch Engine, specijalizirani mre\v{z}ni korpus...}, journal = {Rasprave Instituta za hrvatski jezik i jezikoslovlje}, doi = {10.31724/rihjj.46.2.31}, volume = {46}, number = {2}, issn = {1331-6745}, title = {Od specijaliziranih mre\v{z}nih korpusa do rje\v{c}nika za neizvorne govornike}, keyword = {izgradnja korpusa, tehnologija BootCat, podru\v{c}je turizma, rje\v{c}nik za neizvorne govornike, Sketch Engine, specijalizirani mre\v{z}ni korpus...} }
@article{article, author = {Srdanovi\'{c}, Irena}, year = {2020}, pages = {1059-1083}, DOI = {10.31724/rihjj.46.2.31}, keywords = {corpus building, BootCat technology, tourism domain, learners’s dictionary, Sketch Engine, specialized web corpus of Croatian tourism in Japanese}, journal = {Rasprave Instituta za hrvatski jezik i jezikoslovlje}, doi = {10.31724/rihjj.46.2.31}, volume = {46}, number = {2}, issn = {1331-6745}, title = {From Specialized Web Corpora of Tourism to a Learner’s Dictionary}, keyword = {corpus building, BootCat technology, tourism domain, learners’s dictionary, Sketch Engine, specialized web corpus of Croatian tourism in Japanese} }

Časopis indeksira:


  • Web of Science Core Collection (WoSCC)
    • Emerging Sources Citation Index (ESCI)
  • Scopus


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font