:::台灣日、西、德語學習者語料庫:::

中文 | English

「臺灣多國語言學習者語料庫系統」是2008.8~2011.7執行成大頂尖計畫「多國語言語料庫之建構與研究」之部分研究成果。目前（2018.1~）日文和西文語料庫相關工具之開發與研究仍在科技部專題研究計畫補助下持續延伸與運作。

建構之目的

近二十年來以語料庫為本之相關研究在語言學不同領域的發展中，逐漸成為趨勢與潮流；如：「語料庫語言學」的研究即是國科會人文學研究中心所公佈的【語言學門熱門及前瞻學術研究議題調查計畫】中「台灣語言學門的前瞻議題」之一。而語語料庫的建構則可透過資源共享，結合語言學次領域的研究焦點，讓語言研究學者間產生更多的互動，進而擴大語言學的研究層面。但相較於英語之研發成果，其它外語則落後甚多。有鑑於此，「成功大學多國語語料庫」將跨中、德、日、西語言的整合模式涵蓋東西不同語種、語言形式及語言特色的語料庫的建構工作，並以其為本進行不同層面的語言分析、理論發展。此外，更進一步，以建構與研究之成果為基礎建置網路課程，希望成功結合網路科技的多元發展於語料庫的建構和應用中。預期語料庫建構與以之為本的研究成果將可提供相關學者參考，而開發出的語料庫為本的數位教材更可嘉惠學子，使其融入資訊科技和外語的學習，符合時代知識的發展潮流。

The Aim of the Construction for the Multilingual Corpora

In the last twenty years, there have been a growing number of studies based on corpus linguistics. One example is the report "Taiwanese Prospective Linguistic Issues" that was published as part of the Linguistics Field and Academic Research Investigation Project, produced by the National Science Council's Department of Humanities and Social Sciences in Taiwan. Through sharing resources and combining the research efforts in the subfield of linguistics, our construction of corpus linguistics aims to create more interactions between researchers in order to extend the reach of linguistics studies. To date, most studies on linguistic corpora focus on English, and thus the "National Cheng Kung University Multilingual Corpora" project will include Chinese, German, Japanese and Spanish materials, examining various linguistic forms and features, and conducting both linguistic analysis and theoretical development. In addition, based on the results of this Multilingual Corpora, this project aims to construct web-based lessons, and the results are expected to act as a reference for other researchers who aim to further develop digital teaching materials.

特色

跨語言之多國語語料庫在語料庫之開發與研究領域獨具特色。在全世界尚未有跨東方、西方語種且提供公開檢索服務之第二外語學習者語料庫；在亞洲僅有英語或日語之學習者語料庫，而且甚少收錄台灣學習者之語料；在台灣的成功大學，以現有資源及人力得以進行具註記之多國語整合計畫實具有嶄新的學術價值和開創性意義。

Features

This project is unique in the corpus linguistics field, as to date there have been no such works that combine both. Western and Asian languages and also allow public searches. Moreover, in Asia, there are few English or Japanese corpora linguistics projects, and they rarely collect material from Taiwanese learners. However, National Cheng Kung University in Taiwan, with its abundant resources, is able to execute the integration of multiple corpora linguistics in this innovative project.

現階段規劃與未來發展

本整合計畫涵蓋中、德、日、西四個語言，語料對象的收集來源是第二外語的學習者，現階段的語料類型以書面語語料為收集重點。日後，將進行跨語言口語形式之語料庫建構。此外，各語言亦將根據研究的分析成果發展理論，結合文字與聲音的多媒體語料庫，輔以學習與網路科技之整合，編撰建置而成以通過教育部認證為目標之線上數位多媒體的外語學習課程。

Present plans and future developments

This integrative project covers four languages: Chinese, German, Japanese and Spanish. The material will be obtained from second language learners, and the current stage of this project focuses on the collection of written material for the corpus. Later on, this project will move on to the construction of the corpora linguistics by focusing on the oral form in all four languages. In addition, this project will develop original theories based on the research results, focusing on multimedia-based corpora linguistics to assist in the integration of learning and internet-based technology so as to meet the standards set by the Minister of Education with regard to on-line language learning materials.

團隊成員

鄭錦全院士（顧問）
成功大學：
            外文系  盧慧娟（西班牙文）
            外文系  黃淑妙（日文）
            外文系  謝菁玉（德文）
            外文系  劉繼仁（教學科技）
            資工系  盧文祥（語料庫技術）
東京大學：楊凱栄（中文）

Project Members

Chin-Chuan Cheng academician （consultant）
National Cheng Kung University：
Department of Foreign Languages
Hui-Chuan Lu（Spanish）
Su-Miao Huang（Japanese）
Ching-Yu Hsieh（German）
Gi-Zen Liu（Digital Language Learning）
Department of Computer Science
Wen-Hsiang Lu（Corpus Technology）
Tokyo University： Kai-Rong Yang（Chinese）