中文部件組字與形構資料庫之建立及其在識字教學的應用
No Thumbnail Available
Date
2011-11-??
Journal Title
Journal ISSN
Volume Title
Publisher
國立臺灣師範大學教育心理學系
Department of Educational Psychology, NTNU
Department of Educational Psychology, NTNU
Abstract
近幾年來,字部件在中文字教學中的作用日益受到重視,因而提升字部件的應用和研究價值。中文部件(radical)是構成中文字的基本單位,每個中文字(character)由單一或多個部件在二維方形空間中,依照組字規則排列組合而成。從當代心理學研究成果來看,部件數、筆畫數越多,其字形組成越趨於複雜,學習者處理時所需的心理歷程也越繁複;然而,過去少有以正體中文所建立的完整部件組字與形構資料庫,或對部件之頻次位置、結構等進行竭盡式地分析,因此,教學者或學習者無法對正體中文的部件屬性有通盤的了解。基於此,本研究拆分6097個常用中文字,建立439 個基礎中文部件,並歸納11 種空間結構關係,以此建構中文部件組字與形構資料庫。本研究以此資料庫為基礎,分析出以下指標:1. 部件出現頻次、2. 常用字之結構關係出現頻次、3. 部件與結構關係之組合關係頻次、4. 中文部件之衍生字族集合(即具有某部件的所有中文字,如「麻」的衍生字,包括「嘛麼(嬤)摩麾磨(蘑)糜靡魔」)。期透過中文字各項指標與知識之建立,提供字詞研究與教學實務之教材編製作參考。
Radicals, as components of Chinese characters, and configurations are integral parts of Chinese orthography. Current studies have proven the psychological entity as well as the pedagogical meaning of radicals; however, little research has been done on the properties of radicals. The present study aims to develop a data-driven and exhaustive searching knowledge base – Chinese Orthography Database-which consists of a radical set and a traditional Chinese character set. Four hundred and thirty-nine radicals are used, with 11 symbols of configurations, to take apart 6097 frequent characters, which is the union of two sets of frequent characters defined by the Big-5 encoding method and the Chinese Knowledge and Information Processing group. These freguent characters are computed by the parameters of Chinese character constituent and exhaustively analyzed, while several orthographic indices are created: (a) radical frequency by type/token, (b) configuration frequency, (c) position-based radical frequency and, (d) neighborhood sizes of radicals. To assist researchers in constructing experimental materials and educators in teaching Chinese, several the applications of the Chinese Orthography Database are discussed.
Radicals, as components of Chinese characters, and configurations are integral parts of Chinese orthography. Current studies have proven the psychological entity as well as the pedagogical meaning of radicals; however, little research has been done on the properties of radicals. The present study aims to develop a data-driven and exhaustive searching knowledge base – Chinese Orthography Database-which consists of a radical set and a traditional Chinese character set. Four hundred and thirty-nine radicals are used, with 11 symbols of configurations, to take apart 6097 frequent characters, which is the union of two sets of frequent characters defined by the Big-5 encoding method and the Chinese Knowledge and Information Processing group. These freguent characters are computed by the parameters of Chinese character constituent and exhaustively analyzed, while several orthographic indices are created: (a) radical frequency by type/token, (b) configuration frequency, (c) position-based radical frequency and, (d) neighborhood sizes of radicals. To assist researchers in constructing experimental materials and educators in teaching Chinese, several the applications of the Chinese Orthography Database are discussed.