Institutional Repository

Experimental Bootstrapping of Morphological Analysers for Nguni Languages

Show simple item record

dc.contributor.author Bosch, Sonja E.
dc.contributor.author Pretorius, Laurette
dc.contributor.author Fleisch, Axel
dc.date.accessioned 2016-09-30T12:38:42Z
dc.date.available 2016-09-30T12:38:42Z
dc.date.issued 2008
dc.identifier.citation Bosch, Sonja, Pretorius, Laurette, Fleisch, Axel. 2008. Experimental Bootstrapping of Morphological Analysers for Nguni Languages. Nordic Journal of African Studies 17(2):66-88. ISSN 1459-9465 en
dc.identifier.issn 1459-9465
dc.identifier.uri http://hdl.handle.net/10500/21557
dc.description.abstract This paper addresses the experimental bootstrapping of the development of broad-coverage finite-state morphological analysers for Xhosa, Swati and (Southern) Ndebele by using an existing prototype of a morphological analyser for Zulu. These languages are both morphologically complex and resource-scarce. The research question is whether bootstrapping is feasible across the language boundaries between these closely related varieties. The objective is an assessment of the recognition rates yielded by the Zulu morphological analyser for the three related languages. The strategy is to use bootstrapping techniques that consist of the following steps: applying the analyser to corpus data from all languages, identifying (types of) failures, and implementing the respective changes in the analyser. The results show that the high degree of shared typological properties and formal similarities among the Nguni varieties warrants a modular bootstrapping approach. Word forms in these languages that were recognized by the Zulu analyser were mostly adequately analysed. Therefore, the focus lies on providing the necessary adaptations based on an analysis of the failure output for each language. As a result, the development of analysers for Xhosa, Swati and Ndebele is considerably faster than the creation of the Zulu prototype. The paper concludes with comments on the feasibility of the experiment, and the results of the evaluation. en
dc.language.iso en en
dc.publisher Nordic Association of African Studies en
dc.relation.ispartofseries Nordic Journal of African Studies;17(2)
dc.subject Nguni languages, broad-coverage finite-state morphological analysis, agglutinating morphological structure, resource-scarce languages en
dc.title Experimental Bootstrapping of Morphological Analysers for Nguni Languages en
dc.type Article en
dc.description.department African Languages en


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search UnisaIR


Browse

My Account

Statistics