LingBaW. Linguistics Beyond and Within, 2020, Vol. 6
Permanent URI for this collection
Browse
Browsing LingBaW. Linguistics Beyond and Within, 2020, Vol. 6 by Author "Drienkó, László"
Now showing 1 - 1 of 1
Results Per Page
Sort Options
- ItemWord-based largest chunks for Agreement Groups processing: Cross-linguistic observations(Wydawnictwo KUL, 2020) Drienkó, LászlóThe present study reports results from a series of computer experiments seeking to combine word-based Largest Chunk (LCh) segmentation and Agreement Groups (AG) sequence processing. The AG model is based on groups of similar utterances that enable combinatorial mapping of novel utterances. LCh segmentation is concerned with cognitive text segmentation, i.e. with detecting word boundaries in a sequence of linguistic symbols. Our observations are based on the text of Le petit prince (The little prince) by Antoine de Saint-Exupéry in three languages: French, English, and Hungarian. The data suggest that word-based LCh segmentation is not very efficient with respect to utterance boundaries, however, it can provide useful word combinations for AG processing. Typological differences between the languages are also reflected in the results.