Word-based largest chunks for Agreement Groups processing: Cross-linguistic observations

dc.contributor.authorDrienkó, László
dc.date.accessioned2024-05-17T11:26:38Z
dc.date.available2024-05-17T11:26:38Z
dc.date.issued2020
dc.description.abstractThe present study reports results from a series of computer experiments seeking to combine word-based Largest Chunk (LCh) segmentation and Agreement Groups (AG) sequence processing. The AG model is based on groups of similar utterances that enable combinatorial mapping of novel utterances. LCh segmentation is concerned with cognitive text segmentation, i.e. with detecting word boundaries in a sequence of linguistic symbols. Our observations are based on the text of Le petit prince (The little prince) by Antoine de Saint-Exupéry in three languages: French, English, and Hungarian. The data suggest that word-based LCh segmentation is not very efficient with respect to utterance boundaries, however, it can provide useful word combinations for AG processing. Typological differences between the languages are also reflected in the results.
dc.identifier.citation"Linguistics Beyond and Within", 2020, Vol. 6, pp. 60-73
dc.identifier.doi10.31743/lingbaw.11831
dc.identifier.issn2450-5188
dc.identifier.urihttps://hdl.handle.net/20.500.12153/7091
dc.language.isoen
dc.publisherWydawnictwo KUL
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectcognitive computer modelling
dc.subjectsegmentation
dc.subjectsyntactic processing
dc.subjectlanguage acquisition
dc.titleWord-based largest chunks for Agreement Groups processing: Cross-linguistic observations
dc.typeinfo:eu-repo/semantics/article
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Drienko_Laszlo_Word-based_largest_chunks_for_Agreement_Groups_processing.pdf
Size:
157.55 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.81 KB
Format:
Item-specific license agreed upon to submission
Description: