A Robust statistical model of word frequencies

dc.contributor.authorIngelby, Michael
dc.contributor.authorSharoff, Serge
dc.date.accessioned2021-05-07T12:04:35Z
dc.date.available2021-05-07T12:04:35Z
dc.date.issued2019-08
dc.descriptionPaper presented at the 5th Strathmore International Mathematics Conference (SIMC 2019), 12 - 16 August 2019, Strathmore University, Nairobi, Kenyaen_US
dc.description.abstractFor the purposes of language teaching or automatic language processing it is important to know how frequent a word is. However, a simple procedure counting the number of times a word occurs in a collection of texts leads to many unfortunate artefacts because some words occur too often in a small number of texts leading to frequency bursts. Our task in this paper is to introduce a statistical model which uses methods from robust statistics to estimate the frequencies of words in a collection of texts.en_US
dc.description.sponsorshipUniversity of Leeds, United Kingdom.en_US
dc.identifier.urihttp://hdl.handle.net/11071/10467
dc.language.isoen_USen_US
dc.publisherStrathmore Universityen_US
dc.subjectRobust statisticsen_US
dc.subjectWord frequenciesen_US
dc.subjectCore lexiconen_US
dc.titleA Robust statistical model of word frequenciesen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
A robust statistical model of word frequencies.pdf
Size:
89 KB
Format:
Adobe Portable Document Format
Description:
Abstract - SIMC Conference paper, 2019
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections