• No results found

V Exploring natural language processing for single-word and multi-word lexical complexity from a second language learner perspective

N/A
N/A
Protected

Academic year: 2021

Share "V Exploring natural language processing for single-word and multi-word lexical complexity from a second language learner perspective"

Copied!
1
0
0

Loading.... (view fulltext now)

Full text

(1)

D avid A lfter / Exploring natural language pr ocessing for single-wor d and multi-wor d lexical complexity fr om a second language learner perspectiv e

31 • 2021

Exploring natural language processing for

single-word and multi-word lexical complexity

from a second language learner perspective

David Alfter

Data linguistica

David Alfter

V

ocabulary is the building block of many language learning ad- ventures. The central question concerns when to learn what.

Traditionally, learners rely on textbook authors to decide on the or- der of vocabulary items per proficiency level. Frequency is also often chosen as deciding factor, meaning that more frequent words are learned earlier.

In his thesis, David Alfter investigates different methods for au- tomatically classifying Swedish single and multi-word expressions into proficiency levels using computer models. In the first part, he presents a machine learning model trained on multiple textbooks capable of producing proficiency estimations for unseen words. In the second part, he investigates crowdsourcing as a way to rank ex- pressions according to difficulty. Finally, he shows how the proposed resources and tools for language learning can be used in real-life sce- narios.

ISBN 978-91-87850-79-0 ISSN 0347-948X

References

Related documents

In explorations of the data it was found the the same verbatim can be mapped to different classes and in some cases it seemed as if the mappings were based on more information then

Furthermore, we have shown how multi-slot semantics for call-routing systems allows straight- forward division of categories into routing catego- ries and disambiguation

The Chinese 東西 can be used, according to Tōhō Chūgokugo Jiten (2004), in the same way as the Japanese 東西, to express the meaning ”east and west”, if you tweak

The first was to extract data from The Swedish Sign Language Corpus (Mesch et al., 2012), the second generating a co-occurence matrix with these utterances, the third to cluster

We have implemented various prototype applications for (1) the automatic prediction of words based on the feature-engineering machine learning method, (2) practical implementations

We have implemented various prototype applications for (1) the automatic prediction of words based on the feature-engineering machine learning method, (2) language learning

The various tools in GoLingual : live chat, messaging, lectures, tests, web TV, web radio, and games, must fulfill the principles and at the same time be consistent in design

In chapter 3 described gold standard dataset and in chapter 4 presented features are used as training and testing data to carry out a number of experiments in order to (I) select