Журнал Сибирского федерального университета. Математика и физика. Journal of Siberian Federal University, Mathematics & Physics / №2 2016

Topic Categorization Based on Collectives of Term Weighting Methods for Natural Language Call Routing (150,00 руб.)

Первый автор	Sergienko
Авторы	Muhammad Shan
Страниц	11

150,00р

ID	453732
Аннотация	Natural language call routing is an important data analysis problem which can be applied in diﬀerent domains including airspace industry. This paper presents the investigation of collectives of term weighting methods for natural language call routing based on text classiﬁcation. The main idea is that collectives of diﬀerent term weighting methods can provide classiﬁcation eﬀectiveness improvement with the same classiﬁcation algorithm. Seven diﬀerent unsupervised and supervised term weighting methods were tested and compared with each other for classiﬁcation with k-NN. After that diﬀerent combinations of term weighting methods were formed as collectives. Two approaches for the handling of the collectives were considered: the meta-classiﬁer based on the rule induction and the majority vote procedure. The numerical experiments have shown that the best result is provided with the vote of all seven diﬀerent term weighting methods. This combination provides a signiﬁcant increasing of classiﬁcation eﬀectiveness in comparison with the most eﬀective term weighting methods.
УДК	004.93

Sergienko, RomanB. Topic Categorization Based on Collectives of Term Weighting Methods for Natural Language Call Routing / RomanB. Sergienko, Shan Muhammad // Журнал Сибирского федерального университета. Математика и физика. Journal of Siberian Federal University, Mathematics & Physics .— 2016 .— №2 .— С. 107-117 .— URL: https://rucont.ru/efd/453732 (дата обращения: 21.04.2025)

Вы уже смотрели

Гуманитарий Юга России №2 2013 449,00 руб

Труд №53 2013 11,58 руб

Особенности лексического состава нижнеуд...

К вопросу о влиянии корректного определения таможенной стоимости товаров на экономическую эффективность экспортно-импортных операций

К вопросу о влиянии корректного определе... 190,00 руб

Фармация №7 2019 945,00 руб

Российская газета - Неделя. Северо-Запад №33(6305) 2014

Российская газета - Неделя. Северо-Запад... 1,34 руб

Предпросмотр (выдержки из произведения)

Mathematics & Physics 2016, 9(2), 235–245 УДК 004.93 Topic Categorization Based on Collectives of TermWeighting Methods for Natural Language Call Routing Roman B. Sergienko∗ Muhammad Shan† Wolfgang Minker‡ Institute of Telecommunication Engineering Ulm University Albert-Einstein-Allee, 43, Ulm, 89081 Germany Eugene S. Semenkin§ Informatics and Telecommunications Institute Siberian State Aerospace University Krasnoyarskiy Rabochiy, 31, Krasnoyarsk, 660037 Russia Received 26.12.2015, received in revised form 11.01.2016, accepted 20.02.2016 Natural language call routing is an important data analysis problem which can be applied in different domains including airspace industry. <...> This paper presents the investigation of collectives of term weighting methods for natural language call routing based on text classification. <...> The main idea is that collectives of different term weighting methods can provide classification effectiveness improvement with the same classification algorithm. <...> Seven different unsupervised and supervised term weighting methods were tested and compared with each other for classification with k-NN. <...> After that different combinations of term weighting methods were formed as collectives. <...> Two approaches for the handling of the collectives were considered: the meta-classifier based on the rule induction and the majority vote procedure. <...> The numerical experiments have shown that the best result is provided with the vote of all seven different term weighting methods. <...> The first one is speech recognition of calls and the second one is topic categorization of users’ utterances for further routing. <...> Topic categorization of users’ utterances can be also useful for multi-domain ∗roman.sergienko@uni-ulm.de †muhammad.shan@uni-ulm.de ‡wolfgang.minker@uni-ulm.de §eugenesemenkin@yandex.ru ⃝ Siberian Federal University. <...> All rights reserved c – 235 – Roman B. Sergienko, Muhammad Shan, Wolfgang Minker, Eugene S. Semenkin Topic Categorization . . . spoken dialogue system design [2]. <...> In this work we treat call routing as an example of a text classification application In the vector space model [3] text classification is considered as a machine learning problem. <...> The complexity of text categorization with a vector space model is compounded by the need to extract the numerical data from text information before applying machine learning algorithms <...>

Облако ключевых слов *

* - вычисляется автоматически


	Для выхода нажмите Esc или