Text Categorization using Support Vector machines
DOI:
https://doi.org/10.15381/risi.v10i1.5711Keywords:
text categorization, text classification, support vector machines, linear classifiers.Abstract
The categorization of texts is an application that falls within the discipline of naturallanguage pro cessing and is closely related to the concept of classification. Due to the abundant existing infor mation becomes necessary to organize, maintain, and process any information available from a deeper knowledge of the language of the support vector machines (MSV) belong to the family of linear classifiers, and can be used to resolve the problem of the categorization of texts(eT) which consists in label text or document with one or severa! predefined thematic categories. The reason which tackles the problem is their application in different scenarios of the area of information retrie val (IR) such as the automatic organization of documents, filtering of documents. The approach of the MSV basically considers the following: Given a set of documents D anda set of categories e, it is important to find a function that match toa document d taken from D, a particular category e in C.Downloads
Downloads
Published
Issue
Section
License
Copyright (c) 2013 Augusto Cortez Vasquez, Luzmila Pró Concepción, Oswaldo Rojas Lazo, Robero Calmet Agnelli
![Creative Commons License](http://i.creativecommons.org/l/by-nc-sa/4.0/88x31.png)
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
AUTHORS RETAIN THEIR RIGHTS:
a. Authors retain their trade mark rights and patent, and also on any process or procedure described in the article.
b. Authors retain their right to share, copy, distribute, perform and publicly communicate their article (eg, to place their article in an institutional repository or publish it in a book), with an acknowledgment of its initial publication in the Revista de investigación de Sistemas e Informática.
c. Authors retain theirs right to make a subsequent publication of their work, to use the article or any part thereof (eg a compilation of his papers, lecture notes, thesis, or a book), always indicating its initial publication in the Revista de investigación de Sistemas e Informática (the originator of the work, journal, volume, number and date).