Bilgilendirme: Kurulum ve veri kapsamındaki çalışmalar devam etmektedir. Göstereceğiniz anlayış için teşekkür ederiz.

Publication:
Feature Selection in Text Classification

Loading...
Thumbnail Image

Date

Journal Title

Journal ISSN

Volume Title

Research Projects

Organizational Units

Journal Issue

Abstract

In recent years, text classification have been widely used. Dimension of text data has increased more and more. Working of almost all classification algorithms is directly related to dimension. In high dimension data set, working of classification algorithms both takes time and occurs over fitting problem. So feature selection is crucial for machine learning techniques. In this study, frequently used feature selection metrics Chi Square (CHI), Information Gain (IG) and Odds Ratio (OR) have been applied. At the same time the method Relevancy Frequency (RF) proposed as term weighting method has been used as feature selection method in this study. It is used for tf.idf term as weighting method, Sequential Minimal Optimization (SMO) and Naive Bayes (NB) in the classification algorithm. Experimental results show that RF gives successful results. © 2016 IEEE.

Description

Citation

WoS Q

N/A

Scopus Q

N/A

Source

-- 24th Signal Processing and Communication Application Conference, SIU 2016 -- 2016-05-16 Through 2016-05-19 -- Zonguldak -- 122605

Volume

Issue

Start Page

1777

End Page

1780

Endorsement

Review

Supplemented By

Referenced By