Efficient Text Classification of 20 Newsgroup Dataset using Classification Algorithm

Karishma Borkar, Prof. Nutan Dhande

doi:10.17762/ijritcc.v5i6.934

PDF

Published: Jun 30, 2017

DOI: https://doi.org/10.17762/ijritcc.v5i6.934

Karishma Borkar, Prof. Nutan Dhande

Abstract

Text classification is the undertaking of naturally sorting an arrangement of archives into classifications from a predefined set. Content Classification is an information mining procedure used to anticipate bunch enrollment for information occurrences inside a given dataset. It is utilized for ordering information into various classes by thinking of some as compels. Rather than conventional component determination systems utilized for content archive grouping. We present another model in view of likelihood and over all class recurrence of term. The Naive Bayesian classifier depends on Bayes hypothesis with autonomy presumptions between indicators. A Naive Bayesian model is anything but difficult to work, with no confounded iterative parameter estimation which makes it especially valuable for substantial datasets. The paper demonstrates that the new probabilistic translation of tf×idf term weighting may prompt better comprehension of measurable positioning instruments.

How to Cite

, K. B. P. N. D. (2017). Efficient Text Classification of 20 Newsgroup Dataset using Classification Algorithm. International Journal on Recent and Innovation Trends in Computing and Communication, 5(6), 1236–. https://doi.org/10.17762/ijritcc.v5i6.934

Issue

Vol. 5 No. 6 (2017): June (2017) Issue

Section

Articles

Make a Submission

Announcements

Call for Papers

January 5, 2026

Call for Papers for the New Issue.
Last Date of Submission: July 20^th, 2026

Imp. Announcement

April 15, 2022

Dear Authors,
We are feeling proud congratulations to all the contributors of IJRITCC. Because The "International Journal on Recent and Innovation Trends in Computing and Communication" has been accepted for Scopus.

Like, Subscribe and Share This Video