Class information for:
Level 1: TEXT CATEGORIZATION//TEXT CLASSIFICATION//SPAM FILTERING

Basic class information

Class id #P Avg. number of
references
Database coverage
of references
11493 978 28.2 28%



Bar chart of Publication_year

Last years might be incomplete

Hierarchy of classes

The table includes all classes above and classes immediately below the current class.



Cluster id Level Cluster label #P
9 4 COMPUTER SCIENCE, THEORY & METHODS//COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE//COMPUTER SCIENCE, INFORMATION SYSTEMS 1247339
20 3       COMPUTER SCIENCE, INFORMATION SYSTEMS//COMPUTER SCIENCE, THEORY & METHODS//COMPUTER SCIENCE, SOFTWARE ENGINEERING 118625
164 2             RECOMMENDER SYSTEMS//COLLABORATIVE FILTERING//COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE 22632
11493 1                   TEXT CATEGORIZATION//TEXT CLASSIFICATION//SPAM FILTERING 978

Terms with highest relevance score



rank Term termType Chi square Shr. of publ. in
class containing
term
Class's shr. of
term's tot. occurrences
#P with
term in
class
1 TEXT CATEGORIZATION authKW 1726891 12% 46% 119
2 TEXT CLASSIFICATION authKW 1270294 12% 34% 119
3 SPAM FILTERING authKW 1048595 5% 73% 46
4 SPAM authKW 522414 5% 36% 47
5 ANTI SPAM FILTERING authKW 312207 1% 100% 10
6 SPAM DETECTION authKW 204917 2% 41% 16
7 SPAM CLASSIFICATION authKW 156104 1% 100% 5
8 TERM WEIGHTING authKW 152690 2% 33% 15
9 UNSOLICITED COMMERCIAL EMAIL authKW 124883 0% 100% 4
10 DOCUMENT CATEGORIZATION authKW 115620 1% 37% 10

Web of Science journal categories



Rank Term Chi square Shr. of publ. in
class containing
term
Class's shr. of
term's tot. occurrences
#P with
term in
class
1 Computer Science, Artificial Intelligence 38418 52% 0% 504
2 Computer Science, Information Systems 20761 38% 0% 370
3 Computer Science, Theory & Methods 4837 20% 0% 193
4 Operations Research & Management Science 2527 12% 0% 116
5 Information Science & Library Science 2132 8% 0% 74
6 Computer Science, Software Engineering 1556 10% 0% 94
7 Engineering, Electrical & Electronic 927 20% 0% 195
8 Computer Science, Hardware & Architecture 789 5% 0% 52
9 Computer Science, Interdisciplinary Applications 477 7% 0% 64
10 Computer Science, Cybernetics 174 1% 0% 13

Address terms



Rank Term Chi square Shr. of publ. in
class containing
term
Class's shr. of
term's tot. occurrences
#P with
term in
class
1 KNOWLEDGE MANAGEMENT DATA ANAL 62441 0% 100% 2
2 MAIN LIB 3087 62441 0% 100% 2
3 BERLIN BRANDENBURG DISTRIBUTED INFORMAT 41626 0% 67% 2
4 DEUSTOTECH COMP S3 41626 0% 67% 2
5 ADV STUDIES LINGUIST 31221 0% 100% 1
6 AUTOMAT REASONING SYST 31221 0% 100% 1
7 AUTOMAT SIMULAT 31221 0% 100% 1
8 COMP ORAN 31221 0% 100% 1
9 COMP SCI COMMUN DICOM 31221 0% 100% 1
10 COPERN 31221 0% 100% 1

Journals



Rank Term Chi square Shr. of publ. in
class containing
term
Class's shr. of
term's tot. occurrences
#P with
term in
class
1 EXPERT SYSTEMS WITH APPLICATIONS 27195 10% 1% 96
2 INFORMATION PROCESSING & MANAGEMENT 22987 4% 2% 39
3 KNOWLEDGE-BASED SYSTEMS 8810 3% 1% 28
4 INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY 8297 1% 2% 13
5 LECTURE NOTES IN ARTIFICIAL INTELLIGENCE 7924 7% 0% 65
6 INFORMATION RETRIEVAL 7623 1% 3% 9
7 LECTURE NOTES IN COMPUTER SCIENCE 6766 14% 0% 140
8 KNOWLEDGE AND INFORMATION SYSTEMS 4985 1% 1% 13
9 COMPUTERS & SECURITY 4397 1% 1% 13
10 ACM TRANSACTIONS ON INFORMATION SYSTEMS 2171 1% 1% 6

Author Key Words



Rank Term Chi square Shr. of publ. in
class containing
term
Class's shr. of
term's tot. occurrences
#P with
term in
class
LCSH search Wikipedia search
1 TEXT CATEGORIZATION 1726891 12% 46% 119 Search TEXT+CATEGORIZATION Search TEXT+CATEGORIZATION
2 TEXT CLASSIFICATION 1270294 12% 34% 119 Search TEXT+CLASSIFICATION Search TEXT+CLASSIFICATION
3 SPAM FILTERING 1048595 5% 73% 46 Search SPAM+FILTERING Search SPAM+FILTERING
4 SPAM 522414 5% 36% 47 Search SPAM Search SPAM
5 ANTI SPAM FILTERING 312207 1% 100% 10 Search ANTI+SPAM+FILTERING Search ANTI+SPAM+FILTERING
6 SPAM DETECTION 204917 2% 41% 16 Search SPAM+DETECTION Search SPAM+DETECTION
7 SPAM CLASSIFICATION 156104 1% 100% 5 Search SPAM+CLASSIFICATION Search SPAM+CLASSIFICATION
8 TERM WEIGHTING 152690 2% 33% 15 Search TERM+WEIGHTING Search TERM+WEIGHTING
9 UNSOLICITED COMMERCIAL EMAIL 124883 0% 100% 4 Search UNSOLICITED+COMMERCIAL+EMAIL Search UNSOLICITED+COMMERCIAL+EMAIL
10 DOCUMENT CATEGORIZATION 115620 1% 37% 10 Search DOCUMENT+CATEGORIZATION Search DOCUMENT+CATEGORIZATION

Core articles

The table includes core articles in the class. The following variables is taken into account for the relevance score of an article in a cluster c:
(1) Number of references referring to publications in the class.
(2) Share of total number of active references referring to publications in the class.
(3) Age of the article. New articles get higher score than old articles.
(4) Citation rate, normalized to year.



Rank Reference # ref.
in cl.
Shr. of ref. in
cl.
Citations
1 GUZELLA, TS , CAMINHAS, WM , (2009) A REVIEW OF MACHINE LEARNING APPROACHES TO SPAM FILTERING.EXPERT SYSTEMS WITH APPLICATIONS. VOL. 36. ISSUE 7. P. 10206-10222 38 75% 91
2 KAYA, Y , ERTUGRUL, OF , (2016) A NOVEL APPROACH FOR SPAM EMAIL DETECTION BASED ON SHIFTED BINARY PATTERNS.SECURITY AND COMMUNICATION NETWORKS. VOL. 9. ISSUE 10. P. 1216 -1225 21 91% 0
3 UYSAL, AK , (2016) AN IMPROVED GLOBAL FEATURE SELECTION SCHEME FOR TEXT CLASSIFICATION.EXPERT SYSTEMS WITH APPLICATIONS. VOL. 43. ISSUE . P. 82 -92 16 73% 5
4 GHAREB, AS , ABU BAKAR, A , HAMDAN, AR , (2016) HYBRID FEATURE SELECTION BASED ON ENHANCED GENETIC ALGORITHM FOR TEXT CATEGORIZATION.EXPERT SYSTEMS WITH APPLICATIONS. VOL. 49. ISSUE . P. 31 -47 17 68% 2
5 UYSAL, AK , GUNAL, S , (2012) A NOVEL PROBABILISTIC FEATURE SELECTION METHOD FOR TEXT CLASSIFICATION.KNOWLEDGE-BASED SYSTEMS. VOL. 36. ISSUE . P. 226-235 17 71% 28
6 CHEN, KW , ZHANG, ZP , LONG, J , ZHANG, H , (2016) TURNING FROM TF-IDF TO TF-IGM FOR TERM WEIGHTING IN TEXT CLASSIFICATION.EXPERT SYSTEMS WITH APPLICATIONS. VOL. 66. ISSUE . P. 245 -260 16 76% 0
7 PINHEIRO, RHW , CAVALCANTI, GDC , REN, TI , (2015) DATA-DRIVEN GLOBAL-RANKING LOCAL FEATURE SELECTION METHODS FOR TEXT CATEGORIZATION.EXPERT SYSTEMS WITH APPLICATIONS. VOL. 42. ISSUE 4. P. 1941 -1949 13 87% 5
8 LIU, YN , WANG, YW , FENG, LZ , ZHU, XD , (2016) TERM FREQUENCY COMBINED HYBRID FEATURE SELECTION METHOD FOR SPAM FILTERING.PATTERN ANALYSIS AND APPLICATIONS. VOL. 19. ISSUE 2. P. 369 -383 16 70% 1
9 MENDEZ, JR , REBOIRO-JATO, M , DIAZ, F , DIAZ, E , FDEZ-RIVEROLA, F , (2012) GRINDSTONE4SPAM: AN OPTIMIZATION TOOLKIT FOR BOOSTING E-MAIL CLASSIFICATION.JOURNAL OF SYSTEMS AND SOFTWARE. VOL. 85. ISSUE 12. P. 2909-2920 14 88% 3
10 TUTKAN, M , GANIZ, MC , AKYOKUS, S , (2016) HELMHOLTZ PRINCIPLE BASED SUPERVISED AND UNSUPERVISED FEATURE SELECTION METHODS FOR TEXT MINING.INFORMATION PROCESSING & MANAGEMENT. VOL. 52. ISSUE 5. P. 885 -910 13 87% 0

Classes with closest relation at Level 1



Rank Class id link
1 35708 TREE KERNELS//RATIONAL KERNELS//STRING KERNELS
2 16823 MULTI LABEL CLASSIFICATION//MULTI LABEL LEARNING//MULTIPLE INSTANCE LEARNING
3 18232 LATENT SEMANTIC INDEXING//SEMIDISCRETE DECOMPOSITION//LATENT SEMANTIC ANALYSIS
4 12112 CROSS LANGUAGE INFORMATION RETRIEVAL//STEMMING//INTER INFORMAT
5 8373 INFORMATION PROCESSING & MANAGEMENT//JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE//INFORMATION RETRIEVAL
6 25585 SEMI SUPERVISED LEARNING//CO TRAINING//TRI TRAINING
7 9249 SENTIMENT ANALYSIS//OPINION MINING//TOPIC MODEL
8 17718 AUTHORSHIP ATTRIBUTION//STYLOMETRY//LITERARY AND LINGUISTIC COMPUTING
9 30552 PHISHING//PHISHING ATTACKS//ANTI PHISHING
10 34252 RHINO FDN NAT NE INDIA//BROADBAND SWITCH ENGN GRP//CONTEXT SENSITIVE SIMILARITY DISCOVERY

Go to start page