Class information for:
Level 1: WRAPPER INDUCTION//WRAPPER GENERATION//WEB DATA EXTRACTION

Basic class information

ID Publications Average number
of references
Avg. shr. active
ref. in WoS
17512 532 26.5 22%



Bar chart of Publication_year

Last years might be incomplete

Classes in level above (level 2)



ID, lev.
above
Publications Label for level above
366 16658 INFORMATION RETRIEVAL//INFORMATION PROCESSING & MANAGEMENT//COMPUTATIONAL LINGUISTICS

Terms with highest relevance score



Rank Term Type of term Relevance score
(tfidf)
Class's shr.
of term's tot.
occurrences
Shr. of publ.
in class containing
term
Num. of
publ. in
class
1 WRAPPER INDUCTION Author keyword 14 65% 2% 13
2 WRAPPER GENERATION Author keyword 6 48% 2% 10
3 WEB DATA EXTRACTION Author keyword 5 32% 2% 12
4 DATA MINING WEB BASED INFORMATION Author keyword 3 100% 1% 3
5 WEB WEB BASED INFORMATION SYSTEMS Author keyword 3 100% 1% 3
6 INFORMAT COMMUN COMP TECHNOL Address 2 67% 0% 2
7 INFORMATIVE BLOCK Author keyword 2 67% 0% 2
8 KNOWLEDGE FRAME Author keyword 2 67% 0% 2
9 SUBLANGUAGE ANALYSIS Author keyword 2 67% 0% 2
10 TEXT DENSITY Author keyword 2 67% 0% 2

Web of Science journal categories

Author Key Words



Rank Web of Science journal category Relevance score
(tfidf)
Class's shr.
of term's tot.
occurrences
Shr. of publ.
in class containing
term
Num. of
publ. in
class
LCSH search Wikipedia search
1 WRAPPER INDUCTION 14 65% 2% 13 Search WRAPPER+INDUCTION Search WRAPPER+INDUCTION
2 WRAPPER GENERATION 6 48% 2% 10 Search WRAPPER+GENERATION Search WRAPPER+GENERATION
3 WEB DATA EXTRACTION 5 32% 2% 12 Search WEB+DATA+EXTRACTION Search WEB+DATA+EXTRACTION
4 DATA MINING WEB BASED INFORMATION 3 100% 1% 3 Search DATA+MINING+WEB+BASED+INFORMATION Search DATA+MINING+WEB+BASED+INFORMATION
5 WEB WEB BASED INFORMATION SYSTEMS 3 100% 1% 3 Search WEB+WEB+BASED+INFORMATION+SYSTEMS Search WEB+WEB+BASED+INFORMATION+SYSTEMS
6 INFORMATIVE BLOCK 2 67% 0% 2 Search INFORMATIVE+BLOCK Search INFORMATIVE+BLOCK
7 KNOWLEDGE FRAME 2 67% 0% 2 Search KNOWLEDGE+FRAME Search KNOWLEDGE+FRAME
8 SUBLANGUAGE ANALYSIS 2 67% 0% 2 Search SUBLANGUAGE+ANALYSIS Search SUBLANGUAGE+ANALYSIS
9 TEXT DENSITY 2 67% 0% 2 Search TEXT+DENSITY Search TEXT+DENSITY
10 WEB CLIPPING 2 67% 0% 2 Search WEB+CLIPPING Search WEB+CLIPPING

Key Words Plus



Rank Web of Science journal category Relevance score
(tfidf)
Class's shr.
of term's tot.
occurrences
Shr. of publ.
in class containing
term
Num. of
publ. in
class
1 WRAPPER INDUCTION 33 66% 6% 31
2 DATA EXTRACTION 8 33% 4% 19
3 INFORMATION EXTRACTION 7 15% 8% 42
4 DATA RECORDS 3 50% 1% 5
5 SEARCH INTERFACES 3 50% 1% 4
6 DEEP WEB 2 44% 1% 4
7 TUPLES EXTRACTION 2 67% 0% 2
8 HTML 1 33% 0% 2
9 LOGIC WRAPPERS 1 50% 0% 1
10 SERVICES MASHUPS 1 50% 0% 1

Journals

Reviews



Title Publ. year Cit. Active
references
% act. ref.
to same field
Open Information Extraction from the Web 2008 44 2 100%
A connection between topological properties and information towards an abstract representation 2009 0 2 100%
Automatic review identification on the web using pattern recognition 2013 0 4 75%
Alignment and dataset identification of linked data in Semantic Web 2014 1 6 17%
Adaptive information extraction 2006 0 8 50%
Review of information extraction technologies and applications 2014 0 16 13%

Address terms



Rank Address term Relevance score
(tfidf)
Class's shr.
of term's tot.
occurrences
Shr. of publ.
in class containing
term
Num. of
publ.
in class
1 INFORMAT COMMUN COMP TECHNOL 2 67% 0.4% 2
2 WEB SEARCH MIN GRP 2 50% 0.6% 3
3 COMP SCI TECHNOL PROGRAM 1 50% 0.4% 2
4 ROLLINS EBUSINESS 1 50% 0.4% 2
5 ADV INFORMAT SYST 1 11% 1.7% 9
6 VIRTUAL REAL TECHNOL 1 20% 0.8% 4
7 CT SE 5 1 50% 0.2% 1
8 DATABASE SYST INFORMAT MANAGEMENT GRP DIMA 1 50% 0.2% 1
9 DPTO BIBLIOTECONOMIA DOCUMENTAC 1 50% 0.2% 1
10 ELECT POWER TECHNOL GRP 1 50% 0.2% 1

Related classes at same level (level 1)



Rank Relatedness score Related classes
1 0.0000179279 TREE EDIT DISTANCE//LARGEST COMMON SUBTREE//UNORDERED TREES
2 0.0000127168 GOOGLE MATRIX//PAGERANK//TOPICAL CRAWLERS
3 0.0000120413 STRING KERNELS//SEMANTIC RELATION EXTRACTION//SOCIAL QA
4 0.0000103524 AUTOMATIC KEYWORD EXTRACTION//BURBEA RAO DIVERGENCE//COMPUTATIONAL INFORMATION GEOMETRY
5 0.0000098351 WEB USAGE MINING//HOTLINK ASSIGNMENT//WEB ROBOT DETECTION
6 0.0000095267 WORD SENSE DISAMBIGUATION//NATURAL LANGUAGE ENGINEERING//COMPUTATIONAL LINGUISTICS
7 0.0000087115 AIFB//ONTOLOGY LEARNING//INFORMAT TELECOMMUN SYST ENGN
8 0.0000081090 KNOWLEDGE BASE INTEROPERABILITY//FON//ARTIFICIAL INTELLIGENCE PL INFORMAT
9 0.0000080317 KNOWLEDGE ACQUISIT SHARING GRP//TEXT KNOWLEDGE ENGN//KNOWLEDGE ACQUISITION FROM TEXTS
10 0.0000077803 TEXT CHUNKING//SHALLOW PARSING//HIDDEN SEMI CRF