Publication

Text Mining Scientific Papers: A Survey on FCA-Based Information Retrieval Research

Poelmans, Jonas
Ignatov, Dmitry I
Viaene, Stijn
Dedene, Guido
Kuznetsov, Sergei O
Citations
Altmetric:
Publication Type
Conference Proceeding
Editor
Supervisor
Publication Year
2012
Journal
Book
Lecture Notes in Computer Science
Publication Volume
7377
Publication Issue
Publication Begin page
Publication End page
Publication Number of pages
Abstract
Formal Concept Analysis (FCA) is an unsupervised clustering technique and many scientific papers are devoted to applying FCA in Information Retrieval (IR) research. We collected 103 papers published between 2003-2009 which mention FCA and information retrieval in the abstract, title or keywords. Using a prototype of our FCA-based toolset CORDIET, we converted the pdf-files containing the papers to plain text, indexed them with Lucene using a thesaurus containing terms related to FCA research and then created the concept lattice shown in this paper. We visualized, analyzed and explored the literature with concept lattices and discovered multiple interesting research streams in IR of which we give an extensive overview. The core contributions of this paper are the innovative application of FCA to the text mining of scientific papers and the survey of the FCA-based IR research.
Research Projects
Organizational Units
Journal Issue
Keywords
4605 Data Management and Data Science, 46 Information and Computing Sciences, 4609 Information Systems, 4610 Library and Information Studies
Citation
Knowledge Domain/Industry
Embedded videos