The Text Mining Handbook
Text mining tries to solve the crisis of information overload by combining techniques from data mining, machine learning, natural language processing, information retrieval, and knowledge management. In addition to providing an in-depth examination of core text mining and link detection algorithms and operations, this book examines advanced pre-processing techniques, knowledge representation considerations, and visualization approaches. Finally, it explores current real-world, mission-critical applications of text mining and link detection in such varied fields as M&A business intelligence, genomics research and counter-terrorism activities.
- The first comprehensive compilation of algorithms, methodologies, practical approaches and applications
- Co-authored by one of the founding figures in the field of text mining
- Detailed description of core text mining algorithms for identifying patterns such as frequent sets, distributions and proportions and associations
Reviews & endorsements
"...buy the book. This book is definitely worth having in your book shelf as a handy reference."
L. Venkata Subramaniam IAPR Newsletter
"A good introduction to text mining written by leading experts in the field. The book is well written and addresses both the theory and practice of text mining, which makes it appealing for researchers and practitioners alike... Highly recommended to those who would like to start delving into the area of text mining without having any previous background in computational linguistics."
Rada Mihalcea, University of North Texas, for Computational Linguistics
Product details
October 2007Adobe eBook Reader
9780511331855
0 pages
0kg
This ISBN is for an eBook version which is distributed on our behalf by a third party.
Table of Contents
- 1. Introduction to text mining
- 2. Core text mining operations
- 3. Text mining preprocessing techniques
- 4. Categorization
- 5. Clustering
- 6. Information extraction
- 7. Probabilistic models for Information extraction
- 8. Preprocessing applications using probabilistic and hybrid approaches
- 9. Presentation-layer considerations for browsing and query refinement
- 10. Visualization approaches
- 11. Link analysis
- 12. Text mining applications
- Appendix
- Bibliography.