The Text Mining Handbook
Text mining is a new and exciting area of computer science research that tries to solve the crisis of information overload by combining techniques from data mining, machine learning, natural language processing, information retrieval, and knowledge management. Similarly, link detection – a rapidly evolving approach to the analysis of text that shares and builds upon many of the key elements of text mining – also provides new tools for people to better leverage their burgeoning textual data resources. The Text Mining Handbook presents a comprehensive discussion of the state-of-the-art in text mining and link detection. In addition to providing an in-depth examination of core text mining and link detection algorithms and operations, the book examines advanced pre-processing techniques, knowledge representation considerations, and visualization approaches. Finally, the book explores current real-world, mission-critical applications of text mining and link detection in such varied fields as M&A business intelligence, genomics research and counter-terrorism activities.
- The first comprehensive compilation of algorithms, methodologies, practical approaches and applications
- Co-authored by one of the founding figures in the field of text mining
- Detailed description of core text mining algorithms for identifying patterns such as frequent sets, distributions and proportions and associations
Reviews & endorsements
' … buy the book. This book is definitely worth having in your book shelf as a handy reference.' IAPR Newsletter
Product details
October 2007Adobe eBook Reader
9780511331855
0 pages
0kg
This ISBN is for an eBook version which is distributed on our behalf by a third party.
Table of Contents
- 1. Introduction to text mining
- 2. Core text mining operations
- 3. Text mining preprocessing techniques
- 4. Categorization
- 5. Clustering
- 6. Information extraction
- 7. Probabilistic models for Information extraction
- 8. Preprocessing applications using probabilistic and hybrid approaches
- 9. Presentation-layer considerations for browsing and query refinement
- 10. Visualization approaches
- 11. Link analysis
- 12. Text mining applications
- Appendix
- Bibliography.