The Text Mining Handbook
Text mining is a new and exciting area of computer science research that tries to solve the crisis of information overload by combining techniques from data mining, machine learning, natural language processing, information retrieval, and knowledge management. Similarly, link detection – a rapidly evolving approach to the analysis of text that shares and builds upon many of the key elements of text mining – also provides new tools for people to better leverage their burgeoning textual data resources. The Text Mining Handbook presents a comprehensive discussion of the state-of-the-art in text mining and link detection. In addition to providing an in-depth examination of core text mining and link detection algorithms and operations, the book examines advanced pre-processing techniques, knowledge representation considerations, and visualization approaches. Finally, the book explores current real-world, mission-critical applications of text mining and link detection in such varied fields as M&A business intelligence, genomics research and counter-terrorism activities.
- The first comprehensive compilation of algorithms, methodologies, practical approaches and applications
- Co-authored by one of the founding figures in the field of text mining
- Detailed description of core text mining algorithms for identifying patterns such as frequent sets, distributions and proportions and associations
Reviews & endorsements
' … buy the book. This book is definitely worth having in your book shelf as a handy reference.' IAPR Newsletter
Product details
February 2007Hardback
9780521836579
424 pages
254 × 178 × 24 mm
0.95kg
Available
Table of Contents
- 1. Introduction to text mining
- 2. Core text mining operations
- 3. Text mining preprocessing techniques
- 4. Categorization
- 5. Clustering
- 6. Information extraction
- 7. Probabilistic models for Information extraction
- 8. Preprocessing applications using probabilistic and hybrid approaches
- 9. Presentation-layer considerations for browsing and query refinement
- 10. Visualization approaches
- 11. Link analysis
- 12. Text mining applications
- Appendix
- Bibliography.