Taming Text

Taming Text
Authors
Grant S. Ingersoll, Thomas S. Morton, Andrew L. Farris
Publisher
Manning Publications
Tags
reference
ISBN
9781933988382
Date
2011-07-01T00:00:00+00:00
Size
9.93 MB
Lang
en
Downloaded: 150 times

It is no secret that the world is drowning in text and data. This causes real problems for everyday users who need to make sense of all the information available, and for software engineers who want to make their text-based applications more useful and user-friendly. Whether building a search engine for a corporate website, automatically organizing email, or extracting important nuggets of information from the news, dealing with unstructured text can be daunting.

Taming Text is a hands-on, example-driven guide to working with unstructured text in the context of real-world applications. It explores how to automatically organize text, using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. This book gives examples illustrating each of these topics, as well as the foundations upon which they are built.

Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book.