Text Processing in Java

Author: Mitzi Morris
Publisher: Colloquial Media Corporation
ISBN:

9780988208728

Pages: 328
Publication Date: 01 January 2014
Format: Paperback
Availability: In stock

We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately.

Our Price $39.60 Quantity:

Share |

Customer Reviews (0)

Overview

This book teaches you how to master the subtle art of multilingual text processing and prevent text data corruption. It provides an introduction to natural language processing using Lucene and Solr. It gives you tools and techniques to manage large collections of text data, whether they come from news feeds, databases, or legacy documents. Each chapter contains executable programs that can also be used for text data forensics. Topics covered: -Unicode code points -Character encodings from ASCII and Big5 to UTF-8 and UTF-32LE -Character normalization using International Components for Unicode (ICU) -Java I/O, including working directly with zip, gzip, and tar files -Regular expressions in Java -Transporting text data via HTTP -Parsing and generating XML, HTML, and JSON -Using Lucene 4 for natural language search and text classification -Search, spelling correction, and clustering with Solr 4 Other books on text processing presuppose much of the material covered in this book. They gloss over the details of transforming text from one format to another and assume perfect input data. The messy reality of raw text will have you reaching for this book again and again.

Full Product Details

Author: Mitzi Morris
Publisher: Colloquial Media Corporation
Imprint: Colloquial Media Corporation
Dimensions: Width: 19.00cm , Height: 1.80cm , Length: 23.50cm
Weight: 0.567kg
ISBN:

9780988208728

ISBN 10: 0988208725
Pages: 328
Publication Date: 01 January 2014
Audience: General/trade , General
Format: Paperback
Publisher's Status: Active
Availability: In stock

We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately.

Reviews

Author Information

Tab Content 6

Author Website:

Customer Reviews

Recent Reviews

No review item found!

Add your own review!

Countries Available

All regions

Latest Reading Guide

Shopping Cart

Your cart is empty

Mailing List

Text Processing in Java

9780988208728

Availability Information

Overview

Full Product Details

9780988208728

Table of Contents

Reviews

Author Information

Tab Content 6

Customer Reviews

Recent Reviews

Countries Available

Sign up now