Power Java


This book is no longer available for sale.

Power Java

About the Book

This book is based on the author's experience as a developer and consultant and consists of seven chapters:

  1. Network programming techniques for the Internet of Things (IoT)
  2. Natural Language Processing using OpenNLP including using existing models and creating your own models
  3. Machine learning using the Spark mllib library (document custering, logistic regression, word2vec similarity)
  4. Anomaly detection machine learning example
  5. Web scraping and information gathering
  6. Using rich semantic and linked data sources on the web to enrich the data models you use in your applications
  7. Java Strategies for Knowledge Management using local and cloud data

The first chapter on IoT is a tutorial on network programming techniques for IoT development. I have also used these same techniques for multiplayer game development and distributed virtual reality systems. This chapter stands on its own and is not connected to any other material in this book. To be clear, this chapter covers some of the network programming techniques you will need for IoT applications and does not cover development using IoT devices.

The second chapter shows you how to use the OpenNLP library to use machine learning to train your own maximum entropy classifiers and to segment sentences, tag parts of speech, and generally process English language text. Both this chapter and the next chapter on machine learning using the Spark MLlib library use machine learning techniques. The Spark MLlib is convenient to use for development on your laptop and you can use the same code you develop on Spark clusters to get near real time processing of big data.

The last two chapters are for information architects or developers who would like to develop information design and knowledge management skills. I stress the idea of leveraging both cloud data (e.g., Microsoft Office 365 and Google Drive) and local data sources. In order to simplify the final example program in the book, I use Google Takeout to export my data (Microsoft Word and Excel file formats, mailbox, and iCal calendar files). It is left as a project for the reader to extend the example program to interface with the cloud data sources their organization uses.

About the Author

Mark Watson
Mark Watson

Please check out my web site markwatson.com for information about the author Mark Watson.

Mark is the author of 16 published books on Artificial Intelligence, Java, Ruby, Common LISP, Clojure, JavaScript, Semantic Web, NLP, C++, Linux, Scheme, and Windows.

Mark also maintains a general technology blog and also a blog specifically for cognition and machine learning technologies.

Table of Contents

  • Preface
    • Book Outline
    • If You Did Not Buy This Book
  • Network Programming Techniques for the Internet of Things
    • Motivation for IoT
    • Running the example programs
    • Design Pattern
    • Directory Lookups
    • User Data Protocol Network Programming
    • Multicast/Broadcast Network Programming
    • Wrap Up on IoT
  • Natural Language Processing Using OpenNLP
    • Using OpenNLP Pre-Trained Models
    • Training a New Categorization Model for OpenNLP
    • Using Our New Trained Classification Model
    • Using the OpenNLP Parsing Model
  • Machine Learning Using Apache Spark
    • Setting Up Spark On Your Laptop
    • Hello Spark - a Word Count Example
    • Introducing the Spark MLlib Machine Learning Library
    • MLlib Logistic Regression Example Using University of Wisconsin Cancer Database
    • MLlib SVM Classification Example Using University of Wisconsin Cancer Database
    • MLlib K-Means Example Program
    • Converting Text to Numeric Feature Vectors
    • Using K-Means to Cluster Wikipedia Articles
    • Using SVM for Text Classification
    • Using word2vec To Find Similar Words In Documents
    • Chapter Wrap Up
  • Anomaly Detection Machine Learning Example
    • Motivation for Anomaly Detection
    • Math Primer for Anomaly Detection
    • AnomalyDetection Utility Class
    • Example Using the University of Wisconsin Cancer Data
  • Deep Learning Using Deeplearning4j
    • Deep Belief Networks
    • Deep Belief Example
    • Deep Learning Wrapup
  • Web Scraping Examples
    • Motivation for Web Scraping
    • Using the jsoup Library
    • Wrap Up
  • Linked Data
    • Example Code
    • Overview of RDF and SPARQL
    • SPARQL Query Client
    • DBPedia Entity Lookup
    • Annotate Text with DBPedia Entity URIs
    • Resolving Named Entities in Text to Wikipedia URIs
    • Combining Data from Public and Private Sources
    • Wrap Up for Linked Data
  • Java Strategies for Working with Cloud Data: Knowledge Management-Lite
    • Motivation for Knowledge Management
    • Using Google Drive Cloud Takeout Service
    • Using Postgres as a Local Document Store with Text Search
    • Wrap Up
  • Book Wrap Up

The Leanpub 60 Day 100% Happiness Guarantee

Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.

Now, this is technically risky for us, since you'll have the book or course files either way. But we're so confident in our products and services, and in our authors and readers, that we're happy to offer a full money back guarantee for everything we sell.

You can only find out how good something is by trying it, and because of our 100% money back guarantee there's literally no risk to do so!

So, there's no reason not to click the Add to Cart button, is there?

See full terms...

80% Royalties. Earn $16 on a $20 book.

We pay 80% royalties. That's not a typo: you earn $16 on a $20 sale. If we sell 5000 non-refunded copies of your book or course for $20, you'll earn $80,000.

(Yes, some authors have already earned much more than that on Leanpub.)

In fact, authors have earnedover $13 millionwriting, publishing and selling on Leanpub.

Learn more about writing on Leanpub

Free Updates. DRM Free.

If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).

Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.

Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.

Learn more about Leanpub's ebook formats and where to read them

Write and Publish on Leanpub

You can use Leanpub to easily write, publish and sell in-progress and completed ebooks and online courses!

Leanpub is a powerful platform for serious authors, combining a simple, elegant writing and publishing workflow with a store focused on selling in-progress ebooks.

Leanpub is a magical typewriter for authors: just write in plain text, and to publish your ebook, just click a button. (Or, if you are producing your ebook your own way, you can even upload your own PDF and/or EPUB files and then publish with one click!) It really is that easy.

Learn more about writing on Leanpub