jsoup is a Java library for working with real-world HTML. It provides a very convenient API for extracting and manipulating data, using the best of DOM, CSS, and jquery-like methods.
If you use Maven to manage the dependencies in your Java project, you do not need to download; just place the following into your POM’s section:
<!-- jsoup HTML parser library @ http://jsoup.org/ -->
// jsoup HTML parser library @ http://jsoup.org/
jsoup is entirely self contained and has no dependencies. jsoup runs on Java 1.5 and up, Scala, Android, OSGi, Lambda, and Google App Engine.