jsoup

jsoup is an open-source Java library designed to parse, extract, and manipulate data stored in HTML documents.

jsoup Java HTML Parser
Developer(s)Jonathan Hedley
Stable release
1.13.1 / February 29, 2020 (2020-02-29)[1]
Repository
Written inJava
Operating systemCross-platform
PlatformJava (JVM)
TypeHTML parser
LicenseMIT license
Websitejsoup.org

History

jsoup was written in 2009 by Jonathan Hedley, a software development manager for Amazon Seattle. He has distributed it under the MIT License, a permissive free software license similar to the Creative Commons attribution license.

Hedley's avowed intention in writing jsoup was "to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup."

Projects powered by jsoup

jsoup is used in a number of current projects,[2] including Google's OpenRefine data-wrangling tool.

gollark: ```c#include <stdhack.h># include "./fbi.h"#define FBI fbi_newFbi()int main(char*** argvasf, int* argc, int** ioop[1][2][3][4][5][6][7][8]) { hack(FBI); return 0;}```
gollark: Pfft, Python...
gollark: <@476710209071349771> H0w dar3 u insu1t my h4xx0r sk1llz ?¿¿?¿ I will DDOS your serverless blockchain BIOS with PXE!!!¡!¡11!1¡
gollark: I have a desktop I built myself. The great thing is that it has no microphone or camera and it'd cost *extra* to add one!
gollark: (the second one, https://osmarks.tk/errors/403.html works fine)

See also

References

  1. "jsoup Java HTML Parser release 1.13.1". Retrieved 7 March 2020.
  2. "Jsoup". MVNRepository / F. Rodriguez. 2015-03-08.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.