tldr/pages/common/pup.md
2017-10-10 19:00:49 -04:00

559 B

pup

Command line HTML parsing tool.

  • Transform a raw HTML file into a cleaned, indented, and colored format:

cat {{index.html}} | pup --color

  • Filter HTML by element tag name:

cat {{index.html}} | pup '{{div}}'

  • Filter HTML by id:

cat {{index.html}} | pup '{{div#id}}'

  • Filter HTML by attribute value:

cat {{index.html}} | pup '{{input[type="text"}}'

  • Print all text from the filtered HTML elements and their children:

cat {{index.html}} | pup '{{div}} text{}'

  • Print HTML as JSON:

cat {{index.html}} | pup '{{div}} json{}'