[Golang] Auto-Detect and Convert Encoding of HTML to UTF-8
Given an URL, auto-detect and convert the encoding of the HTML document to UTF-8 if it is not UTF-8 encoded in Go.
read more »Given an URL, auto-detect and convert the encoding of the HTML document to UTF-8 if it is not UTF-8 encoded in Go.
read more »Given an URL, determine the encoding of the HTML document in Go using golang.org/x/net/html and golang.org/x/text packages.
read more »Remove HTML inline style, i.e., remove style attribute from HTML node via Go net/html package.
read more »Guess metadata from HTML webpage and convert it to reStructuredText format.
read more »Add basic HTML table to reStructuredText support for online HTML to reStructuredText service on Google App Engine Go.
read more »Online service on Google App Engine Go, which helps you extract title, image URL from Taobao item webpage, and output in reStructuredText format.
read more »Online HTML to reStructuredText service on Google App Engine Go.
read more »Convert HTML to restructuredtext format via net/html package in Go programming language. (Not fully supported)
read more »Given a webpage URL. Fetch the title of the webpage and output reStructuredText link on Google App Engine Go.
read more »Get HTML title via net/html package in Go programming language.
read more »Convert HTML unordered (bulleted) list, HTML link, and HTML image to restructuredtext format via net/html package in Go programming language.
read more »Convert HTML unordered (bulleted) list and HTML link to restructuredtext format via net/html package in Go programming language.
read more »Get the number of child nodes via Go net/html package.
read more »Equivalent of JavaScript getElementById via Go net/html package.
read more »Convert HTML table to reStructuredText list-table via Go net/html package. This is experimental and not robust implementation.
read more »Iterate over all DOM elements in HTML via Go programming language. Use net/html package to parse and iterate all elements in HTML. Search for HTML links and output them in reStructuredText format.
read more »