Skip to content
#

html-parsing

Here are 69 public repositories matching this topic...

adbar
adbar commented Jan 3, 2020

I have mostly tested htmldate on a set of English, German and French web pages I had run into by surfing or during web crawls. There are definitely further web pages and cases in other languages for which the extraction of a date doesn't work so far.

Please install the dateparser library beforehand as it significantly extends linguistic coverage: pipor pip3 install -U dateparser or `pi

xBioDreadx
xBioDreadx commented Oct 31, 2019

Hi, im currently parse website that require authentication for access to some content.
So I log in with browser, copy all created cookies and add them with each parse request.

    void editRequest(BoundRequestBuilder req, * *, * *) {
        cookieService.cookies.each { req.addCookie(it) }
    }

That works like a charm until website change cookies in response - short after this my c

Simple example of a web scrapper using python. In this case, we ask the user using the console for the name of a band/artist and using selenium webdriver and beautifulsoup we print information about the discography of that artist/band

  • Updated Apr 16, 2020
  • Python

Improve this page

Add a description, image, and links to the html-parsing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the html-parsing topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.