Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

bs4 is able to parse some malformed documents that libxml2 chokes on.

For these cases it can be useful to do the reverse, and use the BeautifulSoup HTML parser as an alternative parser backend for the lxml package: https://lxml.de/elementsoup.html



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: