-
Notifications
You must be signed in to change notification settings - Fork 294
Broken using beautifulsoup4 #276
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
The verisoning number is hard to make sane at this point. It started off as a bit of a joke but it's rather ended up farcically… (It was never intended to get anywhere beyond 0.9999!) Essentially, 0.99999999 contains most of the API breakage that's needed to reach a point where there's a sane, stable API for 1.0 (and I think I can genuinely say we're actually, finally, realistically close…); some of which was needed to fix long-standing bugs and others that we decided to ship in the same release to just have a single breaking release rather than many. As with many projects, while we've been 0.y we've never guaranteed API stability (though it has indeed been a while since we last broke anything). I believe the majority of the breakage around BS4 is going to simply be |
@gsnedders No worries, thanks for the quick reply. Yeah, I tried disabling the html5lib parser inside BS4 (just defaulting to the built-in Is it possible, within the
Kinda ghetto and a crap workaround. If you want, I can submit a pull-request over the weekend, and while I am philosophically against it, it might save some time with others, esp if they're using both libs. Or if anything, in the README, just stick a note that for use with bs4, use the |
We fixed it forcing version on our project requirements. It has already been reported on https://bugs.launchpad.net/beautifulsoup/+bug/1603299 |
@nabeelio I debated putting something like that, but it essentially just becomes a ticking time-bomb to the point at which it is removed (and probably that would be 1.0, and I hope that isn't far off). That's also an approach that only works for some of the API changes, as some are pretty fundamental and impossible to workaround (the sanitizer changes come to mind here), so it would still leave plenty of breakage. I'll try to add something to the README later today. |
@gsnedders yeah, I agree. Thanks! |
#278 is an attempt at that. |
Started with 1.0b9
File "/Users/.../.tox/py35/lib/python3.5/site-packages/bs4/builder/_html5lib.py", line 70, in <module> class TreeBuilderForHtml5lib(html5lib.treebuilders._base.TreeBuilder): AttributeError: module 'html5lib.treebuilders' has no attribute '_base'
To replicate, just install
beautifulsoup4==4.4.1
, and thenhtml5lib
, justfrom bs4 import BeautifulSoup
, and it'll break.Not sure if this is a bs4 issue, or a html5lib problem, but bs4 hasn't been updated in quite a while.
Can the versioning also be a little more sane? It's hard to see how many 9's there are, took much longer to figure out that it was a newer version that was getting installed. With a (super-) minor version change, I don't think this interface should have really changed, but it's just hard to tell with the versioning being abnormal
The text was updated successfully, but these errors were encountered: