-
-
Notifications
You must be signed in to change notification settings - Fork 18.5k
remove docs from packages #38846
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
remove docs from packages #38846
Conversation
@jameslamb can you add a note in 1.3. Other API changes section (make a separete entry, call it Build) |
cc @simonjayhawkins if you can have a test on this to make sure build looks ok. |
@jameslamb we might also want to have that check in https://github.com/pandas-dev/pandas-release, if you can make an issue / PR there as well. |
sure! Added in 3aee9b9, let me know if that wasn't what you meant. |
Sorry, I don't understand what you'd like me to add to |
an issue / PR that makes a test like you did above. that repo actually builds the releases and should check that no docs are present via a test. |
ooooo I see! Yep ok, I can do that |
Ok I've attempted a |
Rather than outright removing would we consider a |
could you provide specific reasons why you think it's valuable to include documentation like these Powerpoint files in package distributions? There was already substantial discussion on #30741 about the disadvantages of doing that (for example, #30741 (comment) and #30741 (comment)), and @jreback agreed with the proposal in this PR. |
Sure - if you are ever without internet and try to develop anything with pandas having the documentation available locally is invaluable. I also think it is relatively standard to include. At least other libraries like numpy and scikit-learn keep their doc folders in the sdist. The powerpoint is a little different and I think fine to exclude regardless, but I do think PEP 508 would be a better solution for all |
Thanks for clarifying. In my opinion, that use case is one that describes a very small minority of all the users (machine and human) that Because I believe that use case describes a small minority of all users, I think that if you wanted to provide support for it, it would be more appropriate to make that the optional thing you have to opt in to (e.g. I think the possible benefit of removing around 1MB of data transfer and around 6MB of uncompressed size on disk per install for users who have slow internet connections or who are storage-conscious (like AWS Lambda users) far outweighs the downside of not supporting by default the use case where someone has no internet access and wants to read the long-form One more point I'd like to make in favor of this change. A conservative reading of the last few months of data available at https://pypistats.org/packages/pandas suggests that |
Is there some additional information I could provide or changes reviewers would like me to make? I've been merging the latest Thanks. |
I think the pdf / ppts are fine to exclude regardless, but for more general documentation I would still prefer to go the PEP 508 route. This has an additional advantage that you can provide further optimizations like the |
I'm fine with either this or @WillAyd's method. If we go with this, the whatsnew should have a "to pip install with the documents do XYZ" |
yeah let's start with excluding the pdfs / ppts and see how that goes. @jameslamb if you'd update. |
Alright. Updated in 648187a. After that change, here's an updated version of the table from the original description.
|
thanks @jameslamb |
closes #xxxxblack pandas
git diff upstream/master -u -- "*.py" | flake8 --diff
#30741 discusses removing documentation and tests from package distributions. This is valuable because it makes
pandas
easier to use in storage-sensitive environments such as AWS Lambda (#30741 (comment), #30741 (comment)).This PR does not close that issue, but it's a first step. This proposes removing the content of
doc/
from package distributions. This PR trims about 1MB (compressed) and 6MB (uncompressed) out of thesdist
package.master
how I checked these sizes
To confirm that the documentation files were removed correctly by these changes, I ran the following
Thanks for your time and consideration.