Strange "BadZipfile: Bad CRC-32" problem
I might have just found the problem and the solution, but unfortunately I had to replace Python's zipfile
module with a hacked one of my own (called myzipfile
here).
$ diff -u ~/run/lib/python2.7/zipfile.py myzipfile.py
--- /home/msabramo/run/lib/python2.7/zipfile.py 2010-12-22 17:02:34.000000000 -0800
+++ myzipfile.py 2011-04-11 11:51:59.000000000 -0700
@@ -5,6 +5,7 @@
import binascii, cStringIO, stat
import io
import re
+import copy
try:
import zlib # We may need its compression method
@@ -877,7 +878,7 @@
# Only open a new file for instances where we were not
# given a file object in the constructor
if self._filePassed:
- zef_file = self.fp
+ zef_file = copy.copy(self.fp)
else:
zef_file = open(self.filename, 'rb')
The problem in the standard zipfile
module is that when passed a file object (not a filename), it uses that same passed-in file object for every call to the open
method. This means that tell
and seek
are getting called on the same file and so trying to open multiple files within the zip file is causing the file position to be shared and so multiple open
calls result in them stepping all over each other. In contrast, when passed a filename, open
opens a new file object. My solution is for the case when a file object is passed in, instead of using that file object directly, I create a copy of it.
This change to zipfile
fixes the problems I was seeing:
$ ./test_zip_file.py ~/data.zip StringIO 1
Processed 250 items.
$ ./test_zip_file.py ~/data.zip BytesIO 1
Processed 250 items.
$ ./test_zip_file.py ~/data.zip direct 1
Processed 250 items.
but I don't know if it has other negative impacts on zipfile
...
EDIT: I just found a mention of this in the Python docs that I had somehow overlooked before. At http://docs.python.org/library/zipfile.html#zipfile.ZipFile.open, it says:
Note: If the ZipFile was created by passing in a file-like object as the first argument to the constructor, then the object returned by
open()
shares the ZipFile’s file pointer. Under these circumstances, the object returned byopen()
should not be used after any additional operations are performed on the ZipFile object. If the ZipFile was created by passing in a string (the filename) as the first argument to the constructor, thenopen()
will create a new file object that will be held by the ZipExtFile, allowing it to operate independently of the ZipFile.
what i did was update setup tools then re download and it works now
https://pypi.python.org/pypi/setuptools/35.0.1