我在写一个需要合并很多单页pdf文件的小脚本。我希望脚本能够使用Python3运行,并尽可能少地依赖。用Python3合并PDF文件
对于PDF合并部分,我尝试使用PyPdf。但是,Python 3的支持似乎是错误的;它无法处理Inkscape生成的PDF文件(我需要)。我安装PyPdf当前Git版本,下面的测试脚本不起作用:
import PyPDF2
output_pdf = PyPDF2.PdfFileWriter()
with open("testI.pdf", "rb") as input:
input_pdf = PyPDF2.PdfFileReader(input)
output_pdf.addPage(input_pdf.getPage(0))
with open("test.pdf", "wb") as output:
output_pdf.write(output)
这将引发以下堆栈跟踪:
Traceback (most recent call last):
File "test.py", line 7, in <module>
output.addPage(input.getPage(0))
File "/usr/lib/python3.3/site-packages/pyPdf/pdf.py", line 420, in getPage
self._flatten()
File "/usr/lib/python3.3/site-packages/pyPdf/pdf.py", line 574, in _flatten
self._flatten(page.getObject(), inherit)
File "/usr/lib/python3.3/site-packages/pyPdf/generic.py", line 165, in getObject
return self.pdf.getObject(self).getObject()
File "/usr/lib/python3.3/site-packages/pyPdf/pdf.py", line 616, in getObject
retval = readObject(self.stream, self)
File "/usr/lib/python3.3/site-packages/pyPdf/generic.py", line 66, in readObject
return DictionaryObject.readFromStream(stream, pdf)
File "/usr/lib/python3.3/site-packages/pyPdf/generic.py", line 526, in readFromStream
value = readObject(stream, pdf)
File "/usr/lib/python3.3/site-packages/pyPdf/generic.py", line 57, in readObject
return ArrayObject.readFromStream(stream, pdf)
File "/usr/lib/python3.3/site-packages/pyPdf/generic.py", line 152, in readFromStream
obj = readObject(stream, pdf)
File "/usr/lib/python3.3/site-packages/pyPdf/generic.py", line 86, in readObject
return NumberObject.readFromStream(stream)
File "/usr/lib/python3.3/site-packages/pyPdf/generic.py", line 231, in readFromStream
return FloatObject(name.decode("ascii"))
File "/usr/lib/python3.3/site-packages/pyPdf/generic.py", line 207, in __new__
return decimal.Decimal.__new__(cls, str(value), context)
TypeError: optional argument must be a context
相同的脚本,但是,与完美的作品Python 2.7。
我在这里做错了什么?这是图书馆的一个错误吗?我可以在不触摸PyPDF库的情况下解决它吗?
它经历了吗?我患有同样的问题。 – PascalVKooten 2013-03-30 21:29:19
这[回购](https://github.com/colemana/PyPDF2)是一个固定版本。 – janoliver 2013-04-02 20:11:10
这个回购是1岁?你确定它有效吗? – PascalVKooten 2013-04-03 05:01:02