convert()
: add option to get engine document (e.g. pikepdf.Pdf)
#203
|
@ -1075,7 +1075,7 @@ class pdfdoc(object):
|
|||
self.tostream(stream)
|
||||
return stream.getvalue()
|
||||
|
||||
def tostream(self, outputstream=None, return_engine_doc=False):
|
||||
def todoc(self):
|
||||
|
||||
if self.engine == Engine.pikepdf:
|
||||
PdfArray = pikepdf.Array
|
||||
PdfDict = pikepdf.Dictionary
|
||||
|
@ -1267,12 +1267,11 @@ class pdfdoc(object):
|
|||
self.writer.addobj(metadata)
|
||||
self.writer.addobj(iccstream)
|
||||
|
||||
if return_engine_doc:
|
||||
return self.writer, self.output_version
|
||||
|
||||
# now write out the PDF
|
||||
if outputstream is None:
|
||||
raise TypeError("pdfdoc.tostream() requires outputstream unless return_engine_doc is True.")
|
||||
def tostream(self, outputstream):
|
||||
# write out the PDF
|
||||
self.todoc() # finalize self.writer
|
||||
if self.engine == Engine.pikepdf:
|
||||
kwargs = {}
|
||||
if pikepdf.__version__ >= "6.2.0":
|
||||
|
@ -1281,6 +1280,7 @@ class pdfdoc(object):
|
|||
outputstream, min_version=self.output_version, linearize=True, **kwargs
|
||||
)
|
||||
elif self.engine == Engine.pdfrw:
|
||||
from pdfrw import PdfName, PdfArray
|
||||
self.writer.trailer.Info = self.writer.docinfo
|
||||
# setting the version attribute of the pdfrw PdfWriter object will
|
||||
# influence the behaviour of the write() function
|
||||
|
@ -2803,11 +2803,11 @@ def convert(*images, **kwargs):
|
|||
)
|
||||
|
||||
if kwargs["outputstream"]:
|
||||
pdf.tostream(outputstream=kwargs["outputstream"])
|
||||
pdf.tostream(kwargs["outputstream"])
|
||||
return
|
||||
|
||||
if kwargs["return_engine_doc"]:
|
||||
josch
commented
Please do not change the signature of the convert() function. This is necessary to preserve API stability. You have to extract "outputstream" from kwargs. Please do not change the signature of the convert() function. This is necessary to preserve API stability. You have to extract "outputstream" from kwargs.
mara0004
commented
It's not clear to me why this is supposed to break the API -- can you explain? It's not clear to me why this is supposed to break the API -- can you explain?
AFAICS, `kwargs` is internal, and no callee expects `kwargs["outputstream"]`, right?
mara0004
commented
Relatedly, it looks like the Relatedly, it looks like the `_default_kwargs` strategy silently ignores nonexistent parameters, which is problematic. (And of course it breaks IDE completion.)
~~I see you're using kwargs to unify access of `{crop,bleed,trim,art}border`, which is fine, but why can't any others params be in the signature directly?~~
mara0004
commented
Okay, self-answering the question: a SO search yielded that specifying optional params after a The issue with invalid kwargs assumably being ignored without error still stands, though. > It's not clear to me why this is supposed to break the API -- can you explain?
Okay, self-answering the question: a SO search yielded that specifying optional params after a `*capture` raises a `SyntaxError` on Python 2. Never having written code for Python 2, I did not know that. I'll change to extract from kwargs as you said, then.
The issue with invalid kwargs assumably being ignored without error still stands, though.
|
||||
return pdf.tostream(return_engine_doc=True)
|
||||
return pdf.todoc()
|
||||
|
||||
return pdf.tostring()
|
||||
|
||||
|
|
You split
tostream()
intofinalize()
andtostream()
but then why does the newtostream()
not callfinalize()
?Because I though the embedder of
convert_to_docobject()
should not have to invokefinalize()
.Instead,
finalize()
technically belongs intoconvert_to_docobject()
itself, after all image pages have been added. Sotostream()
cannot alsofinalize()
as that would result in a double call.