Skip to content

Broken merged pdfs. #4571

@wz93672

Description

@wz93672

Description of the bug

Hello.

Pymupdf 1.26.0 and 1.26.1 makes broken merged pdf files - Acrobat Reader error 14, with source files made in LibreOffice and "garbage" save option 3 or 4. The same with LO files merged using PDF Arranger (libqpdf, pikepdf), opened and saved with pymupdf.

LO source files:
table1.pdf
table2.pdf
test1.pdf
test2.pdf

LO files merged with PDF Arranger:
in1.pdf

How to reproduce the bug

import pymupdf

#o = {'garbage':3, 'clean':False} # - good
o = {'garbage':4, 'clean':False} # - bad
#o = {'garbage':4, 'clean':True} # - bad
#o = {'garbage':3, 'clean':True} # - good

files = [
    'table1.pdf',
    'table2.pdf',
    'test1.pdf',
    'test2.pdf',
]

with pymupdf.open() as newdoc:
    for f in files:
        with pymupdf.open(f) as doc:
            newdoc.insert_pdf(doc)
    newdoc.save('out1.pdf', **o)


#o = {'garbage':3, 'clean':False} # - bad
o = {'garbage':4, 'clean':False} # - bad
#o = {'garbage':4, 'clean':True} # - bad
#o = {'garbage':3, 'clean':True} # - good

fin = 'in1.pdf'

with pymupdf.open(fin) as doc:
    doc.save('out2.pdf', **o)

PyMuPDF version

1.26.0

Operating system

Windows

Python version

3.13

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions