img2pdf

Author	SHA1	Message	Date
ooBJ3u	244600065d	Strip end-of-page and end-of-file segments from JBIG2 As noted by @phmccarty in #184 (comment) and subsequent comments, we were not properly stripping end-of-page and end-of-file segments. These are valid segments in a JBIG2 file, but not when embedded in PDF. From the PDF spec: > The JBIG2 file header, end-of-page segments, and end-of-file segment > shall not be used in PDF. We were already stripping out the JBIG2 file header, but not yet the end-of-page and end-of-file segments. For this, I'm expanding the approach that we were already taking, of only supporting a narrow subset of JBIG2 files. We assert that the input file has such a footer, and then we strip it. We validated that the issue raised by @phmccarty is indeed resolved by running the following code before and after applying this commit: ```sh src/img2pdf.py src/tests/input/mono.jb2 > test.pdf pdfimages -tiff test.pdf img ``` Before this commit, this returned "Syntax Error (1143): Unknown segment type in JBIG2 stream". After this commit, the error is gone.	2024-10-30 00:00:00 +00:00
ooBJ3u	e2369eb59a	Add support for JBIG2 (generic coding) Implements the proposal detailed at https://gitlab.mister-muffin.de/josch/img2pdf/issues/112#issuecomment-1304 This is a limited implementation of JBIG2, which can be extended to support multiple pages, symbol tables, and other features of the format in the future. Added a test case based on mono.tif. Updated the README.md based on https://gitlab.mister-muffin.de/josch/img2pdf/pulls/184/files#issuecomment-1334	2024-09-25 00:00:00 +00:00
Johannes Schauer Marin Rodrigues	819b366bf5	release version 0.5.1	2023-11-26 06:33:10 +01:00
Johannes Schauer Marin Rodrigues	cc8c708295	HACKING: how to bisect	2023-11-25 09:47:53 +01:00
Johannes Schauer Marin Rodrigues	fb9537d8b7	src/img2pdf.py: allow PNG input without dpi units but non-square dpi aspect ratio Closes: #181	2023-11-25 09:47:52 +01:00
Johannes Schauer Marin Rodrigues	7678435eb7	validate icc profile and no default location on windows closes: #179	2023-11-07 18:50:07 +01:00
Johannes Schauer Marin Rodrigues	ba7a360866	release version 0.5.0	2023-10-28 08:35:54 +02:00
Johannes Schauer Marin Rodrigues	7f0bf47ff3	src/img2pdf.py: reformat with black	2023-10-28 08:35:53 +02:00
Leo	5cd0918d50	Issue #175 related. The original was SmartAlbums, but another case with 'Adobe PS', so delete the exif_software check part	2023-10-18 13:33:44 +08:00
Leo	f157ced05d	ignore RGB icc profile for grayscale jpegs produced by SmartAlbums closes: #175	2023-10-17 11:32:25 +02:00
Johannes Schauer Marin Rodrigues	09064e8e70	jp2: rudimentary support for raw jpeg2000 without jp2 boxes	2023-08-08 07:40:38 +02:00
Johannes Schauer Marin Rodrigues	2f736d7891	allow 'matte' to be missing in MIFF	2023-08-06 19:43:19 +02:00
Johannes Schauer Marin Rodrigues	e05580a49a	src/img2pdf_test.py: IM7 dropped 'baseType' in json output, so use 'type' instead which works for both IM6 and IM7	2023-08-06 19:27:01 +02:00
Johannes Schauer Marin Rodrigues	acc25a4926	Support JPEG2000 images with transparency Closes: #173	2023-08-05 16:06:30 +02:00
Johannes Schauer Marin Rodrigues	f597887088	The GIMP ICC bug does not only apply to 1-bit tiff but also to black/white palette PNG https://gitlab.gnome.org/GNOME/gimp/-/issues/3438 Closes: #159	2023-08-05 14:43:18 +02:00
Johannes Schauer Marin Rodrigues	3e832fbcc2	add information about how to convert images to 8 bit (closes: #170 )	2023-08-05 14:43:07 +02:00
Johannes Schauer Marin Rodrigues	1e8557cef1	src/img2pdf_test.py: drop check for endianness for tests where it does not matter IM7 defaults to big-endian on architectures other than x86 even if they are little endian: https://github.com/ImageMagick/ImageMagick/issues/6300 Closes: #152	2023-08-05 14:42:48 +02:00
Johannes Schauer Marin Rodrigues	29921eeabd	the default PDF/A icc profile is /usr/share/color/icc/sRGB.icc, /usr/share/color/icc/OpenICC/sRGB.icc or /usr/share/color/icc/colord/sRGB.icc depending on which one exists	2023-06-11 21:56:21 +02:00
Johannes Schauer Marin Rodrigues	33139612f8	src/img2pdf_test.py: make endianness dependant on sys.byteorder (closes: #152 )	2023-06-11 14:45:09 +02:00
Johannes Schauer Marin Rodrigues	64d27f4a8b	src/img2pdf_test.py: allow Bilevel as well as Grayscale type for png_gray1_img (closes: #161 )	2023-06-11 13:24:30 +02:00
Johannes Schauer Marin Rodrigues	85cbe1d128	factor out argparse.ArgumentParser to allow for generating completions via shtab	2023-06-11 08:09:46 +02:00
Johannes Schauer Marin Rodrigues	b25429a4c1	src/img2pdf_test.py: add tests for timestamps	2023-06-11 08:01:36 +02:00
Johannes Schauer Marin Rodrigues	c703e9df06	fix date(1) based timestamp parser	2023-06-11 07:48:23 +02:00
Johannes Schauer Marin Rodrigues	79e9985f35	src/img2pdf_test.py: black	2023-06-11 07:47:22 +02:00
Johannes Schauer Marin Rodrigues	cb2644c34f	do not include thumbnails in the output by default unless --include-thumbnails is used This is relevant for the MPO format which otherwise would result in PDF files containing the same image in different sizes multiple times. With this change, the default is to only have a single page containing the full MPO. This means that extracting that MPO also gets the thumbnails back. With the --include-thumbnails option, each frame gets stored on its own page as it is done for multi-frame GIF, for example. Closes: #135	2023-06-11 07:31:07 +02:00
Patrick McCarty	81502f21af	Convert creation/modification dates to UTC (fixes #155 ) Ensure that timezones are correctly interpreted in the input by calling `.astimezone()` as appropriate on datetime objects, and store the resulting date fields as UTC. One could argue that datetimes in the local timezone be stored in the PDF, but then the date string handling becomes more complicated; the PDF and XMP date specs both use the `Z` suffix to indicate UTC time, but other +/- offsets require different syntax between the two specs.	2023-06-10 17:53:03 -07:00
Johannes Schauer Marin Rodrigues	0cbcb8fa12	avoid converting palette PNG with alpha to RGB (closes: #158 )	2023-06-08 08:54:37 +02:00
Johannes Schauer Marin Rodrigues	e9e04b6dd9	extend comments around dropping ICC profile stored by GIMP for bilevel input	2023-06-08 08:53:22 +02:00
Johannes Schauer Marin Rodrigues	fc059ee471	use quotes around caret in examples for windows users Closes: #167	2023-06-08 07:14:17 +02:00
Johannes Schauer Marin Rodrigues	25466113e9	another small fixup for the last commit	2023-05-30 08:06:36 +02:00
Johannes Schauer Marin Rodrigues	7405635b72	only check whether icc profile can be dropped if there is any	2023-05-30 07:10:32 +02:00
Johannes Schauer Marin Rodrigues	aea472101b	strip off RGB color profile from bilevel TIFF images produced by gimp (closes: #164 )	2023-05-30 06:25:26 +02:00
Johannes Schauer Marin Rodrigues	7fa67bb337	demote print() to logger.debug()	2023-05-29 09:25:21 +02:00
Johannes Schauer Marin Rodrigues	7d40569aa1	Inform the user what is happening when running without any arguments and suggest using --help to get the help text (closes: #156 )	2023-05-28 15:25:28 +02:00
Johannes Schauer Marin Rodrigues	83f9c32328	appveyor.yml: try out --console --nowindowed	2023-05-28 15:25:28 +02:00
Johannes Schauer Marin Rodrigues	be8369373f	pass deterministic_id=True to writer.save() for pikepdf >= 6.2.0 Closes: #150	2022-10-16 14:13:35 +02:00
Johannes Schauer Marin Rodrigues	10c6901fa3	src/img2pdf_test.py: do not test the depth attribute and rely on baseDepth closes: #119	2022-09-23 23:10:53 +02:00
Johannes Schauer Marin Rodrigues	57d7e07e6b	Support imagemagick 7.1.0-48 - the output of -metric PSNR changed - CMYK output can now be exactly compared closes: #148	2022-09-15 04:36:16 +02:00
Johannes Schauer Marin Rodrigues	272fe0433f	allow pathlib.Path objects by allowing objects implementing read_bytes function	2022-07-02 21:19:34 +02:00
Johannes Schauer Marin Rodrigues	ef7b9e739d	add miff tests for cmyk8 and rgb8	2022-07-02 20:39:18 +02:00
Johannes Schauer Marin Rodrigues	af6fe27d53	avoid match/case for now until python 3.10 is available on more platforms	2022-06-28 14:22:14 +01:00
Johannes Schauer Marin Rodrigues	bad6fcae39	support for MIFF which allows 16 bit CMYK images closes: #144	2022-06-27 13:22:07 +01:00
Johannes Schauer Marin Rodrigues	d9b90499f3	README.md: compare to econvert (closes: #143 )	2022-05-18 13:08:05 +02:00
Johannes Schauer Marin Rodrigues	edb0d29a14	README.md: fix link	2022-05-13 21:27:12 +02:00
Johannes Schauer Marin Rodrigues	bb3e8b0098	README.md: document that img2pdf.exe can now be downloaded via release	2022-05-13 21:25:37 +02:00
Johannes Schauer Marin Rodrigues	f454ebc6a6	release version 0.4.4	2022-04-07 22:40:36 +02:00
mara0004	c3db273e23	Remove outdated readme entry concerning JP2 colorspace If I understood the code in `jp2.py` correctly, this should now work. Moreover, Pillow should usually be able to open JP2 files, so `jp2.py` is only a fallback.	2022-04-07 22:08:41 +02:00
Johannes Schauer Marin Rodrigues	87afabd3cf	add .mailmap	2022-04-07 22:08:18 +02:00
homocomputeris	5045282cc2	Add B and JB paper sizes	2022-04-07 22:02:16 +02:00
Johannes Schauer Marin Rodrigues	fb4b96452a	reformat with black	2022-04-07 21:58:34 +02:00

1 2 3 4 5 ...

433 commits