K2pdfopt Changelog

What's new in K2pdfopt 2.33

Nov 19, 2015

NEW FEATURES:
Compiled with GCC v5.2.0 and MuPDF v1.7a (released May 7, 2015). The MuPDF upgrade involved modifying a significant amount of the MuPDF interface code in the willus library since Artifex changed the APIs on several functions, but the bulk of the logic did not change. I uncovered a bug in the pdf_dict_del() function as well (reported).
The -i option displays information about the source PDF file. Added to MS Windows GUI also.
Added -fr option to rotate wide-aspect-ratio figures to landscape. http://www.mobileread.com/forums/showthread.php?p=3060339#post3060339
Added Kindle Paperwhite 3 (2015 release) and Pocketbook Basic 2 to dev list (from http://www.mobileread.com/forums/showthread.php?t=253579)
Smarter sorting of red regions on a multiple-column page. See pageregion_sort() function in pageregions.c.
New -ibox option has same format as -cbox, but these boxes are ignored by k2pdfopt--they are "whited out" in the source file. For native output, the contents may still be visible in the output.
The -neg option now attempts to only negate text passages to white on black and to leave figures alone. Use -neg+ to negate everything. http://www.mobileread.com/forums/showthread.php?p=3104536#post3104536
Added option -ehl to erase horizontal lines in the document. Works exactly like the -evl option.
Added -author and -title options to specify the author and title of the output PDF. http://www.mobileread.com/forums/showthread.php?p=3112052#post3112052
Added -px option to exclude a set of pages, e.g. -px 4,7,10-20. http://www.mobileread.com/forums/showthread.php?p=3112052#post3112052
User can use color markings to tell k2pdfopt where to apply page breaks to the output file. http://www.mobileread.com/forums/showthread.php?p=3152988#post3152988
The -? option can now be followed by a (wildcard) matching string to show the usage of a particlar option, e.g. -? -ws.
BUG FIXES:
With notes options turned on (-nl / -nr), k2pdfopt will still search for multiple columns if no notes are found on the page. In addition, the -crgh option now more directly affects column divider finding. See textrows_remove_small_rows() call in bmpregion_find_multicolumn_divider(). http://www.mobileread.com/forums/showthread.php?p=3148589#post3148589
Fixed multiple file select (broke when I converted to wide chars in v2.30).
Modified bmpregion_hyphen_detect() to be less strict about rejecting hyphens that aren't exactly centered. Also modified calculation of lcheight in bmpregion_calc_bbox()--see the function. http://www.mobileread.com/forums/showthread.php?p=3119501#post3119501
The k2pdfopt web site and help pages work again from the help menu.
Turned off some debugging text from the bmp_autocrop2 function in k2bmp.c.
Not really a bug fix, but the command-line help is now shown in Courier New in MS Windows (a mono-spaced font).
In info_update() in wmupdf.c in the willus library, I check to see if I can resolve the Info dictionary. This checks to see if it can be parsed correctly. If not, I discard the dictionary. This was causing a bug that a user submitted to me in an e-mail on 15 April 2015. The users had a PDF file with a corrupt "Info" dictionary.
WPDFOUTLINE structures correctly freed.
MuPDF v1.7 stores ligatured characters differently than previous versions in its internal character arrays, so I had to compensate for this.

New in K2pdfopt 2.32 (Mar 7, 2015)

New in K2pdfopt 2.31 (Dec 29, 2014)

New in K2pdfopt 2.30 (Nov 27, 2014)

New in K2pdfopt 2.21 (Jul 26, 2014)

New in K2pdfopt 2.18 (Jul 4, 2014)

New in K2pdfopt 2.17a (Jun 3, 2014)

New in K2pdfopt 2.17 (May 19, 2014)

New in K2pdfopt 2.16 (May 5, 2014)

New in K2pdfopt 2.15 (Mar 28, 2014)

New in K2pdfopt 2.14 (Jan 3, 2014)

New in K2pdfopt 2.12 (Dec 3, 2013)

New in K2pdfopt 2.10 (Nov 25, 2013)

NEW FEATURES:
The PDF "Outlines" tree (often called "bookmarks" by PDF viewers) that helps you navigate the PDF file and is usually shown in the left pane of the PDF viewer is now preserved in the converted file.Or you can create your own bookmarks from a simple text file if your PDF source file doesn't have one (or if you want to change it). See the -toc, -toclist, and -tocsave command-line options. (toc = Table of Contents.)Destination page breaks are forced at outline anchor pages by default (see -bp option).
A new -cbox option allows you to specify a crop box to be applied to each page.You can specify more than one, and each separate crop box will be rendered to a different output page, similar to the way the -grid option works.See -cbox in the command usage. Using -mode crop with -cbox, you can crop a source PDF file to a destination PDF file.You can specify different crop boxes for even and odd pages, as well.
The -bpl option now allows you to specify a list of source pages where destination page breaks will be forced.
Three new modes:-mode trim causes the source page to be trimmed and the destination to be sized to the trimmed source.-mode fitpage is similar, but squeezes the trimmed source page into the specified device output screen size.-mode crop is a complement to the -cbox option and causes each cropped box to be placed on a new page the size of the cropped box.
ENHANCEMENTS:
Windows versions are compiled with gcc 4.8.2.
The Win64 binary is now compressed with UPX 3.91w which finally is able to compress the Win64/PE format.
BUG FIXES:
In native output, consecutive streams now delimited by white space.
http://www.mobileread.com/forums/showthread.php?p=2655550#post2655550
Pages with no "/Contents" entry are correctly handled.
Re-wrote masterinfo_break_point() to make use of bmpregion_find_textrows() so that decisions on where to break pages in the "fitwidth" mode should be more consistent and also will be affected by the -gtr option. http://www.mobileread.com/forums/showthread.php?p=2686067#post2686067
Removed last vestiges of -pi option (interactive menu 'w' option was incorrectly still using it).
The vert_line_erase() function in k2bmp.c correctly handle the cbmp pointer when it is an 8-bit bitmap now.
Fixed a flow problem in k2file.c (k2pdfopt_proc_one() function) which was causing the GUI preview not to work with -mode copy.
The textrows_remove_small_rows() function no longer includes figures (REGION_TYPE_FIGURE) when doing statistics on the row heights.

New in K2pdfopt 2.03 (Sep 23, 2013)

New in K2pdfopt 2.02 (Sep 19, 2013)

New in K2pdfopt 2.01 (Sep 16, 2013)

New in K2pdfopt 1.66 (Jul 24, 2013)

New in K2pdfopt 1.65 (Apr 9, 2013)

NEW FEATURES / OPTIONS:
Added Kobo Glo and Kobo Touch device settings. (http://www.mobileread.com/forums/showpost.php?p=2441354&postcount=336)
Re-vamped the bmp_source_page_add() function so that the logic that breaks the page out into displayable rectangular regions can be used in other places (e.g. by the OCR fill-in function).
Added option -ocrcols which sets the max number of columns for processing with OCR (if different from the -col value). You would use this if you want to OCR a PDF file using -mode copy, but the file has multiple columns of text. (http://www.mobileread.com/forums/showpost.php?p=2442523&postcount=341)
Added option -rsf (row-split figure-of-merit) which controls a new algorithm which goes back and looks for rows of text which should be split into two (or three) separate rows. This is meant to help catch those cases where k2pdfopt should have split apart two rows of text but did not because of a small amount of overlap. See breakinfo_find_doubles() in breakinfo.c.
LIBRARY UPDATES:
Compiled with latest versions of major libraries: MuPDF 1.2, DjVu 3.5.25.3, FreeType 2.4.11, Turbo JPEG 1.2.1, PNG 1.5.14, Z-lib 1.2.7.
Linux version now compiled with gcc 4.7.2 in Ubuntu 12.
TWEAKS:
Clarified usage for -vb in k2usage.c
Changed "destination" to "E-reader" in places on the k2 interactive menu and device menu.
Put "disclaimer" in OCR usage which clarifies the purpose.
Default crop margins are now zero (was 0.25 inches). This was confusing too many people. (http://www.mobileread.com/forums/showpost.php?p=2456032&postcount=352)
In bmp_region_vertically_break(), different width regions and regions with different ending/starting row heights cause a vertical gap to be inserted in the output.
BUG FIXES:
Call k2pdfopt_settings_sanity_check() once per source document. This fixes a crash when converting multiple files. (Certain vars weren't getting correctly initialized on the 2nd, 3rd, etc. conversion files.) (http://www.mobileread.com/forums/showpost.php?p=2409726&postcount=317)
Fixed array-out-of-bounds access in k2proc.c (bmpregion_find_multicolumn_divider function) which occasionally caused k2pdfopt to terminate abnormally (typically when converting mostly blank pages). (http://www.mobileread.com/forums/showpost.php?p=2456548&postcount=356)
Fixed k2pdfopt_proc_one() in k2file.c so that native PDF output is turned off if the source file is not PDF (e.g. DjVu conversion).
Fixed spacing between regions with -vb -2 or -vb -1 (gap between pages where new chapter starts, for example--font change, etc.). (http://www.mobileread.com/forums/showpost.php?p=2373550&postcount=292)
Minimum width in vertical line detection is now 1 pixel. (http://www.mobileread.com/forums/showpost.php?p=2452356&postcount=345)
Better diagnostic output on TESSDATA_PREFIX env var.
Fixed native PDF output so that scientific notation is not allowed in PDF clipping commands. This was causing native conversions not to work correctly in some cases. (http://www.mobileread.com/forums/showpost.php?p=2467063&postcount=371)

New in K2pdfopt 1.64a (Jan 11, 2013)

New in K2pdfopt 1.64 (Jan 11, 2013)

New in K2pdfopt 1.33 (Nov 29, 2011)