v3.4.0
Version 3.4.0 adds support for pre-defined CMaps which improves text extraction significantly for many languages, notably for ones using Asian fonts. In addition there are many fixes and improvements.
Pull requests included:
- #246 Add predefined CMaps for Type 0 composite fonts (@adrg)
- #247 Ignore optional inline image field parameters (@gunnsth)
- #238 Add tolerance for seeking EOF marker between read data buffers (@adrg)
- #233 Improve outline destination parsing (@adrg)
- #232 Extend buffer used for searching EOF marker (@adrg)
- #231 Follow object indirections in PdfPage.GetMediaBox (@samuel)
- #229 Improve outline extraction (@adrg)
- #228 Improve traversal of outline item nodes in the GetOutlinesFlattened method (@adrg)
- #216 Unify and optimize number parsing (@samuel, @gunnsth, @adrg)
- #225 Attempt decryption for invalid crypt filter dictionary type (@adrg)
- #224 Fix Chapter component SetShowNumbering method (@adrg)
- #223 Use utf-16 encoding for serializing outline item titles (@adrg)