Skip to content

v3.4.0

Compare
Choose a tag to compare
@gunnsth gunnsth released this 10 Feb 14:57
· 235 commits to master since this release
11e21cd

Version 3.4.0 adds support for pre-defined CMaps which improves text extraction significantly for many languages, notably for ones using Asian fonts. In addition there are many fixes and improvements.

Pull requests included:

  • #246 Add predefined CMaps for Type 0 composite fonts (@adrg)
  • #247 Ignore optional inline image field parameters (@gunnsth)
  • #238 Add tolerance for seeking EOF marker between read data buffers (@adrg)
  • #233 Improve outline destination parsing (@adrg)
  • #232 Extend buffer used for searching EOF marker (@adrg)
  • #231 Follow object indirections in PdfPage.GetMediaBox (@samuel)
  • #229 Improve outline extraction (@adrg)
  • #228 Improve traversal of outline item nodes in the GetOutlinesFlattened method (@adrg)
  • #216 Unify and optimize number parsing (@samuel, @gunnsth, @adrg)
  • #225 Attempt decryption for invalid crypt filter dictionary type (@adrg)
  • #224 Fix Chapter component SetShowNumbering method (@adrg)
  • #223 Use utf-16 encoding for serializing outline item titles (@adrg)