Include name
property in outputs of page.images
#1201
djr2015
started this conversation in
Show and tell
Replies: 1 comment
-
Thanks! That's a relatively straightforward add, now available on |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
When working with images inside pdfs, how about also recording the object's id when computing
page.images
via:attr["name"]=obj.name
here: https://github.com/jsvine/pdfplumber/blob/stable/pdfplumber/page.py#L369
Doing this enables mapping images' coordinates and dimensions to their identifiers as recorded in the pdf (e.g.
/Im0
) which is useful for downstream tasks involving other pdf parsing libraries.Beta Was this translation helpful? Give feedback.
All reactions