NAME

PDF::Make::Extract::Word - A word with position and font information

METHODS

text()

The word's text content (UTF-8).

x0, y0, x1, y1

Bounding box coordinates in PDF user space.

bbox()

Returns (x0, y0, x1, y1) as a list.

width()

Width of the word bounding box.

height()

Height of the word bounding box.

font_size()

The font size used to render this word.

mcid()

For tagged PDFs, the marked-content identifier of the BDC/EMC block that contained this word's glyphs. undef when the word is not inside a marked-content block.

tag()

For tagged PDFs, the PDF structure role (e.g. H1, P, Figure) associated with this word via /StructTreeRoot. undef when the word is not tagged.

SEE ALSO

PDF::Make::Extract::Line