Package org.apache.pdfbox.pdfparser
The pdfparser package contains classes to parse PDF documents and objects within the document.
-
Interface Summary Interface Description PDFXRef -
Class Summary Class Description BaseParser This class is used to contain parsing logic that will be used by both the PDFParser and the COSStreamParser.COSParser PDF-Parser which first reads startxref and xref tables in order to know valid objects and parse only these objects.FDFParser PDFObjectStreamParser This will parse a PDF 1.5 object stream and extract all of the objects from the stream.PDFParser PDFStreamParser This will parse a PDF byte stream and extract operands and such.PDFXRefStream PDFXrefStreamParser This will parse a PDF 1.5 (or better) Xref stream and extract the xref information from the stream.XrefTrailerResolver This class will collect all XRef/trailer objects and creates correct xref/trailer information after all objects are read using startxref and 'Prev' information (unused XRef/trailer objects are discarded). -
Enum Summary Enum Description XrefTrailerResolver.XRefType The XRefType of a trailer.