Package org.apache.pdfbox.pdfparser
Class FDFParser
java.lang.Object
org.apache.pdfbox.pdfparser.BaseParser
org.apache.pdfbox.pdfparser.COSParser
org.apache.pdfbox.pdfparser.FDFParser
-
Field Summary
FieldsFields inherited from class org.apache.pdfbox.pdfparser.COSParser
EOF_MARKER, fileLen, initialParseDone, OBJ_MARKER, securityHandler, source, SYSPROP_EOFLOOKUPRANGE, SYSPROP_PARSEMINIMAL, TMP_FILE_PREFIX, xrefTrailerResolver
Fields inherited from class org.apache.pdfbox.pdfparser.BaseParser
A, ASCII_CR, ASCII_LF, B, D, DEF, document, E, ENDOBJ_STRING, ENDSTREAM_STRING, J, M, MAX_LENGTH_LONG, N, O, R, S, seqSource, STREAM_STRING, T
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprivate void
init()
private void
The initial parse will first parse only the trailer, the xrefstart and all xref tables to have a pointer (offset) to all the pdf's objects.protected final boolean
isCatalog
(COSDictionary dictionary) Tell if the dictionary is a FDF catalog.void
parse()
This will parse the stream and populate the COSDocument object.Methods inherited from class org.apache.pdfbox.pdfparser.COSParser
checkPages, getAccessPermission, getDocument, getEncryption, getStartxrefOffset, isLenient, lastIndexOf, parseCOSStream, parseDictObjects, parseFDFHeader, parseObjectDynamically, parseObjectDynamically, parsePDFHeader, parseTrailerValuesDynamically, parseXref, parseXrefTable, rebuildTrailer, retrieveTrailer, setEOFLookupRange, setLenient
Methods inherited from class org.apache.pdfbox.pdfparser.BaseParser
isClosing, isClosing, isDigit, isDigit, isEndOfName, isEOL, isEOL, isSpace, isSpace, isWhitespace, isWhitespace, parseBoolean, parseCOSArray, parseCOSDictionary, parseCOSName, parseCOSString, parseDirObject, readExpectedChar, readExpectedString, readExpectedString, readGenerationNumber, readInt, readLine, readLong, readObjectNumber, readString, readString, readStringNumber, skipSpaces, skipWhiteSpaces
-
Field Details
-
LOG
private static final org.apache.commons.logging.Log LOG
-
-
Constructor Details
-
FDFParser
Constructs parser for given file using memory buffer.- Parameters:
filename
- the filename of the pdf to be parsed- Throws:
IOException
- If something went wrong.
-
FDFParser
Constructs parser for given file using given buffer for temporary storage.- Parameters:
file
- the pdf to be parsed- Throws:
IOException
- If something went wrong.
-
FDFParser
Constructor.- Parameters:
input
- input stream representing the pdf.- Throws:
IOException
- If something went wrong.
-
-
Method Details
-
isCatalog
Tell if the dictionary is a FDF catalog. -
init
private void init() -
initialParse
The initial parse will first parse only the trailer, the xrefstart and all xref tables to have a pointer (offset) to all the pdf's objects. It can handle linearized pdfs, which will have an xref at the end pointing to an xref at the beginning of the file. Last the root object is parsed.- Throws:
IOException
- If something went wrong.
-
parse
This will parse the stream and populate the COSDocument object.- Throws:
IOException
- If there is an error reading from the stream or corrupt data is found.
-