L
LtCommander
Hi all,
I am using the CAM:DF module to extract text from PDF files. (It's an
AMAZING module!!) You can pretty much do everything with it.
My snippet for extracting text from a PDF file is:
$pdf = CAM:DF->new($Fil);
$page = $pdf->getPageText($Pg);
This works fine for all pages without any graphics. I am able to print
the contents of $page without any problems whatsoever. However, if a
particular page has some sort of an inline graphic, the $page returns
an empty value!
I was wondering:
- If somebody knows how to remove all the graphics from the $pdf object
before running the next line of code. I think that should fix it and no
longer return any empty string.
I've tried searching plenty but no luck so far!
Would be grateful for your help.
Vince
I am using the CAM:DF module to extract text from PDF files. (It's an
AMAZING module!!) You can pretty much do everything with it.
My snippet for extracting text from a PDF file is:
$pdf = CAM:DF->new($Fil);
$page = $pdf->getPageText($Pg);
This works fine for all pages without any graphics. I am able to print
the contents of $page without any problems whatsoever. However, if a
particular page has some sort of an inline graphic, the $page returns
an empty value!
I was wondering:
- If somebody knows how to remove all the graphics from the $pdf object
before running the next line of code. I think that should fix it and no
longer return any empty string.
I've tried searching plenty but no luck so far!
Would be grateful for your help.
Vince