G
Giovanni Azua
Hello!
I have the strong need to do the following. Given a set of PDF files
scattered across multiple directories, build a global index that includes
for every index term the file names and corresponding pages where such
index occurs. A really nice to have would be to "parse" formulas but I
guess these are stored as images ...
Before I go ahead and build a solution using Apache's PDFBox and/or iText
can anyone advice if such solution exists? even if commercial? I googled
for this already ...
My use-case for this is a very critical open book exam but there are no
books instead a bunch of dense PDF papers and lectures (a lot) if I get
such index I might get an edge here
TIA,
Best regards,
Giovanni
-- Giovanni
I have the strong need to do the following. Given a set of PDF files
scattered across multiple directories, build a global index that includes
for every index term the file names and corresponding pages where such
index occurs. A really nice to have would be to "parse" formulas but I
guess these are stored as images ...
Before I go ahead and build a solution using Apache's PDFBox and/or iText
can anyone advice if such solution exists? even if commercial? I googled
for this already ...
My use-case for this is a very critical open book exam but there are no
books instead a bunch of dense PDF papers and lectures (a lot) if I get
such index I might get an edge here
TIA,
Best regards,
Giovanni
-- Giovanni