Perl and modifying a PDF document

Thread starter January Weiner
Start date Oct 30, 2007

January Weiner

Oct 30, 2007

Hello,

I have this cool little script for content analysis of some documents.
Currently, it takes simple text and finds certain phrases, then creates a
html which shows the phrases in bold in a context.

However, the documents are initilally all PDF. Right now I just run
pdftotext first and proceed from there. Of course, all images, formatting
etc. are lost and if you want to precisely track back what happens where,
you need to go back to the original document (preferably with a printout
and a pencil).

What I would like to do is to take directly the PDF file and modify it in
such a way that the phrases are shown in red.

Can this be done? If yes, could you point me to which of the numerous PDF
Perl modules should I use? Or, even better, give me examples how this
can be done?

Regards,

January

--

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads

Digital Signature field form in PDF generated document from HTML	5	Nov 16, 2022
Creating a direct download div link for pdf file	3	Mar 19, 2023
How to use PDF-lib and how to center each line of texts on the page?	1	Aug 16, 2023
How can I view / open / render / display a pdf file with c code?	0	Sep 23, 2023
PDF, Excel, LaTeX, and possibly R and sweave	7	Nov 1, 2013
modifying a time.struct_time	8	Dec 16, 2011
Cam::pdf, document page number	0	Mar 6, 2007
PDF::Create - text colour	1	Jun 4, 2009

Facebook Twitter Reddit Pinterest Tumblr WhatsApp Email Link

Members online

RogerDoger

Total: 90 (members: 3, guests: 87)
Robots: 453

Forum statistics

Threads: 473,995

Messages: 2,570,230

Members: 46,819

Latest member: masterdaster

Latest Threads

How to Merge to div with each other as shadow effect?
- Started by treekmostly22
- Today at 7:58 AM
Syntax error
- Started by RGIANNETTI
- Yesterday at 7:13 PM
SYNTAX ERROR
- Started by RGIANNETTI
- Yesterday at 7:10 PM
Right or wrong
- Started by Tobi1987
- Yesterday at 6:34 AM
Hello , Im Emilio
- Started by Mercury_Dev
- Yesterday at 5:55 AM
Anyone want to balance my browser-based clicker game?
- Started by timo
- Wednesday at 11:16 PM
DNS Disaster: The Server Downfall
- Started by Infinityhost
- Tuesday at 5:04 AM
Web-Based RAM Management: Real-Time Server Control in Windows 10
- Started by Infinityhost
- Tuesday at 5:00 AM
Create and Preview HTML & PDF with Custom Encryption and Micro Cloud Storage
- Started by Infinityhost
- Tuesday at 4:53 AM
Demonstration of a Self-Written HTTP Server for an Online Store
- Started by Infinityhost
- Tuesday at 4:50 AM

Top