Pythonise this algorithm ?

news · Jan 19, 2006

Don't you hate the *.ps/*.pdf texts which are arranged in columns
as if it was a newspaper ? Especially when you want to email
a section after using 'pdftotxt'.

I'm guessing that an algorithm to extract colums could work
like this : [assume 2 column, but 3, 4.. should be similar, remember
that the RHS-colm of pageN continues to the LHS-colm of pageN+1]

Initialise;
Repeat (* NextBlok or exit DO *)
BeginBloks:-
Mark the TopLeftCorner -> get(StartRow,StartColm);
Mark the BotmRightCorner -> get(EndRow,EndColm);
Extract the Blok's text :-
For Row = StartRow to EndRow;
For Colm = StartColm to EndColm
PutCharToBufr;
DoLineTerminator;
Until ExitBloks.

Obviously the nesting is: Bloks > Rows > Colms.

Then it can be morphed to clean up the ">>>>" in newsgroup
threads as the lines get too long for the extra ">" ?

Thanks for any input,

== Chris Glur.

Can't seem to start on this	0	Jan 3, 2013
Which is a better way to implement this algorithm?	19	Apr 20, 2008
Algorithm for performing a rollup	25	Mar 17, 2007
Saving array to a file	0	Nov 25, 2004
Qquestion on Shortest paths algorithm	5	Jul 26, 2006
A Practical Introduction to Data Structures and Algorithm Analysis2Ed by Shaffer	0	Feb 4, 2010
Cryptographic service provider (CSP) could not be found for this algorithm.	3	Sep 7, 2004
Can I perform this Parse in perl? (Non standard address)	2	Aug 12, 2007

Pythonise this algorithm ?

news

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads