isplit

bearophileHUGS · Jan 26, 2006

I have a file of lines that contains some extraneous chars, this the
basic version of code to process it:

IDtable = "".join(map(chr, xrange(256)))
text = file("...", "rb").read().translate(IDtable, toRemove)
for raw_line in file(file_name):
line = raw_line.translate(IDtable, toRemove)
...

A faster alternative:

IDtable = "".join(map(chr, xrange(256)))
text = file(file_name).read().translate(IDtable, toRemove)
for line in text.split("/n"):
...

But text.split requires some memory if the text isn't small.
Probably there are simpler solutions (solutions with the language as it
is now), but one seems the following, an:

str.isplit()
or
str.itersplit()
or
str.xsplit()
Like split, but iterative.

(Or even making str.split() itself an iterator (for Py3.0), and
str.listsplit() to generate lists.)
(At the moment a simple RE can probably work as the isplit.)

Bye,
bearophile

String multi-replace	2	Nov 18, 2010
hex dump w/ or w/out utf-8 chars	40	Jul 8, 2013
Newbie...	2	Feb 24, 2011
Output confusion	2	Mar 9, 2023
Simple converter of files into their hex components... but i can'tarrange utf-8 parts!	2	Jun 9, 2013
imap vs map	1	Mar 5, 2010
performance of script to write very long lines of random chars	15	Apr 11, 2013
Trying to write a first plain ruby script	2	Jul 11, 2008

isplit

bearophileHUGS

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads