Reading the first line of a file (in a zipfile)

mike.aldrich · Apr 11, 2007

Hi folks,
I am trying to read the first occurence of non-whitespace in a file,
within a zipfile. Here is my code:

zipnames = glob.glob("<search_dir>*")
for zipname in zipnames:
z = zipfile.ZipFile(zipname, "r")
for filename in z.namelist():
count = len(z.read(filename).split('\n'))
if fnmatch.fnmatch(filename, "*AUDIT*"):
test = filename.split(' ')
print 'File:', test[0],
bytes = z.read(filename)
print 'has', len(bytes), 'bytes'
print 'and', count, 'lines'

The first line in the file I am examining will be a number followed by
more whitespace. Looks like I cannot split by whitespace?

Larry Bates · Apr 11, 2007

Hi folks,
I am trying to read the first occurence of non-whitespace in a file,
within a zipfile. Here is my code:

zipnames = glob.glob("<search_dir>*")
for zipname in zipnames:
z = zipfile.ZipFile(zipname, "r")
for filename in z.namelist():
count = len(z.read(filename).split('\n'))
if fnmatch.fnmatch(filename, "*AUDIT*"):
test = filename.split(' ')
print 'File:', test[0],
bytes = z.read(filename)
print 'has', len(bytes), 'bytes'
print 'and', count, 'lines'

The first line in the file I am examining will be a number followed by
more whitespace. Looks like I cannot split by whitespace?

You have told split to split on single blank space not whitespace.
To split on whitespace use .split() (e.g. no arguments)

-Larry

Gabriel Genellina · Apr 11, 2007

En Wed said:
Hi folks,
I am trying to read the first occurence of non-whitespace in a file,
within a zipfile. Here is my code:

zipnames = glob.glob("<search_dir>*")
for zipname in zipnames:
z = zipfile.ZipFile(zipname, "r")
for filename in z.namelist():
count = len(z.read(filename).split('\n'))
if fnmatch.fnmatch(filename, "*AUDIT*"):
test = filename.split(' ')
print 'File:', test[0],
bytes = z.read(filename)
print 'has', len(bytes), 'bytes'
print 'and', count, 'lines'

The first line in the file I am examining will be a number followed by
more whitespace. Looks like I cannot split by whitespace?

Your code does nothing with the first line on the file; you only split the
*filename* on whitespace. And you extract the file twice.
You don't even try to find "the first occurence of non-whitespace". Surely
an example of file contents and what output you really expect from it
would be adequate.

mike.aldrich · Apr 11, 2007

En Wed, 11 Apr 2007 16:13:42 -0300, <[email protected]> escribió:

Hi folks,
I am trying to read the first occurence of non-whitespace in a file,
within a zipfile. Here is my code:

Click to expand...

zipnames = glob.glob("<search_dir>*")
for zipname in zipnames:
z = zipfile.ZipFile(zipname, "r")
for filename in z.namelist():
count = len(z.read(filename).split('\n'))
if fnmatch.fnmatch(filename, "*AUDIT*"):
test = filename.split(' ')
print 'File:', test[0],
bytes = z.read(filename)
print 'has', len(bytes), 'bytes'
print 'and', count, 'lines'

Click to expand...

The first line in the file I am examining will be a number followed by
more whitespace. Looks like I cannot split by whitespace?

Click to expand...

Your code does nothing with the first line on the file; you only split the
*filename* on whitespace. And you extract the file twice.
You don't even try to find "the first occurence of non-whitespace". Surely
an example of file contents and what output you really expect from it
would be adequate.

The file contents have leading whitespace, then a number:
123456 \n
I expect to return '123456'

Gabriel Genellina · Apr 11, 2007

En Wed said:
The file contents have leading whitespace, then a number:
123456 \n
I expect to return '123456'

And nothing following the number?

py> line = " 123456 \n"
py> print line.strip()
123456

mike.aldrich · Apr 13, 2007

And nothing following the number?

py> line = " 123456 \n"
py> print line.strip()
123456

That works fine if I am using the interpreter, but I get 'cannot open
file' when i try to read from an archive..
Does that make sense? Sorry, this is my 2nd python script.

Gabriel Genellina · Apr 15, 2007

En Fri said:
That works fine if I am using the interpreter, but I get 'cannot open
file' when i try to read from an archive..
Does that make sense? Sorry, this is my 2nd python script.

Try a small, failing example and post the code and the full error
traceback - else it's hard to tell what's happening.

7stud · Apr 15, 2007

Hi folks,

The first line in the file I am examining will be a number followed by
more whitespace. Looks like I cannot split by whitespace?

but I get 'cannot open
file' when i try to read from an archive..

....and that led you to conclude that you cannot split by whitespace?

zipfile extracting png files corrupt	3	Oct 17, 2009
I made a blockchain and want to make a cryptocurrency, but my code doesn't verify hash of each block	2	Jun 2, 2024
Python 3.2 bug? Reading the last line of a file	10	May 25, 2011
Trouble with prediction code, for the life of me I can't figure out why it isnt running properly. Help would be appreciated.	0	Jul 8, 2023
Zipfile module errors	2	Jun 4, 2008
When the first line of a file tells something about the other lines	1	Aug 16, 2010
Define a class containing methods for reading a file and then storelines in external variable	1	Aug 15, 2013
Help for my project in the last minute	0	Apr 23, 2022

Reading the first line of a file (in a zipfile)

mike.aldrich

Larry Bates

Gabriel Genellina

mike.aldrich

Gabriel Genellina

mike.aldrich

Gabriel Genellina

7stud

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads