Info regarding Zlib::GzipReader

J-H Johansen · Jun 15, 2007

Hi,

I'm trying to parse through a gzip'ed proxy access log with
Zlib::GzipReader and I'm having some difficulties.

f = File.open(file, "r")
gz = Zlib::GzipReader.new(f)
gz.readlines.each do |block|
puts block
end

What this piece of code will do is to read the first 6 lines of the
proxy log before it reaches (what it believes to be) the end of the
file. These few lines happens to be the info header which contains:

#Software: ......
#Version: ......
#Start-date: ......
#Date: ......
#Fields: ....................
#Remark: ........

The access log contains a wee bit more than that though (980796 lines).
By just using File.open(file) it seems I can read the whole file.

I'm speculating here but I think that maybe the gzip file may have
been buffered. I.e. first 6 lines has been gzip'ed and then the rest
of the file has been gzip'ed and appended to it afterwards.

One way of fixing the problem is to gunzip the file and then gzip the
output into a new file. Problem solved (sort of).

Do any of you know of any other way to do this without actually
modifying the access logs ?

I'm thinking of something along the lines of breaking up the file into
smaller file handles which in turn can be used by GzipReader, but I
don't know how this is done.

Anyone know how this can be done or if there is any better ways of doing it ?

Thanks

Zlib::GzipReader and multiple compressed blobs in a single stream	10	Jan 28, 2011
Zlib::GzipReader doesn't work as expected	5	Apr 25, 2012
Most simple usage of zlib or pr-zlib	4	Mar 9, 2011
open-uri + Zlib: not in gzip format	0	Oct 23, 2010
Zlib file decompression EOL issue	0	Apr 11, 2008
Custom Minecraft launcher client error; I think regarding java	0	Sep 7, 2022
unzipping a gzipped string	3	Aug 22, 2006
Speed gap between zcat and zlib's GzipReader	3	Oct 19, 2004

Info regarding Zlib::GzipReader

J-H Johansen

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads