[QUIZ] hexdump (#171)

M

Matthew Moss

-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-

The three rules of Ruby Quiz 2:

1. Please do not post any solutions or spoiler discussion for this
quiz until 48 hours have passed from the time on this message.

2. Support Ruby Quiz 2 by submitting ideas as often as you can! (A
permanent, new website is in the works for Ruby Quiz 2. Until then,
please visit the temporary website at

<http://splatbang.com/rubyquiz/>.

3. Enjoy!

Suggestion: A [QUIZ] in the subject of emails about the problem
helps everyone on Ruby Talk follow the discussion. Please reply to
the original quiz message, if you can.

-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-

## hexdump (#171)

_Quiz idea provided by Robert Dober._

This week's quiz should be quick and easy for experienced Rubyists,
and a good lesson for beginners. Your task this week is to write a
utility that outputs a hex dump of the input.

There are a number of hex dump utilities in existence, that go by the
names `hd`, `od`, `hexdump`... I'm sure there are more. Pick one you'd
like to reproduce: If you're on any variety of Unix or BSD (including
Mac OS X), you can get man pages from the command-line to see how they
work. On Windows, if you don't have one installed, you can check out
this [man page for hexdump][1] and use that as a model.

You are not required to implement all the various command-line
switches, but I should be able to run your script on a file and, as a
minimum, see output resembling this (view with fixed-width font for
best results):

0000000 6573 2074 6c68 0a73 7973 746e 7861 6f20
0000010 0a6e 6f63 6f6c 7372 6863 6d65 2065 6564
0000020 6573 7472 0a0a 6573 2074 7865 6170 646e
0000030 6174 0a62 6573 2074 6174 7362 6f74 3d70
0000040 0a32 6573 2074 6873 6669 7774 6469 6874
0000050 323d 220a 6573 2074 6574 7478 6977 7464
0000060 3d68 3836 0a0a 2022 2051 6f63 6d6d 6e61
0000070 2064 6f74 7220 6665 726f 616d 2074 6170
0000080 6172 7267 7061 7368 6120 646e 6c20 7369
0000090 2e74 6e0a 6f6e 6572 616d 2070 2051 7167
00000a0 0a7d 0a0a
00000a4

Your submission should accept input either from a named file (part of
the command-line arguments) or from standard input if no filename is
provided.

Finally, when submitting, make sure to describe what existing hex dump
program you are emulating/reproducing (if any), and what arguments to
your script are needed, if any, to produce the basic output above.


[1]: http://unixhelp.ed.ac.uk/CGI/man-cgi?hexdump+1
 
J

James Gray

There are a number of hex dump utilities in existence, that go by the
names `hd`, `od`, `hexdump`... I'm sure there are more. Pick one you'd
like to reproduce=85

xxd is my favorite.

James Edward Gray II
 
M

Mikael Høilund

Will you be accepting golfed solutions? Of course you will. :)

-- a,b=%Q=Z,O^NPO\r4_PV\\PI\x15^-\x0\v=,email=%\%%%c\%115%%# Mikael
Hoilund, CTO
okay=%#;hmm=(0...a.size).map{|i|((a-email+2)%128).# of Meta.io
ApS from
chr}.join;!email.gsub!'o',"%c%c"%[3+?0.<<(2),?G.~@];aha=#############
Denmark
hmm.scan(/#{'(.)'*5}/);!puts(email[1..-12]+aha.shift.zip(*aha).join)#
Ruby <3
 
C

Chris Shea

    0000000 6573 2074 6c68 0a73 7973 746e 7861 6f20
    0000010 0a6e 6f63 6f6c 7372 6863 6d65 2065 6564
    0000020 6573 7472 0a0a 6573 2074 7865 6170 646e
    0000030 6174 0a62 6573 2074 6174 7362 6f74 3d70
    0000040 0a32 6573 2074 6873 6669 7774 6469 6874
    0000050 323d 220a 6573 2074 6574 7478 6977 7464
    0000060 3d68 3836 0a0a 2022 2051 6f63 6d6d 6e61
    0000070 2064 6f74 7220 6665 726f 616d 2074 6170
    0000080 6172 7267 7061 7368 6120 646e 6c20 7369
    0000090 2e74 6e0a 6f6e 6572 616d 2070 2051 7167
    00000a0 0a7d 0a0a
    00000a4


Is this really what you dumped, Matthew? I was hoping for something a
little more... comprehensible.

Chris

---

es tlh
systnxao
nocolsrhcme eedestr

es txeapdnat
bes tatsbot=p
2es thsfiwtdiht2="
es tettxiwtd=h86

" Qocmmna dotr feroam taparrgpasha dnl si.tn
oneram p Qqg
}
 
M

Matthew Moss

Is this really what you dumped, Matthew? I was hoping for something a
little more... comprehensible.

Chris

---

es tlh
systnxao
nocolsrhcme eedestr

es txeapdnat
bes tatsbot=3Dp
2es thsfiwtdiht2=3D"
es tettxiwtd=3Dh86

=A0" Qocmmna dotr feroam taparrgpasha dnl si.tn
oneram p Qqg

I believe your endianness is off, sir.
 
M

Matthew Moss

Will you be accepting golfed solutions? Of course you will. :)

Well, sure... Though in this case, I'd somewhat prefer to see nicely
written solutions that offered up more command-line options, such as
those provided by the various utilities. Things like grouping by 1, 2
or 4 bytes; ASCII display; binary/octal; etc.

But golfed solutions are okay, as usual...
 
R

Robert Dober

Well here goes my reference implementation, in good ol' RQ tradition.
Nothing fancy here just 16 bytes per line
with hexaddresses and ASCII output at the right, like the System V hd command.

http://pastie.org/242020

Robert
 
M

Mikael Høilund

Oh hi, I just thought I'd golf a solution. I'm sure other people can =20
do a much better job than I making a full hexdumping suite, so I just =20=

had some fun. Can't seem to get it lower than 78 characters, =20
unfortunately.

i=3D0;$<.read.scan(/.{0,16}/m){puts"%08x "%i+$&.unpack('H4'*8).join(' =20=

');i+=3D16}

Expanded and parenthesified, clarified:

i =3D 0
ARGF.read.scan(/.{0,16}/m) {
puts(("%08x " % i) + $&.unpack('H4'*8).join(' '))
i +=3D 16
}

ARGF (aliased as $<) is the file handle of all file names given in the =20=

arguments concatenated, STDIN if none =97 exactly what we need. The =20
regex to scan matches between 0 and 16 characters (including newline) =20=

greedily. Change it to 1,16 if you don't want the empty line at the end.

Instead of letting the block to scan take an argument, I used a trick =20=

I picked up from the last Ruby Quiz I participated in (Obfuscated =20
Email), and use $& inside the block, which is the last regex match. =20
Saves two characters \o/

The unpack returns an array of eight strings, each of four characters, =20=

with the hexadecimal representation of the ASCII value of two =20
consecutive characters. Fun, fun, fun.
 
M

Martin Boese

I added an ascii column to your solution... now it's about twice the size ;=
=2D)

i =3D 0
$<.read.scan(/.{0,16}/m) {
puts(("%08x " % i) + $&.unpack('H4'*8).join(' ') + ' ['+
$&.split(//).collect { |c| c.inspect[1] =3D=3D 92 ? '.' :c }.join + =
']' )
i +=3D 16
}
 
A

Adam Shelly

Oh hi, I just thought I'd golf a solution. I'm sure other people can
do a much better job than I making a full hexdumping suite, so I just
had some fun. Can't seem to get it lower than 78 characters,
unfortunately.

I added an ascii column to your solution... now it's about twice the size= ;-)

i =3D 0
$<.read.scan(/.{0,16}/m) {
puts(("%08x " % i) + $&.unpack('H4'*8).join(' ') + ' ['+
$&.split(//).collect { |c| c.inspect[1] =3D=3D 92 ? '.' :c }.join += ']' )
i +=3D 16
}

I can't resist golf: I got Martin's solution down to 95 bytes (If you
take out the ascii column it's down to 71).

i=3D0;$<.read.scan(/.{0,16}/m){puts"%08x0 "%i+$&.unpack('H4'*8)*' '+' |
'+$&.tr('^ -~','.');i+=3D1}

Tricks: *' ' is a shorter version of .join(' ') for arrays,
and $&.tr('^ -~','.') says translate any character not between ' ' and
'~' (32 to 126) to a '.' That saved a ton over the
split/collect/inspect method. (By the way, map and dump save a few
bytes over collect and inspect)

I also did a more full-featured version that supports some command line opt=
ions

-Adam
----------------------------------------------------
#hexdump utility for RubyQuiz#171
USAGE=3D<<USAGE

Usage:
#{$0.split(/[\/\\]/)[-1]} [-n length] [-s skip] [-g group] [-w
width] [-a] file

Dumps <length> bytes of <file> in hex format, starting at offset <skip>.
Prints <width> bytes per line in groups of size <group>.
Prints the ascii on the right unless <-a> specified

Default is all bytes of $stdin in 16/2 format.

USAGE
begin
width=3D16
group=3D2
skip=3D0
length=3DFloat::MAX
do_ascii =3D true
file =3D $stdin

while (opt=3DARGV.shift)
if opt[0]=3D=3D?-
case opt[1]
when ?n
length=3DARGV.shift.to_i
when ?s
skip=3DARGV.shift.to_i
when ?g
group =3D ARGV.shift.to_i
when ?w
width =3D ARGV.shift.to_i
when ?a
do_ascii =3D false
else
raise ArgumentError,"invalid Option #{opt}"
end
else
file =3D File.new(opt)
end
end

n=3D0
ascii=3D''
file.read(skip)
file.each_byte{|b|
if n%width =3D=3D 0
print "%s\n%08x "%[ascii,n+skip]
ascii=3D'| ' if do_ascii
end
print "%02x"%b
print ' ' if (n+=3D1)%group=3D=3D0
ascii << "%s"%b.chr.tr('^ -~','.') if do_ascii
break if n>length
}
puts ' '*(((2+width-ascii.size)*(2*group+1))/group.to_f).ceil+ascii
#this is probably the most complicated line
#it pads out the line to get the remaining ascii to align:
# (2+width-ascii.size) is the number of bytes missing (the 2 is for the '=
| ')
# *(2*group+1) is the width of a group of bytes with the space
# /group.to_f divides by the number of groups
# .ceil rounds up, otherwise we misalign on partial groups

rescue =3D>x
puts USAGE, "ERROR: #{x}"
end
 
M

Mikael Høilund

I can't resist golf: I got Martin's solution down to 95 bytes (If you
take out the ascii column it's down to 71).

i=3D0;$<.read.scan(/.{0,16}/m){puts"%08x0 "%i+$&.unpack('H4'*8)*' '+' = |
'+$&.tr('^ -~','.');i+=3D1}

That's pretty neat! I'd totally forgotten about that trick. The way =20
you handle the counter is ;)-ish ;)

--=20
# Mikael H=F8ilund
def method_missing(m, a=3D0) a +
m.to_s[/[a-z]+/].size * 2; end
p What is the meaning of life?
 
M

Michael Morin

Matthew said:
## hexdump (#171)

_Quiz idea provided by Robert Dober._

This week's quiz should be quick and easy for experienced Rubyists,
and a good lesson for beginners. Your task this week is to write a
utility that outputs a hex dump of the input.

I did something a little different, I made a module that can be used to
extend IO objects. This means you can extend any File or socket objects
to become hex writers. Since I don't think you can "un-extend" an
object, it would probably be best if you dup the IO object if you need
to switch between hex and normal output.

#!/usr/bin/env ruby
# UziMonkey <[email protected]>

module HexWriter
def self.extend_object(o)
class << o
alias_method :eek:ld_write, :write
end

super
end

def write(s)
s.each_byte do|b|
if @bytes % 16 == 0 and @address != 0
end_line
new_line
end

write_byte b
end
end

def new_file
@address = 0
new_line
end

def end_line
old_write " " * (16 - @bytes)
old_write " #{@ascii}\n"
end

def new_line
@bytes = 0
@address ||= 0
@ascii = ""

old_write "%08x" % @address
end

def write_byte(b)
old_write " %02x" % b

@ascii << ((b.chr =~ /[[:print:]]/).nil? ? '.' : b.chr)

@bytes += 1
@address += 1
end
end

hex = STDOUT.dup.extend HexWriter
ARGV.each do|f|
puts "File: #{f}"
hex.new_file
hex.write File.read(f)
hex.end_line
end

--
Michael Morin
Guide to Ruby
http://ruby.about.com/
Become an About.com Guide: beaguide.about.com
About.com is part of the New York Times Company
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads


Members online

No members online now.

Forum statistics

Threads
473,961
Messages
2,570,131
Members
46,689
Latest member
liammiller

Latest Threads

Top