Newbie Questions: Swithing from Perl to Python

T

Todd Stephens

This isn't Perl, everything is a 'my' variable unless you explictly ask
otherwise, so there's no need to restate that fact in your variable
names.
Having 'my' as a prefix to every name is bad style in Python.

Thanks. Normally I name variable by what they are, I was using this as a
method for them to be easily read (and typed) for example purposes.
 
A

Aahz

What do you mean by "a static method on the list type"? Will I be able
to do:

for key in myDict.keys().sorted():
print key

Nope:

for key in list.sorted(myDict.keys()):
print key
If that's what you're talking about, there's an obvious downside, which
is that now we'll have list.sort() and list.sorted() which do two
different things. This will be confusing.

Yup. I pointed that out; I preferred copysort(), but the fact that you
have to actually use the list object to access the method separates the
namespace at least. I'm -0 on the current name, but Guido likes it.
Is there a PEP on this I could read? A quick look at the PEP index
didn't show anything that looked appropos.

No, this was considered a small enough addition to warrant restricting
the discussion to python-dev. If other people pipe up with objections,
I'll forward that back to python-dev.
I certainly understand the efficiency aspects of in-place sorting, but
this has always seemed like premature optimization to me. Most of the
time (at least in the code I write), the cost of an extra copy is
inconsequential. I'll be happy to burn a few thousand CPU cycles if it
lets me avoid an intermediate assignment or a couple of extra lines of
code. When things get too slow, then is the time to do some profiling
and figure out where I can speed things up.

It's not so much that as that it's much easier to get a copying sort (by
making your own copy) than it is to get a non-copying sort. The
efficiency issue here is less the CPU cycles than the RAM, and people
*do* sort lists with thousands or millions of elements. Python is in
many ways about keeping people from shooting themselves in the "worst
case".
 
A

Alex Martelli

Roy Smith wrote:
...
Actually a classmethod -- only relevant if you subclass list, e.g.:

class mylist(list):
''' whatever tweaks '''

foo = mylist.sorted(bar)

now foo will be an instance of mylist, NOT just of list. Marginal issue, of
course.
What do you mean by "a static method on the list type"? Will I be able
to do:

for key in myDict.keys().sorted():
print key

Nope. You will, if you insist, be able to code:

for key in list.sorted(myDict.keys()):
print key

making a silly and useless copy of the keys list. Or, you will be able to
avoid the useless copy by coding list.sorted(myDict) or also, if you prefer,
list.sorted(myDict.iterkeys()).

You can bind srt=list.sorted somewhere in your code if you want more
concision, but it will still be called as srt(somesequence), just as, say,
len(somesequence), NOT with method syntax on somesequence.
and get what I expect? If so, then I think that's the behavior that
most people have found wanting in the current implementation and thus
will make a lot of people happy. It certainly will make me happy :)

The list.sorted classmethod is more powerful than an instance method
of lists would be: it will let you build a sorted list out of any iterable
at all, AND support smoothly and transparently _subclasses_ of list.
If that's what you're talking about, there's an obvious downside, which
is that now we'll have list.sort() and list.sorted() which do two
different things. This will be confusing.

Keeping one as an instance method and the other as a classmethod
should ease things. I.e., in practice, the former will always be called
as somelist.sort(), the latter as list.sorted(somesequence).

In theory, classmethods do allow somewhat confusing use, e.g.:

myDict = {}
another = myDict.fromkeys(range(7))

this leaves myDict untouched and doesn't even look into it, except
for ascertaining its class. Apparently, people aren't actually prone
to falling into such confusions (haven't seen _any_ occurrence wrt
fromkeys either here, on (e-mail address removed), on python-dev, or any
other venue where I help, advise and teach).
Is there a PEP on this I could read? A quick look at the PEP index
didn't show anything that looked appropos.

Unfortunately I don't think a PEP number has been assigned (I do
hope Raymond Hettinger, who's developing the patch for this as
well as other useful tweaks to the sort method for 2.3, has started the
PEP-number-request procedure).
I certainly understand the efficiency aspects of in-place sorting, but
this has always seemed like premature optimization to me. Most of the
time (at least in the code I write), the cost of an extra copy is
inconsequential. I'll be happy to burn a few thousand CPU cycles if it
lets me avoid an intermediate assignment or a couple of extra lines of
code. When things get too slow, then is the time to do some profiling
and figure out where I can speed things up.

But the "couple of extra lines of code" is an even more trivial price to pay
than the cost of a copy, in Python's general worldview. The key is
consistency: all mutating methods of lists and dicts work inplace, and do
NOT return a list or dict (neither the original nor a copy) because Guido
does NOT like "chaining" -- such methods return None except when they
have a specifically useful item to return, such as the .pop methods.

Having .sort specified in a subtly different manner than .reverse, .update,
..extend etc would be inconsistent and harder to remember. Specifying
them all to work in-place and return the original list would encourage
chaining, which Guido definitely does NOT want (saving a couple of
lines of code by making everything much denser is NOT what Python
is all about: "sparse is better than dense" is a Pythonic principle).
Having them all make a preliminary copy and return the mutated copy
would be nicely functional, but unacceptable for all trivial operations
such as reverse -- slowdowns of over 100% are NOT within the realm
of "premature optimization" accusations (some operations would even
go up in big-O terms -- eek). AND it would NOT let you "speed things
up" if the only way to e.g. extend a list copied it (try a loop of
a.extend(b) vs one of a = a + b over a few thousand sublists b to
get an idea...).

So, I disagree with your characterization of having all mutators work
in-place and NOT return the mutated object as "premature optimization".
It's a design that nicely achieves consistency, simplicity, and promotion
of Guido's favourite sparse-is-better-than-dense style in the "one obvious
way to do it" mold.

So, now (actually for 2.4 -- I predict more than a year, so don't hold your
breath) we get to try and allow a hopefully controlled amount of style
variation towards somewhat higher "conceptual" density -- what has
been described (not technically accurately, but suggestively) as "a more
declarative style", suitable to be read as
"for k ranging over the sorted list corresponding to thedict,"...
rather than as
"get the list of thedict's keys; sort it; now, for k ranging over that
list,"...
which "feels" more imperative, stylistically speaking, because it is a
sequence of obviously-imperative steps rather than an expression
conjoining "not obviously imperative" sub-steps. It's a fine thread to
walk, since Python's roots ARE deeply imperative, of course;-).


Alex
 
A

Alex Martelli

Luther Barnum wrote:
...
really. It's just that I read that you cannot change a dictionary value

I suspect that may be a misreading of something else: "you cannot
use a mutable value as a dictionary key" (actually, you cannot use
a *non-hashable* value -- mutable values can still be hashable, but
normally they shouldn't be, unless the mutable part is just some kind
of implementation detail, such as a cache, not relevant from the
point of view of checking if an object "equals" another... but, this is
somewhat of a quibble:).

Basically, this means you can't use (e.g.) a list, a dict, or a file object
as the key in a dictionary. You CAN use a tuple (with hashable items), a
string, a number, and so on, since in Python those are immutable.


Alex
 
J

John J. Lee

Alex Martelli said:
In theory, classmethods do allow somewhat confusing use, e.g.:

myDict = {}
another = myDict.fromkeys(range(7))

this leaves myDict untouched and doesn't even look into it, except
for ascertaining its class. Apparently, people aren't actually prone
to falling into such confusions (haven't seen _any_ occurrence wrt
fromkeys either here, on (e-mail address removed), on python-dev, or any
other venue where I help, advise and teach).
[...]

That might be because people just don't know about it's existence yet.
I didn't, until you mentioned it.


John
 
H

Harry George

Luther Barnum said:
I am a new Python programmer and I am having a few difficulties. I love Perl
and I am just trying to learn Python because it is used heavily at work. It
looks pretty cool so I am diving in. I'm sure they are easy but I not sure
how to proceed.

1. How can I run a program and modify the output on the fly then send it to
standard output.

Example in Perl:

ex. open(LS_PIPE, "/usr/bin/ls |");
while(<LS_PIPE>) {
s/this/that/g;
print;
}
close(LS_PIPE);


2. How can I sort and print out a hash.

Example in Perl:

ex. foreach $string (sort keys %hash) {
print("$string = $hash{$string}\n");
}

In Perl these are very easy tasks, but I am finding it a little difficult to
understand.

Like most perl-to-python migrants, yoiu are still thinkiing in perl.
Here is a module to show how to do the std sorts of things:
http://www.seanet.com/~hgg9140/comp/pyperlish-1.2/doc/manual.html
available at:
http://www.seanet.com/~hgg9140/comp/index.html
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
474,169
Messages
2,570,919
Members
47,458
Latest member
Chris#

Latest Threads

Top