for_some(), for_all()?

aurora · Sep 22, 2004

I am trying to test a list to see if some or all elements pass a test. For
example I can use reduce() to test if any element is 0.

lst = [1,2,0,3]
test = lambda x: x == 0
reduce(operator.__or__,[test(x) for x in lst])

Click to expand...

Click to expand...

True

However I bet reduce() does not exploit the short circuit logic. Also it
is a little clumsy to create another list to pass into reduce. Is there
some equivalent of

for_some(test, lst)
or
for_all(test, lst)?

Larry Bates · Sep 22, 2004

Use list comprehension and then test for what you want:

for_some

result=[x for x in lst if <insert your test here>]

if result:
#
# At least some of the items passed the test
#
else:
#
# None passed
#

for_all

result=[x for x in lst if <insert your test here>]

if len(result) == len(lst):
#
# All items passed the test
#
else:
#
# At least one didn't pass
#

I think it is easier to read and maintain.

Larry Bates

aurora said:
I am trying to test a list to see if some or all elements pass a test. For
example I can use reduce() to test if any element is 0.

lst = [1,2,0,3]
test = lambda x: x == 0
reduce(operator.__or__,[test(x) for x in lst])

Click to expand...

Click to expand...

True

However I bet reduce() does not exploit the short circuit logic. Also it
is a little clumsy to create another list to pass into reduce. Is there
some equivalent of

for_some(test, lst)
or
for_all(test, lst)?

Michael Hoffman · Sep 22, 2004

aurora said:
However I bet reduce() does not exploit the short circuit logic. Also
it is a little clumsy to create another list to pass into reduce. Is
there some equivalent of

for_some(test, lst)
or
for_all(test, lst)?

There are some recipes for any() and all() in the itertools documents
that do what you want:

http://www.python.org/doc/current/lib/itertools-example.html
.... "Returns True if pred(x) is True at least one element in the
iterable"
.... return True in imap(pred, seq)
....

>>> def equals_0(x): return x == 0 ....
>>> items = range(500000)
>>> timeit.Timer("reduce(operator.__or__, [equals_0(x) for x in

Click to expand...

Click to expand...

items])", setup="from __main__ import operator, equals_0, items").timeit(3)
4.1040000915527344itertools, any, equals_0, items").timeit(3)
....itertools, any, equals_0, items").timeit(3) # short-circuit
0.0

>>> del items[0]
>>> timeit.Timer("any(items, equals_0)", setup="from __main__ import

Click to expand...

Click to expand...

itertools, any, equals_0, items").timeit(3) # no short-circuit, still faster
1.6640000343322754

Alex Martelli · Sep 22, 2004

aurora said:
I am trying to test a list to see if some or all elements pass a test. For
example I can use reduce() to test if any element is 0.

lst = [1,2,0,3]
test = lambda x: x == 0
reduce(operator.__or__,[test(x) for x in lst])

Click to expand...

Click to expand...

True

However I bet reduce() does not exploit the short circuit logic. Also it
is a little clumsy to create another list to pass into reduce. Is there
some equivalent of

for_some(test, lst)
or
for_all(test, lst)?

Simplest:

def for_some(test, lst):
for item in lst:
if test(item): return True

def for_all(test, lst):
for item in lst:
if not test(item): return False
return True

You can play tricks, but you can't beat the simplicity of these, and
even with the cleverest tricks you can't beat their speed by much. And
simplicity is a great quality for software to have.

Alex

Raymond Hettinger · Sep 23, 2004

[Michael Hoffman]

There are some recipes for any() and all() in the itertools documents
that do what you want:

http://www.python.org/doc/current/lib/itertools-example.html

... "Returns True if pred(x) is True at least one element in the
iterable"
... return True in imap(pred, seq)

I chose that one for the docs because it gave the best balance of clarity and
speed.

For pure speed, the following is faster and gives short-circuit behavior:
.... for elem in ifilter(pred, seq):
.... return True
.... return False

Raymond Hettinger

Michael Hoffman · Sep 23, 2004

Raymond said:
For pure speed, the following is faster and gives short-circuit behavior:

... for elem in ifilter(pred, seq):
... return True
... return False

Nice! I've always found the itertools examples exceptionally useful. Is
there a reason why they aren't in the standard library? I find myself
copying and pasting the snippets all the time.

Michele Simionato · Sep 23, 2004

Raymond Hettinger said:
I chose that one for the docs because it gave the best balance of clarity and
speed.

I always wondered why "any" and "all" are given as recipes but are not
part of the itertools module. What is the rationale?
I think they are so common that they deserve to be in the module.
At least, we would have standard names.

Michele Simionato

Steven Bethard · Sep 23, 2004

Raymond Hettinger said:
For pure speed, the following is faster and gives short-circuit behavior:

... for elem in ifilter(pred, seq):
... return True
... return False

So, the other one also has short-circuit behavior:
.... return True in imap(pred, seq)
........ return x > 3
....[5, 6, 7, 8, 9]

Could you explain what makes the second one faster? I'm guessing it's
something like that comparing True to each element in the iterable costs more
than binding each element in the iterable to elem...?

Thanks,

STeve

Raymond Hettinger · Sep 23, 2004

[Michael Hoffman]

Nice! I've always found the itertools examples exceptionally useful. Is
there a reason why they aren't in the standard library? I find myself
copying and pasting the snippets all the time.

[Michele Simionato]

I always wondered why "any" and "all" are given as recipes but are not
part of the itertools module. What is the rationale?
I think they are so common that they deserve to be in the module.
At least, we would have standard names.

Random thoughts on itertools recipes and module design:

* The recipes are formatted to make it easy to paste the whole thing into a
module (it's already there for the taking).

* Adding more tools to the basic set makes the module itself harder to learn. I
theorize that the difficulty of creating a new composite tool grows
exponentially with the number of available fundamental building blocks.

* The recipes have as much value as a teaching tool as they do for practical
applications. Study the page of recipes and you've mastered itertools for life.

* Once in the standard library, the API is set in concrete. But as part of the
docs, I'm free to backport, improve variable names, improve function names,
substitute alternate approaches, and drop recipes that don't prove themselves in
battle.

* Publishing the recipes is a step toward standardizing the naming and
collecting best practices. IOW, we already get some of the benefits without
having to include the functions in the standard library.

* If included in the library, some of the functions would grow more complex.
For example, the pred=bool default needs to be followed by "if pred is bool:
pred=None" to take advantage of the ifilter's faster internal call to bool.

* Until now, there has been no popular demand. If the demand grows, I'll oblige
and include some of them in Py2.5. I prefer to be careful. Being conservative
about what goes in the module has contributed substantially to its success.

* Some derived tools like padnone() arise infrequently and are rarely the right
solution for a problem (not many functions respond well to a data stream where
one field suddenly starts yielding None). This function takes more than a
modicum of skill to use correctly. It is there to show how it is done and to
provide an iterator solution for folks relying on map()'s None fill-in feature.
IOW, it is not deserving of being in the standard library.

* Leaving the derived tools as recipes creates a sense that that more
combinations are open for invention. Putting them in the module suggests that
the problem is closed. This of course isn't strictly true, but it does affect
the mood.

Raymond Hettinger

Raymond Hettinger · Sep 24, 2004

For pure speed, the following is faster and gives short-circuit behavior:
[Steven Bethard]

So, the other one also has short-circuit behavior:

Right. It was the reduce(operator.__or__) version that didn't short-circuit.

... return True in imap(pred, seq)
...... return x > 3
...[5, 6, 7, 8, 9]

Could you explain what makes the second one faster? I'm guessing it's
something like that comparing True to each element in the iterable costs more
than binding each element in the iterable to elem...?

Close. The imap() version has two layers of iteration:

for elem in seq:
yield pred(elem)

for elem in imapresults:
if elem == True:
return True
return False

The ifilter() version has only one layer:

for elem in seq:
if pred(elem):
yield elem

Note, it make look like there is an outer layer in the ifilter() version, but
closer inspection shows that it never runs more than one iteration:

for elem in ifilterresults:
return True # because this returns, the for never loops back
return False

Raymond Hettinger

Alex Martelli · Sep 24, 2004

Raymond Hettinger said:
[Michele Simionato]

I always wondered why "any" and "all" are given as recipes but are not
part of the itertools module. What is the rationale?
I think they are so common that they deserve to be in the module.
At least, we would have standard names.

Click to expand...

Random thoughts on itertools recipes and module design:

[snipped lots of valuable considerations]

and one more, specific thing...: 'any' and 'all', like (say) other
potentially useful functions such as count, average, etc, are
_accumulators_ -- they _consume_ an iterator, producing a 'scalar'
result.

itertools is about filters and producers -- calling itertools.whatever
_returns_ an iterator.

I'd like a small module of accumulators. But putting accumulators into
itertools is the wrong placing. A general rule that helps users
remember what-goes-where, such as 'itertools stuff returns iterators',
is too precious to throw away lightly, IMHO.

Alex

List comprehension vs filter()	6	Apr 20, 2011
HMAC encription issue	4	Aug 31, 2013
Minimum Total Difficulty	0	Nov 15, 2023
"and" and "or" on every item in a list	7	Oct 29, 2007
After MultiIndex DataFrame object is created with additional information	2	Apr 4, 2023
Printing with interspersed element	5	Oct 30, 2008
Make 'Image X' spin 360deg on y-Axis until reset button clicked	1	Jan 22, 2023
Trying to build a SARIMAX model to forecast the S&P500 trend	0	Nov 5, 2023

for_some(), for_all()?

aurora

Larry Bates

Michael Hoffman

Alex Martelli

Raymond Hettinger

Michael Hoffman

Michele Simionato

Steven Bethard

Raymond Hettinger

Raymond Hettinger

Alex Martelli

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads