iterator clone

Y

Yosifov Pavel

Whats is the way to clone "independent" iterator? I can't use tee(),
because I don't know how many "independent" iterators I need. copy and
deepcopy doesn't work...

--pavel
 
P

Peter Otten

Yosifov said:
Whats is the way to clone "independent" iterator? I can't use tee(),
because I don't know how many "independent" iterators I need. copy and
deepcopy doesn't work...

There is no general way. For "short" sequences you can store the items in a
list which is also the worst-case behaviour of tee().

What are you trying to do?

Peter
 
Y

Yosifov Pavel

There is no general way. For "short" sequences you can store the items in a
list which is also the worst-case behaviour of tee().

What are you trying to do?

Peter

I try to generate iterators (iterator of iterators). Peter, you are
right! Thank you. For example, it's possible to use something like
this:

def cloneiter( it ):
"""return (clonable,clone)"""
return tee(it)

and usage:

clonable,seq1 = cloneiter(seq)

...iter over seq1...
then clone again:

clonable,seq2 = cloneiter(clonable)

...iter over seq2...

Or in class:

class ReIter:
def __init__( self, it ):
self._it = it
def __iter__( self ):
self._it,ret = tee(self._it)
return ret

and usage:

ri = ReIter(seq)
...iter over ri...
...again iter over ri...
...and again...

But I think (I'm sure!) it's deficiency of Python iterators! They are
not very good...

--Pavel
 
P

Peter Otten

Yosifov said:
I try to generate iterators (iterator of iterators). Peter, you are
right! Thank you. For example, it's possible to use something like
this:

def cloneiter( it ):
"""return (clonable,clone)"""
return tee(it)

[snip]

That is too abstract, sorry. What concrete problem are you trying to solve
with your cloned iterators? There might be a way to rearrange your setup in
a way that doesn't need them.
But I think (I'm sure!) it's deficiency of Python iterators! They are
not very good...

Well, I think Python's iterators, especially the generators, are beautiful.
More importantly, I think there is no general way to make iterators
copyable, regardless of the programming language. The problem is that most
of the useful ones depend on external state.

Peter
 
Y

Yosifov Pavel

Well, I think Python's iterators, especially the generators, are beautiful.
More importantly, I think there is no general way to make iterators
copyable, regardless of the programming language. The problem is that most
of the useful ones depend on external state.

Peter

Hmm, but tee() de facto do it (clone iterator) and ignore side-effects
of iterator ("external" state). And tee() create independent
**internal** state of iterator (current position). But **external**
state - is headache of programmer. So, iterator/generator have to be
method for copy itself (the tee() implementation) or be "re-
startable". Why not?

Concrete problem was to generate iterators (iterator of slices). It
was solved with ReIter.

--Best regards,
--pavel
 
M

Marc 'BlackJack' Rintsch

Hmm, but tee() de facto do it (clone iterator) and ignore side-effects
of iterator ("external" state). And tee() create independent
**internal** state of iterator (current position).

`tee()` doesn't copy the iterator or its internal state but just caches
it's results, so you can iterate over them again. That makes only sense
if you expect to use the two iterators in a way they don't get much out of
sync. If your usage pattern is "consume iterator 1 fully, and then
re-iterate with iterator 2" `tee()` has no advantage over building a list
of all results of the original iterator and iterate over that twice.
`tee()` would be building this list anyway.
But **external** state - is headache of programmer. So,
iterator/generator have to be method for copy itself (the tee()
implementation) or be "re- startable". Why not?

Because it's often not possible without generating a list with all
results, and the advantage of a low memory footprint is lost.

Ciao,
Marc 'BlackJack' Rintsch
 
Y

Yosifov Pavel

`tee()` doesn't copy the iterator or its internal state but just caches
it's results, so you can iterate over them again. That makes only sense
if you expect to use the two iterators in a way they don't get much out of
sync. If your usage pattern is "consume iterator 1 fully, and then
re-iterate with iterator 2" `tee()` has no advantage over building a list
of all results of the original iterator and iterate over that twice.
`tee()` would be building this list anyway.

It's interesting and a concrete answer. Thanks a lot.
Because it's often not possible without generating a list with all
results, and the advantage of a low memory footprint is lost.

Ciao,
Marc 'BlackJack' Rintsch

Seems like "monada". But I think is possible to determine when there
is a bounded external state (side-effects) or not, may be is needed
some new class-protocol for it... or something else. Or another way:
iterators may be re-iterable always, but if programmer need to point
to the extra- (external) state, he has to raise some a special
exception in __iter)) method... OK, it's only fantasies about language
design :)

--pavel
 
B

bruno.desthuilliers

def cloneiter( it ):
"""return (clonable,clone)"""
return tee(it)

This might as well be written as

cloneiter = tee

Or yet better, just remove the above code and s/cloneiter/tee/g in the
remaining...
 
K

Kay Schluehr

cloning of iterators in this manner is bad, more good is to use one,
single list(my_iter) instead of (seehttp://aquagnu.blogspot.com/2008/07/self-repair-iterator-in-python.html).

This won't work for "big" iterators as mentioned by Peter Otten. With
this recipe you can't even clone generator objects ( which are
iterators ) that produce Fibonaccis in a lazy manner.

Regards, Kay
 
Y

Yosifov Pavel

Kay, can you show example of such generator? ReIter, for example, work
with usual generators.

But for "big" iterator, I think is no any good solutions. IMHO we can
discern 2 types of iterators: re-startable (based on internal Python
objects) and not re-startable (with an external state, side-
effects)...

Best regards, Pavel
 
M

Marc 'BlackJack' Rintsch

Kay, can you show example of such generator? ReIter, for example, work
with usual generators.

But for "big" iterator, I think is no any good solutions. IMHO we can
discern 2 types of iterators: re-startable (based on internal Python
objects) and not re-startable (with an external state, side-
effects)...

Has nothing to do with internal vs. external.
Examples: ``itertools.count(1)``, ``itertools.cycle(iterable)``, or

def fib():
a, b = 0, 1
while True:
yield a
a, b = b, a + b

Ciao,
Marc 'BlackJack' Rintsch
 
Y

Yosifov Pavel

Has nothing to do with internal vs. external.
Examples: ``itertools.count(1)``, ``itertools.cycle(iterable)``, or

def fib():
a, b = 0, 1
while True:
yield a
a, b = b, a + b

Ciao,
Marc 'BlackJack' Rintsch

Yes. So, I'm disconcerted: what Python "means" about iterators. Why
iterator's are not clonable in default?.. ``itertools.count(it) ``
"suppose" ``it`` will be restarted after count? So ``count(it)``
"suppose" ``it`` is restarable and therefore clonable (why not?!).
Generator is only function, so can be restarted/cloned. Generator
keeps "position" and can't be iterated again. But this position can be
reseted (main rule: if generator function has not side-effects/
external state, see below)!

Iterator, I think, is more general method (for naturally serial
access). For example, you can read values from some device (RS323 or
other) and represent it in program like iterator. In this case,
``__iter__()`` make some preparation, ``next()`` read next value from
RS232. In this case, iterator can't be restarted and really cloned. It
has external state (state of device) and can't to return it at start.
But when series of values are generated (are born) in the program: no
problem to have the feature of clone/restart iterator.

And is very interesting to research Icon iterators. Is possible to
create something in Python such theirs, for non-deterministic solving
purpose... :)

--pavel
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
474,222
Messages
2,571,137
Members
47,754
Latest member
Armand37T7

Latest Threads

Top