Pyrex list/array

J

Jim Lewis

Jun 3, 2006

#1

I'm trying to move a function into pyrex for speed. The python side
needs to pass a list to the pyrex function. Do I need to convert to
array or something so pyrex can generate tight code? I'm not clear how
to do this.

J

John Machin

Jun 4, 2006

#2

I'm trying to move a function into pyrex for speed.

You probably didn't expect the Inquisition; nobody does. But here it is,
nice red uniforms and all:

1. What is your speed requirement and how far short of that are you at
the moment?
2. Are you sure there is no Python or third-party module that does what
you want?
3. Is your algorithm the best possible?
4. Is your Python implementation of that algorithm the best possible?
Have you exposed it to the critical gaze of the speed-freaks in this
newsgroup?
5. Does your architecture support psyco? If so, have you tried that and
what were the results?

The python side
needs to pass a list to the pyrex function. Do I need to convert to
array or something so pyrex can generate tight code? I'm not clear how
to do this.

The question might be better asked on the Pyrex mailing list.

You don't need to convert a list to a C array, and it may not even be
possible, depending on what type(s) of data you have in the list.

Almost any Python code is also valid Pyrex code. For a start, just
compile your function with Pyrex and compare the speed. What you do next
is going to depend very much on what operations you are performing on
the list and the objects it contains. Watch out for Python built-ins
like range, xrange, ord, chr, abs, bool, int(a_number), float(a_number),
divmod, max/min (two_numeric_args). In almost all cases you get cheap
wins by replacing use of these by simple C-like code -- provided of
course you are absolutely sure you know what types you are dealing with.

HTH,
John

J

Jim Lewis

Jun 4, 2006

#3

Thanks for your comments.

You probably didn't expect the Inquisition...

Correct ;-)

1. What is your speed requirement and how far short of that are you at the moment?

~10 times faster.

2. Are you sure there is no Python or third-party module that does what you want?
Yes.

3. Is your algorithm the best possible?

I think so although of course one can never be certain.

4. Is your Python implementation of that algorithm the best possible? Have you exposed it to the critical gaze of the speed-freaks in this newsgroup?

Thanks for the good suggestion but I want to try pyrex first.

5. Does your architecture support psyco? If so, have you tried that and what were the results?

Already using psyco.

The question might be better asked on the Pyrex mailing list.

I did not find it - where is it?

Almost any Python code is also valid Pyrex code. For a start, just compile your function with Pyrex and compare the speed.

It's slower.

What you do next is going to depend very much on what operations you are performing on the list and the objects it contains.

Simple list of ints. Comparing sections of lists between each other.

S

skip

Jun 4, 2006

#4

Jim> Already using psyco.

Is it substantially faster with psyco than without? If psyco is performing
its magic on the critical section of code already, you are going to lose
that when switching to Pyrex.

Skip

J

Jim Lewis

Jun 4, 2006

#5

Is it substantially faster with psyco than without? If psyco is performing

its magic on the critical section of code already, you are going to lose
that when switching to Pyrex.

Yes but from what I read Pyrex can be a lot faster than psyco under the
right circumstances.

S

skip

Jun 4, 2006

#6

Jim> Yes but from what I read Pyrex can be a lot faster than psyco under
Jim> the right circumstances.

I'm sure that's true. That also means under the wrong circumstances it
might not. ;-) Can you post the code that's running too slowly?

Also, note that psyco learned some new tricks at the recent NeedForSpeed
sprint. You might want to check out the latest version from Subversion and
give it a whirl.

Skip

J

John Machin

Jun 4, 2006

#7

Thanks for your comments.

Correct ;-)

Nobody does

I did not find it - where is it?

Evidently somewhere near the Hall of the Mountain King. A reference to
it is cunningly concealed in the last place one would think of finding
it: under the heading "Mailing List" on the Pyrex home page

Here:
http://lists.copyleft.no/mailman/listinfo/pyrex ... get in quick before
the pirate moves it again.

It's slower.

Simple list of ints. Comparing sections of lists between each other.

Do you mean alist[x:x+n] == alist[y:y+n] ?
If so, that's creating two new lists each of size n, and then comparing
those two lists. I doubt that psyco would recognize that it didn't need
to copy the two slices. The first step might be to write functions to
compare without copying, e.g.:

def py_slices_cmp_eq(py_list, start1, start2, size):
"""Return 1 if py_list[start1+size] == py_list[start2+size]
else 0"""
offset = start2 - start1
for i in xrange(start1, start1+size):
if py_list != py_list[i+offset]:
return 0
return 1

See what psyco makes of that.
Then turn that into a cdef function for Pyrex.

If that's still not fast enough, then you might be in for some harder work:

Allocate memory for a C array, unpack your list into it, write
comparison functions c_slices_cmp_* that operate on your array of ints.
There should be no Python stuff in there, only C constructs. You can
even use memcmp() for the cmp_eq function.

Which brings us back to your original question "Do I need to convert to
array or something so pyrex can generate tight code?" ...
1. Some of the above may help you to determine whether you need to.
2. Without any knowledge of the size of your list or what you are doing,
we can't help you much more on whether you need to.
3. AFAICT, Pyrex doesn't do much in the way of optimisation, leaving
that up to the C compiler. Generating tight code would depend more on
you replacing appropriately-chosen Pythonisms with C-isms.

As for *how* to make your C array, something like this:

cdef extern from "Python.h":
void PyMem_Free(void *p)
void* PyMem_Malloc(int n) except NULL
# untested -- compiles OK
cdef int * int_array_from_list(object ilist):
cdef int n, i
cdef int * arrayp
n = len(ilist)
arrayp = <int *>PyMem_Malloc(n * sizeof(int))
for i from 0 <= i < n:
arrayp = ilist
return &arrayp[0]

Hoping some of this helps,
John

J

Jim Lewis

Jun 4, 2006

#8

cunningly concealed in the last place one would think of finding it: under the heading "Mailing List" on the Pyrex home page

Hmmm - maybe I should try the scroll bar occassionally ;-)

Do you mean alist[x:x+n] == alist[y:y+n] ?

OK, probably you an Skip are right - let's see if I missed something at
the Python level.

There are essentially two differences from your snip above. I am trying
to compute n and there are multiple (under 10) lists. Size of lists are
typically under 100 ints.

...See what psyco makes of that.

I'm doing a similar straightforward loop approach but it's too slow.

Jim

Pyrex: problem with blanks in string	1	Jun 4, 2007
Pyrex char *x[]	0	May 15, 2007
Opinion on Pyrex	4	Sep 30, 2005
Lack of += and -= operators in Pyrex?	0	Jun 7, 2007
Copy string from 2D array to a 1D array in C	1	Nov 1, 2023
ANN: Pyrex 0.9.8.3	0	Jun 8, 2008
ANN: Pyrex 0.9.5.1a	0	Feb 1, 2007
ANN: Pyrex 0.9.8.1	0	May 17, 2008

Jim Lewis

John Machin

Jim Lewis

skip

Jim Lewis

skip

John Machin

Jim Lewis

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads