Another quest for speed

michael.goossens · Jan 28, 2008

BBox::Expand(float delta){
Vector d = Vector(delta, delta, delta);
p_min -= d;
p_max += d;
}

BBox::Expand(float delta){
p_min -= Vector(delta, delta, delta);
p_max += Vector(delta, delta, delta);
}

Logically the first method would only make 1 Vector object and the
second 2, so method two would take more cpu instructions? Or does c++
do some intern stuff to optimise this?

Alf P. Steinbach · Jan 28, 2008

* (e-mail address removed):

BBox::Expand(float delta){
Vector d = Vector(delta, delta, delta);
p_min -= d;
p_max += d;
}

BBox::Expand(float delta){
p_min -= Vector(delta, delta, delta);
p_max += Vector(delta, delta, delta);
}

Logically the first method would only make 1 Vector object and the
second 2, so method two would take more cpu instructions? Or does c++
do some intern stuff to optimise this?

Depends on your compiler and compiler settings.

Measure.

If it matters.

Cheers, & hth.,

- Alf

Jensen Somers · Jan 28, 2008

BBox::Expand(float delta){
Vector d = Vector(delta, delta, delta);
p_min -= d;
p_max += d;
}

BBox::Expand(float delta){
p_min -= Vector(delta, delta, delta);
p_max += Vector(delta, delta, delta);
}

Logically the first method would only make 1 Vector object and the
second 2, so method two would take more cpu instructions? Or does c++
do some intern stuff to optimise this?

Any C++ compiler will always try to optimize things as best as it can.
Check the documentation of your compiler to see what and how it
optimizes different things.

- Jensen

Phil Endecott · Jan 28, 2008

BBox::Expand(float delta){
Vector d = Vector(delta, delta, delta);
p_min -= d;
p_max += d;
}

BBox::Expand(float delta){
p_min -= Vector(delta, delta, delta);
p_max += Vector(delta, delta, delta);
}

Logically the first method would only make 1 Vector object and the
second 2, so method two would take more cpu instructions? Or does c++
do some intern stuff to optimise this?

It's really easy to measure this sort of thing: write the code (looks
like you've already done it), add a simple main() that calls it a few
zillion times, compile with all available optimisations enabled, and
measure execution time. That will give you a more accurate answer for
your compiler and hardware than all the people here can offer, and it's
quicker.

FWIW, my guess is that as long as the caller can see Vector's
constructor and BBox::Expand and the rest inline, then you'll get
essentially optimal code from both.

If you're coding for an x86 system and performance is vital, then you
may like to investigate using SIMD instructions for this sort of thing,
i.e. processing the x, y and z components in parallel. How to do that
is off-topic for this group, but it's likely to make more impact than
tweaking the details of the C++ coding style.

Phil.

Weird Behavior with Rays in C and OpenGL	4	Feb 13, 2024
GET NEIL DEGRASSES TYSON, I ripped a hole with this one...	0	Nov 10, 2022
Whose fault is my problem?	9	Aug 4, 2008
An idea for heap allocation at near stack allocation speed	14	Feb 13, 2011
speed versus OO	8	Jan 26, 2008
PC configuration for fastest compiles (synthesis, place and route,etc)	1	Feb 15, 2008
[SUMMARY] Texas Hold'Em (#24)	0	Mar 24, 2005
Generic programming in C	46	Apr 17, 2010

Another quest for speed

michael.goossens

Alf P. Steinbach

Jensen Somers

Phil Endecott

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads