I want to set up several Benchmark Tests for PowerPC Processors using
Altivec.
The MP-Scheduler Benchmark works for Core2 Duo and on PS3 (without
Altivec)
fine.
But how can i set up the Benchmark for PowerPC Processors (e.g. Cell BE)
using the Altivec Extensions?
There are no options given to use Altivec, or is there a special
benchmark-code?
On Fri, Apr 30, 2010 at 09:08:59AM +0200, matty wrote:
I want to set up several Benchmark Tests for PowerPC Processors using
Altivec.
The MP-Scheduler Benchmark works for Core2 Duo and on PS3 (without Altivec)
fine.
But how can i set up the Benchmark for PowerPC Processors (e.g. Cell BE)
using the Altivec Extensions?
There are no options given to use Altivec, or is there a special
benchmark-code?
I’m pretty sure that it uses Altivec by default on PPC. (The only
hand coded Altivec code is in the gr_fir_fff filter.)
here is my mp-benchmark of PS3! Looks a little bit deformed at pipe 6.
I’m
running at the moment another test, to check out.
OK, i think this is with altivec. But now, how can i perform this test
without altivec like here: http://gnuradio.org/images/perf-data-images/ps3.png
Is it generally possible to use qa_fft.py or the CGRAN gcellized FFT as
offloaded portion,
because i want to analyse the benefits of the SPEs on some GNU Radio
code.
Therefore a benchmark result like this ( http://gnuradio.org/redmine/attachments/104/R-10231-ps3-20090115-0226.png)
would be convincing, because you can see the linear dependence between
speedup and # of spe’s.
The cgran version shows the speedup pretty well, but you need to be
using big FFTs to see the win, where big >= 4096 points.
I laugh heartily at your puny FFTs of only 4096 points
I regularly do FFTs with 1Hz resolution over bandwidths of several MHz
with Gnu Radio.
They run in “real time” on a reasonably-snappy “normal” CPU. But I
guess I could be
doing them on a GPU at some point
Something that requires big crunchies in my application space is
coherent de-dispersion, which requires
the construction of a (usually largish) complex FFT filter. Seems
that might benefit from
GPU speedup. Right now, I have to decide whether I want an RFI filter
(an FFT notch filter
basically, and it doesn’t have to be all that long), or a
de-dispersion filter (generally much
longer). But oh boy, can I have both, please?
Actually, I’m probably going to start playing with a Phenom II X6 1090T
some time this summer, and
I may get back enough crunchies to do both coherent de-dispersion and
RFI notch filtering in
real time. Yay!
–
Marcus L.
Principal Investigator
Shirleys Bay Radio Astronomy Consortium
On Wed, May 05, 2010 at 06:50:36PM +0200, matty wrote:
Is it generally possible to use qa_fft.py or the CGRAN gcellized FFT as
offloaded portion,
because i want to analyse the benefits of the SPEs on some GNU Radio code.
Therefore a benchmark result like this ( http://gnuradio.org/redmine/attachments/104/R-10231-ps3-20090115-0226.png)
would be convincing, because you can see the linear dependence between
speedup and # of spe’s.
Best Regards
Matty
The cgran version shows the speedup pretty well, but you need to be
using big FFTs to see the win, where big >= 4096 points.
Eric
This forum is not affiliated to the Ruby language, Ruby on Rails framework, nor any Ruby applications discussed here.