Solved

Parallel Computing: Clusters or Graphics?

Posted on 2008-06-09
11
235 Views
Last Modified: 2013-11-08
I'm working on my AI research, which could benefit greatly from parallel computing. As I'm sure there's someone here who knows more than I do on the different platforms, which one would bring better results? (I know I could use both... but time's not really on my side.)

- nVidia CUDA [w/ GF8800s]
- OpenMPI [w/ about 50-100 computers, running P4D]

Some pages linking to benchmarks would be useful.

[Sorry for the low point count, I ran out. :(]

Thanks!
0
Comment
Question by:holobyted
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 6
  • 5
11 Comments
 
LVL 69

Expert Comment

by:Callandor
ID: 21751070
One Tesla card supposedly can run at 518 gigaflops http://xtreview.com/addcomment-id-2756-view-Nvidia-Tesla-c870,D870-and-s870.html+tesla+nvidia+benchmarks&hl=en&ct=clnk&cd=2&gl=us, which is compared to the throughput of 40 x86 processors.  There is a 4-card version for servers that is that much more powerful.   Graphics cards are designed for parallel processing of textures and have a much higher transistor count than cpus, so it is not surprising that they can outperform general purpose processors for certain applications.
0
 

Author Comment

by:holobyted
ID: 21751739
What would the higher-end Tesla card compare to? Ie, one "normal" Tesla card compares to 40 x86 CPUs (which CPUs?), what would the other be?

0
 
LVL 69

Accepted Solution

by:
Callandor earned 50 total points
ID: 21752812
One Tesla card (c870) is 518 gigaflops, the d870 is two c870 cards and is over a teraflop, and the s870 is four c870 cards and is over 2 teraflops.  The system scales linearly with additional cards, so by extrapolation that means 80 x86 cpus and 160 x86 cpus, respectively.
0
Technology Partners: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

 

Author Comment

by:holobyted
ID: 21752976
How would 35 Pentium 4 D @ 2.00GHz compare? What would be the "rated" Xflops? Assuming peak performance.
0
 
LVL 69

Expert Comment

by:Callandor
ID: 21754526
0
 

Author Comment

by:holobyted
ID: 21754707
If I recall correctly, P4D's went up to 3.2GHz... According to Wikipedia though, (http://en.wikipedia.org/wiki/Pentium_D), you're right.

What would be the approx. flops be for such a cluster? I'll try getting in touch w/ the owner of the 35 CPUs so I can get a real speed value. (Running OpenMPI)
0
 
LVL 69

Expert Comment

by:Callandor
ID: 21760050
A single PentiumD 3.2 clocks in at about 600 megaflops, so 35 of them will be around 21 gigaflops.  The PentiumD cpus are much lower in performance than the newer Core2 cpus, easily trounced by even AMD's X2 offerings.
0
 

Author Comment

by:holobyted
ID: 21760797
Wow. That's actually pretty depressing... 35 systems can't even match up to one graphics card. Too bad CUDA is a pain to implement...
0
 
LVL 69

Expert Comment

by:Callandor
ID: 21761480
Modern graphics cards are very powerful, and the ability to use them in non-graphics applications is very nice.  Think about a $200 card giving you the power of 10 modern cpus - that's quite a good deal.
0
 

Author Comment

by:holobyted
ID: 21761524
Yeah, I know. What's the GFlops on a "normal" GF8800 though? The tesla is outstanding, but that's cause it's a "small supercomputer for your workstation."

0
 
LVL 69

Expert Comment

by:Callandor
ID: 21765742
It's about the same - 500 gigaflops: http://en.wikipedia.org/wiki/GeForce_8_Series#8800_GT, though I don't know if all of that is available for number crunching.
0

Featured Post

Independent Software Vendors: We Want Your Opinion

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
C++ Language error 28 278
User cannot delete a folder from the desktop. 3 75
intel CPUs suffix question.. 4 62
c++ aadding data to a list box vs2005 mfc project 7 7
I don't know if many of you have made the great mistake of using the Cisco Thin Client model with the management software VXC. If you have then you are probably more then familiar with the incredibly clunky interface, the numerous work arounds, and …
Windows 7 does not have the best desktop search built in. This is something Windows 7 users have struggled with. You type something in, and your search results don’t always match what you are looking for, or it doesn’t actually work at all. There ar…
The goal of the video will be to teach the user the difference and consequence of passing data by value vs passing data by reference in C++. An example of passing data by value as well as an example of passing data by reference will be be given. Bot…
The viewer will be introduced to the technique of using vectors in C++. The video will cover how to define a vector, store values in the vector and retrieve data from the values stored in the vector.

735 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question