Solved

Casting between two structs in C (CUDA)

Posted on 2010-08-12
13
1,559 Views
Last Modified: 2012-05-10
The CUDA library effectively defines its own complex number data type, cuComplex, as follows:

  struct __builtin_align__(8) float2
  {
    float x, y;
  };
  typedef float2 cuFloatComplex;
  typedef cuFloatComplex cuComplex;

I am already using my own complex data type:

  typedef struct  { float re; float im; } complex;

and I wish to be able to cast between the two. Here's an example of what I'm currently trying:

  cuComplex cu_data;
  ...
  complex c_data;
  c_data = (complex)cu_data;

but I'm getting the (nvcc) error:

  'no suitable user-defined conversion from "cuComplex" to "complex" exists'.

So how do I do this? Equivalently: is there a way to do a typedef, but to simply rename the members of that struct? (ie rename 'x' to 're', 'y' to 'im')?

Thanks!
0
Comment
Question by:InteractiveMind
  • 4
  • 3
  • 2
  • +4
13 Comments
 
LVL 86

Expert Comment

by:jkr
Comment Utility
You could provide your own operator, e.g.
  struct /*__builtin_align__(8)*/ float2

  {

    float x, y;

  };

  typedef float2 cuFloatComplex;

  typedef cuFloatComplex cuComplex;



 typedef struct  { float re; float im; operator cuComplex();} complex;



 complex::operator cuComplex() {



   cuComplex out;



   out.x = re;

   out.y = im;



   return out;

 }



  int main () {



  cuComplex out;

  complex in;



  out = (cuComplex) in;



  return 0;

  }

Open in new window

0
 
LVL 86

Accepted Solution

by:
jkr earned 167 total points
Comment Utility
Ooops, seems that I missed the 'C' part - well, in that case, make that a function:
  struct /*__builtin_align__(8)*/ float2

  {

    float x, y;

  };

  typedef float2 cuFloatComplex;

  typedef cuFloatComplex cuComplex;



 typedef struct  { float re; float im; operator cuComplex();} complex;



 cuComplex to_cuComplex(complex* in) {



   cuComplex out;



   out.x = in->re;

   out.y = in->im;



   return out;

 }



  int main () {



  cuComplex out;

  complex in;



  out = to_cuComplex(&in);



  return 0;

  }

Open in new window

0
 
LVL 1

Expert Comment

by:sridhard
Comment Utility

Hi,

Try using pointers . This will assign/point the correponding values. Note sure whether this helps

  cuComplex cu_data = (cuComplex  *) malloc(sizeof(cuComplex ));
  ...
  complex *c_data;
  c_data = (complex*)cu_data;

You can delete any one pointer later .
0
 
LVL 25

Author Comment

by:InteractiveMind
Comment Utility
Thank you, both. I think I should mention that I am in fact casting pointers/arrays of such types in my code (but I get the same problem).

@jkr, would your method work for casting pointers? (I can't test it at the moment). If so, my code needs to be as fast as possible, so given that I'm going to need to do [probably] millions of such casts per second, would it be quicker to just do a typedef:

   typedef cuComplex complex;

and then a global search and replace of '.re' with '.x' etc? (I'm reluctant to do so)

Also, can you think of any reason why what I'm already doing wouldn't work? Is it the __builtin_align__(8) that's causing problems?
0
 
LVL 86

Expert Comment

by:jkr
Comment Utility
>>@jkr, would your method work for casting pointers?

If you use the indirecton appropriately - yes.
0
 
LVL 53

Assisted Solution

by:Infinity08
Infinity08 earned 167 total points
Comment Utility
>> given that I'm going to need to do [probably] millions of such casts per second, would it be quicker to just do a typedef:

Yes. Because no conversion would be needed then. Performing no conversion is faster than performing a conversion ;)


>> Also, can you think of any reason why what I'm already doing wouldn't work?

The error says it :

>>   'no suitable user-defined conversion from "cuComplex" to "complex" exists'.

the compiler doesn't know how to perform the conversion from cuComplex to complex, because you didn't tell it how to do that (see jkr's post for that).


>> I should mention that I am in fact casting pointers/arrays of such types in my code (but I get the same problem).

Casting pointers wouldn't give you the same problem. It would give you a different problem in C++ : the type safety system in C++ wouldn't allow a cast from cuComplex* to complex*, because the two types are not related - ie. the cast is not safe.
If you shut the compiler up, and force the cast, then the __builtin_align__(8) can indeed cause a problem. If the packing of the complex type is not the same as that of the cuComplex type, then something will go wrong badly.
0
IT, Stop Being Called Into Every Meeting

Highfive is so simple that setting up every meeting room takes just minutes and every employee will be able to start or join a call from any room with ease. Never be called into a meeting just to get it started again. This is how video conferencing should work!

 
LVL 20

Assisted Solution

by:ikework
ikework earned 166 total points
Comment Utility
If you use the same alignment for your struct (8 bytes) then the cast shouldn't be a problem.
0
 
LVL 20

Expert Comment

by:ikework
Comment Utility
The safest strategy is to use only cuda's struct in the first place, also in your code, so there is no cast needed anymore.
0
 
LVL 22

Expert Comment

by:ambience
Comment Utility
>> so given that I'm going to need to do [probably] millions of such casts per second, would it be quicker to just do a typedef:

My vote -> YES
0
 
LVL 20

Expert Comment

by:ikework
Comment Utility
@ambience, did you consider the different alignments as previous posts said?
0
 
LVL 20

Expert Comment

by:ikework
Comment Utility
@ambience, sorry I misunderstood your post, I thought you were suggesting just cast anyway, nevermind :)
0
 
LVL 1

Expert Comment

by:soscpd
Comment Utility
Hi

Your answer is a function ponter. There (in the function) you read/write cuComplex/complex structs anyway you like.

Rafael
0
 
LVL 25

Author Closing Comment

by:InteractiveMind
Comment Utility
Thanks
0

Featured Post

IT, Stop Being Called Into Every Meeting

Highfive is so simple that setting up every meeting room takes just minutes and every employee will be able to start or join a call from any room with ease. Never be called into a meeting just to get it started again. This is how video conferencing should work!

Join & Write a Comment

This tutorial is posted by Aaron Wojnowski, administrator at SDKExpert.net.  To view more iPhone tutorials, visit www.sdkexpert.net. This is a very simple tutorial on finding the user's current location easily. In this tutorial, you will learn ho…
Container Orchestration platforms empower organizations to scale their apps at an exceptional rate. This is the reason numerous innovation-driven companies are moving apps to an appropriated datacenter wide platform that empowers them to scale at a …
The goal of this video is to provide viewers with basic examples to understand opening and writing to files in the C programming language.
The viewer will be introduced to the member functions push_back and pop_back of the vector class. The video will teach the difference between the two as well as how to use each one along with its functionality.

744 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

12 Experts available now in Live!

Get 1:1 Help Now