This is a yes or not question, please :
Is the content of jit.gen processed by GPU?
is it by CPU?
What I am doing is "unpacking " matrices and then performing matrix operations, then packing
again the matrix. I need it to be processed by GPU. Any suggestions?