need to use col-major tiles, e.g. BTAS::Tensor with col-major storage. This will allow also direct interoperation with ScaLAPACK.
need to use col-major tiles, e.g. BTAS::Tensor with col-major storage. This will allow also direct interoperation with ScaLAPACK.