DPCT1086#

メッセージ#

__activemask() は 0xffffffff に置き換えられます。ソースコードの調整が必要な場合があります。

詳細な説明#

現在、SYCL* には、__activemask() と同等の機能はありません。コードにスレッドを非アクティブにするスロー制御がある場合、スレッドのロジックを書き直す必要があります。

例えば、以下のオリジナル CUDA* コードについて考えてみます。

   __device__ inline int SHFL_SYNC(unsigned mask, int val, unsigned offset, 
   unsigned w = warpSize) { 
     return __shfl_down_sync(mask, val, offset, w); 
   } 
 
   __global__ void kernel(int *array) { 
     unsigned int tid = threadIdx.x; 
     if (tid >= 8) 
       return; 
    unsigned mask = __activemask(); 
    array[tid] = SHFL_SYNC(mask, array[tid], 4); 
  }

このコードは、以下の SYCL* コードに移行されます。

   inline int SHFL_SYNC(unsigned mask, int val, unsigned offset, 
   const sycl::nd_item<3> &item_ct1, unsigned w = 0) { 
   /* 
   DPCT1023:0: The SYCL sub-group does not support mask options for 
   dpct::shift_sub_group_left.You can specify 
   "--use-experimental-features=masked-sub-group-operation" to use the 
   experimental helper function to migrate __shfl_down_sync.
   */ 
     if (!w) w = item_ct1.get_sub_group().get_local_range().get(0); 
  // This call will wait for all work-items to arrive which will never happen since only work-items with tid < 8 will encounter this call.   return dpct::shift_sub_group_left(item_ct1.get_sub_group(), val, offset, w); 
  } 
 
  void kernel(int *array, const sycl::nd_item<3> &item_ct1) { 
    unsigned int tid = item_ct1.get_local_id(2); 
    if (tid >= 8) 
      return; 
  /* 
  DPCT1086:1: __activemask() is migrated to 0xffffffff. You may need to adjust 
  the code.
  */ 
    unsigned mask = 0xffffffff; 
    array[tid] = SHFL_SYNC(mask, array[tid], 4, item_ct1); 
  }

このコードは次のように書き換えられます。

   // remove mask parameter, as it is not used 
   inline int SHFL_SYNC(int val, unsigned offset, 
   const sycl::nd_item<3> &item_ct1, unsigned w = 0) { 
     if (!w) w = item_ct1.get_sub_group().get_local_range().get(0); 
       unsigned int tid = item_ct1.get_local_id(2); 
   // Use a temporary variable to save the result of sycl::shift_group_left() to make sure all work-items can encounter this call. 
     int v_tmp = sycl::shift_group_left(item_ct1.get_sub_group(), val, offset); 
     return (tid < 8) ? v_tmp : val; 
   } 
 
  void kernel(int *array, const sycl::nd_item<3> &item_ct1) { 
    unsigned int tid = item_ct1.get_local_id(2); 
  // remove mask parameter, as it is not used 
    array[tid] = SHFL_SYNC(array[tid], 4, item_ct1); 
  }

修正方法の提案#

__activemask() の代わりに 0xffffffff を使用できるか確認します。使用できない場合は、スレッドのロジックを再設計します。

インテル® DPC++
互換性ツール・
デベロッパー・ガイド
およびリファレンス

DPCT1086

目次

DPCT1086#

メッセージ#

詳細な説明#

修正方法の提案#

インテル® DPC++互換性ツール・デベロッパー・ガイドおよびリファレンス

DPCT1086

目次

DPCT1086#

メッセージ#

詳細な説明#

修正方法の提案#

インテル® DPC++
互換性ツール・
デベロッパー・ガイド
およびリファレンス