DPCT1019#

メッセージ#

SYCL* の local_mem_size は、CUDA* の <variable name> と完全に等価ではありません。コードを調整する必要があります。

説明#

CUDA* では、sharedMemPerBlock はブロックごとに利用可能な共有メモリーのサイズをバイト単位で報告します。SYCL* のワークグループは、CUDA* のブロックに相当します。SYCL* のローカルメモリーは、共有メモリーに相当します。SYCL* では、work-group ごとのローカルメモリーのサイズに制限はありません。計算ユニットごとに利用可能なローカルメモリーの最大サイズ (バイト) には制限があり、SYCL* の info::device::local_mem_size デバイス記述子で取得できます。

修正方法の提案#

コードの正当性を検証してください。

例えば、以下のオリジナル CUDA* コードについて考えてみます。

void foo() { 
 cudaDeviceProp prop; 
 cudaGetDeviceProperties(&prop, 0); 
 if (prop.sharedMemPerBlock >= threshold) { 
 // submit the task 
 Code piece A 
 } else { 
 // change the block size or block number 
 Code piece B 
 } 
}

このコードは、以下の SYCL* コードに移行されます。

void foo() { 
 dpct::device_info prop; 
 dpct::dev_mgr::instance().get_device(0).get_device_info(prop); 
 /*  DPCT1019:0: local_mem_size in SYCL is not a complete equivalent of  sharedMemPerBlock in CUDA. You may need to adjust the code.  */ 
 if (prop.get_local_mem_size() >= threshold) { 
 // submit the task 
 Code piece A 
 } else { 
 // change the block size or block number 
 Code piece B 
 } 
}

このコードは次のように書き換えられます。

void foo() { 
 dpct::device_info prop; 
 dpct::dev_mgr::instance().get_device(0).get_device_info(prop); 
 if (prop.get_local_mem_size() >= threshold) { 
 // submit the task 
 Code piece A 
 } else { 
 // change the block size or block number 
 Code piece B 
 } 
}

DPCT1019

目次

DPCT1019#

メッセージ#

説明#

修正方法の提案#