alias: l2, block id: 17
alias: l2_per_channel, block id: 18
Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100
Target: MI100
Command: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: ['l2', 'l2_per_channel']

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.2s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/12][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:12.724044 134850529345344 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.305617 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:12.733935 134850529345344 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:12.946883 134850529345344 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:13.081236 134850529345344 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.347302 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.120531 134850529345344 generateRocpd.cpp:582] writing SQL database for process 2385133 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:13.121816 134850529345344 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385133_results.db (UUID=00004318-470a-770a-9df8-ccede583e243)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.212407 134850529345344 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014367 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.213540 134850529345344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.215763 134850529345344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002194 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.220892 134850529345344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003109 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.278098 134850529345344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.057178 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.281057 134850529345344 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002927 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.281086 134850529345344 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.296719 134850529345344 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015618 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.296746 134850529345344 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.296758 134850529345344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.296770 134850529345344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.296970 134850529345344 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000185 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.297390 134850529345344 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.176860 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.303138 134850529345344 simple_timer.cpp:55] [rocprofv3] output generation ::     0.219402 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:13.303223 134850529345344 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.221936 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385133_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/12][Approximate profiling time left: 32 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:15.620033 135591392800576 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.313248 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:15.629101 135591392800576 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:15.840920 135591392800576 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:15.972346 135591392800576 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.343245 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.012001 135591392800576 generateRocpd.cpp:582] writing SQL database for process 2385143 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:16.013296 135591392800576 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385143_results.db (UUID=00004318-5253-7253-9e0c-48408dbcf1ae)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.102085 135591392800576 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014443 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.103229 135591392800576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001113 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.105699 135591392800576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002442 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.110590 135591392800576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003014 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.210878 135591392800576 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.100256 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.213700 135591392800576 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002791 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.213729 135591392800576 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.229965 135591392800576 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016220 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.230007 135591392800576 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.230020 135591392800576 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.230032 135591392800576 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.230266 135591392800576 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000220 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.230847 135591392800576 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.218846 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.236871 135591392800576 simple_timer.cpp:55] [rocprofv3] output generation ::     0.262064 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:16.236970 135591392800576 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.264572 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385143_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/12][Approximate profiling time left: 28 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_10.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:18.497998 139616773181248 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.304634 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:18.507795 139616773181248 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:18.718774 139616773181248 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:18.846808 139616773181248 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.339013 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:18.886074 139616773181248 generateRocpd.cpp:582] writing SQL database for process 2385153 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:18.887355 139616773181248 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385153_results.db (UUID=00004318-5d99-7d99-9d62-d858e9675388)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:18.976441 139616773181248 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014612 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:18.977557 139616773181248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001085 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:18.979764 139616773181248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002179 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:18.984775 139616773181248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003080 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:19.038963 139616773181248 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.054159 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:19.041768 139616773181248 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002747 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:19.041797 139616773181248 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:19.058135 139616773181248 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016324 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:19.058169 139616773181248 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:19.058181 139616773181248 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:19.058193 139616773181248 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:19.058426 139616773181248 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000219 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:19.059021 139616773181248 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.172948 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:19.065009 139616773181248 simple_timer.cpp:55] [rocprofv3] output generation ::     0.215717 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:19.065103 139616773181248 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.218244 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385153_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/12][Approximate profiling time left: 24 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_11.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:21.300160 134346728689472 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.300622 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:21.309922 134346728689472 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.522021 134346728689472 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:21.653537 134346728689472 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.343616 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.692545 134346728689472 generateRocpd.cpp:582] writing SQL database for process 2385163 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:21.693822 134346728689472 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385163_results.db (UUID=00004318-6890-7890-a5e9-131cb85c487a)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.782663 134346728689472 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.013867 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.783774 134346728689472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001081 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.785949 134346728689472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002147 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.790965 134346728689472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.793830 134346728689472 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.002828 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.796265 134346728689472 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002407 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.796294 134346728689472 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.811614 134346728689472 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015306 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.811642 134346728689472 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.811654 134346728689472 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.811666 134346728689472 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.811855 134346728689472 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.812216 134346728689472 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.119672 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.818021 134346728689472 simple_timer.cpp:55] [rocprofv3] output generation ::     0.161983 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:21.818095 134346728689472 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.164507 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385163_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/12][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:24.143925 133747215466304 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.312239 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:24.153794 133747215466304 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.366090 133747215466304 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:24.500200 133747215466304 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.346406 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.539758 133747215466304 generateRocpd.cpp:582] writing SQL database for process 2385173 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:24.541052 133747215466304 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385173_results.db (UUID=00004318-73a0-73a0-962b-554b4c624eeb)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.631620 133747215466304 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014714 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.632906 133747215466304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001255 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.635349 133747215466304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002415 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.640417 133747215466304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.740606 133747215466304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.100160 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.743527 133747215466304 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002887 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.743557 133747215466304 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.759231 133747215466304 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015659 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.759259 133747215466304 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.759271 133747215466304 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.759283 133747215466304 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.759494 133747215466304 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000194 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.759928 133747215466304 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.220171 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.765813 133747215466304 simple_timer.cpp:55] [rocprofv3] output generation ::     0.263146 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:24.765901 133747215466304 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.265650 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385173_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/12][Approximate profiling time left: 17 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:27.056266 134724065120064 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.305520 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:27.064822 134724065120064 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.276521 134724065120064 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:27.407448 134724065120064 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.342627 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.440006 134724065120064 generateRocpd.cpp:582] writing SQL database for process 2385194 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:27.441009 134724065120064 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385194_results.db (UUID=00004318-7f07-7f07-b702-baf88e93ff99)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.514805 134724065120064 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.010768 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.515799 134724065120064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.000970 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.517821 134724065120064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.521952 134724065120064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.002441 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.563858 134724065120064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.041877 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.566242 134724065120064 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002360 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.566265 134724065120064 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.579110 134724065120064 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.012834 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.579130 134724065120064 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.579140 134724065120064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.579149 134724065120064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.579301 134724065120064 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000141 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.579641 134724065120064 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.139637 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.584031 134724065120064 simple_timer.cpp:55] [rocprofv3] output generation ::     0.173997 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:27.584100 134724065120064 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.176596 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385194_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/12][Approximate profiling time left: 14 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:29.852457 127845502680896 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.305509 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:29.862187 127845502680896 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.074548 127845502680896 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:30.205252 127845502680896 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.343065 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.244352 127845502680896 generateRocpd.cpp:582] writing SQL database for process 2385205 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:30.245629 127845502680896 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385205_results.db (UUID=00004318-89f3-79f3-b2a1-35479374f811)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.334961 127845502680896 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014325 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.336109 127845502680896 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.338689 127845502680896 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002552 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.343738 127845502680896 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003118 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.398476 127845502680896 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.054710 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.401285 127845502680896 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002775 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.401315 127845502680896 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.417841 127845502680896 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016512 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.417875 127845502680896 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.417887 127845502680896 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.417899 127845502680896 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.418129 127845502680896 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000216 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.418700 127845502680896 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.174349 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.424690 127845502680896 simple_timer.cpp:55] [rocprofv3] output generation ::     0.216946 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:30.424782 127845502680896 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.219479 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385205_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/12][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:32.689798 128239515971392 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.304525 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:32.699152 128239515971392 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:32.911239 128239515971392 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:33.041798 128239515971392 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.342646 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.080773 128239515971392 generateRocpd.cpp:582] writing SQL database for process 2385215 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:33.082086 128239515971392 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385215_results.db (UUID=00004318-9509-7509-aeba-0b0229b77dde)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.158750 128239515971392 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.010783 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.159711 128239515971392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.000937 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.161734 128239515971392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.166018 128239515971392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.002624 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.207574 128239515971392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.041535 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.209992 128239515971392 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002396 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.210015 128239515971392 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.222478 128239515971392 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.012453 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.222498 128239515971392 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.222507 128239515971392 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.222516 128239515971392 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.222666 128239515971392 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000135 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.223004 128239515971392 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.142233 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.227694 128239515971392 simple_timer.cpp:55] [rocprofv3] output generation ::     0.183429 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:33.227757 128239515971392 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.185903 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385215_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/12][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_6.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:35.586768 133150120640320 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.309265 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:35.596532 133150120640320 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:35.810858 133150120640320 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:35.943272 133150120640320 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.346741 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:35.982312 133150120640320 generateRocpd.cpp:582] writing SQL database for process 2385225 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:35.983572 133150120640320 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385225_results.db (UUID=00004318-a056-7056-9ef7-59a667f1ed21)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.072073 133150120640320 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014733 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.073228 133150120640320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001126 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.075767 133150120640320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002510 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.080745 133150120640320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003066 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.180760 133150120640320 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.099987 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.183573 133150120640320 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002779 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.183602 133150120640320 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.199662 133150120640320 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016045 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.199693 133150120640320 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.199705 133150120640320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.199717 133150120640320 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.199947 133150120640320 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000210 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.200532 133150120640320 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.218221 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.206481 133150120640320 simple_timer.cpp:55] [rocprofv3] output generation ::     0.260780 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:36.206581 133150120640320 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.263257 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385225_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/12][Approximate profiling time left: 5 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_7.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:38.439852 136358003949376 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.297702 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:38.449732 136358003949376 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.660468 136358003949376 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:38.791046 136358003949376 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.341314 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.830160 136358003949376 generateRocpd.cpp:582] writing SQL database for process 2385237 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:38.831433 136358003949376 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385237_results.db (UUID=00004318-ab86-7b86-82f3-19e0a77751a3)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.918592 136358003949376 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014015 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.919710 136358003949376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001087 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.921910 136358003949376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002172 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.926899 136358003949376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.934365 136358003949376 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.007435 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.936769 136358003949376 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002376 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.936798 136358003949376 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.952905 136358003949376 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.952932 136358003949376 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.952944 136358003949376 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.952956 136358003949376 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.953165 136358003949376 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000179 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.953525 136358003949376 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.123367 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.959208 136358003949376 simple_timer.cpp:55] [rocprofv3] output generation ::     0.165713 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:38.959280 136358003949376 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.168175 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385237_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/12][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_8.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:41.279958 125138910142272 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.311284 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:41.289718 125138910142272 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.502594 125138910142272 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:41.637594 125138910142272 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.347876 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.677361 125138910142272 generateRocpd.cpp:582] writing SQL database for process 2385247 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:41.678660 125138910142272 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385247_results.db (UUID=00004318-b691-7691-b394-977e241754c7)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.768277 125138910142272 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014817 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.769458 125138910142272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001149 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.772062 125138910142272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002575 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.777103 125138910142272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003069 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.877736 125138910142272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.100605 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.880605 125138910142272 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002833 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.880634 125138910142272 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.896345 125138910142272 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015696 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.896373 125138910142272 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.896385 125138910142272 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.896397 125138910142272 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.896603 125138910142272 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000192 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.897059 125138910142272 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.219698 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.902819 125138910142272 simple_timer.cpp:55] [rocprofv3] output generation ::     0.262669 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:41.902904 125138910142272 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.265257 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385247_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/12][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/perfmon/pmc_perf_9.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:44.194635 128930676363072 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.303798 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:44.204179 128930676363072 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.414771 128930676363072 tool.cpp:2422] HSA version 8.20.1 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 16:53:44.548926 128930676363072 simple_timer.cpp:55] [rocprofv3] '/home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/vcopy -n 1048576 -b 256 -i 3' ::     0.344747 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.588295 128930676363072 generateRocpd.cpp:582] writing SQL database for process 2385257 on node 2710291163
   |-> [rocprofiler-sdk] [m[0;31mE20260526 16:53:44.589618 128930676363072 generateRocpd.cpp:605] Opened result file: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/dl385-20-mi100-3c48/2385257_results.db (UUID=00004318-c1fb-71fb-8786-fa835578c746)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.679486 128930676363072 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.014285 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.680614 128930676363072 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.683184 128930676363072 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002542 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.688111 128930676363072 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.003027 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.742258 128930676363072 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.054113 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.745100 128930676363072 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002812 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.745131 128930676363072 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.760683 128930676363072 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015538 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.760710 128930676363072 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.760723 128930676363072 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.760735 128930676363072 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.760935 128930676363072 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000187 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.761362 128930676363072 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.173068 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.767334 128930676363072 simple_timer.cpp:55] [rocprofv3] output generation ::     0.215945 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 16:53:44.767422 128930676363072 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.218437 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/ipblocks_TCC/MI100/out/pmc_1/2385257_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Skipping roofline
