Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/dispatch_2/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: ['2']
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:57.533018 125854215790400 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192632 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:57.533625 125854215790400 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:57.728505 125854215790400 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:57.810352 125854215790400 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276727 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:57.833097 125854215790400 generateRocpd.cpp:583] writing SQL database for process 2523015 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:56:57.833887 125854215790400 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523015_results.db (UUID=0001fa72-49d9-79d9-a6a1-39e247c350b5)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:57.916747 125854215790400 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007909 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:57.917860 125854215790400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:57.919452 125854215790400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001577 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:57.929669 125854215790400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008262 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:58.253540 125854215790400 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.323857 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:58.255808 125854215790400 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002249 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:58.255825 125854215790400 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:58.265046 125854215790400 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009213 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:58.265060 125854215790400 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:58.265066 125854215790400 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:58.265073 125854215790400 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:58.265198 125854215790400 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000113 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:58.265441 125854215790400 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.432344 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:58.268410 125854215790400 simple_timer.cpp:55] [rocprofv3] output generation ::     0.456205 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:58.268522 125854215790400 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.458117 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523015_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:59.799516 128289833054016 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191973 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:56:59.800119 128289833054016 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:56:59.994627 128289833054016 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:00.076617 128289833054016 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276498 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.099283 128289833054016 generateRocpd.cpp:583] writing SQL database for process 2523024 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:00.100099 128289833054016 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523024_results.db (UUID=0001fa72-52b4-72b4-924a-1873019aa9a2)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.183651 128289833054016 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008016 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.184769 128289833054016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.186732 128289833054016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001948 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.197005 128289833054016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008350 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.508654 128289833054016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.311634 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.511023 128289833054016 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002338 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.511047 128289833054016 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.520380 128289833054016 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009325 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.520394 128289833054016 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.520400 128289833054016 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.520406 128289833054016 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.520513 128289833054016 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.520718 128289833054016 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.421435 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.523625 128289833054016 simple_timer.cpp:55] [rocprofv3] output generation ::     0.445292 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:00.523720 128289833054016 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.447049 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523024_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:02.056049 134620646752064 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192381 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:02.056657 134620646752064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.250950 134620646752064 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:02.334376 134620646752064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277720 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.356969 134620646752064 generateRocpd.cpp:583] writing SQL database for process 2523033 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:02.357758 134620646752064 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523033_results.db (UUID=0001fa72-5b84-7b84-a51c-3d4df43e2249)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.441108 134620646752064 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007989 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.442308 134620646752064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001185 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.443875 134620646752064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001551 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.454299 134620646752064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008447 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.762010 134620646752064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.307698 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.764262 134620646752064 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002234 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.764279 134620646752064 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.773532 134620646752064 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009246 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.773547 134620646752064 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.773553 134620646752064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.773560 134620646752064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.773675 134620646752064 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000108 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.773908 134620646752064 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.416939 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.776879 134620646752064 simple_timer.cpp:55] [rocprofv3] output generation ::     0.440674 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:02.776974 134620646752064 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.442552 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523033_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:04.302600 125387798310720 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.195230 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:04.303225 125387798310720 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:04.495967 125387798310720 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:04.578550 125387798310720 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275325 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:04.601218 125387798310720 generateRocpd.cpp:583] writing SQL database for process 2523041 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:04.602058 125387798310720 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523041_results.db (UUID=0001fa72-6448-7448-be80-8f14b8cdcaa0)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:04.686148 125387798310720 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008033 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:04.687376 125387798310720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001210 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:04.689352 125387798310720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001961 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:04.699747 125387798310720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008396 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:05.006066 125387798310720 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.306305 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:05.008388 125387798310720 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002300 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:05.008405 125387798310720 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:05.018103 125387798310720 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009691 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:05.018117 125387798310720 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:05.018123 125387798310720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:05.018129 125387798310720 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:05.018235 125387798310720 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:05.018442 125387798310720 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.417224 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:05.021344 125387798310720 simple_timer.cpp:55] [rocprofv3] output generation ::     0.441043 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:05.021439 125387798310720 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.442833 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523041_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:06.583615 140481882300224 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191802 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:06.584209 140481882300224 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:06.778087 140481882300224 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:06.879640 140481882300224 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.295431 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:06.902100 140481882300224 generateRocpd.cpp:583] writing SQL database for process 2523049 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:06.902878 140481882300224 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523049_results.db (UUID=0001fa72-6d34-7d34-9da0-fbe1ec7b922c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:06.987306 140481882300224 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007992 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:06.988505 140481882300224 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001183 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:06.990197 140481882300224 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001677 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.000909 140481882300224 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008512 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.280404 140481882300224 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.279479 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.284378 140481882300224 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.003958 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.284395 140481882300224 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.293755 140481882300224 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009353 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.293770 140481882300224 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.293776 140481882300224 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.293782 140481882300224 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.293892 140481882300224 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.294107 140481882300224 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.392007 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.297008 140481882300224 simple_timer.cpp:55] [rocprofv3] output generation ::     0.415646 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:07.297103 140481882300224 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.417419 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523049_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:08.788201 124327346528064 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.183366 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:08.788821 124327346528064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:08.984260 124327346528064 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:09.065728 124327346528064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276907 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.088385 124327346528064 generateRocpd.cpp:583] writing SQL database for process 2523057 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:09.089214 124327346528064 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523057_results.db (UUID=0001fa72-75d9-75d9-896a-b6d28db8b7e2)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.173060 124327346528064 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007687 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.174303 124327346528064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001226 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.175965 124327346528064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001647 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.186668 124327346528064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008480 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.195199 124327346528064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008517 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.197306 124327346528064 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.197324 124327346528064 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.205865 124327346528064 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008535 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.205880 124327346528064 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.205886 124327346528064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.205893 124327346528064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.205991 124327346528064 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000087 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.206192 124327346528064 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.117807 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.208952 124327346528064 simple_timer.cpp:55] [rocprofv3] output generation ::     0.141454 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:09.209000 124327346528064 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.143228 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523057_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:10.717083 134509318934336 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190709 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:10.717670 134509318934336 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:10.914532 134509318934336 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:10.996525 134509318934336 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278855 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.019058 134509318934336 generateRocpd.cpp:583] writing SQL database for process 2523066 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:11.019842 134509318934336 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523066_results.db (UUID=0001fa72-7d5b-7d5b-b328-0cd07896816f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.102122 134509318934336 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008053 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.103327 134509318934336 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001189 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.105469 134509318934336 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002127 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.115967 134509318934336 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008359 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.524504 134509318934336 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.408522 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.526858 134509318934336 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002336 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.526876 134509318934336 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.536113 134509318934336 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009230 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.536128 134509318934336 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.536134 134509318934336 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.536141 134509318934336 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.536251 134509318934336 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.536467 134509318934336 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.517410 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.539449 134509318934336 simple_timer.cpp:55] [rocprofv3] output generation ::     0.541154 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:11.539569 134509318934336 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.542996 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523066_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:13.091757 133682368044864 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191421 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:13.092356 133682368044864 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.286494 133682368044864 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:13.378012 133682368044864 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285656 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.400521 133682368044864 generateRocpd.cpp:583] writing SQL database for process 2523075 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:13.401340 133682368044864 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523075_results.db (UUID=0001fa72-86a1-76a1-8058-8b4ec1178417)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.485203 133682368044864 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008050 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.486393 133682368044864 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.488375 133682368044864 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001967 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.498922 133682368044864 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008540 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.901404 133682368044864 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.402466 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.903696 133682368044864 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002273 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.903714 133682368044864 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.913064 133682368044864 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009343 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.913079 133682368044864 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.913086 133682368044864 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.913092 133682368044864 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.913200 133682368044864 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.913413 133682368044864 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.512893 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.916545 133682368044864 simple_timer.cpp:55] [rocprofv3] output generation ::     0.536975 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:13.916654 133682368044864 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.538563 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523075_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:15.490255 126936325046080 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.196317 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:15.490846 126936325046080 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:15.684882 126936325046080 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:15.770885 126936325046080 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280040 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:15.793232 126936325046080 generateRocpd.cpp:583] writing SQL database for process 2523083 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:15.794028 126936325046080 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523083_results.db (UUID=0001fa72-8ffa-7ffa-81ef-dc82cccd2493)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:15.877338 126936325046080 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008209 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:15.878538 126936325046080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001184 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:15.880740 126936325046080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002188 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:15.891401 126936325046080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008534 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:16.474628 126936325046080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.583211 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:16.476918 126936325046080 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002259 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:16.476936 126936325046080 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:16.485650 126936325046080 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008707 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:16.485664 126936325046080 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:16.485671 126936325046080 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:16.485678 126936325046080 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:16.485819 126936325046080 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000134 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:16.486117 126936325046080 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.692885 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:16.489021 126936325046080 simple_timer.cpp:55] [rocprofv3] output generation ::     0.716507 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:16.489171 126936325046080 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.718232 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523083_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:18.017887 135084497911616 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189094 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:18.018534 135084497911616 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.213671 135084497911616 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:18.294205 135084497911616 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275672 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.316373 135084497911616 generateRocpd.cpp:583] writing SQL database for process 2523091 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:18.317182 135084497911616 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523091_results.db (UUID=0001fa72-99e1-79e1-8114-1d7bf8df9fe1)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.400165 135084497911616 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008022 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.401384 135084497911616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001201 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.403563 135084497911616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002161 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.414055 135084497911616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008398 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.756739 135084497911616 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.342667 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.759085 135084497911616 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002329 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.759105 135084497911616 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000005 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.768868 135084497911616 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009749 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.768883 135084497911616 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.768889 135084497911616 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.768896 135084497911616 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.769004 135084497911616 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.769218 135084497911616 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.452846 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.772175 135084497911616 simple_timer.cpp:55] [rocprofv3] output generation ::     0.476517 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:18.772291 135084497911616 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.478027 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523091_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:20.310411 131509439053632 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192259 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:20.311049 131509439053632 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:20.506267 131509439053632 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:20.591588 131509439053632 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280540 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:20.614642 131509439053632 generateRocpd.cpp:583] writing SQL database for process 2523099 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:20.615457 131509439053632 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523099_results.db (UUID=0001fa72-a2d2-72d2-a83b-b6b05ac9aceb)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:20.698654 131509439053632 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008041 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:20.699870 131509439053632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001201 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:20.702061 131509439053632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:20.712649 131509439053632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008394 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:21.044636 131509439053632 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.331972 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:21.046966 131509439053632 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002313 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:21.046984 131509439053632 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:21.056071 131509439053632 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009080 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:21.056085 131509439053632 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:21.056092 131509439053632 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:21.056098 131509439053632 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:21.056220 131509439053632 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:21.056469 131509439053632 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.441827 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:21.059503 131509439053632 simple_timer.cpp:55] [rocprofv3] output generation ::     0.466036 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:21.059605 131509439053632 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.467967 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523099_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:22.628635 139839295672128 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.198473 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:22.629216 139839295672128 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:22.823967 139839295672128 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:22.907336 139839295672128 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:22.929682 139839295672128 generateRocpd.cpp:583] writing SQL database for process 2523109 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:22.930477 139839295672128 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523109_results.db (UUID=0001fa72-abdb-7bdb-a43d-b60b0c9e7e87)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.012852 139839295672128 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008433 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.014063 139839295672128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001195 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.016208 139839295672128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002131 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.027196 139839295672128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008826 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.547154 139839295672128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.519944 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.549502 139839295672128 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002329 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.549520 139839295672128 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.558448 139839295672128 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008921 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.558463 139839295672128 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.558469 139839295672128 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.558476 139839295672128 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.558604 139839295672128 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.558865 139839295672128 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.629183 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.561845 139839295672128 simple_timer.cpp:55] [rocprofv3] output generation ::     0.653050 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:23.561969 139839295672128 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.654588 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523109_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/dispatch_2/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:25.091304 128498669076288 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189985 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:25.091889 128498669076288 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.284983 128498669076288 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:57:25.366712 128498669076288 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274823 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.389213 128498669076288 generateRocpd.cpp:583] writing SQL database for process 2523117 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:57:25.390008 128498669076288 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523117_results.db (UUID=0001fa72-b582-7582-aa7b-b57ea3594743)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.473620 128498669076288 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007944 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.474815 128498669076288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001180 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.476550 128498669076288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001719 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.487191 128498669076288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008457 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.806743 128498669076288 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.319538 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.809082 128498669076288 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002322 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.809099 128498669076288 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.819100 128498669076288 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009994 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.819117 128498669076288 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.819124 128498669076288 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.819131 128498669076288 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.819275 128498669076288 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.819523 128498669076288 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.430310 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.822538 128498669076288 simple_timer.cpp:55] [rocprofv3] output generation ::     0.454110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:57:25.822652 128498669076288 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.455888 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_2/MI200/out/pmc_1/2523117_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/dispatch_2/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
