Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/dispatch_0_1/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: ['1:3']
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:22.650023 133556255006528 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190212 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:22.650640 133556255006528 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:22.844898 133556255006528 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:22.939581 133556255006528 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288942 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:22.962228 133556255006528 generateRocpd.cpp:583] writing SQL database for process 2523476 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:22.963026 133556255006528 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523476_results.db (UUID=0001fa74-80b8-70b8-a45c-f775129e5ada)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.047016 133556255006528 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008005 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.048211 133556255006528 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001170 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.050152 133556255006528 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001926 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.060414 133556255006528 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008301 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.384473 133556255006528 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.324044 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.386734 133556255006528 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002244 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.386752 133556255006528 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.396353 133556255006528 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009594 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.396368 133556255006528 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.396374 133556255006528 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.396380 133556255006528 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.396523 133556255006528 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.396737 133556255006528 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.434510 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.399883 133556255006528 simple_timer.cpp:55] [rocprofv3] output generation ::     0.458548 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:23.399987 133556255006528 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.460357 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523476_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:24.941503 134691697549120 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188593 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:24.942080 134691697549120 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.134387 134691697549120 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:25.229905 134691697549120 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.287825 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.252202 134691697549120 generateRocpd.cpp:583] writing SQL database for process 2523485 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:25.252979 134691697549120 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523485_results.db (UUID=0001fa74-89ad-79ad-9f4f-bfb4b42eaf1c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.335883 134691697549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007959 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.337097 134691697549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001193 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.339016 134691697549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001905 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.349309 134691697549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008298 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.662917 134691697549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.313593 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.665199 134691697549120 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002264 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.665216 134691697549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.674211 134691697549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008988 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.674226 134691697549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.674233 134691697549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.674239 134691697549120 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.674366 134691697549120 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000115 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.674617 134691697549120 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.422415 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.677655 134691697549120 simple_timer.cpp:55] [rocprofv3] output generation ::     0.446274 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:25.677759 134691697549120 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.447815 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523485_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:27.233499 131032983813952 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190375 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:27.234096 131032983813952 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.427935 131032983813952 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:27.514752 131032983813952 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280656 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.537323 131032983813952 generateRocpd.cpp:583] writing SQL database for process 2523493 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:27.538131 131032983813952 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523493_results.db (UUID=0001fa74-92a0-72a0-b7b9-9773fd03a874)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.620793 131032983813952 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.621987 131032983813952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001177 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.623601 131032983813952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001599 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.634015 131032983813952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008415 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.934143 131032983813952 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.300113 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.936517 131032983813952 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002355 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.936534 131032983813952 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.945466 131032983813952 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008925 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.945481 131032983813952 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.945488 131032983813952 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.945494 131032983813952 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.945649 131032983813952 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000117 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.945901 131032983813952 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.408579 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.948901 131032983813952 simple_timer.cpp:55] [rocprofv3] output generation ::     0.432542 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:27.948995 131032983813952 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.434194 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523493_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:29.501377 139737554026304 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.193188 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:29.501960 139737554026304 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:29.699028 139737554026304 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:29.795232 139737554026304 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.293273 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:29.817771 139737554026304 generateRocpd.cpp:583] writing SQL database for process 2523502 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:29.818570 139737554026304 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523502_results.db (UUID=0001fa74-9b79-7b79-b296-a50d2907304d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:29.901904 139737554026304 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008077 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:29.903139 139737554026304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001219 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:29.905290 139737554026304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002136 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:29.915995 139737554026304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008518 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:30.202601 139737554026304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.286589 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:30.204943 139737554026304 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002316 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:30.204960 139737554026304 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:30.214213 139737554026304 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009246 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:30.214227 139737554026304 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:30.214234 139737554026304 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:30.214240 139737554026304 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:30.214361 139737554026304 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:30.214600 139737554026304 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.396830 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:30.217568 139737554026304 simple_timer.cpp:55] [rocprofv3] output generation ::     0.420516 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:30.217661 139737554026304 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.422384 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523502_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:31.758366 139394758696768 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191249 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:31.758944 139394758696768 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:31.961133 139394758696768 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:32.063605 139394758696768 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.304661 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.086271 139394758696768 generateRocpd.cpp:583] writing SQL database for process 2523510 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:32.087066 139394758696768 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523510_results.db (UUID=0001fa74-a44c-744c-95c0-e1cd4632fea4)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.170117 139394758696768 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008009 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.171320 139394758696768 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001187 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.172904 139394758696768 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001569 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.183381 139394758696768 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008442 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.463440 139394758696768 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.280042 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.465750 139394758696768 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002290 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.465767 139394758696768 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.474821 139394758696768 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009045 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.474835 139394758696768 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.474842 139394758696768 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.474849 139394758696768 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.475003 139394758696768 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.475262 139394758696768 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.388991 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.478288 139394758696768 simple_timer.cpp:55] [rocprofv3] output generation ::     0.413005 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:32.478383 139394758696768 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.414733 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523510_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:34.000100 131833225559872 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182706 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:34.000725 131833225559872 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.195011 131833225559872 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:34.291875 131833225559872 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.291150 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.314248 131833225559872 generateRocpd.cpp:583] writing SQL database for process 2523519 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:34.315064 131833225559872 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523519_results.db (UUID=0001fa74-ad16-7d16-b5b2-e2806b752cbf)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.398141 131833225559872 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007844 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.399351 131833225559872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001192 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.401052 131833225559872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001686 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.411677 131833225559872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008449 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.420165 131833225559872 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008473 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.422270 131833225559872 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002091 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.422288 131833225559872 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.430968 131833225559872 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008673 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.430982 131833225559872 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.430988 131833225559872 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.430994 131833225559872 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.431109 131833225559872 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.431301 131833225559872 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.117054 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.434265 131833225559872 simple_timer.cpp:55] [rocprofv3] output generation ::     0.140843 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:34.434313 131833225559872 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.142387 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523519_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:35.952588 127068946517824 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192196 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:35.953241 127068946517824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.149021 127068946517824 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:36.248489 127068946517824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.295248 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.271090 127068946517824 generateRocpd.cpp:583] writing SQL database for process 2523527 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:36.271875 127068946517824 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523527_results.db (UUID=0001fa74-b4ad-74ad-8c43-2a978cb4e419)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.355332 127068946517824 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008151 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.356553 127068946517824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001205 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.358690 127068946517824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.369398 127068946517824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008596 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.779414 127068946517824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.410001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.781791 127068946517824 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002351 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.781809 127068946517824 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.790793 127068946517824 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008976 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.790807 127068946517824 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.790813 127068946517824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.790820 127068946517824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.790975 127068946517824 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.791257 127068946517824 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.520168 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.794269 127068946517824 simple_timer.cpp:55] [rocprofv3] output generation ::     0.543957 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:36.794390 127068946517824 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.545854 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523527_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:38.350654 135430725443392 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190097 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:38.351280 135430725443392 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:38.547408 135430725443392 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:38.643313 135430725443392 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.292032 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:38.665593 135430725443392 generateRocpd.cpp:583] writing SQL database for process 2523535 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:38.666395 135430725443392 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523535_results.db (UUID=0001fa74-be0d-7e0d-a432-0028c00c801c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:38.749803 135430725443392 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:38.750999 135430725443392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001179 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:38.752982 135430725443392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001968 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:38.763360 135430725443392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008362 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:39.164786 135430725443392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.401410 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:39.167173 135430725443392 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002355 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:39.167190 135430725443392 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:39.176653 135430725443392 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009456 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:39.176668 135430725443392 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:39.176675 135430725443392 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:39.176681 135430725443392 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:39.176844 135430725443392 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:39.177113 135430725443392 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.511520 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:39.180169 135430725443392 simple_timer.cpp:55] [rocprofv3] output generation ::     0.535535 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:39.180298 135430725443392 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.536937 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523535_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:40.743941 129356410535744 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.199412 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:40.744595 129356410535744 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:40.937839 129356410535744 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:41.032374 129356410535744 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.287779 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.054944 129356410535744 generateRocpd.cpp:583] writing SQL database for process 2523543 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:41.055759 129356410535744 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523543_results.db (UUID=0001fa74-c75d-775d-bdea-64b87918b9e3)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.139373 129356410535744 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008295 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.140587 129356410535744 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001199 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.142771 129356410535744 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002169 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.153758 129356410535744 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008783 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.738830 129356410535744 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.585057 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.741147 129356410535744 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002292 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.741170 129356410535744 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.750148 129356410535744 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008971 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.750164 129356410535744 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.750170 129356410535744 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.750178 129356410535744 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.750316 129356410535744 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000130 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.750601 129356410535744 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.695657 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.753561 129356410535744 simple_timer.cpp:55] [rocprofv3] output generation ::     0.719359 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:41.753697 129356410535744 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.721273 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523543_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:43.304398 137122441338688 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189606 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:43.304987 137122441338688 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:43.498278 137122441338688 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:43.593021 137122441338688 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288035 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:43.615170 137122441338688 generateRocpd.cpp:583] writing SQL database for process 2523551 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:43.615948 137122441338688 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523551_results.db (UUID=0001fa74-d167-7167-9564-118316ea6fb1)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:43.699210 137122441338688 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007997 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:43.700431 137122441338688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001204 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:43.702380 137122441338688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001935 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:43.712890 137122441338688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008499 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:44.056691 137122441338688 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.343786 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:44.058938 137122441338688 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002231 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:44.058956 137122441338688 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:44.067897 137122441338688 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008934 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:44.067913 137122441338688 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:44.067919 137122441338688 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:44.067925 137122441338688 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:44.068046 137122441338688 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000113 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:44.068280 137122441338688 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.453110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:44.071284 137122441338688 simple_timer.cpp:55] [rocprofv3] output generation ::     0.476757 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:44.071397 137122441338688 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.478310 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523551_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:45.608240 124454502268736 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191333 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:45.608828 124454502268736 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:45.802052 124454502268736 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:45.891323 124454502268736 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282495 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:45.914082 124454502268736 generateRocpd.cpp:583] writing SQL database for process 2523559 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:45.914876 124454502268736 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523559_results.db (UUID=0001fa74-da65-7a65-9c0f-f23ba6d6dc93)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:45.998687 124454502268736 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:45.999922 124454502268736 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001219 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.001920 124454502268736 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001982 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.012310 124454502268736 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008370 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.347689 124454502268736 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.335365 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.350041 124454502268736 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002335 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.350059 124454502268736 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.359229 124454502268736 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009163 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.359243 124454502268736 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.359250 124454502268736 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.359257 124454502268736 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.359409 124454502268736 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.359653 124454502268736 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.445572 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.362576 124454502268736 simple_timer.cpp:55] [rocprofv3] output generation ::     0.469413 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:46.362675 124454502268736 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.471300 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523559_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:47.907893 126586972487488 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.198821 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:47.908511 126586972487488 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.101149 126586972487488 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:48.190363 126586972487488 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281852 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.212360 126586972487488 generateRocpd.cpp:583] writing SQL database for process 2523568 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:48.213157 126586972487488 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523568_results.db (UUID=0001fa74-e359-7359-a42d-2377ec13e008)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.295311 126586972487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.296499 126586972487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001171 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.298479 126586972487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001965 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.309169 126586972487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008702 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.825016 126586972487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.515832 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.827371 126586972487488 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002319 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.827389 126586972487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.836193 126586972487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008798 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.836209 126586972487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.836215 126586972487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.836222 126586972487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.836361 126586972487488 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000128 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.836648 126586972487488 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.624288 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.839704 126586972487488 simple_timer.cpp:55] [rocprofv3] output generation ::     0.648007 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:48.839836 126586972487488 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.649431 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523568_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/dispatch_0_1/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:50.399693 136640222084928 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188645 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:50.400325 136640222084928 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:50.594797 136640222084928 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:59:50.681236 136640222084928 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280911 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:50.703209 136640222084928 generateRocpd.cpp:583] writing SQL database for process 2523576 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:59:50.704001 136640222084928 generateRocpd.cpp:606] Opened result file: tests/workloads/dispatch_0_1/MI200/out/pmc_1/smc4124-25-mi210-3c48/2523576_results.db (UUID=0001fa74-ed20-7d20-86d9-a55b91762b85)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:50.787614 136640222084928 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008022 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:50.788811 136640222084928 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001180 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:50.790559 136640222084928 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001733 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:50.801246 136640222084928 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008519 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:51.121859 136640222084928 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.320597 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:51.124303 136640222084928 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002420 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:51.124320 136640222084928 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:51.133125 136640222084928 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008797 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:51.133140 136640222084928 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:51.133146 136640222084928 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:51.133153 136640222084928 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:51.133264 136640222084928 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:51.133463 136640222084928 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.430254 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:51.136447 136640222084928 simple_timer.cpp:55] [rocprofv3] output generation ::     0.453830 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:59:51.136540 136640222084928 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.455263 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/dispatch_0_1/MI200/out/pmc_1/2523576_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/dispatch_0_1/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
