Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/kernel_inv_int/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: ['42']
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:56.361244 125761752211264 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191420 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:56.361867 125761752211264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:56.557590 125761752211264 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:56.648960 125761752211264 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.287092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:56.671568 125761752211264 generateRocpd.cpp:583] writing SQL database for process 2522227 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:56.672366 125761752211264 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522227_results.db (UUID=0001fa6e-9bc6-7bc6-845c-986a4daea18a)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:56.756024 125761752211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008028 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:56.757242 125761752211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001189 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:56.759208 125761752211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001950 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:56.769387 125761752211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008215 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:57.098902 125761752211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.329501 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:57.101249 125761752211264 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002326 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:57.101267 125761752211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:57.110201 125761752211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008927 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:57.110216 125761752211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:57.110222 125761752211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:57.110229 125761752211264 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:57.110350 125761752211264 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000113 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:57.110606 125761752211264 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.439039 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:57.113567 125761752211264 simple_timer.cpp:55] [rocprofv3] output generation ::     0.463259 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:57.113672 125761752211264 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.464663 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522227_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:58.647142 135973650833216 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190467 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:58.647731 135973650833216 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:58.840595 135973650833216 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:52:58.940406 135973650833216 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.292675 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:58.962703 135973650833216 generateRocpd.cpp:583] writing SQL database for process 2522237 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:52:58.963508 135973650833216 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522237_results.db (UUID=0001fa6e-a4b5-74b5-b504-ec3383faeb3a)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.047052 135973650833216 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008031 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.048272 135973650833216 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001201 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.050405 135973650833216 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002117 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.060764 135973650833216 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008238 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.373672 135973650833216 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.312892 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.376008 135973650833216 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002318 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.376027 135973650833216 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.384608 135973650833216 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008564 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.384624 135973650833216 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.384637 135973650833216 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.384676 135973650833216 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.384802 135973650833216 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.385053 135973650833216 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.422350 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.388002 135973650833216 simple_timer.cpp:55] [rocprofv3] output generation ::     0.445964 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:52:59.388114 135973650833216 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.447659 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522237_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:00.928945 137674040426304 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189508 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:00.929581 137674040426304 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.125276 137674040426304 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:01.226733 137674040426304 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.297153 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.248972 137674040426304 generateRocpd.cpp:583] writing SQL database for process 2522261 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:01.249781 137674040426304 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522261_results.db (UUID=0001fa6e-ada0-7da0-8311-270121b40daa)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.332443 137674040426304 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008009 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.333560 137674040426304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.335170 137674040426304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001595 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.345390 137674040426304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008247 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.644612 137674040426304 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.299207 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.646755 137674040426304 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002124 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.646772 137674040426304 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.655423 137674040426304 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008644 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.655438 137674040426304 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.655444 137674040426304 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.655451 137674040426304 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.655584 137674040426304 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.655823 137674040426304 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.406851 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.658787 137674040426304 simple_timer.cpp:55] [rocprofv3] output generation ::     0.430604 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:01.658889 137674040426304 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.432111 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522261_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:03.183144 133943283433280 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189647 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:03.183776 133943283433280 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.377637 133943283433280 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:03.476816 133943283433280 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.293040 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.499087 133943283433280 generateRocpd.cpp:583] writing SQL database for process 2522270 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:03.499879 133943283433280 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522270_results.db (UUID=0001fa6e-b66e-766e-9370-675fc1eb70d7)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.582245 133943283433280 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007954 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.583362 133943283433280 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.585306 133943283433280 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001928 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.595628 133943283433280 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008317 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.882635 133943283433280 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.286992 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.884737 133943283433280 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002078 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.884755 133943283433280 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.893337 133943283433280 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008575 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.893351 133943283433280 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.893357 133943283433280 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.893364 133943283433280 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.893475 133943283433280 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.893697 133943283433280 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.394610 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.896644 133943283433280 simple_timer.cpp:55] [rocprofv3] output generation ::     0.418420 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:03.896740 133943283433280 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.419876 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522270_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:05.430848 129783795236672 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188513 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:05.431449 129783795236672 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:05.623871 129783795236672 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:05.706154 129783795236672 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274705 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:05.728289 129783795236672 generateRocpd.cpp:583] writing SQL database for process 2522279 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:05.729025 129783795236672 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522279_results.db (UUID=0001fa6e-bf37-7f37-8b77-927dd41bb670)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:05.812214 129783795236672 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007916 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:05.813432 129783795236672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001201 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:05.815018 129783795236672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001571 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:05.825236 129783795236672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008218 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:06.105450 129783795236672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.280198 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:06.107788 129783795236672 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002310 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:06.107805 129783795236672 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:06.116350 129783795236672 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008537 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:06.116364 129783795236672 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:06.116371 129783795236672 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:06.116377 129783795236672 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:06.116484 129783795236672 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:06.116685 129783795236672 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.388397 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:06.119656 129783795236672 simple_timer.cpp:55] [rocprofv3] output generation ::     0.411768 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:06.119735 129783795236672 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.413547 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522279_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:07.620869 137301408685888 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.183952 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:07.621530 137301408685888 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:07.817705 137301408685888 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:07.899385 137301408685888 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277855 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:07.921745 137301408685888 generateRocpd.cpp:583] writing SQL database for process 2522287 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:07.922544 137301408685888 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522287_results.db (UUID=0001fa6e-c7c9-77c9-9d07-a862cbe81509)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.005399 137301408685888 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007740 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.006652 137301408685888 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001236 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.008350 137301408685888 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001683 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.019075 137301408685888 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008487 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.027709 137301408685888 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008619 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.029832 137301408685888 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002108 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.029849 137301408685888 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.038416 137301408685888 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008559 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.038429 137301408685888 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.038435 137301408685888 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.038442 137301408685888 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.038546 137301408685888 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.038739 137301408685888 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.116994 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.041552 137301408685888 simple_timer.cpp:55] [rocprofv3] output generation ::     0.140542 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:08.041601 137301408685888 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.142174 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522287_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:09.576153 139679354601280 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191551 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:09.576736 139679354601280 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:09.772233 139679354601280 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:09.853250 139679354601280 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276514 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:09.875752 139679354601280 generateRocpd.cpp:583] writing SQL database for process 2522295 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:09.876559 139679354601280 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522295_results.db (UUID=0001fa6e-cf65-7f65-9d75-ca9b987a3ada)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:09.957765 139679354601280 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007947 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:09.958938 139679354601280 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001157 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:09.960958 139679354601280 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002005 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:09.971356 139679354601280 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008430 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:10.380505 139679354601280 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.409134 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:10.382802 139679354601280 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002266 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:10.382819 139679354601280 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:10.392346 139679354601280 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009520 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:10.392360 139679354601280 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:10.392366 139679354601280 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:10.392373 139679354601280 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:10.392477 139679354601280 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:10.392668 139679354601280 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.516915 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:10.395555 139679354601280 simple_timer.cpp:55] [rocprofv3] output generation ::     0.540594 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:10.395660 139679354601280 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.542363 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522295_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:11.954664 138349510373184 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189497 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:11.955299 138349510373184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.147634 138349510373184 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:12.231146 138349510373184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275848 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.253611 138349510373184 generateRocpd.cpp:583] writing SQL database for process 2522303 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:12.254396 138349510373184 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522303_results.db (UUID=0001fa6e-d8b2-78b2-8b29-4d0838428b3f)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.337192 138349510373184 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.338346 138349510373184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001137 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.340316 138349510373184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001955 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.350505 138349510373184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008215 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.751891 138349510373184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.401371 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.754254 138349510373184 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002339 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.754271 138349510373184 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.762707 138349510373184 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008429 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.762722 138349510373184 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.762728 138349510373184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.762735 138349510373184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.762885 138349510373184 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.763134 138349510373184 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.509523 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.766015 138349510373184 simple_timer.cpp:55] [rocprofv3] output generation ::     0.533088 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:12.766135 138349510373184 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.534942 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522303_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:14.323414 139361266523968 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.195249 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:14.324000 139361266523968 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:14.516083 139361266523968 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:14.604583 139361266523968 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280583 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:14.627178 139361266523968 generateRocpd.cpp:583] writing SQL database for process 2522312 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:14.627976 139361266523968 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522312_results.db (UUID=0001fa6e-e1ed-71ed-8350-f58b63232103)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:14.711288 139361266523968 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008243 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:14.712486 139361266523968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001181 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:14.714609 139361266523968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:14.725182 139361266523968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008419 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:15.311167 139361266523968 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.585970 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:15.313364 139361266523968 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002169 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:15.313381 139361266523968 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:15.322156 139361266523968 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008768 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:15.322170 139361266523968 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:15.322176 139361266523968 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:15.322183 139361266523968 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:15.322297 139361266523968 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:15.322535 139361266523968 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.695358 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:15.325560 139361266523968 simple_timer.cpp:55] [rocprofv3] output generation ::     0.719405 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:15.325683 139361266523968 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.721055 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522312_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:16.863350 134805799731008 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192482 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:16.863945 134805799731008 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.056395 134805799731008 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:17.144144 134805799731008 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280199 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.166755 134805799731008 generateRocpd.cpp:583] writing SQL database for process 2522320 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:17.167505 134805799731008 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522320_results.db (UUID=0001fa6e-ebdb-7bdb-8055-5fe4f704ac0e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.245056 134805799731008 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007929 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.246221 134805799731008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001147 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.248065 134805799731008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001829 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.258014 134805799731008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008079 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.599887 134805799731008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.341859 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.602037 134805799731008 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002134 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.602055 134805799731008 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.610680 134805799731008 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008618 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.610694 134805799731008 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.610700 134805799731008 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.610707 134805799731008 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.610814 134805799731008 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.611015 134805799731008 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.444259 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.613941 134805799731008 simple_timer.cpp:55] [rocprofv3] output generation ::     0.468019 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:17.614037 134805799731008 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.469859 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522320_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:19.152698 136941633363776 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188894 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:19.153297 136941633363776 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.348155 136941633363776 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:19.431067 136941633363776 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.277770 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.453143 136941633363776 generateRocpd.cpp:583] writing SQL database for process 2522328 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:19.453918 136941633363776 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522328_results.db (UUID=0001fa6e-f4d0-74d0-a96c-bd30a4fc57e4)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.536576 136941633363776 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008024 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.537680 136941633363776 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001086 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.539673 136941633363776 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001978 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.550040 136941633363776 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008380 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.882141 136941633363776 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.332086 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.884399 136941633363776 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002240 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.884415 136941633363776 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.892979 136941633363776 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008557 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.892995 136941633363776 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.893002 136941633363776 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.893008 136941633363776 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.893140 136941633363776 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.893362 136941633363776 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.440219 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.896282 136941633363776 simple_timer.cpp:55] [rocprofv3] output generation ::     0.463860 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:19.896393 136941633363776 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.465280 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522328_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:21.462134 128355928522560 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.195901 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:21.462740 128355928522560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:21.659266 128355928522560 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:21.741238 128355928522560 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278499 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:21.763479 128355928522560 generateRocpd.cpp:583] writing SQL database for process 2522336 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:21.764255 128355928522560 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522336_results.db (UUID=0001fa6e-fdcf-7dcf-a0ec-bf8518459bef)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:21.848563 128355928522560 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008236 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:21.849764 128355928522560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001185 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:21.851763 128355928522560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001984 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:21.862135 128355928522560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008344 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:22.378210 128355928522560 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.516060 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:22.380504 128355928522560 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002274 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:22.380521 128355928522560 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:22.389411 128355928522560 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008883 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:22.389426 128355928522560 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:22.389432 128355928522560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:22.389439 128355928522560 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:22.389572 128355928522560 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000123 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:22.389834 128355928522560 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.626356 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:22.392750 128355928522560 simple_timer.cpp:55] [rocprofv3] output generation ::     0.649770 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:22.392873 128355928522560 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.651595 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522336_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/kernel_inv_int/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:23.936089 140553057034048 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.187857 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:23.936669 140553057034048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.128566 140553057034048 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 18:53:24.209920 140553057034048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.273251 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.232226 140553057034048 generateRocpd.cpp:583] writing SQL database for process 2522344 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 18:53:24.233005 140553057034048 generateRocpd.cpp:606] Opened result file: tests/workloads/kernel_inv_int/MI200/out/pmc_1/smc4124-25-mi210-3c48/2522344_results.db (UUID=0001fa6f-0781-7781-bc2e-5cb47119b6bf)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.314572 140553057034048 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008007 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.315745 140553057034048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001158 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.317323 140553057034048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001563 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.327729 140553057034048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008473 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.646206 140553057034048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.318462 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.648527 140553057034048 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002303 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.648545 140553057034048 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.657357 140553057034048 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008805 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.657373 140553057034048 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.657379 140553057034048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.657387 140553057034048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.657542 140553057034048 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.657783 140553057034048 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.425557 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.660667 140553057034048 simple_timer.cpp:55] [rocprofv3] output generation ::     0.449330 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 18:53:24.660767 140553057034048 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.450803 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/kernel_inv_int/MI200/out/pmc_1/2522344_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/kernel_inv_int/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
