Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/mem_levels_LDS/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:52.594749 129357291544384 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192741 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:52.595369 129357291544384 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:52.787826 129357291544384 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:52.873855 129357291544384 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278487 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:52.896233 129357291544384 generateRocpd.cpp:583] writing SQL database for process 2527495 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:52.897026 129357291544384 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527495_results.db (UUID=0001fa81-c6ee-76ee-833c-2053ec97c0be)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:52.978550 129357291544384 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007992 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:52.979706 129357291544384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001140 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:52.981380 129357291544384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001659 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:52.991623 129357291544384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008252 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:53.352049 129357291544384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.360411 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:53.366549 129357291544384 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.014472 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:53.366579 129357291544384 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:53.382653 129357291544384 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.016060 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:53.382680 129357291544384 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:53.382692 129357291544384 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:53.382704 129357291544384 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:53.382893 129357291544384 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000176 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:53.383145 129357291544384 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.486912 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:53.386857 129357291544384 simple_timer.cpp:55] [rocprofv3] output generation ::     0.511417 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:53.386963 129357291544384 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.513056 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527495_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:54.936291 124019417001792 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192802 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:54.936936 124019417001792 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.132275 124019417001792 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:55.234581 124019417001792 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.297645 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.258332 124019417001792 generateRocpd.cpp:583] writing SQL database for process 2527505 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:55.259146 124019417001792 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527505_results.db (UUID=0001fa81-d014-7014-a250-4a0fe2e89ac6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.339690 124019417001792 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007941 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.340815 124019417001792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.342394 124019417001792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001564 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.352616 124019417001792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008299 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.664533 124019417001792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.311902 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.666707 124019417001792 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002158 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.666725 124019417001792 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.675574 124019417001792 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008843 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.675588 124019417001792 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.675594 124019417001792 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.675601 124019417001792 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.675707 124019417001792 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.675943 124019417001792 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.417611 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.679165 124019417001792 simple_timer.cpp:55] [rocprofv3] output generation ::     0.442824 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:55.679262 124019417001792 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.444607 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527505_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 23 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:57.236351 133054127923008 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191955 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:57.236962 133054127923008 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.430020 133054127923008 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:57.523256 133054127923008 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.286294 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.545262 133054127923008 generateRocpd.cpp:583] writing SQL database for process 2527518 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:57.546023 133054127923008 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527518_results.db (UUID=0001fa81-d911-7911-a5bc-da338cebe43c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.629345 133054127923008 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008028 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.630547 133054127923008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001184 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.632220 133054127923008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001658 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.642862 133054127923008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008501 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.942880 133054127923008 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.300003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.945252 133054127923008 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002340 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.945269 133054127923008 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.954411 133054127923008 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009135 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.954425 133054127923008 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.954431 133054127923008 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.954438 133054127923008 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.954586 133054127923008 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.954829 133054127923008 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.409568 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.958072 133054127923008 simple_timer.cpp:55] [rocprofv3] output generation ::     0.433493 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:57.958178 133054127923008 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.434883 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527518_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:59.487864 135076226826048 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190421 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:59.488489 135076226826048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:59.680781 135076226826048 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:13:59.773660 135076226826048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285171 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:59.796417 135076226826048 generateRocpd.cpp:583] writing SQL database for process 2527527 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:13:59.797249 135076226826048 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527527_results.db (UUID=0001fa81-e1de-71de-babf-c8815f63a827)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:59.879696 135076226826048 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007984 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:59.880816 135076226826048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:59.882807 135076226826048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001976 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:13:59.893098 135076226826048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008315 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:00.178757 135076226826048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.285643 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:00.180886 135076226826048 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:00.180904 135076226826048 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:00.189643 135076226826048 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008733 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:00.189660 135076226826048 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:00.189666 135076226826048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:00.189673 135076226826048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:00.189776 135076226826048 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:00.189987 135076226826048 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.393571 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:00.193143 135076226826048 simple_timer.cpp:55] [rocprofv3] output generation ::     0.417728 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:00.193249 135076226826048 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.419546 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527527_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:01.727374 130082237857600 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192762 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:01.728053 130082237857600 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:01.923617 130082237857600 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:02.012209 130082237857600 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284156 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.034359 130082237857600 generateRocpd.cpp:583] writing SQL database for process 2527536 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:02.035173 130082237857600 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527536_results.db (UUID=0001fa81-ea9b-7a9b-a962-8f1cef635e66)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.118059 130082237857600 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007962 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.119264 130082237857600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001190 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.120843 130082237857600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001563 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.131176 130082237857600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008366 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.412802 130082237857600 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.281607 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.415121 130082237857600 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002289 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.415139 130082237857600 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.424360 130082237857600 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009214 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.424374 130082237857600 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.424381 130082237857600 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.424388 130082237857600 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.424525 130082237857600 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000125 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.424781 130082237857600 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.390422 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.427905 130082237857600 simple_timer.cpp:55] [rocprofv3] output generation ::     0.414267 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:02.428016 130082237857600 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.415762 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527536_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:03.924471 125865833508672 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.181814 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:03.925104 125865833508672 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.123341 125865833508672 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:04.215433 125865833508672 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.290328 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.238314 125865833508672 generateRocpd.cpp:583] writing SQL database for process 2527547 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:04.239123 125865833508672 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527547_results.db (UUID=0001fa81-f33b-733b-9d3d-d80eae9f7cae)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.320210 125865833508672 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007692 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.321290 125865833508672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001064 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.322874 125865833508672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001570 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.333221 125865833508672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008439 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.341632 125865833508672 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008396 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.343554 125865833508672 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001907 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.343572 125865833508672 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.352232 125865833508672 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008653 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.352246 125865833508672 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.352252 125865833508672 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.352259 125865833508672 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.352361 125865833508672 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000094 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.352546 125865833508672 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.114232 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.355502 125865833508672 simple_timer.cpp:55] [rocprofv3] output generation ::     0.138277 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:04.355553 125865833508672 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.140072 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527547_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:05.858796 127930863673152 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188235 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:05.859392 127930863673152 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.053202 127930863673152 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:06.141227 127930863673152 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281834 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.163335 127930863673152 generateRocpd.cpp:583] writing SQL database for process 2527557 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:06.164133 127930863673152 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527557_results.db (UUID=0001fa81-fac3-7ac3-a44f-72529b0112f6)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.247383 127930863673152 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008089 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.248595 127930863673152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.250555 127930863673152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001945 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.261185 127930863673152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008640 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.670075 127930863673152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.408875 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.672320 127930863673152 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002221 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.672337 127930863673152 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.682016 127930863673152 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009672 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.682041 127930863673152 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.682051 127930863673152 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.682062 127930863673152 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.682196 127930863673152 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000124 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.682450 127930863673152 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.519115 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.685465 127930863673152 simple_timer.cpp:55] [rocprofv3] output generation ::     0.542855 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:06.685586 127930863673152 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.544314 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527557_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:08.201410 123459548487488 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188588 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:08.201990 123459548487488 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:08.396957 123459548487488 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:08.484500 123459548487488 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282511 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:08.507285 123459548487488 generateRocpd.cpp:583] writing SQL database for process 2527566 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:08.508091 123459548487488 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527566_results.db (UUID=0001fa82-03e9-73e9-8143-424186a3c55e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:08.591027 123459548487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008013 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:08.592230 123459548487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001174 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:08.594161 123459548487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001917 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:08.604480 123459548487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008327 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:09.008553 123459548487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.404058 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:09.010886 123459548487488 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002317 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:09.010904 123459548487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:09.020653 123459548487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009742 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:09.020667 123459548487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:09.020674 123459548487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:09.020680 123459548487488 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:09.020802 123459548487488 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:09.021016 123459548487488 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.513732 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:09.024098 123459548487488 simple_timer.cpp:55] [rocprofv3] output generation ::     0.537828 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:09.024215 123459548487488 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.539664 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527566_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:10.589042 138795272970048 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.195935 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:10.589618 138795272970048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:10.784816 138795272970048 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:10.874762 138795272970048 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285144 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:10.898272 138795272970048 generateRocpd.cpp:583] writing SQL database for process 2527575 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:10.899088 138795272970048 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527575_results.db (UUID=0001fa82-0d35-7d35-8141-4b5e553815de)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:10.982537 138795272970048 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008328 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:10.983712 138795272970048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001159 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:10.985850 138795272970048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002123 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:10.996819 138795272970048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008598 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:11.580286 138795272970048 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.583451 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:11.582537 138795272970048 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002212 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:11.582555 138795272970048 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000005 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:11.593313 138795272970048 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010750 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:11.593329 138795272970048 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:11.593336 138795272970048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:11.593342 138795272970048 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:11.593492 138795272970048 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000139 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:11.593777 138795272970048 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.695505 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:11.596867 138795272970048 simple_timer.cpp:55] [rocprofv3] output generation ::     0.720122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:11.597029 138795272970048 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.722221 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527575_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:13.135185 135894395854656 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192349 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:13.135777 135894395854656 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.331620 135894395854656 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:13.419356 135894395854656 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283579 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.441790 135894395854656 generateRocpd.cpp:583] writing SQL database for process 2527583 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:13.442605 135894395854656 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527583_results.db (UUID=0001fa82-172b-772b-a9c0-ae4639db2fd9)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.524361 135894395854656 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007950 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.525478 135894395854656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.527429 135894395854656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001935 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.537663 135894395854656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008257 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.880446 135894395854656 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.342768 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.882601 135894395854656 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002138 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.882619 135894395854656 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.891607 135894395854656 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008981 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.891621 135894395854656 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.891627 135894395854656 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.891634 135894395854656 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.891759 135894395854656 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.891972 135894395854656 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.450183 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.894925 135894395854656 simple_timer.cpp:55] [rocprofv3] output generation ::     0.474209 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:13.895041 135894395854656 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.475637 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527583_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:15.419972 128893158170432 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191655 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:15.420598 128893158170432 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:15.615690 128893158170432 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:15.719459 128893158170432 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.298861 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:15.742865 128893158170432 generateRocpd.cpp:583] writing SQL database for process 2527592 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:15.743669 128893158170432 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527592_results.db (UUID=0001fa82-2019-7019-9af8-e788bac68d35)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:15.827564 128893158170432 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008125 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:15.828748 128893158170432 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001169 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:15.830728 128893158170432 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001964 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:15.841164 128893158170432 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008466 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:16.174146 128893158170432 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.332967 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:16.176448 128893158170432 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002277 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:16.176465 128893158170432 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:16.185183 128893158170432 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008711 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:16.185198 128893158170432 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:16.185204 128893158170432 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:16.185211 128893158170432 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:16.185325 128893158170432 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:16.185543 128893158170432 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.442679 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:16.188471 128893158170432 simple_timer.cpp:55] [rocprofv3] output generation ::     0.467166 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:16.188567 128893158170432 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.469064 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527592_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:17.737328 133343738928960 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.198832 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:17.737925 133343738928960 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:17.932770 133343738928960 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:18.021446 133343738928960 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283521 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.044425 133343738928960 generateRocpd.cpp:583] writing SQL database for process 2527601 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:18.045257 133343738928960 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527601_results.db (UUID=0001fa82-291f-791f-9724-55c47d567e9d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.128869 133343738928960 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008258 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.130049 133343738928960 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001164 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.132000 133343738928960 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001936 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.142498 133343738928960 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008445 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.660509 133343738928960 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.517996 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.662773 133343738928960 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002245 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.662790 133343738928960 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.671695 133343738928960 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008898 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.671709 133343738928960 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.671715 133343738928960 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.671721 133343738928960 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.671829 133343738928960 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.672053 133343738928960 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.627628 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.675128 133343738928960 simple_timer.cpp:55] [rocprofv3] output generation ::     0.651849 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:18.675249 133343738928960 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.653754 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527601_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_LDS/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:20.227257 136053516754752 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191692 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:20.227819 136053516754752 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.422551 136053516754752 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:14:20.510468 136053516754752 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282650 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.532765 136053516754752 generateRocpd.cpp:583] writing SQL database for process 2527611 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:14:20.533561 136053516754752 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527611_results.db (UUID=0001fa82-32e0-72e0-ab78-3b37b7678a6b)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.616046 136053516754752 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008047 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.617150 136053516754752 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001089 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.618725 136053516754752 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001560 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.629200 136053516754752 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008513 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.952561 136053516754752 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.323347 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.954841 136053516754752 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002261 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.954859 136053516754752 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.963600 136053516754752 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008733 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.963615 136053516754752 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.963621 136053516754752 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.963627 136053516754752 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.963739 136053516754752 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.963946 136053516754752 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.431181 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.966925 136053516754752 simple_timer.cpp:55] [rocprofv3] output generation ::     0.454888 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:14:20.967027 136053516754752 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.456512 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_LDS/MI200/out/pmc_1/2527611_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/mem_levels_LDS/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
