Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/mem_levels_L2/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:59.954120 133960644960064 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191584 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:11:59.954717 133960644960064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.148674 133960644960064 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:00.235636 133960644960064 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280919 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.257733 133960644960064 generateRocpd.cpp:583] writing SQL database for process 2526908 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:00.258485 133960644960064 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526908_results.db (UUID=0001fa80-0eef-7eef-a3f2-b1e1ab8683d8)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.341376 133960644960064 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008036 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.342573 133960644960064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001180 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.344156 133960644960064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001569 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.354317 133960644960064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008217 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.704874 133960644960064 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.350541 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.719328 133960644960064 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.014422 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.719347 133960644960064 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.729251 133960644960064 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009897 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.729265 133960644960064 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.729271 133960644960064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.729278 133960644960064 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.729384 133960644960064 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000099 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.729572 133960644960064 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.471839 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.733652 133960644960064 simple_timer.cpp:55] [rocprofv3] output generation ::     0.496771 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:00.733743 133960644960064 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.498072 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2526908_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:02.289693 133302320176960 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191935 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:02.290295 133302320176960 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:02.484298 133302320176960 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:02.576239 133302320176960 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285944 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:02.598624 133302320176960 generateRocpd.cpp:583] writing SQL database for process 2526918 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:02.599437 133302320176960 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526918_results.db (UUID=0001fa80-180e-780e-8fd7-a83f61a543ed)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:02.683084 133302320176960 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008018 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:02.684311 133302320176960 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001211 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:02.685963 133302320176960 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001637 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:02.696158 133302320176960 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008205 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:03.018920 133302320176960 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.322747 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:03.021182 133302320176960 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002242 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:03.021199 133302320176960 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:03.029859 133302320176960 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008653 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:03.029874 133302320176960 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:03.029880 133302320176960 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:03.029887 133302320176960 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:03.030011 133302320176960 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:03.030258 133302320176960 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.431635 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:03.033545 133302320176960 simple_timer.cpp:55] [rocprofv3] output generation ::     0.455796 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:03.033648 133302320176960 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.457356 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2526918_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 23 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:04.576681 126144870629184 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189864 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:04.577279 126144870629184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:04.770321 126144870629184 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:04.863389 126144870629184 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.286110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:04.885690 126144870629184 generateRocpd.cpp:583] writing SQL database for process 2526926 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:04.886489 126144870629184 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526926_results.db (UUID=0001fa80-20ff-70ff-9687-3d2e4a177700)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:04.969160 126144870629184 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007964 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:04.970244 126144870629184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001067 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:04.971833 126144870629184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001573 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:04.982108 126144870629184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008314 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:05.281044 126144870629184 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.298906 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:05.283403 126144870629184 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002329 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:05.283420 126144870629184 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:05.292542 126144870629184 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:05.292557 126144870629184 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:05.292564 126144870629184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:05.292571 126144870629184 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:05.292703 126144870629184 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:05.292960 126144870629184 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.407270 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:05.296156 126144870629184 simple_timer.cpp:55] [rocprofv3] output generation ::     0.431311 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:05.296259 126144870629184 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.432820 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2526926_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:06.851830 124146734686016 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191883 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:06.852470 124146734686016 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.046537 124146734686016 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:07.128251 124146734686016 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275781 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.150760 124146734686016 generateRocpd.cpp:583] writing SQL database for process 2526935 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:07.151575 124146734686016 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526935_results.db (UUID=0001fa80-29e0-79e0-a11c-793d509237f7)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.234588 124146734686016 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007948 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.235734 124146734686016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001128 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.237869 124146734686016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002120 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.248427 124146734686016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008384 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.541764 124146734686016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.293322 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.544012 124146734686016 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002226 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.544036 124146734686016 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000010 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.553399 124146734686016 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009356 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.553414 124146734686016 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.553420 124146734686016 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.553427 124146734686016 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.553530 124146734686016 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000096 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.553743 124146734686016 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.402983 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.556783 124146734686016 simple_timer.cpp:55] [rocprofv3] output generation ::     0.426929 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:07.556875 124146734686016 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.428575 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2526935_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:09.106515 139044470603584 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189391 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:09.107164 139044470603584 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.301668 139044470603584 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:09.387197 139044470603584 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280033 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.409548 139044470603584 generateRocpd.cpp:583] writing SQL database for process 2526945 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:09.410325 139044470603584 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526945_results.db (UUID=0001fa80-32b2-72b2-a9c5-e993fcfce28c)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.493243 139044470603584 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008065 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.494438 139044470603584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001177 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.496112 139044470603584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001659 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.506585 139044470603584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008319 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.785054 139044470603584 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.278453 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.787543 139044470603584 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002458 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.787561 139044470603584 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.796701 139044470603584 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009133 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.796717 139044470603584 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.796723 139044470603584 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.796731 139044470603584 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.796860 139044470603584 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.797129 139044470603584 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.387581 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.800187 139044470603584 simple_timer.cpp:55] [rocprofv3] output generation ::     0.411429 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:09.800293 139044470603584 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.413055 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2526945_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:11.307877 137336669691712 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.182945 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:11.308506 137336669691712 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.502898 137336669691712 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:11.588754 137336669691712 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280249 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.610831 137336669691712 generateRocpd.cpp:583] writing SQL database for process 2526953 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:11.611616 137336669691712 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526953_results.db (UUID=0001fa80-3b52-7b52-9bc8-4b379b8348a0)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.694316 137336669691712 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007692 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.695507 137336669691712 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.697198 137336669691712 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001676 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.707816 137336669691712 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008473 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.716425 137336669691712 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008594 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.718523 137336669691712 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002083 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.718540 137336669691712 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.727104 137336669691712 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008557 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.727119 137336669691712 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.727125 137336669691712 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.727132 137336669691712 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.727231 137336669691712 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.727430 137336669691712 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.116599 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.730239 137336669691712 simple_timer.cpp:55] [rocprofv3] output generation ::     0.140344 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:11.730284 137336669691712 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.141483 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2526953_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:13.279313 131655501852480 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191944 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:13.279967 131655501852480 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:13.473489 131655501852480 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:13.558103 131655501852480 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278136 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:13.580607 131655501852480 generateRocpd.cpp:583] writing SQL database for process 2526962 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:13.581410 131655501852480 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526962_results.db (UUID=0001fa80-42fc-72fc-b6c1-c7c43c198a05)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:13.664887 131655501852480 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:13.666043 131655501852480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001140 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:13.667968 131655501852480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001910 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:13.678467 131655501852480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008443 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:14.088278 131655501852480 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.409796 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:14.090559 131655501852480 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002261 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:14.090576 131655501852480 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:14.099021 131655501852480 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008437 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:14.099043 131655501852480 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:14.099049 131655501852480 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:14.099056 131655501852480 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:14.099183 131655501852480 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:14.099447 131655501852480 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.518840 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:14.102432 131655501852480 simple_timer.cpp:55] [rocprofv3] output generation ::     0.542679 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:14.102553 131655501852480 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.544401 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2526962_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:15.626221 133990585794368 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192744 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:15.626813 133990585794368 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:15.820413 133990585794368 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:15.911396 133990585794368 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284583 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:15.933296 133990585794368 generateRocpd.cpp:583] writing SQL database for process 2526972 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:15.934098 133990585794368 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526972_results.db (UUID=0001fa80-4c26-7c26-b0ce-0304f8d975d3)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.018087 133990585794368 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008125 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.019309 133990585794368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001205 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.021404 133990585794368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002081 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.031959 133990585794368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008372 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.435854 133990585794368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.403880 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.438126 133990585794368 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002256 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.438144 133990585794368 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.447102 133990585794368 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008950 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.447115 133990585794368 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.447122 133990585794368 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.447129 133990585794368 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.447239 133990585794368 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.447456 133990585794368 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.514161 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.450467 133990585794368 simple_timer.cpp:55] [rocprofv3] output generation ::     0.537866 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:16.450583 133990585794368 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.539144 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2526972_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 9 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:18.042817 130509853384512 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197354 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:18.043416 130509853384512 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:18.239180 130509853384512 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:18.331165 130509853384512 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.287750 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:18.353875 130509853384512 generateRocpd.cpp:583] writing SQL database for process 2526980 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:18.354672 130509853384512 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526980_results.db (UUID=0001fa80-5592-7592-b30f-d36be0d2d468)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:18.437697 130509853384512 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008359 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:18.438881 130509853384512 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001168 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:18.440892 130509853384512 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001996 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:18.451433 130509853384512 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008572 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:19.037767 130509853384512 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.586318 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:19.039990 130509853384512 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002194 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:19.040008 130509853384512 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:19.049044 130509853384512 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009028 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:19.049060 130509853384512 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:19.049066 130509853384512 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:19.049073 130509853384512 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:19.049242 130509853384512 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000126 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:19.049525 130509853384512 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.695650 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:19.052564 130509853384512 simple_timer.cpp:55] [rocprofv3] output generation ::     0.719744 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:19.052706 130509853384512 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.721492 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2526980_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:20.601893 123274359480128 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191116 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:20.602550 123274359480128 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:20.798232 123274359480128 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:20.885020 123274359480128 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282471 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:20.907782 123274359480128 generateRocpd.cpp:583] writing SQL database for process 2526989 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:20.908559 123274359480128 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526989_results.db (UUID=0001fa80-5f97-7f97-8abe-bc3c7d8e3d61)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:20.992072 123274359480128 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008076 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:20.993289 123274359480128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001200 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:20.995365 123274359480128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002061 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.005738 123274359480128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008293 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.351619 123274359480128 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.345866 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.353923 123274359480128 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002287 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.353940 123274359480128 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.363568 123274359480128 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009621 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.363583 123274359480128 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.363589 123274359480128 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.363596 123274359480128 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.363718 123274359480128 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.363953 123274359480128 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.456171 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.366948 123274359480128 simple_timer.cpp:55] [rocprofv3] output generation ::     0.480076 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:21.367054 123274359480128 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.481987 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2526989_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:22.913815 125662948376384 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189812 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:22.914400 125662948376384 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.108306 125662948376384 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:23.198337 125662948376384 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.283937 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.220818 125662948376384 generateRocpd.cpp:583] writing SQL database for process 2526997 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:23.221623 125662948376384 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526997_results.db (UUID=0001fa80-68a0-78a0-abac-7e100a284ff3)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.303447 125662948376384 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007936 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.304615 125662948376384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001151 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.306536 125662948376384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001906 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.316994 125662948376384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008556 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.667739 125662948376384 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.350729 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.669949 125662948376384 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002193 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.669967 125662948376384 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.679499 125662948376384 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009525 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.679514 125662948376384 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.679521 125662948376384 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.679527 125662948376384 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.679635 125662948376384 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.679830 125662948376384 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.459012 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.682872 125662948376384 simple_timer.cpp:55] [rocprofv3] output generation ::     0.482988 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:23.682978 125662948376384 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.484590 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2526997_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:25.244140 129777831489344 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.199188 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:25.244724 129777831489344 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:25.441930 129777831489344 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:25.527215 129777831489344 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.282492 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:25.549795 129777831489344 generateRocpd.cpp:583] writing SQL database for process 2527005 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:25.550592 129777831489344 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527005_results.db (UUID=0001fa80-71b1-71b1-99da-a19e83c1e41d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:25.634381 129777831489344 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008183 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:25.635582 129777831489344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001183 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:25.637681 129777831489344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002085 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:25.648239 129777831489344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008407 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:26.166987 129777831489344 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.518732 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:26.169225 129777831489344 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002210 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:26.169244 129777831489344 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:26.179334 129777831489344 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010083 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:26.179351 129777831489344 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:26.179357 129777831489344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:26.179364 129777831489344 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:26.179531 129777831489344 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000133 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:26.179818 129777831489344 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.630023 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:26.182887 129777831489344 simple_timer.cpp:55] [rocprofv3] output generation ::     0.653995 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:26.183020 129777831489344 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.655754 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2527005_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_L2/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:27.717057 130927387017024 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192859 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:27.717709 130927387017024 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:27.914948 130927387017024 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:12:27.999160 130927387017024 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.281451 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.021607 130927387017024 generateRocpd.cpp:583] writing SQL database for process 2527013 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:12:28.022410 130927387017024 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_L2/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527013_results.db (UUID=0001fa80-7b61-7b61-ba8c-ca74bb4bd158)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.105650 130927387017024 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008019 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.106834 130927387017024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001167 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.108430 130927387017024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001581 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.118813 130927387017024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008411 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.439069 130927387017024 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.320240 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.441380 130927387017024 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002287 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.441397 130927387017024 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.450432 130927387017024 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009027 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.450446 130927387017024 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.450453 130927387017024 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.450459 130927387017024 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.450610 130927387017024 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000117 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.450859 130927387017024 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.429253 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.453880 130927387017024 simple_timer.cpp:55] [rocprofv3] output generation ::     0.453054 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:12:28.453982 130927387017024 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.454773 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_L2/MI200/out/pmc_1/2527013_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/mem_levels_L2/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
