Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/mem_levels_vL1d_LDS/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:45.297798 130297226395456 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192644 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:45.298402 130297226395456 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:45.491536 130297226395456 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:45.578104 130297226395456 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279702 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:45.600709 130297226395456 generateRocpd.cpp:583] writing SQL database for process 2527852 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:45.601514 130297226395456 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527852_results.db (UUID=0001fa83-7f2d-7f2d-be7f-21024398c553)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:45.684413 130297226395456 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008026 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:45.685582 130297226395456 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001152 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:45.687150 130297226395456 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001554 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:45.697497 130297226395456 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008406 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:46.030120 130297226395456 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.332607 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:46.032521 130297226395456 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002381 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:46.032539 130297226395456 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:46.042546 130297226395456 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:46.042560 130297226395456 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:46.042566 130297226395456 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:46.042573 130297226395456 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:46.042678 130297226395456 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:46.042892 130297226395456 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.442183 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:46.047210 130297226395456 simple_timer.cpp:55] [rocprofv3] output generation ::     0.467342 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:46.047335 130297226395456 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.469183 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527852_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 25 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:47.602044 134238754398016 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192321 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:47.602667 134238754398016 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:47.795441 134238754398016 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:47.878331 134238754398016 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.275665 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:47.901625 134238754398016 generateRocpd.cpp:583] writing SQL database for process 2527862 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:47.902440 134238754398016 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527862_results.db (UUID=0001fa83-882e-782e-84e8-f141139d4351)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:47.985245 134238754398016 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008009 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:47.986451 134238754398016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001184 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:47.988109 134238754398016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001643 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:47.998769 134238754398016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008695 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:48.313575 134238754398016 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.314791 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:48.315799 134238754398016 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002206 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:48.315817 134238754398016 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:48.325177 134238754398016 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009353 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:48.325192 134238754398016 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:48.325198 134238754398016 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:48.325205 134238754398016 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:48.325326 134238754398016 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000111 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:48.325578 134238754398016 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.423954 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:48.328994 134238754398016 simple_timer.cpp:55] [rocprofv3] output generation ::     0.448761 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:48.329129 134238754398016 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.450750 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527862_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:49.858764 138600138497856 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189358 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:49.859390 138600138497856 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.052631 138600138497856 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:50.156952 138600138497856 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.297562 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.179081 138600138497856 generateRocpd.cpp:583] writing SQL database for process 2527870 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:50.179884 138600138497856 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527870_results.db (UUID=0001fa83-9102-7102-af42-2054c9b0c78b)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.263644 138600138497856 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007977 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.264762 138600138497856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.266631 138600138497856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001855 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.277102 138600138497856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008337 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.579278 138600138497856 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.302162 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.581564 138600138497856 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002267 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.581581 138600138497856 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.590365 138600138497856 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008776 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.590379 138600138497856 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.590385 138600138497856 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.590392 138600138497856 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.590499 138600138497856 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.590704 138600138497856 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.411623 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.593736 138600138497856 simple_timer.cpp:55] [rocprofv3] output generation ::     0.435422 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:50.593837 138600138497856 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.436836 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527870_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:52.118265 131167171596096 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189816 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:52.118876 131167171596096 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.312311 131167171596096 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:52.409477 131167171596096 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.290601 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.431763 131167171596096 generateRocpd.cpp:583] writing SQL database for process 2527880 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:52.432595 131167171596096 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527880_results.db (UUID=0001fa83-99d5-79d5-96fc-194beba478db)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.515579 131167171596096 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007908 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.516687 131167171596096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.518662 131167171596096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001959 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.529012 131167171596096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008375 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.817234 131167171596096 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.288206 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.819501 131167171596096 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002239 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.819519 131167171596096 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.828433 131167171596096 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008908 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.828449 131167171596096 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.828455 131167171596096 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.828462 131167171596096 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.828587 131167171596096 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000117 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.828839 131167171596096 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.397077 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.831968 131167171596096 simple_timer.cpp:55] [rocprofv3] output generation ::     0.420864 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:52.832082 131167171596096 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.422553 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527880_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:54.348864 134315397177152 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192954 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:54.349502 134315397177152 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:54.542281 134315397177152 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:54.637239 134315397177152 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.287738 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:54.659946 134315397177152 generateRocpd.cpp:583] writing SQL database for process 2527888 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:54.660742 134315397177152 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527888_results.db (UUID=0001fa83-a288-7288-bff4-a9001f02627e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:54.743060 134315397177152 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007927 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:54.744172 134315397177152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:54.745745 134315397177152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001558 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:54.756089 134315397177152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008332 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:55.036623 134315397177152 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.280520 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:55.038795 134315397177152 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002156 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:55.038812 134315397177152 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:55.047440 134315397177152 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008620 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:55.047455 134315397177152 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:55.047461 134315397177152 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:55.047468 134315397177152 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:55.047597 134315397177152 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000121 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:55.047847 134315397177152 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.387902 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:55.050897 134315397177152 simple_timer.cpp:55] [rocprofv3] output generation ::     0.411936 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:55.050996 134315397177152 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.413714 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527888_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:56.532564 124205076127552 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.181978 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:56.533157 124205076127552 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.728367 124205076127552 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:56.820795 124205076127552 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.287638 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.844230 124205076127552 generateRocpd.cpp:583] writing SQL database for process 2527898 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:56.845019 124205076127552 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527898_results.db (UUID=0001fa83-ab1b-7b1b-a512-78bf1bf1eecd)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.927604 124205076127552 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007754 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.928807 124205076127552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.930400 124205076127552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001578 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.940803 124205076127552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008419 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.949193 124205076127552 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008374 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.951240 124205076127552 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002032 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.951257 124205076127552 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.960094 124205076127552 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008830 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.960108 124205076127552 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.960114 124205076127552 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.960121 124205076127552 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.960227 124205076127552 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.960423 124205076127552 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.116193 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.963412 124205076127552 simple_timer.cpp:55] [rocprofv3] output generation ::     0.140399 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:56.963461 124205076127552 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.142613 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527898_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:58.458071 127986590818112 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190904 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:58.458641 127986590818112 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:58.650798 127986590818112 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:15:58.742870 127986590818112 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284229 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:58.765369 127986590818112 generateRocpd.cpp:583] writing SQL database for process 2527907 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:15:58.766180 127986590818112 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527907_results.db (UUID=0001fa83-b298-7298-aa91-a5de960c056a)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:58.849944 127986590818112 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008182 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:58.851170 127986590818112 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001209 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:58.853289 127986590818112 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:58.863848 127986590818112 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008540 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:59.284377 127986590818112 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.420512 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:59.286657 127986590818112 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002261 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:59.286674 127986590818112 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:59.295481 127986590818112 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008800 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:59.295497 127986590818112 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:59.295504 127986590818112 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:59.295511 127986590818112 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:59.295626 127986590818112 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000105 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:59.295851 127986590818112 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.530482 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:59.298958 127986590818112 simple_timer.cpp:55] [rocprofv3] output generation ::     0.554596 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:15:59.299095 127986590818112 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.556177 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527907_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:00.832799 129196542955328 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191615 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:00.833390 129196542955328 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.025892 129196542955328 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:01.113595 129196542955328 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280205 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.136195 129196542955328 generateRocpd.cpp:583] writing SQL database for process 2527916 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:16:01.136996 129196542955328 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527916_results.db (UUID=0001fa83-bbde-7bde-be6d-1c87137d0501)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.219378 129196542955328 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008011 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.220488 129196542955328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.222421 129196542955328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001919 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.232791 129196542955328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008377 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.623227 129196542955328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.390422 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.625308 129196542955328 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002065 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.625325 129196542955328 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.634277 129196542955328 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008945 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.634291 129196542955328 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.634297 129196542955328 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.634304 129196542955328 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.634407 129196542955328 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.634624 129196542955328 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.498429 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.638330 129196542955328 simple_timer.cpp:55] [rocprofv3] output generation ::     0.523103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:01.638458 129196542955328 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.524813 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527916_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:03.189866 128853387714368 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197381 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:03.190485 128853387714368 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:03.384953 128853387714368 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:03.471236 128853387714368 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.280751 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:03.493914 128853387714368 generateRocpd.cpp:583] writing SQL database for process 2527924 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:16:03.494721 128853387714368 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527924_results.db (UUID=0001fa83-c50d-750d-9add-a327acadcca2)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:03.577948 128853387714368 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008247 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:03.579162 128853387714368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:03.581310 128853387714368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002131 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:03.591919 128853387714368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008604 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:04.185299 128853387714368 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.593362 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:04.187534 128853387714368 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002203 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:04.187554 128853387714368 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:04.196707 128853387714368 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009138 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:04.196721 128853387714368 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:04.196728 128853387714368 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:04.196734 128853387714368 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:04.196870 128853387714368 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000126 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:04.197143 128853387714368 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.703230 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:04.200452 128853387714368 simple_timer.cpp:55] [rocprofv3] output generation ::     0.727427 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:04.200592 128853387714368 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.729298 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527924_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:05.731596 128348834717504 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190058 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:05.732288 128348834717504 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:05.926345 128348834717504 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:06.017090 128348834717504 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.284802 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.040061 128348834717504 generateRocpd.cpp:583] writing SQL database for process 2527935 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:16:06.040858 128348834717504 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527935_results.db (UUID=0001fa83-cf02-7f02-8371-b1bf0645fce8)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.123929 128348834717504 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007997 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.125150 128348834717504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001205 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.127237 128348834717504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002072 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.137614 128348834717504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008365 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.479868 128348834717504 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.342238 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.482056 128348834717504 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002167 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.482074 128348834717504 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.491537 128348834717504 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009456 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.491552 128348834717504 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.491558 128348834717504 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.491565 128348834717504 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.491682 128348834717504 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000108 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.491918 128348834717504 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.451858 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.495026 128348834717504 simple_timer.cpp:55] [rocprofv3] output generation ::     0.476014 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:06.495138 128348834717504 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.477999 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527935_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:08.024686 129098230357824 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189540 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:08.025294 129098230357824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.233257 129098230357824 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:08.317669 129098230357824 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.292376 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.339642 129098230357824 generateRocpd.cpp:583] writing SQL database for process 2527944 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:16:08.340444 129098230357824 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527944_results.db (UUID=0001fa83-d7f8-77f8-b6d7-96c18a7ec353)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.422744 129098230357824 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008031 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.423945 129098230357824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001186 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.426110 129098230357824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002150 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.436813 129098230357824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008551 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.767697 129098230357824 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.330869 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.769953 129098230357824 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002239 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.769970 129098230357824 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.779741 129098230357824 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009764 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.779755 129098230357824 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.779761 129098230357824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.779768 129098230357824 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.779869 129098230357824 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.780090 129098230357824 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.440448 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.783165 129098230357824 simple_timer.cpp:55] [rocprofv3] output generation ::     0.464210 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:08.783311 129098230357824 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.465600 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527944_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:10.354901 130288116449088 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.200048 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:10.355552 130288116449088 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:10.549402 130288116449088 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:10.640724 130288116449088 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.285172 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:10.663632 130288116449088 generateRocpd.cpp:583] writing SQL database for process 2527954 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:16:10.664455 130288116449088 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527954_results.db (UUID=0001fa83-e107-7107-8711-7a6b675334eb)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:10.748672 130288116449088 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008122 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:10.749861 130288116449088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001146 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:10.751837 130288116449088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001961 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:10.762021 130288116449088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008335 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:11.282700 130288116449088 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.520656 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:11.285296 130288116449088 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002569 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:11.285316 130288116449088 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000003 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:11.301166 130288116449088 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.015841 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:11.301182 130288116449088 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:11.301188 130288116449088 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:11.301194 130288116449088 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:11.301316 130288116449088 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:11.301746 130288116449088 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.638114 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:11.306474 130288116449088 simple_timer.cpp:55] [rocprofv3] output generation ::     0.664196 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:11.306744 130288116449088 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.665969 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527954_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_vL1d_LDS/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:12.876403 127745145790272 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190510 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:12.877046 127745145790272 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.070866 127745145790272 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:16:13.165299 127745145790272 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288254 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.187758 127745145790272 generateRocpd.cpp:583] writing SQL database for process 2527969 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:16:13.188570 127745145790272 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/smc4124-25-mi210-3c48/2527969_results.db (UUID=0001fa83-eaea-7aea-b76a-5e635d579a6e)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.270242 127745145790272 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008061 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.271421 127745145790272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001162 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.273072 127745145790272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001637 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.283871 127745145790272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008437 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.604443 127745145790272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.320557 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.606766 127745145790272 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002306 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.606784 127745145790272 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.615997 127745145790272 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009206 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.616012 127745145790272 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.616019 127745145790272 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.616026 127745145790272 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.616147 127745145790272 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.616489 127745145790272 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.428732 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.619642 127745145790272 simple_timer.cpp:55] [rocprofv3] output generation ::     0.452561 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:16:13.619740 127745145790272 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.454398 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_vL1d_LDS/MI200/out/pmc_1/2527969_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/mem_levels_vL1d_LDS/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
