Rocprofiler-Compute version: 3.7.0
Profiler choice: rocprofiler-sdk
Output directory: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/tests/workloads/mem_levels_HBM/MI200
Target: MI210
Command: ./tests/vcopy -n 1048576 -b 256 -i 3
Kernel Selection: None
Dispatch Selection: None
Filtered sections: All

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Collecting Performance Counters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Generating native tool project using command: cmake -S /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib -B /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
-- Checking for module 'libdw'
--   Package 'libdw', required by 'virtual:world', not found
-- Could NOT find libdw (missing: libdw_LIBRARY libdw_INCLUDE_DIR)
-- {fmt} version: 12.1.0
-- Build type:
-- Configuring done (0.1s)
-- Generating done (0.0s)
-- Build files have been written to: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build
Building native tool using command: cmake --build /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build --parallel
[  0%] Built target gsl_assert
[ 33%] Built target fmt
[100%] Built target rocprofiler-compute-tool
Searching /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src by lib/_build/lib/librocprofiler-compute-tool.so for native collector
Using native collector: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
Using native counter collection tool: /home/xuchen/dev/rocm-systems/projects/rocprofiler-compute/src/lib/_build/lib/librocprofiler-compute-tool.so
[profiling] Iteration multiplexing: Disabled
[Run 1/13][Approximate profiling time left: pending first measurement...]
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_0.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:07.429390 137693620150080 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192135 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:07.430024 137693620150080 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:07.625161 137693620150080 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:07.708506 137693620150080 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278482 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:07.730875 137693620150080 generateRocpd.cpp:583] writing SQL database for process 2526572 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:07.731676 137693620150080 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526572_results.db (UUID=0001fa7e-5762-7762-b704-7e4e34e296f0)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:07.812599 137693620150080 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007876 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:07.813660 137693620150080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001045 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:07.815210 137693620150080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001535 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:07.825144 137693620150080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008058 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:08.113609 137693620150080 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.288451 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:08.115662 137693620150080 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002030 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:08.115680 137693620150080 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:08.125920 137693620150080 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.010233 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:08.125934 137693620150080 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:08.125940 137693620150080 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:08.125947 137693620150080 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:08.126073 137693620150080 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000118 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:08.126285 137693620150080 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.395411 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:08.130112 137693620150080 simple_timer.cpp:55] [rocprofv3] output generation ::     0.420022 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:08.130212 137693620150080 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.421656 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526572_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 2/13][Approximate profiling time left: 24 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_1.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:09.676463 138524028714816 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191028 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:09.677096 138524028714816 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:09.869148 138524028714816 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:09.953636 138524028714816 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.276539 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:09.975842 138524028714816 generateRocpd.cpp:583] writing SQL database for process 2526581 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:09.976626 138524028714816 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526581_results.db (UUID=0001fa7e-602a-702a-9169-b03fd11d8fe0)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.056253 138524028714816 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007931 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.057306 138524028714816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001034 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.058838 138524028714816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001518 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.074876 138524028714816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.014145 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.385508 138524028714816 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.310615 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.387804 138524028714816 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002266 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.387821 138524028714816 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.397642 138524028714816 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009813 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.397657 138524028714816 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.397664 138524028714816 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.397671 138524028714816 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.397788 138524028714816 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000110 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.398054 138524028714816 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.422212 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.401281 138524028714816 simple_timer.cpp:55] [rocprofv3] output generation ::     0.446098 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:10.401392 138524028714816 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.447717 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526581_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 3/13][Approximate profiling time left: 22 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_2.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:11.947482 130763983429440 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190094 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:11.948101 130763983429440 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.140010 130763983429440 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:12.222552 130763983429440 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.274452 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.244924 130763983429440 generateRocpd.cpp:583] writing SQL database for process 2526590 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:12.245776 130763983429440 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526590_results.db (UUID=0001fa7e-690a-790a-beef-6fbad26ed299)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.326528 130763983429440 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007840 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.327638 130763983429440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001093 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.329247 130763983429440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001594 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.339355 130763983429440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008161 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.638737 130763983429440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.299365 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.640794 130763983429440 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002040 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.640812 130763983429440 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.649808 130763983429440 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008989 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.649824 130763983429440 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.649830 130763983429440 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.649837 130763983429440 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.649986 130763983429440 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000115 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.650238 130763983429440 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.405314 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.653349 130763983429440 simple_timer.cpp:55] [rocprofv3] output generation ::     0.429168 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:12.653442 130763983429440 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.430852 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526590_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 4/13][Approximate profiling time left: 20 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_3.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:14.189061 131342306729792 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190595 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:14.189643 131342306729792 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.382730 131342306729792 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:14.467796 131342306729792 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278153 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.490048 131342306729792 generateRocpd.cpp:583] writing SQL database for process 2526598 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:14.490848 131342306729792 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526598_results.db (UUID=0001fa7e-71cb-71cb-9f5c-48b96ab9e212)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.573272 131342306729792 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007972 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.574391 131342306729792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.576384 131342306729792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001975 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.586594 131342306729792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008200 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.876226 131342306729792 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.289613 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.878525 131342306729792 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002267 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.878545 131342306729792 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.887600 131342306729792 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009044 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.887617 131342306729792 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.887659 131342306729792 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.887673 131342306729792 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.887777 131342306729792 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.887993 131342306729792 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.397945 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.891043 131342306729792 simple_timer.cpp:55] [rocprofv3] output generation ::     0.421746 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:14.891137 131342306729792 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.423286 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526598_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 5/13][Approximate profiling time left: 18 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_4.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:16.418312 138058191150912 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.191253 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:16.418965 138058191150912 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:16.613647 138058191150912 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:16.719807 138058191150912 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.300842 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:16.741868 138058191150912 generateRocpd.cpp:583] writing SQL database for process 2526608 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:16.742646 138058191150912 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526608_results.db (UUID=0001fa7e-7a80-7a80-b976-3e970f257569)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:16.825629 138058191150912 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007960 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:16.826818 138058191150912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001172 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:16.828558 138058191150912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001726 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:16.839050 138058191150912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008337 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:17.119453 138058191150912 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.280387 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:17.121809 138058191150912 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002327 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:17.121827 138058191150912 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:17.131036 138058191150912 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009202 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:17.131052 138058191150912 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:17.131058 138058191150912 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:17.131065 138058191150912 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:17.131178 138058191150912 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000104 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:17.131420 138058191150912 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.389552 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:17.134505 138058191150912 simple_timer.cpp:55] [rocprofv3] output generation ::     0.413211 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:17.134610 138058191150912 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.414760 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526608_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 6/13][Approximate profiling time left: 15 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_5.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:18.630808 131899173781312 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.184730 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:18.631403 131899173781312 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:18.825274 131899173781312 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:18.910970 131899173781312 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.279567 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:18.933223 131899173781312 generateRocpd.cpp:583] writing SQL database for process 2526617 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:18.934010 131899173781312 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526617_results.db (UUID=0001fa7e-832a-732a-ba52-6843e0d42fce)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.015707 131899173781312 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007696 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.016832 131899173781312 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001106 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.018425 131899173781312 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001576 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.028678 131899173781312 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008265 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.037158 131899173781312 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.008464 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.039094 131899173781312 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.001920 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.039113 131899173781312 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.047831 131899173781312 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008703 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.047846 131899173781312 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.047852 131899173781312 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.047858 131899173781312 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.047958 131899173781312 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000092 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.048152 131899173781312 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.114929 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.051137 131899173781312 simple_timer.cpp:55] [rocprofv3] output generation ::     0.138735 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:19.051193 131899173781312 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.140170 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526617_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 7/13][Approximate profiling time left: 13 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_SQC_DCACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:20.566317 131355642437440 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188525 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:20.566873 131355642437440 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:20.759147 131355642437440 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:20.855252 131355642437440 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.288379 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:20.877914 131355642437440 generateRocpd.cpp:583] writing SQL database for process 2526626 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:20.878715 131355642437440 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526626_results.db (UUID=0001fa7e-8ab6-7ab6-aa06-b53cd2388bfe)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:20.961980 131355642437440 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008084 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:20.963112 131355642437440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001116 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:20.965114 131355642437440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001988 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:20.975592 131355642437440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008448 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:21.384738 131355642437440 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.409130 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:21.386889 131355642437440 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002127 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:21.386906 131355642437440 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:21.395957 131355642437440 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009043 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:21.395973 131355642437440 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:21.395979 131355642437440 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:21.395986 131355642437440 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:21.396125 131355642437440 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000130 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:21.396409 131355642437440 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.518495 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:21.399500 131355642437440 simple_timer.cpp:55] [rocprofv3] output generation ::     0.542520 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:21.399636 131355642437440 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.544336 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526626_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 8/13][Approximate profiling time left: 11 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_SQC_ICACHE_INFLIGHT_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:22.928721 127209772695360 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.188365 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:22.929306 127209772695360 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.123877 127209772695360 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:23.226239 127209772695360 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.296933 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.248774 127209772695360 generateRocpd.cpp:583] writing SQL database for process 2526634 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:23.249562 127209772695360 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526634_results.db (UUID=0001fa7e-93f1-73f1-9202-b39872663251)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.331160 127209772695360 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008052 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.332276 127209772695360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.334205 127209772695360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001914 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.344500 127209772695360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008319 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.758217 127209772695360 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.413702 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.760442 127209772695360 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002206 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.760458 127209772695360 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.769330 127209772695360 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008865 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.769343 127209772695360 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.769349 127209772695360 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.769356 127209772695360 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.769463 127209772695360 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000100 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.769669 127209772695360 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.520895 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.772747 127209772695360 simple_timer.cpp:55] [rocprofv3] output generation ::     0.544740 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:23.772864 127209772695360 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.546573 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526634_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 9/13][Approximate profiling time left: 8 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_SQ_IFETCH_LEVEL_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:25.355867 133547856830272 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.198354 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:25.356438 133547856830272 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:25.551756 133547856830272 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:25.647194 133547856830272 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.290757 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:25.669934 133547856830272 generateRocpd.cpp:583] writing SQL database for process 2526642 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:25.670779 133547856830272 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526642_results.db (UUID=0001fa7e-9d62-7d62-b508-6c9afe3084cd)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:25.753551 133547856830272 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008179 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:25.754666 133547856830272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:25.756722 133547856830272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.002039 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:25.767228 133547856830272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008511 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:26.356855 133547856830272 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.589609 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:26.358953 133547856830272 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002073 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:26.358972 133547856830272 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:26.368706 133547856830272 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009720 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:26.368722 133547856830272 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:26.368729 133547856830272 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:26.368736 133547856830272 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:26.368877 133547856830272 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000133 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:26.369176 133547856830272 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.699243 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:26.372200 133547856830272 simple_timer.cpp:55] [rocprofv3] output generation ::     0.723202 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:26.372342 133547856830272 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.725092 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526642_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 10/13][Approximate profiling time left: 6 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_LDS_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:27.914541 140353848303424 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.190532 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:27.915131 140353848303424 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.107723 140353848303424 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:28.201453 140353848303424 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.286323 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.223972 140353848303424 generateRocpd.cpp:583] writing SQL database for process 2526650 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:28.224757 140353848303424 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526650_results.db (UUID=0001fa7e-a768-7768-8d76-fb3d4f963d70)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.305884 140353848303424 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007984 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.306979 140353848303424 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001079 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.308905 140353848303424 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001911 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.318951 140353848303424 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008102 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.671851 140353848303424 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.352885 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.673974 140353848303424 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.673991 140353848303424 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.682696 140353848303424 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008697 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.682711 140353848303424 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.682718 140353848303424 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.682725 140353848303424 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.682832 140353848303424 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000097 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.683059 140353848303424 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.459086 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.686037 140353848303424 simple_timer.cpp:55] [rocprofv3] output generation ::     0.482908 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:28.686140 140353848303424 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.484640 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526650_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 11/13][Approximate profiling time left: 4 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_SMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:30.225397 138027025522496 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.192873 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:30.225973 138027025522496 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.424232 138027025522496 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:30.518592 138027025522496 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.292619 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.541390 138027025522496 generateRocpd.cpp:583] writing SQL database for process 2526658 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:30.542175 138027025522496 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526658_results.db (UUID=0001fa7e-b06d-706d-ba0e-f9e75db82752)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.624512 138027025522496 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008048 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.625618 138027025522496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001090 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.627615 138027025522496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001982 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.637883 138027025522496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008296 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.967300 138027025522496 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.329400 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.969383 138027025522496 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002066 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.969400 138027025522496 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.979103 138027025522496 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009696 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.979117 138027025522496 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.979123 138027025522496 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.979129 138027025522496 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.979238 138027025522496 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.979443 138027025522496 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.438053 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.982426 138027025522496 simple_timer.cpp:55] [rocprofv3] output generation ::     0.462175 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:30.982517 138027025522496 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.463871 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526658_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 12/13][Approximate profiling time left: 2 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_SQ_INST_LEVEL_VMEM_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:32.536348 133195591147328 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.197078 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:32.536967 133195591147328 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:32.731833 133195591147328 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:32.815530 133195591147328 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.278562 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:32.838055 133195591147328 generateRocpd.cpp:583] writing SQL database for process 2526667 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:32.838849 133195591147328 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526667_results.db (UUID=0001fa7e-b970-7970-b4a1-256b80729a82)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:32.921313 133195591147328 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.008156 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:32.922437 133195591147328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001107 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:32.924448 133195591147328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001996 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:32.935046 133195591147328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008632 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:33.455705 133195591147328 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.520644 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:33.457768 133195591147328 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002038 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:33.457785 133195591147328 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:33.466809 133195591147328 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.009016 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:33.466823 133195591147328 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:33.466829 133195591147328 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:33.466836 133195591147328 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000002 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:33.466946 133195591147328 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000103 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:33.467175 133195591147328 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.629119 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:33.470169 133195591147328 simple_timer.cpp:55] [rocprofv3] output generation ::     0.652919 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:33.470288 133195591147328 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.654710 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526667_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
[Run 13/13][Approximate profiling time left: 0 seconds]...
[profiling] Current input file: tests/workloads/mem_levels_HBM/MI200/perfmon/pmc_perf_SQ_LEVEL_WAVES_ACCUM.yaml
   |-> [rocprofiler-sdk] [rocprofiler-compute] [rocprofiler_configure] (priority=1) is using rocprofiler-sdk v1.1.0 (1.1.0)
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:35.003187 138129634115392 simple_timer.cpp:55] [rocprofv3] tool initialization ::     0.189590 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool init
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:35.003782 138129634115392 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.210955 138129634115392 tool.cpp:2423] HSA version 8.21.0 initialized (instance=0)
   |-> [rocprofiler-sdk] [mvcopy testing on GCD 0
   |-> [rocprofiler-sdk] Finished allocating vectors on the CPU
   |-> [rocprofiler-sdk] Finished allocating vectors on the GPU
   |-> [rocprofiler-sdk] Finished copying vectors to the GPU
   |-> [rocprofiler-sdk] sw thinks it moved 1.000000 KB per wave
   |-> [rocprofiler-sdk] Total threads: 1048576, Grid Size: 4096 block Size:256, Wavefronts:16384:
   |-> [rocprofiler-sdk] Launching the  kernel on the GPU
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished executing kernel
   |-> [rocprofiler-sdk] Finished copying the output vector from the GPU to the CPU
   |-> [rocprofiler-sdk] Releasing GPU memory
   |-> [rocprofiler-sdk] Releasing CPU memory
   |-> [rocprofiler-sdk] [0;33mW20260526 19:10:35.309568 138129634115392 simple_timer.cpp:55] [rocprofv3] './tests/vcopy -n 1048576 -b 256 -i 3' ::     0.305786 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.332158 138129634115392 generateRocpd.cpp:583] writing SQL database for process 2526675 on node 2976770398
   |-> [rocprofiler-sdk] [m[0;31mE20260526 19:10:35.332952 138129634115392 generateRocpd.cpp:606] Opened result file: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/smc4124-25-mi210-3c48/2526675_results.db (UUID=0001fa7e-c31a-731a-963d-577883fd678d)
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.415075 138129634115392 simple_timer.cpp:55] SQLite3 generation :: rocpd_string             ::     0.007924 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.416203 138129634115392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_node          ::     0.001112 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.417811 138129634115392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_process       ::     0.001593 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.428066 138129634115392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_agent         ::     0.008280 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.746761 138129634115392 simple_timer.cpp:55] SQLite3 generation :: rocpd_info_pmc           ::     0.318680 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.748880 138129634115392 simple_timer.cpp:55] SQLite3 generation :: rocpd kernel info        ::     0.002101 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.748898 138129634115392 simple_timer.cpp:55] SQLite3 generation :: rocpd_region             ::     0.000004 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.757548 138129634115392 simple_timer.cpp:55] SQLite3 generation :: rocpd_kernel_dispatch    ::     0.008643 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.757563 138129634115392 simple_timer.cpp:55] SQLite3 generation :: rocpd_pmc_event          ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.757569 138129634115392 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_copy        ::     0.000000 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.757575 138129634115392 simple_timer.cpp:55] SQLite3 generation :: rocpd_memory_allocate    ::     0.000001 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.757677 138129634115392 simple_timer.cpp:55] SQLite3 generation :: SQL indexing             ::     0.000095 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.757898 138129634115392 simple_timer.cpp:55] SQLite3 generation :: total                    ::     0.425740 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.760886 138129634115392 simple_timer.cpp:55] [rocprofv3] output generation ::     0.449689 sec
   |-> [rocprofiler-sdk] [m[0;33mW20260526 19:10:35.760986 138129634115392 simple_timer.cpp:55] [rocprofv3] tool finalization ::     0.451371 sec
   |-> [rocprofiler-sdk] [m[rocprofiler-compute] In tool fini
   |-> [rocprofiler-sdk] [rocprofiler-compute] [write_counters] Counter collection data has been written to: tests/workloads/mem_levels_HBM/MI200/out/pmc_1/2526675_native_counter_collection.csv
Intermediate results_*.csv generation from rocpd databases is deprecated and will be replaced with automatic .db file retention in a future release.
PC sampling data collection skipped as block 21 is not specified.
[roofline] Checking for roofline.csv in tests/workloads/mem_levels_HBM/MI200
[roofline] Benchmark execution failed: 'L1'. Skipping roofline.
