Replies: 11 comments 12 replies
-
Hi @meta1360 , thanks for reaching out. We’re going to try and get one of the uProf experts to help you |
Beta Was this translation helpful? Give feedback.
-
Hi Justin,
I appreciate your help.
…-Meysam
On Fri, Jul 12, 2024 at 8:57 PM Justin Chang ***@***.***> wrote:
Hi @meta1360 <https://github.com/meta1360> , thanks for reaching out.
We’re going to try and get one of the uProf experts to help you
—
Reply to this email directly, view it on GitHub
<#18 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ACYNRJ2SKPSCL45NLBUGWULZMCQTNAVCNFSM6AAAAABKZXIHHKVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTAMBTG4YDKNQ>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Hi @jychang48 |
Beta Was this translation helpful? Give feedback.
-
Hi @jychang48 thanks. |
Beta Was this translation helpful? Give feedback.
-
Hi Justin,
Any Luck in finding somebody to discuss my issues in uProf?
…On Fri, Jul 12, 2024 at 9:11 PM Meysam Taassori ***@***.***> wrote:
Hi Justin,
I appreciate your help.
-Meysam
On Fri, Jul 12, 2024 at 8:57 PM Justin Chang ***@***.***>
wrote:
> Hi @meta1360 <https://github.com/meta1360> , thanks for reaching out.
> We’re going to try and get one of the uProf experts to help you
>
> —
> Reply to this email directly, view it on GitHub
> <#18 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/ACYNRJ2SKPSCL45NLBUGWULZMCQTNAVCNFSM6AAAAABKZXIHHKVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTAMBTG4YDKNQ>
> .
> You are receiving this because you were mentioned.Message ID:
> ***@***.***>
>
|
Beta Was this translation helpful? Give feedback.
-
Hi @meta1360,
Here, we are collecting memory bandwidth information from all channels using |
Beta Was this translation helpful? Give feedback.
-
Hi Gina,
Thanks for your help and reply. I have to bind the command you suggested
with an app because I want to measure the memory bandwidth when an app is
running the app is a simple Stream AMD Spack
./AMDuProfPcm -r -m memory -C -a -o out2.csv -- ./stream_triad
the stream reports ~700G but Uprof measures it as 500G?
Besides that, I am using the perf command as follows
perf -e <event> to get the memory accesses and I am using this document of
AMD
***@***.***/
for event number,
As mentioned in this document, I am using
remote(local)_processor_write(read)_data_beats_cs(0-11)
to sum up all these counters to get the total memory accesses, but this
time the number of memory accesses is way (~10x or 15x) more than Stream
expected values.
Especially, working with these counters remote(local)_processor_
write(read)_data_beats_cs(0-11) are so vital for me is there a way to get
an acceptable estimation out of these counters? if not, what is your
suggestion?
Your prompt reply would be so much appreciated.
…On Mon, Jul 22, 2024 at 8:09 PM Gina Sitaraman ***@***.***> wrote:
Hi @meta1360 <https://github.com/meta1360>,
Thank you for reaching out to us and providing the command you used. From
the command, I see that you are trying to collect memory bandwidth from
core 0 only. This invocation will not use all memory channels on the
socket, so the achieved memory bandwidth will be low. Could you try using
all cores for this test, as shown in the command below?
./AMDuProfPcm -m memory -a -d 60 -o /tmp/pcmdata.csv
Here, we are collecting memory bandwidth information from all channels
using -a to specify collection from all cores of the socket for a
duration of 60 seconds.
Apologies for the delay in this response and I hope this clarifies the
issue you faced. For more details, the AMDuProf User Guide
<https://www.amd.com/content/dam/amd/en/documents/developer/version-4-2-documents/uprof/uprof-user-guide-v4.2.pdf>
seems to be a good reference.
—
Reply to this email directly, view it on GitHub
<#18 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ACYNRJYQPYGEVNIA7AHNIX3ZNXCOPAVCNFSM6AAAAABKZXIHHKVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTAMJSGEZDQMY>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Thanks for your help, I appreciate it.
…-Meysam
On Tue, Jul 23, 2024 at 7:21 AM Gina Sitaraman ***@***.***> wrote:
I have requested help from the uProf team now for further details about
the hardware counters. Let's wait to hear from them.
—
Reply to this email directly, view it on GitHub
<#18 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ACYNRJ5RNPRCSQHZ72NECSLZNZREZAVCNFSM6AAAAABKZXIHHKVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTAMJSG4ZTOOI>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Any Update? |
Beta Was this translation helpful? Give feedback.
-
Hi @meta1360, the AMDuProf team investigated this and determined that running Stream for a longer duration can help AMDuProfPcm report memory bandwidth that is closer to the expected bandwidth. This is because setup and teardown in Stream where much bandwidth is not consumed is also included in the calculation and this lowers the average. This is expected behavior. Please try running stream with NTIMES = 500 or so, and this should give you the expected bandwidth. |
Beta Was this translation helpful? Give feedback.
-
Hi @gsitaram Gina Sitaraman, |
Beta Was this translation helpful? Give feedback.
-
I am using uProf tool (AMDuProfSys and AMDuProfPcm) to get memory bandwith on a Genoa CPU which has 12 memory channels. I am running Spack2 stream application and while it is expected to be ~70GB/s these tools are giving me only around 20 GB/s. can you help me please to know where I am doing something wrong? my command is like this
./AMDuProfSys.py collect --config df,umc -C 0 -o output taskset -c 0 ./stream
Beta Was this translation helpful? Give feedback.
All reactions