ragemem benchmark results thread !

	Bottom Previous Topic Next Topic
Register To Post

« 1 ... 4 5 6 (7) 8

Severin

Re: ragemem benchmark results thread !

Posted on: 2014/11/24 15:16 #122

Just can't stay away

Just for a laugh and as there isn't a post of the X1000...


RAGEMEM v0.37 - compiled 11/06/2010



CPU: P.A. Semi PWRficient PA6T-1682M B1 @ 1800 Mhz

Caches Sizes: L1: 64 KB - L2: 2048 KB - L3: none

Cache Line: 64



---> CPU <---

MAX MIPS:  3052



---> L1 <---

READ32:  6772 MB/Sec

READ64:  13525 MB/Sec

WRITE32: 6770 MB/Sec

WRITE64: 13520 MB/Sec



---> L2 <---

READ32:  3222 MB/Sec

READ64:  4633 MB/Sec

WRITE32: 2387 MB/Sec

WRITE64: 4047 MB/Sec



---> RAM <---

READ32:  2831 MB/Sec

READ64:  4046 MB/Sec

WRITE32: 2374 MB/Sec

WRITE64: 2343 MB/Sec

WRITE: 340 MB/Sec (Tricky)



---> VIDEO BUS <---

READ:  66 MB/Sec

WRITE: 160 MB/Sec

Amiga user since 1985
AOS4, A-EON, IBrowse & Alinea Betatester

Ps. I hate the new amigans website. <shudder>

noXLar

Re: ragemem benchmark results thread !

Posted on: 2014/11/25 16:56 #123

Not too shy to talk

@328gts

i have an sam460ex :)

i did some benchmarking before and after installing latest RadeonHD drivers. but i was using SysMon ragemem and didn't fine the log, but i have the Mips score.

i also was using RadeonHD 0.55 to start with.

before update: ati drv 0.55

GfxBench2D score: 5903
ragemem: 2291 mips

after installed ati driver 1.2

GfxBench2D score: 5890
ragemem: 2308

after installed ati driver 2.4

GfxBench2D score: 5880
ragemem: 2308

Sam460ex 2GB 120Gb SSD&1Tb HD7750 Envy24HT A-Eon Drv 2.10+Warp3D New Uboot
Apollo v4 Standalone

ddni

Re: ragemem benchmark results thread !

Posted on: 2015/5/28 21:15 #124

Just can't stay away

@Severin

Does anyone know why the RAM write speeds on X1000 are so appalling in ragemem? The x1000 RAM Write speeds are the slowest of all the NG Amigas.

AmigaOne X1000.
Radeon RX550

http://www.tinylife.org.uk/

Antique

Re: ragemem benchmark results thread !

Posted on: 2015/5/28 21:34 #125

Home away from home

@ddni

I'm guessing you're thinking about the tricky write speed? As the other ram speeds are most often better than other machines.

X5000

Hans

Re: ragemem benchmark results thread !

Posted on: 2015/5/28 23:17 #126

Home away from home

@ddni

Quote:

@Severin

Does anyone know why the RAM write speeds on X1000 are so appalling in ragemem? The x1000 RAM Write speeds are the slowest of all the NG Amigas.

Actually, only the "tricky" write speed is "appalling," the others are very good.

I'm guessing that "tricky" means that it's using the dcba cache instruction to prepare a cacheline for a write without wasting bandwidth reading it from RAM. While this boosts performance on 32-bit CPUs, it's an illegal instruction on 64-bit (G5) CPUs like the PA6T in the A1-X1000. Hence, that instruction triggers an exception handler, and that slows things right down.

Hans

Join Kea Campus' Amiga Corner and support Amiga content creation
https://keasigmadelta.com/ - see more of my work

ddni

Re: ragemem benchmark results thread !

Posted on: 2015/5/29 5:22 #127

Just can't stay away

Interesting. Thanks for the explanation.
Do existing programs make that call in real day to day usage?

AmigaOne X1000.
Radeon RX550

http://www.tinylife.org.uk/

Hans

Re: ragemem benchmark results thread !

Posted on: 2015/5/29 22:44 #128

Home away from home

@ddni
Quote:

Interesting. Thanks for the explanation.
Do existing programs make that call in real day to day usage?

AFAIK, the vast majority of programs don't use it. A developer has to be pretty concerned with performance to even consider using such lowlevel techniques. Dcba has to be used carefully, because it operates on an entire cacheline. That means handling special cases so you don't accidentally wipe memory before/after the data that you're working on. Besides, the "dcbz" instruction is similar enough, and is safe to use on G5 CPUs (although you don't get all the speed benefits because of the larger cacheline size).

It's possible that the kernel may use it on hardware that supports it. IIRC, some functions such as the memcopy ones are optimised on a per-CPU basis.

Hans

Join Kea Campus' Amiga Corner and support Amiga content creation
https://keasigmadelta.com/ - see more of my work

angelheart

Re: ragemem benchmark results thread !

Posted on: 2015/6/8 10:57 #129

Just popping in

@Severin

Is it me or the X1000 results are low in MIPS and video Bus ?

no one noticed 7454 3.3 @ 1400 MHZ = 4194 MIPS ?

KimmoK

Re: ragemem benchmark results thread !

Posted on: 2015/6/8 13:10 #130

Not too shy to talk

Went through the thread but did not spot x1000 results.
Someone with better eyes.... please?

- Kimmo
--------------------------PowerPC-Advantage------------------------
"PowerPC Operating Systems can use a microkernel architecture with all it�s advantages yet without the cost of slow context switches." - N. Blachford

ddni

Re: ragemem benchmark results thread !

Posted on: 2015/6/8 16:28 #131

Just can't stay away

@KimmoK

AmigaOne X1000.
Radeon RX550

http://www.tinylife.org.uk/

KimmoK

Re: ragemem benchmark results thread !

Posted on: 2015/6/9 6:59 #132

Not too shy to talk

@ddni
Thanks!

So...
>7454 1400 MHZ = 4194 MIPS ?

= 3 MIPS/Mhz while it should reach 2,3 MIPS/Mhz ??

PA6T 1800 Mhz = 3082 MIPS

= 1,7MIPS/Mhz while it should reach 2,2MIPS/Mhz

So the test code or compiler perhaps are not the same in those tests???

(( I think 7454 should have same core & MIPS/Mhz as these http://www.freescale.com/webapp/sps/s ... _summary.jsp?code=MPC7448 ))

+++++++++++++
@angelheart
VideoRAM speeds depend greatly on what methode is used.
http://www.hdrlab.org.nz/benchmark/gf ... 2d/OS/AmigaOS/Result/1551
So it seems x1000 can reach 1400MB/s currently.

Edited by KimmoK on 2015/6/9 7:22:12
Edited by KimmoK on 2015/6/9 7:23:13

angelheart

Re: ragemem benchmark results thread !

Posted on: 2015/6/9 18:06 #133

Just popping in

@KimmoK

Once can compare different results from different apps all day long.

I was referring to post #122 X1000 result for RAGEMEM v0.37 -Video bus :

---> VIDEO BUS <---
READ: 66 MB/Sec
WRITE: 160 MB/Sec

compare against

post #60 CPU: AMCC PPC460EX 1.2 @ 1166 Mhz

---> VIDEO BUS <---
READ: 72 MB/Sec
WRITE: 261 MB/Sec

Then think about price difference.

surely bus/bandwidth is better on AMCC for this value - so that needs investigating to improve on future systems.

If we can collate all results into CSV would be great, then a small code can test a games usage of resources and offer recommended system / card requirements.

Antique

Re: ragemem benchmark results thread !

Posted on: 2015/6/9 18:32 #134

Home away from home

@angelheart
This is with 4.1 FE

RAGEMEM v0.37 - compiled 11/06/2010

CPU: P.A. Semi PWRficient PA6T-1682M B1 @ 1800 Mhz
Caches Sizes: L1: 64 KB - L2: 2048 KB - L3: none
Cache Line: 64

---> CPU <---
MAX MIPS: 3082

---> L1 <---
READ32: 6845 MB/Sec
READ64: 13670 MB/Sec
WRITE32: 6846 MB/Sec
WRITE64: 13668 MB/Sec

---> L2 <---
READ32: 3339 MB/Sec
READ64: 4979 MB/Sec
WRITE32: 2539 MB/Sec
WRITE64: 4042 MB/Sec

---> RAM <---
READ32: 2958 MB/Sec
READ64: 4150 MB/Sec
WRITE32: 2716 MB/Sec
WRITE64: 3507 MB/Sec
WRITE: 369 MB/Sec (Tricky)

---> VIDEO BUS <---
READ: 177 MB/Sec
WRITE: 161 MB/Sec

X5000

KimmoK

Re: ragemem benchmark results thread !

Posted on: 2015/6/15 7:53 #135

Not too shy to talk

@thread

I wonder if ragemem is (hand) optimized for PA6T?
To my understanding we do not yet have gcc that can do PA6T optimizations.

K-L

Re: ragemem benchmark results thread !

Posted on: 2015/6/15 16:02 #136

Just can't stay away

@Thread

Crisot gave an explanation (in French) on Amiga-NG:

"Sur SAM460, la RAM tourne à la vitesse du L2... Soit le L2 du 460EX est pas terrible, soit j'ai loupé un truc (mais quoi...)

(On Sam460, RAM goes as fast as L2 cache. Either L2 from the Sam460 is rubbish or I missed someting)

Sur X1000, c'est encore pire, l'écriture RAM est supérieure à l'écriture L2, ce qui est matériellement impossible, la RAM passant forcément par le L2... Je me demande si ma loop est pas trop courte pour bencher à ces vitesses là.

(It's even worse on X1000 : writing in RAM is faster than writing in L2, which in impossible since RAM goes through the L2 cache... I'm wondering if my Loop is too short to bench such speeds)

Pour le tricky super lent, c'est parceque mon code n'est pas adapté à la Cache Line du PA6T.

(Regarding the super slow tricky test, it's because my code is not adapted to the PA6T Cache Line)

Pour les MIPS. Je n'ai que des hypothèses mais pas de réponse:

(Regarding MIPS, I have only hypothesis but no answers)

-Déjà, mon bench n'est pas multi-threadé, il n'utilise qu'un seul core, donc il faut multiplier par 2 ce chiffre.

(First, my bench is not multithreaed, it uses only one core so we must multiply x2 this value)

-Dans tous les cas le PA6T@2.0 Ghz est donné constructeur pour 8800 MIPS, ça fait 3960 MIPS par core@1.8 Ghz.

(Anyway, PA6T@2.0Ghz should give, from the manufaturer, 8800MIPS, so 3960 MIPS by core@1.8Ghz.)

-Ma loop n'est pas du tout optimisée pour le PA6T, elle a été écrite avec les datasheet des 604e/G3/G4/460EX. Je pense que le PA6T ne dispatche pas du tout les instructions de la même manière, mais sans datasheet je peux rien écrire.

(My loop is not at all optimized for the PA6T, it was written with 604e/G3/G4/460EX datasheets. I suspect the PA6T to not dispatch instructions the same way than the others but without the datasheets, I cannot be sure nor enhance Ragemem[i]).

-Au dela des MIPS la rapidité des caches et de la ram ont une énorme influence sur les performances CPU. A titre d'exemple, lire un seul octet en mémoire sur un XE (240 mo/sec) équipé d'un G4@1.0Ghz (3000 MIPS), c'est en temps machine l'équivalent de 104 instructions perdues (!!!) pendant lesquelles le CPU attend. En clair, un G4 attend continuellement après le reste de la machine, si on pouvait convertir le temps machine perdu en taches ménagères, la pelouse serait toujours tondue et la maison propre.

([i]Beyond MIPS, caches and RAM speed make a huge difference in CPU performances. For example, reading only on byte in memory on the XE (240MB/s) with a G4@1Ghz (3000MIPS), is equivalent of 104 lost instructions (!!!) while the CPU is waiting. To simplfy : a G4 is always waiting for the the other parts of the comuter. If we could convert the lost time by the system in household tasks, the house would always be clean.)

Je pense pas que core à core (ahah) le PA6T soit beaucoup plus puissant que le G4, mais tout ce qui tourne autour (caches, bus, ram) lui laisse infiniment plus de marge de manoeuvre. Le G4 ira beaucoup moins vite, mais paradoxalement glandera beaucoup plus.

(I don't think that the PA6T core is much more powerful than the G4 core, but everything that makes the X1000 (caches, FSB, RAM) gives him more latitude. G4 will be way slower but, paroxally, will wait a lot more for the rest of the architecture.)."

I hope I have translated everything correctly

Edited by K-L on 2015/6/15 17:10:19

--
AmigaONE X1000 and Radeon RX 560
Sam460 and Radeon RX 560
MiST
FPGA Replay + 060 DB

328gts

Re: ragemem benchmark results thread !

Posted on: 2015/6/16 18:06 #137

Home away from home

wow just realized I never updated my old ragemem thread with results from my X1000 as I started it with my old Samflex@800.

well better late than never

X1000,4GB ram, Radeon HD7750-1GB VRam (low profile, single slot)

[img width=300]

RageMem-June14-15-HD7750 [/url], [/img]

_______________________________
c64-dual sids, A1000, A1200-060@93, A4000-CSMKIII
PiStorm32 & Catweasel MK4+= Amazing
! My Master Miggies-Amiga1000 & AmigaONE X1000 !
mancave-ramblings

328gts

Re: ragemem benchmark results thread !

Posted on: 2015/6/16 19:46 #138

Home away from home

well now, very interesting seeing my much stronger Gigabyte HD7950, 3GB Ram performing worse here compared to my older single slot low profile HD7750 above??

also noticed that SuperTuxkart ran very smooth with my HD7750 and is running very slow & choppy with my HD7950?? more testing later with my 7950

[img]

RageMem-HD7950-Jun16-15 by G S, [/img]

_______________________________
c64-dual sids, A1000, A1200-060@93, A4000-CSMKIII
PiStorm32 & Catweasel MK4+= Amazing
! My Master Miggies-Amiga1000 & AmigaONE X1000 !
mancave-ramblings

Rob

Re: ragemem benchmark results thread !

Posted on: 2015/6/17 19:50 #139

Not too shy to talk

@angelheart

Quote:

Surely bus/bandwidth is better on AMCC for this value - so that needs investigating to improve on future systems.

The newer AMMC PowerPC SOCs only scale to 1.3Ghz.

The LSI Axxia PowerPC SOCs look much more interesting on paper. They scale as high as 1.8Ghz on a 4 core SOC and the 476fp core is rated at 2.7 DMIPs per Mhz. It also has bigger L2 cache than the 465 cores as well as large L3 cache.
It also has a SIMD unit although I'm not sure if this is Altivec/VMX* although it would make sense for IBM to have designed a a new vector unit for the 476fp.
LSI don't specify what version of PCIe is used on there product pages but I did see a product breif that sugested that it was PCIe 3.0

A system based on that faster Axxia SOCs should be able to trounce the X1000 in every aspect.

Both the Axxia ACP3500 and ACP3400 have a 1295 pin/ball count so in theory should be able to use the same board for both. That would give a good range of speeds and cores. The 3500 can have 2 cores at 1.1Ghz, 2 cores at 1.26Ghz, 4 cores at 1.26Ghz and 6 cores at 1.26Ghz while the 3400 can have 2 cores at 1.6Ghz or 4 cores at 1.8Ghhz

*VMX is IBM's name for Altivec.

monomango

Re: ragemem benchmark results thread !

Posted on: 2019/3/22 8:40 #140

Just popping in

@TSK

HI, I have the exact same machine, and G3 CPU.
What voltage do you run your CXe CPU on ?

I'm running mine at 666MHz at 1.64V, had to step it up from 1.59V.

Have you tried any higher ?

Thanks.

AmigaOne G3-SE : G3@667MHz, Radeon 9200 AGP 256MB, 2GB Reg 133MHz RAM, RTL8169 gigabit Ethernet, NEC USB 2.0, Sii0680/Sii3114, PS/2 keyboard and mouse
AmigaOne X5000 : P5040, Radeon RX580, 16GB Reg RAM, ESI Juli@
Both: AmigaOS 4.1 FE Update 2, Linux

monomango

Re: ragemem benchmark results thread !

Posted on: 2019/3/22 8:41 #141

Just popping in

Anyone with the AmigaOne G3-SE 600MHz machine ?

I'm running mine at 666MHz at 1.64V, had to step it up from 1.59V to run stable.
Has anyone tired higher?

Also, what is a good calibration for the CPUTemp docky ? In terms of showing accurate temperature.

Thanks.

Register To Post	« 1 ... 4 5 6 (7) 8
	Top Previous Topic Next Topic

Currently Active Users Viewing This Thread: 1 ( 0 members and 1 Anonymous Users )