Commit 41013cd6 authored by Andy Polyakov's avatar Andy Polyakov
Browse files

PPC assembly pack: correct POWER9 results.



As it turns out originally published results were skewed by "turbo"
mode. VM apparently remains oblivious to dynamic frequency scaling,
and reports that processor operates at "base" frequency at all times.
While actual frequency gets increased under load.

Reviewed-by: default avatarRich Salz <rsalz@openssl.org>
(Merged from https://github.com/openssl/openssl/pull/6406)
parent 9a708bf9
Loading
Loading
Loading
Loading
+2 −1
Original line number Diff line number Diff line
@@ -40,7 +40,8 @@
#		CBC en-/decrypt	CTR	XTS
# POWER8[le]	3.96/0.72	0.74	1.1
# POWER8[be]	3.75/0.65	0.66	1.0
# POWER9[le]	3.05/0.65	0.65	0.80
# POWER9[le]	4.02/0.86	0.84	1.05
# POWER9[be]	3.99/0.78	0.79	0.97

$flavour = shift;

+1 −1
Original line number Diff line number Diff line
@@ -27,7 +27,7 @@
# PPC970/G5		9.29/+160%	?
# POWER7		8.62/+61%	3.38
# POWER8		8.70/+51%	3.36
# POWER9		6.61/+29%	3.30(*)
# POWER9		8.80/+29%	4.50(*)
#
# (*)	this is trade-off result, it's possible to improve it, but
#	then it would negatively affect all others;
+1 −1
Original line number Diff line number Diff line
@@ -30,7 +30,7 @@
# 2x aggregated reduction improves performance by 50% (resulting
# performance on POWER8 is 1 cycle per processed byte), and 4x
# aggregated reduction - by 170% or 2.7x (resulting in 0.55 cpb).
# POWER9 delivers 0.40 cpb.
# POWER9 delivers 0.51 cpb.

$flavour=shift;
$output =shift;
+1 −1
Original line number Diff line number Diff line
@@ -28,7 +28,7 @@
# PPC970		7.00/+114%	3.51/+205%
# POWER7		3.75/+260%	1.93/+100%
# POWER8		-		2.03/+200%
# POWER9		-		1.56/+150%
# POWER9		-		2.00/+150%
#
# Do we need floating-point implementation for PPC? Results presented
# in poly1305_ieee754.c are tricky to compare to, because they are for
+0 −1
Original line number Diff line number Diff line
@@ -26,7 +26,6 @@
# PPC970		6.03/+80%
# POWER7		3.50/+30%
# POWER8		3.75/+10%
# POWER9		2.80/+12%

$flavour = shift;

Loading