Differences between version 37 and previous revision of PgBench.
Other diffs: Previous Major Revision, Previous Author, or view the Annotated Edit History
Newer page: | version 37 | Last edited on Monday, June 19, 2006 10:04:07 pm | by GuyThornley | Revert |
Older page: | version 36 | Last edited on Monday, June 19, 2006 4:38:55 pm | by GuyThornley | Revert |
@@ -1,5 +1,5 @@
-This is a scratch pad for some [PostgreSQL] benchmarks. The contributed utility <tt>pgbench</tt> is used for the testing.
+This is a scratch pad for some [PostgreSQL] 8.0
benchmarks. The contributed utility <tt>pgbench</tt> is used for the testing.
For most of the testing, important parts of the [PostgreSQL] configuration used are:
<verbatim>
shared_buffers = 23987
@@ -18,13 +18,15 @@
default_with_oids = off
stats_start_collector = false
</verbatim>
-Exceptions will be listed
as the tests are performed.
+Exceptions are noted
as the tests are performed.
-The <tt>pgbench</tt> test database was created with the <tt>-s100
</tt> scale factor option. This results in a fresh database of about 1.4GB. Consecutive runs
of <tt>pgbench</tt> grow the database, however
. All
test runs were executed with the <tt>-c100</tt> option for 100 connections. The transactions per connection
was adjusted as needed
to give
a stable test result, without obvious effects of caching. Typical settings were
<tt>-t100</tt> to <tt>-t1000
</tt>.
+The <tt>pgbench</tt> test database was created with the <tt>-s600
</tt> scale factor option. This results in a fresh database of about 8.6GiB, along with
1.3GiB
of WAL
. The
test database
was then backed up
to a <tt>.tar.gz
</tt> file so it could easily be restored between test runs
.
-The <tt>pgbench</tt> client was actually run over a 100Mbit, full-duplex network connection from a client machine for most
of the testing. Running <tt>pgbench</tt> remotely has not measurably degraded the performance. The client machine is a dual 3.06GHz Xeon running Linux 2.4.27.
+Each test was executed 5 times in sequence, and the median result is reported. All tests were executed with the <tt>-c100</tt> option for 100 connections. The transaction count per connection was adjusted as necessary so that each single test would span several minutes. Typical settings were <tt>-t500</tt> to <tt>-t1000</tt>.
+
+
The <tt>pgbench</tt> client was actually run over a 100Mbit, full-duplex network connection from a client machine for all
of the testing. Running <tt>pgbench</tt> remotely has not measurably degraded the performance. The client machine is a dual 3.06GHz Xeon running Linux 2.4.27. [SSL] encryption was disabled
.
The base hardware:
* [HP] DL380 G4
@@ -36,9 +38,9 @@
The base software:
* LinuxKernel 2.4.27 from [Debian]’s <tt>kernel-image-2.4.27-2-686-smp</tt> [Package]
-* Using [Ext3] with <tt>ordered</tt> data mode
+* Using [Ext3] with 128MB journal and
<tt>ordered</tt> data mode
On with the testing!
!! Results: 4-disk configurations
@@ -59,8 +61,15 @@
number of transactions per client: 1000
number of transactions actually processed: 100000/100000
tps = 114.351628 (including connections establishing)
tps = 114.418688 (excluding connections establishing)
+
+ scaling factor: 600
+ number of clients: 100
+ number of transactions per client: 500
+ number of transactions actually processed: 50000/50000
+ tps = 124.728272 (including connections establishing)
+ tps = 124.885813 (excluding connections establishing)
</pre>
* Data array: RAID5, 4x 72GB 10k RPM%%%
WAL array: On data array%%%
@@ -79,8 +88,15 @@
number of transactions per client: 1000
number of transactions actually processed: 100000/100000
tps = 118.365607 (including connections establishing)
tps = 118.442501 (excluding connections establishing)
+
+ scaling factor: 600
+ number of clients: 100
+ number of transactions per client: 500
+ number of transactions actually processed: 50000/50000
+ tps = 124.696768 (including connections establishing)
+ tps = 124.870136 (excluding connections establishing)
</pre>
* Data array: RAID5, 4x 72GB 10k RPM%%%
WAL array: On data array%%%
@@ -99,8 +115,15 @@
number of transactions per client: 500
number of transactions actually processed: 50000/50000
tps = 103.447886 (including connections establishing)
tps = 103.556659 (excluding connections establishing)
+
+ scaling factor: 600
+ number of clients: 100
+ number of transactions per client: 500
+ number of transactions actually processed: 50000/50000
+ tps = 114.885220 (including connections establishing)
+ tps = 115.020971 (excluding connections establishing)
</pre>
* Data array: RAID5, 4x 72GB 10k RPM%%%
WAL array: On data array%%%
@@ -119,8 +142,15 @@
number of transactions per client: 500
number of transactions actually processed: 50000/50000
tps = 73.665381 (including connections establishing)
tps = 73.725278 (excluding connections establishing)
+
+ scaling factor: 600
+ number of clients: 100
+ number of transactions per client: 500
+ number of transactions actually processed: 50000/50000
+ tps = 80.177806 (including connections establishing)
+ tps = 80.244181 (excluding connections establishing)
</pre>
* Data array: RAID1, 2x 72GB 10k RPM%%%
WAL array: RAID1, 2x 72GB 10k RPM%%%
@@ -239,11 +269,10 @@
</pre>
!!! Other observations
-* The test database started at 1.4GB, and got to at least 14GB during testing. Has this growth affected results?
* The WAL consumes large amounts of [Kernel] page cache. When moving the WAL between devices, when the old files are unlinked, 1/2 of the page cache is freed. Since the WAL is never read and written only once, this is as waste!
* The battery-backed write cache makes write performance very erratic.
* The [HP] ~SmartArray hardware (or perhaps driver) tends to block reads while there are cached writes occuring. Large read latencies (whole seconds) result. I have not yet found a way to tune this.
----
Part of CategoryDiskNotes