Penguin

Differences between version 37 and previous revision of PgBench.

Other diffs: Previous Major Revision, Previous Author, or view the Annotated Edit History

Newer page: version 37 Last edited on Monday, June 19, 2006 10:04:07 pm by GuyThornley Revert
Older page: version 36 Last edited on Monday, June 19, 2006 4:38:55 pm by GuyThornley Revert
@@ -1,5 +1,5 @@
-This is a scratch pad for some [PostgreSQL] benchmarks. The contributed utility <tt>pgbench</tt> is used for the testing. 
+This is a scratch pad for some [PostgreSQL] 8.0 benchmarks. The contributed utility <tt>pgbench</tt> is used for the testing. 
  
 For most of the testing, important parts of the [PostgreSQL] configuration used are: 
  <verbatim> 
  shared_buffers = 23987 
@@ -18,13 +18,15 @@
  default_with_oids = off 
  stats_start_collector = false 
  </verbatim> 
  
-Exceptions will be listed as the tests are performed. 
+Exceptions are noted as the tests are performed. 
  
-The <tt>pgbench</tt> test database was created with the <tt>-s100 </tt> scale factor option. This results in a fresh database of about 1.4GB. Consecutive runs of <tt>pgbench</tt> grow the database, however . All test runs were executed with the <tt>-c100</tt> option for 100 connections. The transactions per connection was adjusted as needed to give a stable test result, without obvious effects of caching. Typical settings were <tt>-t100</tt> to <tt>-t1000 </tt>. 
+The <tt>pgbench</tt> test database was created with the <tt>-s600 </tt> scale factor option. This results in a fresh database of about 8.6GiB, along with 1.3GiB of WAL . The test database was then backed up to a <tt>.tar.gz </tt> file so it could easily be restored between test runs
  
-The <tt>pgbench</tt> client was actually run over a 100Mbit, full-duplex network connection from a client machine for most of the testing. Running <tt>pgbench</tt> remotely has not measurably degraded the performance. The client machine is a dual 3.06GHz Xeon running Linux 2.4.27. 
+Each test was executed 5 times in sequence, and the median result is reported. All tests were executed with the <tt>-c100</tt> option for 100 connections. The transaction count per connection was adjusted as necessary so that each single test would span several minutes. Typical settings were <tt>-t500</tt> to <tt>-t1000</tt>.  
+  
+ The <tt>pgbench</tt> client was actually run over a 100Mbit, full-duplex network connection from a client machine for all of the testing. Running <tt>pgbench</tt> remotely has not measurably degraded the performance. The client machine is a dual 3.06GHz Xeon running Linux 2.4.27. [SSL] encryption was disabled
  
 The base hardware: 
  
 * [HP] DL380 G4 
@@ -36,9 +38,9 @@
  
 The base software: 
  
 * LinuxKernel 2.4.27 from [Debian]’s <tt>kernel-image-2.4.27-2-686-smp</tt> [Package] 
-* Using [Ext3] with <tt>ordered</tt> data mode 
+* Using [Ext3] with 128MB journal and <tt>ordered</tt> data mode 
  
 On with the testing! 
  
 !! Results: 4-disk configurations 
@@ -59,8 +61,15 @@
  number of transactions per client: 1000 
  number of transactions actually processed: 100000/100000 
  tps = 114.351628 (including connections establishing) 
  tps = 114.418688 (excluding connections establishing) 
+  
+ scaling factor: 600  
+ number of clients: 100  
+ number of transactions per client: 500  
+ number of transactions actually processed: 50000/50000  
+ tps = 124.728272 (including connections establishing)  
+ tps = 124.885813 (excluding connections establishing)  
  </pre> 
  
 * Data array: RAID5, 4x 72GB 10k RPM%%% 
  WAL array: On data array%%% 
@@ -79,8 +88,15 @@
  number of transactions per client: 1000 
  number of transactions actually processed: 100000/100000 
  tps = 118.365607 (including connections establishing) 
  tps = 118.442501 (excluding connections establishing) 
+  
+ scaling factor: 600  
+ number of clients: 100  
+ number of transactions per client: 500  
+ number of transactions actually processed: 50000/50000  
+ tps = 124.696768 (including connections establishing)  
+ tps = 124.870136 (excluding connections establishing)  
  </pre> 
  
 * Data array: RAID5, 4x 72GB 10k RPM%%% 
  WAL array: On data array%%% 
@@ -99,8 +115,15 @@
  number of transactions per client: 500 
  number of transactions actually processed: 50000/50000 
  tps = 103.447886 (including connections establishing) 
  tps = 103.556659 (excluding connections establishing) 
+  
+ scaling factor: 600  
+ number of clients: 100  
+ number of transactions per client: 500  
+ number of transactions actually processed: 50000/50000  
+ tps = 114.885220 (including connections establishing)  
+ tps = 115.020971 (excluding connections establishing)  
  </pre> 
  
 * Data array: RAID5, 4x 72GB 10k RPM%%% 
  WAL array: On data array%%% 
@@ -119,8 +142,15 @@
  number of transactions per client: 500 
  number of transactions actually processed: 50000/50000 
  tps = 73.665381 (including connections establishing) 
  tps = 73.725278 (excluding connections establishing) 
+  
+ scaling factor: 600  
+ number of clients: 100  
+ number of transactions per client: 500  
+ number of transactions actually processed: 50000/50000  
+ tps = 80.177806 (including connections establishing)  
+ tps = 80.244181 (excluding connections establishing)  
  </pre> 
  
 * Data array: RAID1, 2x 72GB 10k RPM%%% 
  WAL array: RAID1, 2x 72GB 10k RPM%%% 
@@ -239,11 +269,10 @@
  </pre> 
  
 !!! Other observations 
  
-* The test database started at 1.4GB, and got to at least 14GB during testing. Has this growth affected results?  
 * The WAL consumes large amounts of [Kernel] page cache. When moving the WAL between devices, when the old files are unlinked, 1/2 of the page cache is freed. Since the WAL is never read and written only once, this is as waste! 
 * The battery-backed write cache makes write performance very erratic. 
 * The [HP] ~SmartArray hardware (or perhaps driver) tends to block reads while there are cached writes occuring. Large read latencies (whole seconds) result. I have not yet found a way to tune this. 
  
 ---- 
 Part of CategoryDiskNotes