Wilkinson Ratings
OK in an effort to kickstart a standard warrior rating system I have adopted
the Wilkinson Benchmark to rate all my warriors, and this has now been extended
to a more general selection of warriors culled from Planar's archive.
Also I'm going to rate warriors which I haven't
released yet so that people can see how they compare here.
After originally creating this page I changed my mind about the inclusion of
self fights in the standard so this page is being converted to the new rating
system, for comparison you may want to check out the
old page.
Also Steve Bailey and Beppe Bezzi have published a load of ratings presumably
based on the same
benchmark so I'm going to include those here in brackets. Similarly if anyone
has any ratings they have compiled then please send them to me, no cheating
though, I will be watching ;-) Send the warriors along with your scores and
they'll definetely be included
- 180.1 Wind Up Toy v 0.7 - Ian Oversby
- (170.3) Frontwards v2
- (168.7) Naked Dancer
- 165.8 Gem of the Ocean - P. Kline
- 164.9 Wind Up Toy v0.4 - Ian Oversby
- 163.4 Porch Swing - Randy Graham (168.0)
- 164.8 Derision - M. R. Bremer
- 160.3 Endpoint . - M. R. Bremer
- 159.9 Mirage 1.5 - Anton Marsden
- 159.2 Quiz - Schitzo (159.6)
- 157.2 Jack in the Box - Beppe Bezzi (155.8)
- (156.3) Tornado 3.0
- 155.6 La Bomba - Beppe Bezzi
- 151.6 C I A - Anders Ivner
- 151.3 Impfinity v4g1 - Planar (149.4)
- 150.0 Marcia Trionfale 1.3 - Beppe Bezzi (150.4)
- 147.2 Nobody Special - Mike Nonemacher (145.8)
- 147.8 Armoury A5 - J. K. Wilkinson
- 147.1 Timescape 1.0 - J. Pohjalainen (149.4)
- 147.0 Hector 2 - Kurt Franke
- 146.4 Persistence - Kurt Franke
- 143.9 Torch t18 - P. Kline (140.3)
- 143.2 Blizzard - Anton Marsden
- 142.6 Night Train - Karl Lewin
- 142.5 Phq - Maurizio Vittuari
- 142.4 Door Mat v0.1 - K Lewin
- 142.3 Harmony - P. Kline
- 142.0 Seventy Five - Anders Ivner
- 141.8 Blue Funk 3 - Steven Morrell (142.2)
- 141.8 Pretentious v0.2 - Ian Oversby
- 141.6 Memories - Beppe Bezzi (146.9)
- 141.5 Time Lapse v0.1 - David Boeren
- 141.5 Lithium - John K. Wilkinson
- 140.6 Clisson Lite - P. Kline.
- 141.7 myConfuser - Magnus Paulsson
- 140.7 SETI - John K. Wilkinson
- 139.9 Koolaid II: Wogg v2.2 - David Boeren
- 138.5 Die Hard - P. Kline
- 137.5 Juliet and Paper - M.R. Bremer & Beppe Bezzi
- 137.3 Paper One - Beppe Bezzi (138.9)
- 137.0 Leprechaun on Speed - Anders Ivner
- 136.8 anything box - schitzo
- 135.8 Aeka - T. Hsu (139.9)
- 135.3 Impfinity v 3i - Planar
- 134.8 Agony II - Stefan Strack
- 134.8 Harmony II - P. Kline
- 134.7 Time Lapse v0.8 - David Boeren
- 134.7 Crow - Karl Lewin
- 134.8 Babbo Natale - Maurizio Vittuari
- 134.4 myVamp v3.7 - Paulsson
- 134.0 Thermite - Robert Macrae (139.5)
- 131.2 Iron Gate - Wayne Sheppard (132.5)
- 129.2 The Mystery - Magnus Paulsson
- 126.8 Uvavu II revisited
- 126.7 Hyakutake Perihelion (Exclusive- Soon to be unleashed!)
- 126.6 Hyakutake C/1996 B2 +
- 125.8 Mason 2.0 - Robert Macrae
- 125.8 Qwiksand - Wayne Sheppard
- 125.5 Hyakutake C/1996 B2
- 124.3 Hyakutake Zenith
- 123.3 Cannonade - Paul Kline (123.5)
- 123.0 Extreme Prejudice
- 121.4 Rave - Stefan Strack(120.1)
- 121.0 You Wouldn't Let It Lie!
- 119.7 Judge Nutmeg
- 119.7 You Wouldn't Let It Lie! 1.01
- 118.4 127 point Imp Spiral - A. MacAulay
- 115.8 Uvavu II
- 113.2 Hyakutake Approaches
- 111.6 Fire Storm v1.1 - W. Mintardjo (107.6)
- 110.8 Hyakutake Rising
- 109.8 Flashpaper - Matt Hastings (110.4)
- 105.7 Uvavu P!
- 103.7 Pensive v0.3 - Ian Oversby
- 101.8 Hint Test V.4 - Beppe Bezzi
- 99.6 Aleph 0 - Jay Han (104.7)
- (90.0) Tonto 3 - Steve Bailey
- 88.5 Cyclone
- (87.2) Tonto 1 - Steve Bailey
- (87.1) Mice
- 85.5 Tornado - Beppe Bezzi (85.4)
- 83.5 Alien Kiss V1.1 - Bjoern Guenzel
- 81.5 Pensive v0.1 - Ian Oversby
- 81.3 3 Point Imp Spiral - MFCWB
- 77.5 Hyakutake Engine
- 72.7 Pensive v0.2 - Ian Oversby
- 71.5 3 point Imp ring - P. Kline or A. Ivner?
- 71.3 Uvavu V1.01
- 60.8 Uvavu
- (60.5) Cleaner
- 58.1 Enlightenment II
- 56.7 Enlightenment
- (56.0) Dr. Frog
- 52.1 Worm - MFCWB
- 50.3 Dwarf - A. K. Dewdney (49.4)
- 49.4 Deathwalker
- 48.7 Cleaver/75 - Wayne Sheppard
- 48.5 Imp!!!!! Yes the Original 1-point imp. - A. K. Dewdney
- 47.1 Eranu v1.03
- 46.6 Eranu V1.02
- (45.4) Chang1 (filename from Planar's archive)
- 44.9 Simple Scanner (test component for Hyakutake)
- 44.4 Deathwalker II
- 44.3 Uvavu 40(TNG)
- 42.8 Uvavu 20(TNG)
- 42.5 Uvavu 60(TNG)
- 39.3 Mutagen
- 38.0 Uvavu 76(TNG)
- 36.9 Uvavu (original)
- (34.5) Cancer
- 32.8 Ape - Calvin Loh
- 1.0 The original Imp Gate - B. Thomsen
- 0.8 Wait - MFCWB
Some warriors are credited to MFCWB which of course refers to Steven Morrell's
book, where an author has been named in the book then that name is used
instead. I don't have the details for all the warriors so if there are
uncredited authors then please contact me.
The benchmark warriors are available from Planar's archive, and the rating
process I have used is simply 200 battles with each warrior, the scores are
then scaled to 100 battles (i.e 0-300).
Only 2400 battles is probably too small to justify the precision I have used
in the ratings, but when I get time I will run more tests to increase the
accuracy.
So now the question is how the benchmark can be improved, there are only
12 warriors, probably too small for an accurate benchmark, also should
the number of rounds be altered.
My opinion is that since some PSpace algorithms are designed for the hills
then the number of rounds should reflect the Hills. If more rounds are to be
run for increased precision then the rounds should be split into groups of 200
but if extra rounds are going to be run then I'd prefer them to be against
other opponents to decrease the dependance on strategy.
Also the standard warriors appear to be pretty poor at dealing with imps,
I didn't think my early programs were so weak after all they did make it to
the beginners hill, most of them even made the top 10. There's probably too
many strong papers and not enough good scanners, so at the moment I'm going
to stop running benchmarks until a more balanced standard appears. I'd welcome any suggested additions.
BACK
Scott Manley/spm@star.arm.ac.uk