Consistency of Test batsmen - Part 2

A new tool to analyse which batsmen have been the most consistent in Test cricket

28-Jun-2014

Rohan Kanhai plays the pull while batting for Warwickshire, 5 June 1968

In my previous article I had analysed the consistency of Test batsmen, from the innings point of view. I received a number of good comments and a few excellent ideas were sent by the readers. The one idea which appealed to me most was by Santosh Sequeira who suggested that the Consistency analysis will have much better value if done using a single Test as the basis. Once I got out of my self-created mental block that 100 and 0 in a single Test represented inconsistency, this made a lot of sense.

I could see the following benefits accruing if Test Consistency was measured by Test, and not by innings.

- A Test is the logical unit of delivery for a player since the result is driven by a Test.
- There was a well-justified concern that many batsmen, even the very best, were short-changed in the innings-based analysis. A 100 and 0 represented two innings out of the consistency zone. This will disappear if these two innings have been played within a single Test.
- Top batsmen rarely have double failures in a Test. They make up for failures in one innings with a good innings in the other. The Test-based analysis recognises this characteristic.
- The results bear out this improvement since many top batsmen who were languishing in the lower half of the table of the select group of batsmen, have moved up considerably.
- The upper limit of the consistency zone was fairly low and this meant that many a good innings went out of the consistency zone. This is partly alleviated in the Test-based analysis.
- The impact of not-outs is fairly negligible. Since there are two innings to combine, I could adopt a different approach.

Let me reassure the readers that the following ideas that went into my to-do list are still active candidates for inclusion in future articles.

- Consistency analysis using the Median (Q2) value. This will be an assumption-free analysis.
- Batsman low-score analysis.
- Consistency value at batsman peak.
- Analysis of batsman troughs.

At the end of this article I will check whether there is good correlation between the innings-based analysis and Test-based analysis. If there is good correlation, we could work with either of the methods. If there are many variations, we have to peg our hat on either of the analysis methods. The criteria could be many. Why cross a bridge which is a few kilometres away?

I have devised a simple concept of "Batsman-active Tests". If a batsmen batted in either innings, I consider that Test as an active one for him. Else I do not include it. Don Bradman batted in 50 Tests only, and Sachin Tendulkar in 197 Tests. Just to give a specific example, when South Africa scored 637 for 2 at The Oval and won, AB de Villiers did not bat at all. So this Test is excluded from this analysis. On the other hand Alviro Peterson scored a duck and this is certainly an "active Test" for him. This is eminently fair, simple to understand and easy to work out.

If a batsman is not out at 10 in the only innings he played in a Test, well, these would even out across a career. It is the same policy for all batsmen. I briefly considered, and discarded, the method of taking a fraction of a Test, derived from the scores, in the denominator. Quite confusing, and just not worth it. An analysis of 39.42 Tests? No way.

Test Batsmen Consistency analysis: Top 30 batsmen
No	Batsman	LHB	Ctry	Tests	Runs	RpT	Inactive-Tests	Active-Tests	Real RpT	Cons-Zone Range	Below CZ	Below CZ %	Cons-Zone Tests	Cons-Index
1	Saeed Ahmed		Pak	41	2991	73.0	1	40	74.8	37.4-112.2	10	25.0%	25	62.5%
2	RB Kanhai		Win	79	6227	78.8	0	79	78.8	39.4-118.2	18	22.8%	46	58.2%
3	RC Fredericks	L	Win	59	4334	73.5	0	59	73.5	36.7-110.2	13	22.0%	34	57.6%
4	CH Lloyd	L	Win	110	7515	68.3	1	109	68.9	34.5-103.4	25	22.9%	62	56.9%
5	KD Walters		Aus	74	5357	72.4	0	74	72.4	36.2-108.6	18	24.3%	42	56.8%
6	AH Jones		Nzl	39	2922	74.9	0	39	74.9	37.5-112.4	9	23.1%	22	56.4%
7	GM Turner		Nzl	41	2991	73.0	0	41	73.0	36.5-109.4	11	26.8%	23	56.1%
8	NC O'Neill		Aus	42	2779	66.2	1	41	67.8	33.9-101.7	9	22.0%	23	56.1%
9	SR Watson		Aus	52	3408	65.5	0	52	65.5	32.8- 98.3	13	25.0%	29	55.8%
10	Misbah-ul-Haq		Pak	46	3218	70.0	1	45	71.5	35.8-107.3	10	22.2%	25	55.6%
11	MH Richardson	L	Nzl	38	2776	73.1	0	38	73.1	36.5-109.6	9	23.7%	21	55.3%
12	IJL Trott		Eng	49	3763	76.8	0	49	76.8	38.4-115.2	13	26.5%	27	55.1%
13	FE Woolley	L	Eng	64	3283	51.3	2	62	53.0	26.5- 79.4	18	29.0%	34	54.8%
14	A Ranatunga	L	Slk	93	5105	54.9	2	91	56.1	28.0- 84.1	25	27.5%	49	53.8%
15	ND McKenzie		Saf	58	3253	56.1	2	56	58.1	29.0- 87.1	16	28.6%	30	53.6%
16	AL Hassett		Aus	43	3073	71.5	0	43	71.5	35.7-107.2	10	23.3%	23	53.5%
17	RB Richardson		Win	86	5949	69.2	0	86	69.2	34.6-103.8	21	24.4%	45	52.3%
18	JH Edrich	L	Eng	77	5138	66.7	2	75	68.5	34.3-102.8	20	26.7%	39	52.0%
19	GR Marsh		Aus	50	2854	57.1	0	50	57.1	28.5- 85.6	14	28.0%	26	52.0%
20	L Hutton		Eng	79	6971	88.2	0	79	88.2	44.1-132.4	23	29.1%	41	51.9%
21	BJ Haddin		Aus	57	3033	53.2	1	56	54.2	27.1- 81.2	17	30.4%	29	51.8%
22	DPMD Jayawardene		Slk	144	11392	79.1	1	143	79.7	39.8-119.5	37	25.9%	74	51.7%
23	ER Dexter		Eng	62	4502	72.6	0	62	72.6	36.3-108.9	15	24.2%	32	51.6%
24	WJ Cronje		Saf	68	3714	54.6	2	66	56.3	28.1- 84.4	19	28.8%	34	51.5%
25	ME Trescothick	L	Eng	76	5820	76.6	0	76	76.6	38.3-114.9	23	30.3%	39	51.3%
26	SP Fleming	L	Nzl	111	7172	64.6	3	108	66.4	33.2- 99.6	34	31.5%	55	50.9%
27	Asif Iqbal		Pak	58	3575	61.6	1	57	62.7	31.4- 94.1	15	26.3%	29	50.9%
28	A Flower	L	Zim	63	4794	76.1	0	63	76.1	38.0-114.1	20	31.7%	32	50.8%
29	KJ Hughes		Aus	70	4415	63.1	1	69	64.0	32.0- 96.0	20	29.0%	35	50.7%
30	WR Hammond		Eng	85	7249	85.3	0	85	85.3	42.6-127.9	24	28.2%	43	50.6%
31	AD Nourse		Saf	34	2960	87.1	0	34	87.1	43.5-130.6	10	29.4%	17	50.0%

The top position in the Test-based Consistency table is taken by Saeed Ahmed, the attacking Pakistani batsmen whose Test average of 40.41 belied his value to his team. Out of the 40 Tests he batted in, he was in the ConZone an amazing 25 times, leading to an outstanding index value of 62.5%: That is 5 out of 8 Tests. The other telling statistic is the fact that he failed to reach the ConZone in only ten Tests out of these 40. That is a very low failure rate of 25%. In 1965, he made an unforgettable 172, out of a score of 307 for 8, against New Zealand, saving Pakistan from possible defeat. That was his highest Test score.

Then come four more attacking batsmen: Rohan Kanhai, Roy Fredericks, Clive Lloyd and Doug Walters. There are three West Indians and one Australian. This re-emphasises my belief that the attacking batsmen are as likely to be as consistent as the staid batsmen. With their more aggressive attitude, they are more likely to be able to make up for failures in one innings with good showings in the other. All these batsmen have Consistency indices above 56%.

There are a number of lovely batsmen in the top-20. Norm O'Neill is in the top-10. He might not have been the "next Bradman" but was a terrific batsman. The under-rated Misbah-ul-Haq rounds off the top-10. We will look at Jonathan Trott later on. I am very happy to see John Edrich in 18th place. Frank Woolley was a classical left-hander who is deservedly in 13th position. They are both favourites of mine. The 20th ranked batsman is Len Hutton, possibly the best in this lot. He himself clocks in at 51.9%. There are 36 batsmen who have index values of 50% or higher. The last batsman featured is Dudley Nourse who was placed in first position in the other table.

Test Batsmen Consistency analysis: Bottom 10 batsmen
No	Batsman	LHB	Ctry	Tests	Runs	RpT	Inactive-Tests	Active-Tests	Real RpT	Cons-Zone Range	Below CZ	Below CZ %	Cons-Zone Tests	Cons-Index
191	Shoaib Mohammad		Pak	45	2705	60.1	1	44	61.5	30.7- 92.2	18	40.9%	15	34.1%
192	Aamer Sohail	L	Pak	47	2823	60.1	0	47	60.1	30.0- 90.1	19	40.4%	16	34.0%
193	VT Trumper		Aus	48	3163	65.9	1	47	67.3	33.6-100.9	19	40.4%	16	34.0%
194	DL Amiss		Eng	50	3612	72.2	0	50	72.2	36.1-108.4	20	40.0%	17	34.0%
195	HW Taylor		Saf	42	2936	69.9	0	42	69.9	35.0-104.9	15	35.7%	14	33.3%
196	GA Hick		Eng	65	3383	52.0	0	65	52.0	26.0- 78.1	26	40.0%	21	32.3%
197	SK Warne		Aus	145	3154	21.8	8	137	23.0	11.5- 34.5	59	43.1%	43	31.4%
198	Ijaz Ahmed		Pak	60	3315	55.2	2	58	57.2	28.6- 85.7	24	41.4%	18	31.0%
199	AC Parore		Nzl	78	2865	36.7	3	75	38.2	19.1- 57.3	30	40.0%	23	30.7%
200	MS Atapattu		Slk	90	5502	61.1	2	88	62.5	31.3- 93.8	40	45.5%	26	29.5%

Let us look at the table proppers. Marvan Atapattu takes possession of the 200th position. What makes Atapattu so inconsistent? He batted in 88 Tests. He reached the ConZone mark of 31-94 only 26 times. That is a meagre 29.5%: not even a third of his Tests. This index is less than a half of Saeed Ahmed, the table topper. And Atapattu's inconsistency is emphasised by the 40 Tests below the ConZone. A look at his series of scores indicates that there were 30 Tests in which he scored 20 runs or less, and 11 Tests in which he scored 150 or higher.

Ijaz Ahmad has been a revelation. He was 200th in the innings-based table and 198th in this one: a very firm indicator that he was the embodiment of inconsistency. His index is a very low 31.0%. The inscrutable Graeme Hick is in the last-5 with a low index value of 32.3%. Dennis Amiss also confirms that his inconsistency moves on from the innings level to Test level, with an index of 34%. He was 193rd in the earlier table and he has moved one place below to 194th in this one. Michael Clarke has moved away from 192nd to 167th, still way down the table, but at least some distance away from the bottom.

The graphs are self-explanatory. The first one plots the top five batsmen and three from the bottom-10. I have used a modified box plot to do this visual depiction. The benefit is that I can easily show ten batsmen in one graph. The values for all batsmen are scaled, with 100 being taken to represent the number of Active Tests. This makes comparisons easier. The wider the rectangle, the higher will be the Consistency Index. The more the rectangle is to the right, the more will be the sub-CZ numbers, indicating a greater number of failures.

In this chart, I have picked ten notable batsmen and plotted their numbers. Nourse has been added to this list since he was No. 1 in the earlier table. It can be seen that most of these batsmen now have Consistency index values between 40 and 50%. In the earlier table, they had values between 30 and 40%. This clearly confirms that the top batsmen tend to make up for a failure in one innings with a good innings in the other. Double failures are that much rarer. The first number is also interesting. This indicates the real failures. Garry Sobers and Ricky Ponting lead in this measure with failures in a third of their Tests. The top-of-the-table values for this measure are around 25% and below.

Let us now study the ranks in the two tables. Not one batsman features in the top-10 positions in the two tables. However there is a very close contender for this honour. Trott is in 12th position in both tables. This could very well qualify him to be a contender for the most consistent batsman ever. Shane Watson is very close to Trott with ranks of 14 and 9 respectively in the two tables. Neil McKenzie is in positions 17 and 16. Richie Richardson is in creditable sub-20 positions of 13 and 18. Kanhai is very well placed at 19 and 2. Fredericks is at 16 and 3. Now we come to Dudley Nourse. He was first in the earlier table but has moved down to 31st here. Saeed Ahmed was 57th in the innings-based table and has moved to top position here.

The three batsmen whose ranks zoomed upwards are Walters who moved from 169 to 5, Thilan Samaraweera who moved from 194 to 38 and Bruce Mitchell who saw his position skyrocket from 186 to 35.

There were quite a few batsmen who saw drastic drops in their positions. Not many were top batsmen. The highest drop was also the top-most one: Sobers dropped 133 places from 22 to 155. Aamer Sohail moved from 63 to 192. Vijay Manjrekar dropped like a stone from an exalted 5 to 130. De Villiers drops from a respectable 18 to a mid-table 89. Why did they drop? I do not really have one single explanation. An explanation has to consider many factors including scoring patterns.

Bradman moved 99 places up from 159 to 60. Sunil Gavaskar moved up 79 places from 166 to 87. Tendulkar jumped 71 places from 153 to 82. Kevin Pietersen moved from 113 to 49. Brian Lara improved his position by 63 places: from 127 to 64. Andy Flower, Gordon Greenidge, Mahela Jayawardene, VVS Laxman, Neil Harvey et al., all moved up by around 50 places.

A final point on the ranks secured by the table. I added the two ranks, rather elementary step this is, just to get the batsmen who were top or bottom in both. Fredericks leads this ad-hoc combined table with a combined rank of 19. Kanhai follows with 21. Watson is in third position with 23. Trott, with an even 12 in both, is next with 24. Glenn Turner and Richie Richardson round off the top-5 with a combined rank of 31. Dudley Nourse, McKenzie, Dexter and Hammond round off the top-10.

Ijaz Ahmed is at the bottom with a combined rank of 398. Herbie Taylor is next at 390. Amiss confirms his high level of inconsistency with 386. John Reid and MAK Pataudi clock in at 386. Michael Clarke is tenth from last with a combined rank value of 359.

A few technical details on the comparisons of player ranks derived from the two methods.
- The Pearson's Correlation Coefficient between the two sets of ranks is 0.431, indicating that the two sets of values have virtually no correlation at all.
- The average of the absolute rank difference is 50.1 indicating that there are a number of very significant changes in ranks.
- A mere 28 players have rank differences in single digits. This is only 14% of the population.

Thus it is easy to conclude that the two methods create two entirely different sets of tables. We have to bite the bullet and select one of these two as the final defining table. I am sure the readers would agree with me that the Test-based Consistency analysis has a lot more going for it and that would be my choice. The reasons are summarised below.
- This method allows for players to compensate for one poor innings with a good innings during the course of a single match.
- Many of the top players have achieved this and have moved up in the Test-based table.
- Many of the top scoring and high performance batsmen are in higher positions in the Test-based table.
- The unit of a Test as the contest which produces the result is the final clincher.

What do we conclude? With the Test-based consistency analysis as the guiding factor and the innings-based analysis as a supporting entity, five batsmen stand out. Saeed Ahmed, Nourse, Trott, Kanhai and Fredericks. Considering the number of runs scored, the quality of opposition faced, the very high second position in the important table and the length of the career, I would place Rohan Kanhai at the top of the list.

To me, the type of attacking batsman that he was, the Consistency index values of 58.2% and 40.6% and the very low level of below-ConZone innings/Tests, Kanhai has been absolutely wonderful in the consistency stakes. Kanhai has played only eight Tests in which he scored 20 or fewer runs. He has 46 Tests in the ConZone range of 40-118. Rohan Bholalall Kanhai is an under-rated batsman who lived right through his career under the shadow of Sobers. However, he was a world-class batsman in his own right and this analysis substantiates that.

I have uploaded a single composite Excel chart containing the data for all 200 qualifying players. This contains the Test-based table, Innings-based table and Rank comparisons. To download/view this file, please CLICK HERE.

"A tale of two penultimate balls" could very well be the Dickensian title of the two glorious Tests over the past fortnight. Sitting on the outside it is easy to say that there were no losers and cricket was the winner. But England would feel gutted. They drew a match they should have won and lost a match they should have drawn match. But they could take a lot of positives from the series. The four centurions were all new players: not Cook or Bell. Plunkett's 11 wickets in the series was impressive. The fight England's lower order batsmen showed on the last day was magnificent.

"The private club" is however another thing. The lopsided scheduling in favour of the troika is already being seen. Why could the Sri Lankans not have been given three Tests and India, four Tests? This will happen in the future. This series cried out for a third Test. Next time it could be a two-Test Australia-South Africa series and a five-Test Australia-India series. This sort of imbalance will be the order of the day.

Amazing, but true. If Sangakkara and Jayawardene were to retire tomorrow, they would have shared 22986 career Test runs between them, split right down the middle.

South Africa England Pakistan New Zealand Sri Lanka Australia Bangladesh India West Indies

Anantha Narayanan has written for ESPNcricinfo and CastrolCricket and worked with a number of companies on their cricket performance ratings-related systems