Dr Andrew Scott G7VAV

My photo
Senior Lecturer [D35]
Computing Department
InfoLab 21, South Drive
Lancaster University
Lancaster, LA1 4WA
United Kingdom
 
November 2008
Mo Tu We Th Fr Sa Su
27 28 29 30 31 1 2
3 4 5 6 7 8 9
10 11 12 13 14 15 16
17 18 19 20 21 22 23
24 25 26 27 28 29 30
1 2 3 4 5 6 7


Average Character Frequency
The following shows the average frequency with which each letter appears in English.
Such information is useful in data compression and security applications.
You should also look at the list of English word frequency.
0.056334 0.099662 0.125374 0.139247 0.913523 0.925764 1.598255 1.656386 2.061470 2.097982 2.244094 2.325395 2.399685 2.654289 2.876456 4.249582 4.490411 5.716129 6.184758 6.475467 6.808129 6.865905 7.588697 8.102814 8.982207 12.361982
Z Q J X V K B P G Y F C W M U L D R S H N I O A T E
Note: More complete corpora give slightly differing orders
1 39 Nine Steps Buchan, John 1875 to 1940
2 A Tale of Two Cities Dickens, Charles 1812 to 1870
3 Complete Works Shakespeare, William 1564 to 1616
4 Dracula Stoker, Bram 1847 to 1912
5 Economy of Mach and Manuf Babbage, Charles 1792 to 1871
6 Ivanhoe Scott, Walter 1771 to 1832
7 Jane Eyre Bronte, Charlotte 1816 to 1855
8 Jungle Book Kipling, Rudyard 1865 to 1936
9 Just So Stories Kipling, Rudyard 1865 to 1936
10 Kidnapped Stevenson, Robert 1850 to 1894
11 Kim Kipling, Rudyard 1865 to 1936
12 Micrographia Hooke, Robert 1635 to 1703
13 Poems Blake, William 1757 to 1827
14 Pygmalion Shaw, George Bernard 1856 to 1950
15 Treasure Island Stevenson, Robert 1850 to 1894
16 Wildfell Bronte, Anne 1820 to 1849
17 Wuthering Heights Bronte, Emily 1818 to 1848
Character Frequency (%)
Text A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
1 8.51 1.70 2.37 5.01 12.08 2.30 2.17 6.14 6.82 0.12 1.09 4.11 2.80 6.88 7.74 1.64 0.06 5.27 5.58 9.21 2.78 0.79 2.58 0.09 2.09 0.07
2 8.03 1.41 2.32 4.67 12.48 2.26 2.09 6.49 6.84 0.12 0.80 3.67 2.55 7.07 7.76 1.66 0.11 6.20 6.27 9.01 2.79 0.87 2.35 0.12 2.03 0.04
3 7.63 1.63 2.32 3.94 11.80 2.12 1.80 6.25 6.70 0.13 0.93 4.49 2.94 6.42 8.30 1.54 0.09 6.27 6.57 8.70 3.40 0.99 2.36 0.14 2.49 0.04
4 8.20 1.41 2.12 4.47 12.43 2.19 1.99 6.77 6.68 0.13 0.97 4.09 2.78 6.83 7.89 1.44 0.10 5.48 6.19 9.11 2.82 0.92 2.83 0.12 1.99 0.06
5 7.34 1.58 3.57 3.48 13.04 2.85 1.60 5.49 7.34 0.11 0.49 3.67 2.73 6.84 7.52 2.45 0.20 6.21 6.48 9.61 2.81 1.02 1.69 0.29 1.57 0.04
6 7.83 1.64 2.76 4.48 12.70 2.51 1.79 6.84 6.69 0.17 0.63 3.74 2.46 6.62 7.54 1.76 0.10 6.26 6.37 9.15 2.79 0.94 2.25 0.21 1.73 0.05
7 7.97 1.42 2.38 4.75 12.78 2.14 1.91 5.82 7.11 0.16 0.77 4.12 2.83 6.89 7.69 1.54 0.12 6.01 6.36 8.51 2.98 0.97 2.37 0.16 2.20 0.04
8 8.66 1.77 1.97 4.55 12.28 2.13 2.57 7.37 6.22 0.15 1.64 4.82 2.30 6.70 7.34 1.45 0.07 4.97 5.70 9.13 2.77 0.82 2.78 0.06 1.76 0.05
9 9.27 1.70 1.96 5.46 11.59 2.04 1.93 6.69 6.64 0.14 1.20 4.63 2.58 6.63 6.92 1.63 0.11 4.75 5.96 9.01 3.12 0.83 2.70 0.07 2.38 0.05
10 9.02 1.66 2.09 4.80 11.99 2.29 2.02 6.60 6.66 0.09 0.99 4.19 2.65 7.10 7.48 1.50 0.07 5.31 6.00 8.98 2.81 0.77 2.68 0.09 2.15 0.03
11 8.49 1.80 2.11 4.39 12.36 2.11 2.02 7.09 6.82 0.11 1.20 4.59 2.82 6.37 7.38 1.45 0.08 5.42 6.21 9.05 2.78 0.87 2.48 0.11 1.84 0.06
12 7.61 1.97 2.84 3.65 12.65 2.82 1.74 6.07 7.01 0.09 0.59 3.96 2.48 6.31 7.68 2.00 0.15 6.23 6.68 9.70 2.74 1.11 1.77 0.29 1.84 0.02
13 7.71 1.56 1.70 4.76 13.50 2.02 2.22 7.15 6.53 0.17 0.85 5.06 2.56 6.99 6.63 1.62 0.03 6.01 6.14 8.29 2.11 1.07 2.78 0.04 2.46 0.03
14 7.45 1.38 2.29 3.61 11.44 1.92 3.10 6.27 7.82 0.09 1.07 4.50 2.58 6.93 8.10 1.76 0.10 5.69 6.28 8.68 2.94 0.84 2.15 0.15 2.59 0.27
15 8.59 1.63 2.13 4.95 12.08 2.08 2.03 6.55 6.44 0.15 1.01 4.17 2.39 6.94 7.85 1.63 0.10 5.52 6.12 9.03 2.97 0.84 2.63 0.09 2.04 0.03
16 7.66 1.54 2.22 4.53 12.17 2.23 1.97 5.92 7.23 0.11 0.73 4.29 3.03 7.05 7.84 1.55 0.11 5.70 6.12 8.91 3.33 0.98 2.25 0.16 2.35 0.04
17 7.78 1.38 2.37 4.82 12.81 2.14 2.10 6.56 7.16 0.11 0.78 4.15 2.65 7.18 7.37 1.56 0.09 5.87 6.12 8.63 2.97 0.92 2.12 0.16 2.16 0.04
Average 8.10 1.60 2.33 4.49 12.36 2.24 2.06 6.48 6.87 0.13 0.93 4.25 2.65 6.81 7.59 1.66 0.10 5.72 6.18 8.98 2.88 0.91 2.40 0.14 2.10 0.06
Bigram Frequency (%)
0.884737 0.902697 0.903127 0.921210 0.926120 0.929606 0.934996 0.940888 1.016104 1.040079 1.047432 1.080136 1.082591 1.113441 1.145260 1.146353 1.155351 1.215185 1.282114 1.311331 1.317371 1.432877 1.611445 1.673365 1.707468 2.130870 2.169405 2.219897 3.471259 3.989114
EA LL VE AS ME OF TE SE NG ST ED ES AR IT TO HI IS OR ON AT EN HA OU ND RE ER IN AN HE TH
Note: More complete corpora give slightly differing orders
  A- B- C- D- E- F- G- H- I- J- K- L- M- N- O- P- Q- R- S- T- U- V- W- X- Y- Z-
-A 0.01 0.15 0.38 0.16 0.88 0.23 0.15 1.43 0.13 0.02 0.01 0.46 0.57 0.19 0.07 0.27 0.00 0.48 0.34 0.37 0.06 0.09 0.49 0.01 0.02 0.01
-B 0.17 0.02 0.00 0.00 0.02 0.00 0.00 0.01 0.07 0.00 0.00 0.01 0.08 0.01 0.08 0.00 0.00 0.02 0.02 0.00 0.06 0.00 0.00 0.00 0.00 0.00
-C 0.34 0.00 0.05 0.00 0.26 0.00 0.00 0.01 0.40 0.00 0.00 0.01 0.00 0.33 0.09 0.00 0.00 0.09 0.12 0.05 0.19 0.00 0.00 0.03 0.00 0.00
-D 0.45 0.00 0.00 0.05 1.05 0.00 0.00 0.00 0.34 0.00 0.00 0.38 0.00 1.67 0.21 0.00 0.00 0.28 0.01 0.00 0.07 0.00 0.01 0.00 0.00 0.00
-E 0.02 0.69 0.57 0.64 0.54 0.23 0.32 3.47 0.33 0.04 0.36 0.86 0.93 0.74 0.04 0.40 0.00 1.71 0.94 0.93 0.13 0.90 0.48 0.03 0.15 0.03
-F 0.08 0.00 0.00 0.01 0.15 0.13 0.00 0.00 0.22 0.00 0.00 0.10 0.01 0.04 0.93 0.00 0.00 0.04 0.01 0.01 0.03 0.00 0.00 0.00 0.00 0.00
-G 0.18 0.00 0.00 0.03 0.08 0.00 0.03 0.00 0.27 0.00 0.00 0.00 0.00 1.02 0.04 0.00 0.00 0.07 0.00 0.00 0.17 0.00 0.00 0.00 0.00 0.00
-H 0.03 0.00 0.58 0.00 0.03 0.00 0.36 0.00 0.00 0.00 0.00 0.00 0.00 0.01 0.03 0.05 0.00 0.02 0.52 3.99 0.00 0.00 0.57 0.00 0.00 0.00
-I 0.45 0.07 0.13 0.35 0.18 0.22 0.12 1.15 0.01 0.00 0.18 0.55 0.27 0.23 0.09 0.13 0.00 0.58 0.43 0.65 0.10 0.18 0.59 0.03 0.03 0.00
-J 0.01 0.01 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.01 0.01 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
-K 0.19 0.00 0.19 0.00 0.02 0.00 0.00 0.00 0.08 0.00 0.00 0.04 0.00 0.09 0.12 0.00 0.00 0.07 0.04 0.00 0.02 0.00 0.00 0.00 0.00 0.00
-L 0.76 0.22 0.13 0.05 0.58 0.08 0.09 0.01 0.49 0.00 0.02 0.90 0.01 0.06 0.31 0.19 0.00 0.08 0.07 0.16 0.38 0.00 0.02 0.00 0.01 0.00
-M 0.30 0.00 0.00 0.01 0.31 0.00 0.01 0.01 0.38 0.00 0.00 0.03 0.05 0.01 0.55 0.00 0.00 0.11 0.05 0.01 0.10 0.00 0.00 0.00 0.01 0.00
-N 2.22 0.00 0.00 0.02 1.32 0.00 0.04 0.02 2.17 0.00 0.12 0.00 0.01 0.08 1.28 0.00 0.00 0.15 0.02 0.01 0.39 0.00 0.13 0.00 0.00 0.00
-O 0.00 0.21 0.61 0.33 0.04 0.54 0.22 0.70 0.36 0.05 0.00 0.52 0.35 0.75 0.41 0.27 0.00 0.67 0.46 1.15 0.01 0.05 0.30 0.00 0.55 0.00
-P 0.16 0.00 0.00 0.00 0.16 0.00 0.00 0.00 0.06 0.00 0.00 0.02 0.15 0.00 0.15 0.11 0.00 0.03 0.18 0.00 0.17 0.00 0.00 0.03 0.01 0.00
-Q 0.00 0.00 0.01 0.00 0.02 0.00 0.00 0.00 0.01 0.00 0.00 0.00 0.00 0.01 0.00 0.00 0.00 0.00 0.01 0.00 0.00 0.00 0.00 0.00 0.00 0.00
-R 1.08 0.16 0.12 0.12 2.13 0.22 0.18 0.08 0.37 0.00 0.00 0.01 0.05 0.01 1.22 0.32 0.00 0.13 0.00 0.34 0.68 0.00 0.04 0.00 0.01 0.00
-S 0.92 0.04 0.00 0.16 1.08 0.00 0.06 0.02 1.16 0.00 0.05 0.11 0.07 0.32 0.28 0.04 0.00 0.39 0.37 0.19 0.52 0.00 0.04 0.00 0.08 0.00
-T 1.31 0.02 0.20 0.00 0.47 0.09 0.01 0.24 1.11 0.00 0.00 0.08 0.00 0.75 0.57 0.07 0.00 0.35 1.04 0.21 0.51 0.00 0.00 0.03 0.01 0.00
-U 0.12 0.30 0.11 0.08 0.03 0.09 0.09 0.11 0.04 0.05 0.00 0.10 0.14 0.05 1.61 0.08 0.14 0.15 0.25 0.18 0.00 0.00 0.00 0.00 0.00 0.00
-V 0.33 0.00 0.00 0.02 0.22 0.00 0.00 0.00 0.18 0.00 0.00 0.03 0.00 0.03 0.17 0.00 0.00 0.06 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
-W 0.10 0.00 0.00 0.01 0.11 0.00 0.00 0.00 0.00 0.00 0.00 0.01 0.00 0.01 0.54 0.00 0.00 0.02 0.07 0.08 0.00 0.00 0.00 0.00 0.00 0.00
-X 0.01 0.00 0.00 0.00 0.15 0.00 0.00 0.00 0.02 0.00 0.00 0.00 0.00 0.00 0.01 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
-Y 0.34 0.17 0.03 0.07 0.20 0.01 0.00 0.12 0.00 0.00 0.01 0.37 0.34 0.11 0.05 0.02 0.00 0.23 0.02 0.14 0.00 0.01 0.00 0.00 0.00 0.00
-Z 0.01 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.02 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00


© 2006 - 2008 Andrew Scott