Amino acid dipepetide frequency for Equus caballus (Horse)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.196AlaAla: 7.196 ± 0.032
1.407AlaCys: 1.407 ± 0.009
2.941AlaAsp: 2.941 ± 0.011
4.885AlaGlu: 4.885 ± 0.02
2.551AlaPhe: 2.551 ± 0.014
5.032AlaGly: 5.032 ± 0.02
1.586AlaHis: 1.586 ± 0.009
2.768AlaIle: 2.768 ± 0.012
3.403AlaLys: 3.403 ± 0.019
7.18AlaLeu: 7.18 ± 0.027
1.46AlaMet: 1.46 ± 0.009
2.028AlaAsn: 2.028 ± 0.009
4.466AlaPro: 4.466 ± 0.025
3.251AlaGln: 3.251 ± 0.017
3.936AlaArg: 3.936 ± 0.019
5.895AlaSer: 5.895 ± 0.021
3.508AlaThr: 3.508 ± 0.015
4.718AlaVal: 4.718 ± 0.016
0.824AlaTrp: 0.824 ± 0.006
1.461AlaTyr: 1.461 ± 0.008
0.0AlaXaa: 0.0 ± 0.0
Cys
1.297CysAla: 1.297 ± 0.009
0.618CysCys: 0.618 ± 0.008
1.001CysAsp: 1.001 ± 0.009
1.278CysGlu: 1.278 ± 0.01
0.814CysPhe: 0.814 ± 0.006
1.679CysGly: 1.679 ± 0.016
0.656CysHis: 0.656 ± 0.007
0.917CysIle: 0.917 ± 0.008
1.133CysLys: 1.133 ± 0.009
2.167CysLeu: 2.167 ± 0.014
0.402CysMet: 0.402 ± 0.004
0.765CysAsn: 0.765 ± 0.006
1.404CysPro: 1.404 ± 0.012
1.057CysGln: 1.057 ± 0.009
1.306CysArg: 1.306 ± 0.009
2.036CysSer: 2.036 ± 0.014
1.086CysThr: 1.086 ± 0.008
1.304CysVal: 1.304 ± 0.01
0.317CysTrp: 0.317 ± 0.004
0.551CysTyr: 0.551 ± 0.005
0.0CysXaa: 0.0 ± 0.0
Asp
2.925AspAla: 2.925 ± 0.011
1.041AspCys: 1.041 ± 0.009
2.617AspAsp: 2.617 ± 0.014
3.373AspGlu: 3.373 ± 0.015
2.08AspPhe: 2.08 ± 0.009
3.284AspGly: 3.284 ± 0.016
1.129AspHis: 1.129 ± 0.006
2.545AspIle: 2.545 ± 0.011
2.555AspLys: 2.555 ± 0.012
4.948AspLeu: 4.948 ± 0.016
1.093AspMet: 1.093 ± 0.006
1.644AspAsn: 1.644 ± 0.008
2.886AspPro: 2.886 ± 0.012
1.837AspGln: 1.837 ± 0.009
2.414AspArg: 2.414 ± 0.011
4.193AspSer: 4.193 ± 0.019
2.44AspThr: 2.44 ± 0.013
3.019AspVal: 3.019 ± 0.015
0.608AspTrp: 0.608 ± 0.005
1.402AspTyr: 1.402 ± 0.008
0.0AspXaa: 0.0 ± 0.0
Glu
5.26GluAla: 5.26 ± 0.022
1.409GluCys: 1.409 ± 0.014
4.391GluAsp: 4.391 ± 0.016
7.886GluGlu: 7.886 ± 0.042
2.009GluPhe: 2.009 ± 0.009
4.244GluGly: 4.244 ± 0.017
1.503GluHis: 1.503 ± 0.008
3.278GluIle: 3.278 ± 0.019
5.546GluLys: 5.546 ± 0.033
6.518GluLeu: 6.518 ± 0.028
1.681GluMet: 1.681 ± 0.009
3.212GluAsn: 3.212 ± 0.015
3.29GluPro: 3.29 ± 0.018
3.165GluGln: 3.165 ± 0.016
4.13GluArg: 4.13 ± 0.02
4.456GluSer: 4.456 ± 0.018
3.484GluThr: 3.484 ± 0.014
4.144GluVal: 4.144 ± 0.016
0.677GluTrp: 0.677 ± 0.005
1.549GluTyr: 1.549 ± 0.013
0.0GluXaa: 0.0 ± 0.0
Phe
1.932PheAla: 1.932 ± 0.009
0.904PheCys: 0.904 ± 0.007
1.641PheAsp: 1.641 ± 0.008
1.947PheGlu: 1.947 ± 0.01
1.582PhePhe: 1.582 ± 0.01
2.114PheGly: 2.114 ± 0.011
1.004PheHis: 1.004 ± 0.007
1.822PheIle: 1.822 ± 0.012
1.77PheLys: 1.77 ± 0.01
4.084PheLeu: 4.084 ± 0.018
0.751PheMet: 0.751 ± 0.006
1.32PheAsn: 1.32 ± 0.008
1.979PhePro: 1.979 ± 0.01
1.753PheGln: 1.753 ± 0.01
1.93PheArg: 1.93 ± 0.012
3.403PheSer: 3.403 ± 0.016
1.996PheThr: 1.996 ± 0.01
2.075PheVal: 2.075 ± 0.011
0.482PheTrp: 0.482 ± 0.005
1.162PheTyr: 1.162 ± 0.007
0.0PheXaa: 0.0 ± 0.0
Gly
4.827GlyAla: 4.827 ± 0.024
1.315GlyCys: 1.315 ± 0.01
3.112GlyAsp: 3.112 ± 0.014
4.111GlyGlu: 4.111 ± 0.02
2.342GlyPhe: 2.342 ± 0.013
5.271GlyGly: 5.271 ± 0.032
1.665GlyHis: 1.665 ± 0.011
2.686GlyIle: 2.686 ± 0.012
3.691GlyLys: 3.691 ± 0.017
5.873GlyLeu: 5.873 ± 0.024
1.234GlyMet: 1.234 ± 0.008
2.294GlyAsn: 2.294 ± 0.012
4.458GlyPro: 4.458 ± 0.038
2.769GlyGln: 2.769 ± 0.013
4.064GlyArg: 4.064 ± 0.016
5.861GlySer: 5.861 ± 0.022
3.547GlyThr: 3.547 ± 0.014
3.517GlyVal: 3.517 ± 0.018
0.826GlyTrp: 0.826 ± 0.007
1.65GlyTyr: 1.65 ± 0.009
0.001GlyXaa: 0.001 ± 0.0
His
1.311HisAla: 1.311 ± 0.009
0.73HisCys: 0.73 ± 0.006
0.866HisAsp: 0.866 ± 0.007
1.294HisGlu: 1.294 ± 0.008
1.069HisPhe: 1.069 ± 0.007
1.535HisGly: 1.535 ± 0.009
0.905HisHis: 0.905 ± 0.008
1.229HisIle: 1.229 ± 0.008
1.234HisLys: 1.234 ± 0.007
2.938HisLeu: 2.938 ± 0.011
0.569HisMet: 0.569 ± 0.005
0.839HisAsn: 0.839 ± 0.006
1.687HisPro: 1.687 ± 0.012
1.304HisGln: 1.304 ± 0.01
1.564HisArg: 1.564 ± 0.009
2.295HisSer: 2.295 ± 0.012
1.458HisThr: 1.458 ± 0.01
1.488HisVal: 1.488 ± 0.009
0.332HisTrp: 0.332 ± 0.003
0.77HisTyr: 0.77 ± 0.006
0.0HisXaa: 0.0 ± 0.0
Ile
2.526IleAla: 2.526 ± 0.011
1.055IleCys: 1.055 ± 0.009
1.99IleAsp: 1.99 ± 0.011
2.645IleGlu: 2.645 ± 0.016
1.853IlePhe: 1.853 ± 0.012
2.14IleGly: 2.14 ± 0.011
1.321IleHis: 1.321 ± 0.013
2.41IleIle: 2.41 ± 0.015
2.722IleLys: 2.722 ± 0.017
4.593IleLeu: 4.593 ± 0.019
0.977IleMet: 0.977 ± 0.007
1.9IleAsn: 1.9 ± 0.012
2.644IlePro: 2.644 ± 0.012
2.269IleGln: 2.269 ± 0.011
2.371IleArg: 2.371 ± 0.01
3.763IleSer: 3.763 ± 0.014
2.519IleThr: 2.519 ± 0.014
2.452IleVal: 2.452 ± 0.013
0.509IleTrp: 0.509 ± 0.005
1.395IleTyr: 1.395 ± 0.008
0.0IleXaa: 0.0 ± 0.0
Lys
4.081LysAla: 4.081 ± 0.02
1.103LysCys: 1.103 ± 0.008
3.196LysAsp: 3.196 ± 0.017
5.3LysGlu: 5.3 ± 0.03
1.714LysPhe: 1.714 ± 0.01
3.21LysGly: 3.21 ± 0.019
1.364LysHis: 1.364 ± 0.009
2.842LysIle: 2.842 ± 0.015
4.68LysLys: 4.68 ± 0.026
5.24LysLeu: 5.24 ± 0.02
1.463LysMet: 1.463 ± 0.009
2.431LysAsn: 2.431 ± 0.013
3.091LysPro: 3.091 ± 0.019
2.632LysGln: 2.632 ± 0.014
3.299LysArg: 3.299 ± 0.014
3.995LysSer: 3.995 ± 0.018
3.187LysThr: 3.187 ± 0.012
3.373LysVal: 3.373 ± 0.021
0.625LysTrp: 0.625 ± 0.006
1.517LysTyr: 1.517 ± 0.01
0.0LysXaa: 0.0 ± 0.0
Leu
6.747LeuAla: 6.747 ± 0.024
2.195LeuCys: 2.195 ± 0.014
4.68LeuAsp: 4.68 ± 0.018
7.293LeuGlu: 7.293 ± 0.029
3.38LeuPhe: 3.38 ± 0.016
5.886LeuGly: 5.886 ± 0.025
2.709LeuHis: 2.709 ± 0.013
3.885LeuIle: 3.885 ± 0.016
5.863LeuLys: 5.863 ± 0.022
10.891LeuLeu: 10.891 ± 0.04
1.971LeuMet: 1.971 ± 0.01
3.557LeuAsn: 3.557 ± 0.016
6.12LeuPro: 6.12 ± 0.024
5.81LeuGln: 5.81 ± 0.026
6.051LeuArg: 6.051 ± 0.021
8.139LeuSer: 8.139 ± 0.022
5.099LeuThr: 5.099 ± 0.016
5.437LeuVal: 5.437 ± 0.02
1.121LeuTrp: 1.121 ± 0.008
2.467LeuTyr: 2.467 ± 0.012
0.0LeuXaa: 0.0 ± 0.0
Met
1.851MetAla: 1.851 ± 0.009
0.382MetCys: 0.382 ± 0.004
1.249MetAsp: 1.249 ± 0.007
1.835MetGlu: 1.835 ± 0.009
0.699MetPhe: 0.699 ± 0.005
1.254MetGly: 1.254 ± 0.007
0.437MetHis: 0.437 ± 0.004
0.852MetIle: 0.852 ± 0.006
1.433MetLys: 1.433 ± 0.007
1.929MetLeu: 1.929 ± 0.009
0.552MetMet: 0.552 ± 0.005
0.876MetAsn: 0.876 ± 0.006
1.051MetPro: 1.051 ± 0.008
0.921MetGln: 0.921 ± 0.007
1.032MetArg: 1.032 ± 0.006
1.523MetSer: 1.523 ± 0.009
1.103MetThr: 1.103 ± 0.007
1.348MetVal: 1.348 ± 0.008
0.246MetTrp: 0.246 ± 0.003
0.571MetTyr: 0.571 ± 0.005
0.0MetXaa: 0.0 ± 0.0
Asn
2.018AsnAla: 2.018 ± 0.009
0.801AsnCys: 0.801 ± 0.007
1.482AsnAsp: 1.482 ± 0.009
2.277AsnGlu: 2.277 ± 0.012
1.411AsnPhe: 1.411 ± 0.009
2.382AsnGly: 2.382 ± 0.014
0.915AsnHis: 0.915 ± 0.007
2.148AsnIle: 2.148 ± 0.012
2.313AsnLys: 2.313 ± 0.013
3.725AsnLeu: 3.725 ± 0.015
0.881AsnMet: 0.881 ± 0.005
1.522AsnAsn: 1.522 ± 0.011
2.1AsnPro: 2.1 ± 0.011
1.654AsnGln: 1.654 ± 0.01
1.84AsnArg: 1.84 ± 0.009
3.077AsnSer: 3.077 ± 0.013
1.939AsnThr: 1.939 ± 0.01
2.161AsnVal: 2.161 ± 0.01
0.456AsnTrp: 0.456 ± 0.004
1.096AsnTyr: 1.096 ± 0.008
0.0AsnXaa: 0.0 ± 0.0
Pro
5.238ProAla: 5.238 ± 0.027
1.159ProCys: 1.159 ± 0.009
2.766ProAsp: 2.766 ± 0.013
4.487ProGlu: 4.487 ± 0.02
1.94ProPhe: 1.94 ± 0.009
5.536ProGly: 5.536 ± 0.049
1.47ProHis: 1.47 ± 0.009
1.867ProIle: 1.867 ± 0.012
2.845ProLys: 2.845 ± 0.018
5.394ProLeu: 5.394 ± 0.022
1.043ProMet: 1.043 ± 0.007
1.776ProAsn: 1.776 ± 0.009
6.303ProPro: 6.303 ± 0.046
2.828ProGln: 2.828 ± 0.015
3.649ProArg: 3.649 ± 0.019
5.845ProSer: 5.845 ± 0.028
3.069ProThr: 3.069 ± 0.015
3.789ProVal: 3.789 ± 0.018
0.705ProTrp: 0.705 ± 0.005
1.424ProTyr: 1.424 ± 0.01
0.0ProXaa: 0.0 ± 0.0
Gln
3.501GlnAla: 3.501 ± 0.017
0.915GlnCys: 0.915 ± 0.008
2.324GlnAsp: 2.324 ± 0.01
3.962GlnGlu: 3.962 ± 0.018
1.324GlnPhe: 1.324 ± 0.009
2.858GlnGly: 2.858 ± 0.015
1.292GlnHis: 1.292 ± 0.007
1.984GlnIle: 1.984 ± 0.009
2.979GlnLys: 2.979 ± 0.017
4.756GlnLeu: 4.756 ± 0.021
1.109GlnMet: 1.109 ± 0.008
1.818GlnAsn: 1.818 ± 0.009
2.834GlnPro: 2.834 ± 0.018
3.08GlnGln: 3.08 ± 0.024
2.975GlnArg: 2.975 ± 0.015
3.136GlnSer: 3.136 ± 0.015
2.307GlnThr: 2.307 ± 0.011
2.775GlnVal: 2.775 ± 0.01
0.56GlnTrp: 0.56 ± 0.005
1.115GlnTyr: 1.115 ± 0.007
0.0GlnXaa: 0.0 ± 0.0
Arg
4.168ArgAla: 4.168 ± 0.018
1.241ArgCys: 1.241 ± 0.011
2.706ArgAsp: 2.706 ± 0.012
4.121ArgGlu: 4.121 ± 0.018
1.811ArgPhe: 1.811 ± 0.009
3.884ArgGly: 3.884 ± 0.02
1.532ArgHis: 1.532 ± 0.009
2.459ArgIle: 2.459 ± 0.011
3.636ArgLys: 3.636 ± 0.012
5.471ArgLeu: 5.471 ± 0.018
1.181ArgMet: 1.181 ± 0.007
2.08ArgAsn: 2.08 ± 0.01
3.584ArgPro: 3.584 ± 0.017
2.676ArgGln: 2.676 ± 0.013
4.592ArgArg: 4.592 ± 0.022
4.343ArgSer: 4.343 ± 0.021
2.851ArgThr: 2.851 ± 0.012
3.17ArgVal: 3.17 ± 0.015
0.718ArgTrp: 0.718 ± 0.006
1.404ArgTyr: 1.404 ± 0.008
0.0ArgXaa: 0.0 ± 0.0
Ser
5.486SerAla: 5.486 ± 0.018
1.854SerCys: 1.854 ± 0.013
3.931SerAsp: 3.931 ± 0.02
5.308SerGlu: 5.308 ± 0.02
3.051SerPhe: 3.051 ± 0.013
5.657SerGly: 5.657 ± 0.024
2.11SerHis: 2.11 ± 0.01
3.22SerIle: 3.22 ± 0.014
4.136SerLys: 4.136 ± 0.016
8.388SerLeu: 8.388 ± 0.022
1.553SerMet: 1.553 ± 0.008
2.62SerAsn: 2.62 ± 0.012
6.246SerPro: 6.246 ± 0.032
3.886SerGln: 3.886 ± 0.015
4.618SerArg: 4.618 ± 0.022
9.533SerSer: 9.533 ± 0.046
4.402SerThr: 4.402 ± 0.018
4.901SerVal: 4.901 ± 0.017
1.095SerTrp: 1.095 ± 0.007
2.033SerTyr: 2.033 ± 0.009
0.0SerXaa: 0.0 ± 0.0
Thr
3.761ThrAla: 3.761 ± 0.015
1.308ThrCys: 1.308 ± 0.011
2.41ThrAsp: 2.41 ± 0.01
3.552ThrGlu: 3.552 ± 0.015
2.078ThrPhe: 2.078 ± 0.01
3.444ThrGly: 3.444 ± 0.016
1.272ThrHis: 1.272 ± 0.008
2.387ThrIle: 2.387 ± 0.013
2.694ThrLys: 2.694 ± 0.015
5.319ThrLeu: 5.319 ± 0.015
1.078ThrMet: 1.078 ± 0.006
1.698ThrAsn: 1.698 ± 0.009
3.58ThrPro: 3.58 ± 0.02
2.276ThrGln: 2.276 ± 0.01
2.463ThrArg: 2.463 ± 0.009
4.628ThrSer: 4.628 ± 0.018
2.91ThrThr: 2.91 ± 0.016
3.826ThrVal: 3.826 ± 0.017
0.709ThrTrp: 0.709 ± 0.007
1.37ThrTyr: 1.37 ± 0.008
0.0ThrXaa: 0.0 ± 0.0
Val
4.215ValAla: 4.215 ± 0.014
1.449ValCys: 1.449 ± 0.009
2.849ValAsp: 2.849 ± 0.012
3.829ValGlu: 3.829 ± 0.018
2.324ValPhe: 2.324 ± 0.011
3.346ValGly: 3.346 ± 0.017
1.534ValHis: 1.534 ± 0.009
2.833ValIle: 2.833 ± 0.013
3.41ValLys: 3.41 ± 0.019
6.12ValLeu: 6.12 ± 0.018
1.262ValMet: 1.262 ± 0.008
2.239ValAsn: 2.239 ± 0.012
3.639ValPro: 3.639 ± 0.019
2.695ValGln: 2.695 ± 0.012
3.055ValArg: 3.055 ± 0.013
4.902ValSer: 4.902 ± 0.017
3.746ValThr: 3.746 ± 0.022
3.896ValVal: 3.896 ± 0.015
0.703ValTrp: 0.703 ± 0.005
1.559ValTyr: 1.559 ± 0.007
0.0ValXaa: 0.0 ± 0.0
Trp
0.821TrpAla: 0.821 ± 0.006
0.252TrpCys: 0.252 ± 0.003
0.648TrpAsp: 0.648 ± 0.006
0.815TrpGlu: 0.815 ± 0.006
0.445TrpPhe: 0.445 ± 0.005
0.773TrpGly: 0.773 ± 0.008
0.294TrpHis: 0.294 ± 0.004
0.574TrpIle: 0.574 ± 0.004
0.807TrpLys: 0.807 ± 0.005
1.188TrpLeu: 1.188 ± 0.009
0.312TrpMet: 0.312 ± 0.004
0.531TrpAsn: 0.531 ± 0.005
0.534TrpPro: 0.534 ± 0.005
0.521TrpGln: 0.521 ± 0.004
0.793TrpArg: 0.793 ± 0.006
0.857TrpSer: 0.857 ± 0.006
0.666TrpThr: 0.666 ± 0.006
0.681TrpVal: 0.681 ± 0.005
0.194TrpTrp: 0.194 ± 0.003
0.355TrpTyr: 0.355 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.335TyrAla: 1.335 ± 0.007
0.647TyrCys: 0.647 ± 0.005
1.235TyrAsp: 1.235 ± 0.009
1.634TyrGlu: 1.634 ± 0.011
1.175TyrPhe: 1.175 ± 0.007
1.583TyrGly: 1.583 ± 0.009
0.719TyrHis: 0.719 ± 0.006
1.348TyrIle: 1.348 ± 0.008
1.495TyrLys: 1.495 ± 0.015
2.581TyrLeu: 2.581 ± 0.013
0.571TyrMet: 0.571 ± 0.005
1.049TyrAsn: 1.049 ± 0.006
1.235TyrPro: 1.235 ± 0.008
1.22TyrGln: 1.22 ± 0.007
1.546TyrArg: 1.546 ± 0.009
2.147TyrSer: 2.147 ± 0.01
1.448TyrThr: 1.448 ± 0.01
1.532TyrVal: 1.532 ± 0.007
0.353TyrTrp: 0.353 ± 0.005
0.907TyrTyr: 0.907 ± 0.006
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.038XaaXaa: 0.038 ± 0.016
Statistics based on 44491 proteins (29288982 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski