Amino acid dipepetide frequency for Aerosticca soli

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.805AlaAla: 20.805 ± 0.26
1.394AlaCys: 1.394 ± 0.049
7.185AlaAsp: 7.185 ± 0.091
7.758AlaGlu: 7.758 ± 0.121
4.205AlaPhe: 4.205 ± 0.076
11.983AlaGly: 11.983 ± 0.125
3.254AlaHis: 3.254 ± 0.073
5.376AlaIle: 5.376 ± 0.074
3.024AlaLys: 3.024 ± 0.083
17.67AlaLeu: 17.67 ± 0.234
3.333AlaMet: 3.333 ± 0.075
2.477AlaAsn: 2.477 ± 0.056
6.737AlaPro: 6.737 ± 0.121
5.588AlaGln: 5.588 ± 0.096
11.543AlaArg: 11.543 ± 0.149
5.839AlaSer: 5.839 ± 0.083
5.864AlaThr: 5.864 ± 0.097
9.29AlaVal: 9.29 ± 0.125
2.36AlaTrp: 2.36 ± 0.066
2.63AlaTyr: 2.63 ± 0.065
0.0AlaXaa: 0.0 ± 0.0
Cys
1.194CysAla: 1.194 ± 0.048
0.111CysCys: 0.111 ± 0.011
0.422CysAsp: 0.422 ± 0.022
0.448CysGlu: 0.448 ± 0.023
0.252CysPhe: 0.252 ± 0.019
0.944CysGly: 0.944 ± 0.035
0.288CysHis: 0.288 ± 0.024
0.315CysIle: 0.315 ± 0.021
0.158CysLys: 0.158 ± 0.013
0.9CysLeu: 0.9 ± 0.035
0.161CysMet: 0.161 ± 0.014
0.188CysAsn: 0.188 ± 0.014
0.482CysPro: 0.482 ± 0.026
0.252CysGln: 0.252 ± 0.016
0.711CysArg: 0.711 ± 0.031
0.368CysSer: 0.368 ± 0.019
0.43CysThr: 0.43 ± 0.025
0.663CysVal: 0.663 ± 0.03
0.125CysTrp: 0.125 ± 0.011
0.252CysTyr: 0.252 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
8.129AspAla: 8.129 ± 0.118
0.389AspCys: 0.389 ± 0.021
3.06AspAsp: 3.06 ± 0.065
3.526AspGlu: 3.526 ± 0.067
2.113AspPhe: 2.113 ± 0.053
5.086AspGly: 5.086 ± 0.086
1.212AspHis: 1.212 ± 0.038
2.156AspIle: 2.156 ± 0.054
1.3AspLys: 1.3 ± 0.047
5.619AspLeu: 5.619 ± 0.072
1.046AspMet: 1.046 ± 0.039
1.074AspAsn: 1.074 ± 0.036
3.758AspPro: 3.758 ± 0.071
1.505AspGln: 1.505 ± 0.048
4.055AspArg: 4.055 ± 0.081
1.866AspSer: 1.866 ± 0.047
2.59AspThr: 2.59 ± 0.059
3.949AspVal: 3.949 ± 0.077
1.167AspTrp: 1.167 ± 0.044
1.644AspTyr: 1.644 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
7.646GluAla: 7.646 ± 0.116
0.348GluCys: 0.348 ± 0.021
2.673GluAsp: 2.673 ± 0.055
2.766GluGlu: 2.766 ± 0.07
1.481GluPhe: 1.481 ± 0.044
4.151GluGly: 4.151 ± 0.073
1.694GluHis: 1.694 ± 0.044
2.395GluIle: 2.395 ± 0.059
1.513GluLys: 1.513 ± 0.045
6.674GluLeu: 6.674 ± 0.101
0.9GluMet: 0.9 ± 0.036
1.079GluAsn: 1.079 ± 0.037
2.717GluPro: 2.717 ± 0.057
2.351GluGln: 2.351 ± 0.052
5.925GluArg: 5.925 ± 0.1
2.126GluSer: 2.126 ± 0.048
2.603GluThr: 2.603 ± 0.056
4.061GluVal: 4.061 ± 0.076
0.587GluTrp: 0.587 ± 0.026
1.026GluTyr: 1.026 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
4.501PheAla: 4.501 ± 0.083
0.305PheCys: 0.305 ± 0.02
2.401PheAsp: 2.401 ± 0.055
1.832PheGlu: 1.832 ± 0.044
1.189PhePhe: 1.189 ± 0.043
3.192PheGly: 3.192 ± 0.067
0.811PheHis: 0.811 ± 0.028
1.232PheIle: 1.232 ± 0.045
0.838PheLys: 0.838 ± 0.038
3.138PheLeu: 3.138 ± 0.07
0.644PheMet: 0.644 ± 0.026
0.937PheAsn: 0.937 ± 0.037
1.463PhePro: 1.463 ± 0.044
0.859PheGln: 0.859 ± 0.035
2.178PheArg: 2.178 ± 0.054
1.571PheSer: 1.571 ± 0.042
1.617PheThr: 1.617 ± 0.041
2.538PheVal: 2.538 ± 0.052
0.45PheTrp: 0.45 ± 0.025
0.81PheTyr: 0.81 ± 0.033
0.0PheXaa: 0.0 ± 0.0
Gly
9.683GlyAla: 9.683 ± 0.121
0.846GlyCys: 0.846 ± 0.034
4.354GlyAsp: 4.354 ± 0.075
5.118GlyGlu: 5.118 ± 0.088
3.232GlyPhe: 3.232 ± 0.063
6.909GlyGly: 6.909 ± 0.111
2.297GlyHis: 2.297 ± 0.053
3.972GlyIle: 3.972 ± 0.074
2.858GlyLys: 2.858 ± 0.074
9.187GlyLeu: 9.187 ± 0.126
2.321GlyMet: 2.321 ± 0.058
1.963GlyAsn: 1.963 ± 0.065
3.374GlyPro: 3.374 ± 0.063
3.143GlyGln: 3.143 ± 0.067
6.928GlyArg: 6.928 ± 0.095
4.104GlySer: 4.104 ± 0.069
4.248GlyThr: 4.248 ± 0.086
6.419GlyVal: 6.419 ± 0.089
1.634GlyTrp: 1.634 ± 0.04
2.531GlyTyr: 2.531 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
3.705HisAla: 3.705 ± 0.074
0.317HisCys: 0.317 ± 0.021
1.309HisAsp: 1.309 ± 0.037
1.246HisGlu: 1.246 ± 0.037
0.887HisPhe: 0.887 ± 0.032
2.779HisGly: 2.779 ± 0.058
0.66HisHis: 0.66 ± 0.028
0.79HisIle: 0.79 ± 0.034
0.502HisLys: 0.502 ± 0.023
2.485HisLeu: 2.485 ± 0.052
0.52HisMet: 0.52 ± 0.026
0.452HisAsn: 0.452 ± 0.023
1.79HisPro: 1.79 ± 0.047
0.689HisGln: 0.689 ± 0.028
1.886HisArg: 1.886 ± 0.051
0.854HisSer: 0.854 ± 0.033
1.089HisThr: 1.089 ± 0.04
1.894HisVal: 1.894 ± 0.048
0.503HisTrp: 0.503 ± 0.023
0.759HisTyr: 0.759 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
6.228IleAla: 6.228 ± 0.086
0.304IleCys: 0.304 ± 0.019
3.205IleAsp: 3.205 ± 0.064
3.06IleGlu: 3.06 ± 0.063
1.083IlePhe: 1.083 ± 0.041
4.169IleGly: 4.169 ± 0.078
0.896IleHis: 0.896 ± 0.033
1.233IleIle: 1.233 ± 0.046
1.109IleLys: 1.109 ± 0.047
3.208IleLeu: 3.208 ± 0.071
0.498IleMet: 0.498 ± 0.028
1.155IleAsn: 1.155 ± 0.043
1.886IlePro: 1.886 ± 0.05
1.023IleGln: 1.023 ± 0.043
2.781IleArg: 2.781 ± 0.062
1.655IleSer: 1.655 ± 0.052
1.97IleThr: 1.97 ± 0.05
3.275IleVal: 3.275 ± 0.066
0.399IleTrp: 0.399 ± 0.022
0.889IleTyr: 0.889 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
3.314LysAla: 3.314 ± 0.085
0.136LysCys: 0.136 ± 0.013
1.327LysAsp: 1.327 ± 0.044
1.096LysGlu: 1.096 ± 0.039
0.634LysPhe: 0.634 ± 0.032
1.918LysGly: 1.918 ± 0.055
0.606LysHis: 0.606 ± 0.031
0.957LysIle: 0.957 ± 0.042
0.861LysLys: 0.861 ± 0.053
2.889LysLeu: 2.889 ± 0.067
0.509LysMet: 0.509 ± 0.028
0.585LysAsn: 0.585 ± 0.029
1.78LysPro: 1.78 ± 0.045
0.928LysGln: 0.928 ± 0.038
2.003LysArg: 2.003 ± 0.052
1.29LysSer: 1.29 ± 0.046
1.44LysThr: 1.44 ± 0.046
2.106LysVal: 2.106 ± 0.06
0.257LysTrp: 0.257 ± 0.017
0.562LysTyr: 0.562 ± 0.029
0.0LysXaa: 0.0 ± 0.0
Leu
18.276LeuAla: 18.276 ± 0.233
1.012LeuCys: 1.012 ± 0.036
7.2LeuAsp: 7.2 ± 0.111
6.232LeuGlu: 6.232 ± 0.097
3.445LeuPhe: 3.445 ± 0.072
9.669LeuGly: 9.669 ± 0.127
2.604LeuHis: 2.604 ± 0.061
4.402LeuIle: 4.402 ± 0.077
3.359LeuLys: 3.359 ± 0.068
12.823LeuLeu: 12.823 ± 0.199
2.032LeuMet: 2.032 ± 0.052
2.23LeuAsn: 2.23 ± 0.058
7.076LeuPro: 7.076 ± 0.102
3.118LeuGln: 3.118 ± 0.072
9.417LeuArg: 9.417 ± 0.128
5.063LeuSer: 5.063 ± 0.087
5.212LeuThr: 5.212 ± 0.074
7.987LeuVal: 7.987 ± 0.123
1.507LeuTrp: 1.507 ± 0.05
2.409LeuTyr: 2.409 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.527MetAla: 2.527 ± 0.067
0.138MetCys: 0.138 ± 0.013
1.099MetAsp: 1.099 ± 0.043
0.827MetGlu: 0.827 ± 0.03
0.55MetPhe: 0.55 ± 0.025
1.399MetGly: 1.399 ± 0.044
0.51MetHis: 0.51 ± 0.023
0.788MetIle: 0.788 ± 0.031
0.661MetLys: 0.661 ± 0.028
2.499MetLeu: 2.499 ± 0.061
0.36MetMet: 0.36 ± 0.022
0.716MetAsn: 0.716 ± 0.032
1.527MetPro: 1.527 ± 0.045
0.894MetGln: 0.894 ± 0.036
1.747MetArg: 1.747 ± 0.046
1.335MetSer: 1.335 ± 0.038
1.148MetThr: 1.148 ± 0.037
1.373MetVal: 1.373 ± 0.044
0.162MetTrp: 0.162 ± 0.014
0.331MetTyr: 0.331 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.908AsnAla: 2.908 ± 0.066
0.167AsnCys: 0.167 ± 0.017
1.167AsnAsp: 1.167 ± 0.038
1.035AsnGlu: 1.035 ± 0.039
0.71AsnPhe: 0.71 ± 0.033
1.951AsnGly: 1.951 ± 0.061
0.492AsnHis: 0.492 ± 0.023
0.888AsnIle: 0.888 ± 0.038
0.494AsnLys: 0.494 ± 0.028
2.413AsnLeu: 2.413 ± 0.061
0.357AsnMet: 0.357 ± 0.021
0.521AsnAsn: 0.521 ± 0.029
1.558AsnPro: 1.558 ± 0.053
0.711AsnGln: 0.711 ± 0.035
1.545AsnArg: 1.545 ± 0.041
0.782AsnSer: 0.782 ± 0.034
1.098AsnThr: 1.098 ± 0.041
1.624AsnVal: 1.624 ± 0.052
0.319AsnTrp: 0.319 ± 0.019
0.646AsnTyr: 0.646 ± 0.034
0.0AsnXaa: 0.0 ± 0.0
Pro
8.612ProAla: 8.612 ± 0.126
0.409ProCys: 0.409 ± 0.024
3.298ProAsp: 3.298 ± 0.055
3.213ProGlu: 3.213 ± 0.072
1.825ProPhe: 1.825 ± 0.047
5.197ProGly: 5.197 ± 0.091
1.33ProHis: 1.33 ± 0.039
1.867ProIle: 1.867 ± 0.048
1.268ProLys: 1.268 ± 0.041
5.895ProLeu: 5.895 ± 0.086
1.332ProMet: 1.332 ± 0.045
1.05ProAsn: 1.05 ± 0.039
3.64ProPro: 3.64 ± 0.1
2.049ProGln: 2.049 ± 0.045
4.008ProArg: 4.008 ± 0.081
2.609ProSer: 2.609 ± 0.055
2.371ProThr: 2.371 ± 0.058
4.286ProVal: 4.286 ± 0.075
1.049ProTrp: 1.049 ± 0.038
1.314ProTyr: 1.314 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
5.62GlnAla: 5.62 ± 0.088
0.259GlnCys: 0.259 ± 0.018
1.464GlnAsp: 1.464 ± 0.036
1.434GlnGlu: 1.434 ± 0.049
0.948GlnPhe: 0.948 ± 0.033
2.833GlnGly: 2.833 ± 0.058
0.824GlnHis: 0.824 ± 0.033
1.362GlnIle: 1.362 ± 0.047
0.723GlnLys: 0.723 ± 0.028
3.803GlnLeu: 3.803 ± 0.075
0.712GlnMet: 0.712 ± 0.032
0.651GlnAsn: 0.651 ± 0.032
2.13GlnPro: 2.13 ± 0.05
1.645GlnGln: 1.645 ± 0.052
3.363GlnArg: 3.363 ± 0.072
1.59GlnSer: 1.59 ± 0.049
1.627GlnThr: 1.627 ± 0.044
2.779GlnVal: 2.779 ± 0.053
0.614GlnTrp: 0.614 ± 0.028
0.766GlnTyr: 0.766 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
9.927ArgAla: 9.927 ± 0.149
0.645ArgCys: 0.645 ± 0.029
4.213ArgAsp: 4.213 ± 0.068
5.286ArgGlu: 5.286 ± 0.093
2.916ArgPhe: 2.916 ± 0.061
5.742ArgGly: 5.742 ± 0.087
2.55ArgHis: 2.55 ± 0.057
3.856ArgIle: 3.856 ± 0.071
1.936ArgLys: 1.936 ± 0.046
10.519ArgLeu: 10.519 ± 0.143
1.869ArgMet: 1.869 ± 0.046
1.613ArgAsn: 1.613 ± 0.041
4.286ArgPro: 4.286 ± 0.073
3.228ArgGln: 3.228 ± 0.062
7.981ArgArg: 7.981 ± 0.129
3.275ArgSer: 3.275 ± 0.067
3.353ArgThr: 3.353 ± 0.062
5.677ArgVal: 5.677 ± 0.086
1.577ArgTrp: 1.577 ± 0.048
2.332ArgTyr: 2.332 ± 0.054
0.0ArgXaa: 0.0 ± 0.0
Ser
5.575SerAla: 5.575 ± 0.081
0.365SerCys: 0.365 ± 0.022
2.103SerAsp: 2.103 ± 0.055
1.992SerGlu: 1.992 ± 0.055
1.647SerPhe: 1.647 ± 0.048
4.373SerGly: 4.373 ± 0.073
1.082SerHis: 1.082 ± 0.039
1.884SerIle: 1.884 ± 0.05
0.998SerLys: 0.998 ± 0.039
4.88SerLeu: 4.88 ± 0.074
1.006SerMet: 1.006 ± 0.033
0.986SerAsn: 0.986 ± 0.037
2.586SerPro: 2.586 ± 0.062
1.464SerGln: 1.464 ± 0.043
3.362SerArg: 3.362 ± 0.06
2.33SerSer: 2.33 ± 0.064
2.249SerThr: 2.249 ± 0.057
3.068SerVal: 3.068 ± 0.06
0.673SerTrp: 0.673 ± 0.028
1.203SerTyr: 1.203 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
5.868ThrAla: 5.868 ± 0.084
0.422ThrCys: 0.422 ± 0.025
2.362ThrAsp: 2.362 ± 0.062
1.97ThrGlu: 1.97 ± 0.055
1.457ThrPhe: 1.457 ± 0.044
4.346ThrGly: 4.346 ± 0.064
1.189ThrHis: 1.189 ± 0.037
1.792ThrIle: 1.792 ± 0.054
0.829ThrLys: 0.829 ± 0.038
6.375ThrLeu: 6.375 ± 0.098
0.814ThrMet: 0.814 ± 0.032
0.873ThrAsn: 0.873 ± 0.037
3.493ThrPro: 3.493 ± 0.072
1.587ThrGln: 1.587 ± 0.045
3.607ThrArg: 3.607 ± 0.065
2.029ThrSer: 2.029 ± 0.054
2.253ThrThr: 2.253 ± 0.057
3.858ThrVal: 3.858 ± 0.072
0.614ThrTrp: 0.614 ± 0.026
1.018ThrTyr: 1.018 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
9.31ValAla: 9.31 ± 0.115
0.739ValCys: 0.739 ± 0.03
4.353ValAsp: 4.353 ± 0.071
4.177ValGlu: 4.177 ± 0.071
2.493ValPhe: 2.493 ± 0.054
5.281ValGly: 5.281 ± 0.079
1.757ValHis: 1.757 ± 0.047
3.404ValIle: 3.404 ± 0.067
1.839ValLys: 1.839 ± 0.062
8.97ValLeu: 8.97 ± 0.115
1.454ValMet: 1.454 ± 0.045
1.803ValAsn: 1.803 ± 0.048
4.178ValPro: 4.178 ± 0.072
2.378ValGln: 2.378 ± 0.051
5.742ValArg: 5.742 ± 0.091
3.426ValSer: 3.426 ± 0.073
3.646ValThr: 3.646 ± 0.071
5.826ValVal: 5.826 ± 0.095
0.918ValTrp: 0.918 ± 0.036
1.704ValTyr: 1.704 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
1.292TrpAla: 1.292 ± 0.04
0.162TrpCys: 0.162 ± 0.014
0.667TrpAsp: 0.667 ± 0.029
0.518TrpGlu: 0.518 ± 0.023
0.555TrpPhe: 0.555 ± 0.026
0.905TrpGly: 0.905 ± 0.041
0.469TrpHis: 0.469 ± 0.027
0.654TrpIle: 0.654 ± 0.028
0.4TrpLys: 0.4 ± 0.022
2.484TrpLeu: 2.484 ± 0.064
0.377TrpMet: 0.377 ± 0.023
0.444TrpAsn: 0.444 ± 0.023
0.946TrpPro: 0.946 ± 0.039
0.908TrpGln: 0.908 ± 0.032
1.677TrpArg: 1.677 ± 0.047
0.768TrpSer: 0.768 ± 0.029
0.801TrpThr: 0.801 ± 0.031
0.913TrpVal: 0.913 ± 0.035
0.299TrpTrp: 0.299 ± 0.017
0.377TrpTyr: 0.377 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.195TyrAla: 3.195 ± 0.061
0.197TyrCys: 0.197 ± 0.015
1.381TyrAsp: 1.381 ± 0.039
1.182TyrGlu: 1.182 ± 0.038
0.881TyrPhe: 0.881 ± 0.034
2.291TyrGly: 2.291 ± 0.055
0.589TyrHis: 0.589 ± 0.025
0.705TyrIle: 0.705 ± 0.028
0.526TyrLys: 0.526 ± 0.028
2.644TyrLeu: 2.644 ± 0.046
0.346TyrMet: 0.346 ± 0.022
0.588TyrAsn: 0.588 ± 0.031
1.215TyrPro: 1.215 ± 0.037
0.829TyrGln: 0.829 ± 0.03
2.321TyrArg: 2.321 ± 0.052
0.952TyrSer: 0.952 ± 0.038
1.149TyrThr: 1.149 ± 0.037
1.813TyrVal: 1.813 ± 0.046
0.402TyrTrp: 0.402 ± 0.022
0.704TyrTyr: 0.704 ± 0.032
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2599 proteins (846755 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski