Amino acid dipepetide frequency for Mycoplasma sp. CAG:776

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.008AlaAla: 2.008 ± 0.083
0.567AlaCys: 0.567 ± 0.04
2.341AlaAsp: 2.341 ± 0.091
2.367AlaGlu: 2.367 ± 0.09
2.11AlaPhe: 2.11 ± 0.08
2.978AlaGly: 2.978 ± 0.108
0.739AlaHis: 0.739 ± 0.042
4.781AlaIle: 4.781 ± 0.129
4.357AlaLys: 4.357 ± 0.125
4.718AlaLeu: 4.718 ± 0.121
1.184AlaMet: 1.184 ± 0.065
2.913AlaAsn: 2.913 ± 0.087
1.079AlaPro: 1.079 ± 0.06
1.059AlaGln: 1.059 ± 0.053
1.748AlaArg: 1.748 ± 0.077
3.381AlaSer: 3.381 ± 0.103
2.695AlaThr: 2.695 ± 0.09
2.526AlaVal: 2.526 ± 0.093
0.239AlaTrp: 0.239 ± 0.027
2.247AlaTyr: 2.247 ± 0.078
0.0AlaXaa: 0.0 ± 0.0
Cys
0.481CysAla: 0.481 ± 0.036
0.13CysCys: 0.13 ± 0.018
0.635CysAsp: 0.635 ± 0.08
0.661CysGlu: 0.661 ± 0.041
0.531CysPhe: 0.531 ± 0.044
0.84CysGly: 0.84 ± 0.051
0.2CysHis: 0.2 ± 0.025
0.791CysIle: 0.791 ± 0.049
0.786CysLys: 0.786 ± 0.051
0.97CysLeu: 0.97 ± 0.056
0.192CysMet: 0.192 ± 0.022
0.684CysAsn: 0.684 ± 0.042
0.341CysPro: 0.341 ± 0.034
0.213CysGln: 0.213 ± 0.023
0.304CysArg: 0.304 ± 0.031
0.749CysSer: 0.749 ± 0.047
0.507CysThr: 0.507 ± 0.039
0.57CysVal: 0.57 ± 0.039
0.049CysTrp: 0.049 ± 0.012
0.601CysTyr: 0.601 ± 0.038
0.0CysXaa: 0.0 ± 0.0
Asp
2.541AspAla: 2.541 ± 0.092
0.502AspCys: 0.502 ± 0.038
2.874AspAsp: 2.874 ± 0.103
4.7AspGlu: 4.7 ± 0.106
2.929AspPhe: 2.929 ± 0.097
3.095AspGly: 3.095 ± 0.191
0.809AspHis: 0.809 ± 0.053
5.871AspIle: 5.871 ± 0.134
4.596AspLys: 4.596 ± 0.118
6.139AspLeu: 6.139 ± 0.131
1.316AspMet: 1.316 ± 0.066
3.569AspAsn: 3.569 ± 0.099
1.314AspPro: 1.314 ± 0.06
1.212AspGln: 1.212 ± 0.063
1.571AspArg: 1.571 ± 0.067
3.137AspSer: 3.137 ± 0.086
3.004AspThr: 3.004 ± 0.12
3.246AspVal: 3.246 ± 0.09
0.424AspTrp: 0.424 ± 0.032
3.66AspTyr: 3.66 ± 0.118
0.0AspXaa: 0.0 ± 0.0
Glu
3.948GluAla: 3.948 ± 0.132
0.609GluCys: 0.609 ± 0.035
4.198GluAsp: 4.198 ± 0.117
9.106GluGlu: 9.106 ± 0.196
3.077GluPhe: 3.077 ± 0.094
3.22GluGly: 3.22 ± 0.094
1.118GluHis: 1.118 ± 0.058
8.183GluIle: 8.183 ± 0.165
7.863GluLys: 7.863 ± 0.172
7.9GluLeu: 7.9 ± 0.193
1.92GluMet: 1.92 ± 0.066
5.772GluAsn: 5.772 ± 0.136
1.384GluPro: 1.384 ± 0.085
2.086GluGln: 2.086 ± 0.074
2.515GluArg: 2.515 ± 0.087
3.376GluSer: 3.376 ± 0.111
3.363GluThr: 3.363 ± 0.103
4.802GluVal: 4.802 ± 0.119
0.427GluTrp: 0.427 ± 0.031
4.053GluTyr: 4.053 ± 0.118
0.0GluXaa: 0.0 ± 0.0
Phe
2.091PheAla: 2.091 ± 0.075
0.645PheCys: 0.645 ± 0.039
2.661PheAsp: 2.661 ± 0.088
2.531PheGlu: 2.531 ± 0.082
2.011PhePhe: 2.011 ± 0.093
2.318PheGly: 2.318 ± 0.08
0.658PheHis: 0.658 ± 0.046
4.705PheIle: 4.705 ± 0.151
3.73PheLys: 3.73 ± 0.103
5.044PheLeu: 5.044 ± 0.156
1.014PheMet: 1.014 ± 0.057
2.882PheAsn: 2.882 ± 0.098
1.066PhePro: 1.066 ± 0.06
1.207PheGln: 1.207 ± 0.061
1.275PheArg: 1.275 ± 0.059
3.028PheSer: 3.028 ± 0.101
2.484PheThr: 2.484 ± 0.079
2.549PheVal: 2.549 ± 0.095
0.26PheTrp: 0.26 ± 0.028
2.409PheTyr: 2.409 ± 0.086
0.0PheXaa: 0.0 ± 0.0
Gly
2.929GlyAla: 2.929 ± 0.107
0.515GlyCys: 0.515 ± 0.037
2.921GlyAsp: 2.921 ± 0.105
3.592GlyGlu: 3.592 ± 0.098
2.559GlyPhe: 2.559 ± 0.085
3.205GlyGly: 3.205 ± 0.15
0.851GlyHis: 0.851 ± 0.051
5.577GlyIle: 5.577 ± 0.134
4.534GlyLys: 4.534 ± 0.127
4.9GlyLeu: 4.9 ± 0.122
1.355GlyMet: 1.355 ± 0.066
3.329GlyAsn: 3.329 ± 0.127
1.004GlyPro: 1.004 ± 0.13
0.853GlyGln: 0.853 ± 0.058
1.795GlyArg: 1.795 ± 0.073
3.462GlySer: 3.462 ± 0.117
3.454GlyThr: 3.454 ± 0.132
3.649GlyVal: 3.649 ± 0.119
0.336GlyTrp: 0.336 ± 0.031
3.101GlyTyr: 3.101 ± 0.11
0.0GlyXaa: 0.0 ± 0.0
His
0.687HisAla: 0.687 ± 0.048
0.148HisCys: 0.148 ± 0.021
0.767HisAsp: 0.767 ± 0.039
1.072HisGlu: 1.072 ± 0.059
0.773HisPhe: 0.773 ± 0.05
0.869HisGly: 0.869 ± 0.049
0.398HisHis: 0.398 ± 0.032
1.386HisIle: 1.386 ± 0.058
1.1HisLys: 1.1 ± 0.053
1.545HisLeu: 1.545 ± 0.072
0.333HisMet: 0.333 ± 0.03
1.022HisAsn: 1.022 ± 0.049
0.609HisPro: 0.609 ± 0.038
0.458HisGln: 0.458 ± 0.035
0.455HisArg: 0.455 ± 0.038
0.814HisSer: 0.814 ± 0.045
0.76HisThr: 0.76 ± 0.049
0.877HisVal: 0.877 ± 0.05
0.068HisTrp: 0.068 ± 0.014
0.861HisTyr: 0.861 ± 0.047
0.0HisXaa: 0.0 ± 0.0
Ile
5.155IleAla: 5.155 ± 0.115
1.246IleCys: 1.246 ± 0.063
5.954IleAsp: 5.954 ± 0.153
7.033IleGlu: 7.033 ± 0.138
4.305IlePhe: 4.305 ± 0.126
5.168IleGly: 5.168 ± 0.139
1.467IleHis: 1.467 ± 0.066
10.521IleIle: 10.521 ± 0.231
9.14IleLys: 9.14 ± 0.19
10.04IleLeu: 10.04 ± 0.22
2.177IleMet: 2.177 ± 0.077
6.768IleAsn: 6.768 ± 0.156
3.098IlePro: 3.098 ± 0.092
1.732IleGln: 1.732 ± 0.073
3.34IleArg: 3.34 ± 0.104
6.474IleSer: 6.474 ± 0.141
5.647IleThr: 5.647 ± 0.116
5.834IleVal: 5.834 ± 0.14
0.507IleTrp: 0.507 ± 0.041
4.627IleTyr: 4.627 ± 0.108
0.0IleXaa: 0.0 ± 0.0
Lys
3.561LysAla: 3.561 ± 0.096
0.934LysCys: 0.934 ± 0.05
5.66LysAsp: 5.66 ± 0.121
10.771LysGlu: 10.771 ± 0.192
2.877LysPhe: 2.877 ± 0.093
4.131LysGly: 4.131 ± 0.104
1.077LysHis: 1.077 ± 0.056
9.062LysIle: 9.062 ± 0.185
10.293LysLys: 10.293 ± 0.207
7.572LysLeu: 7.572 ± 0.152
2.481LysMet: 2.481 ± 0.095
7.057LysAsn: 7.057 ± 0.134
1.808LysPro: 1.808 ± 0.087
1.878LysGln: 1.878 ± 0.071
3.449LysArg: 3.449 ± 0.102
4.445LysSer: 4.445 ± 0.099
4.518LysThr: 4.518 ± 0.118
5.275LysVal: 5.275 ± 0.114
0.559LysTrp: 0.559 ± 0.037
4.953LysTyr: 4.953 ± 0.131
0.0LysXaa: 0.0 ± 0.0
Leu
4.776LeuAla: 4.776 ± 0.106
0.936LeuCys: 0.936 ± 0.048
5.816LeuAsp: 5.816 ± 0.126
8.18LeuGlu: 8.18 ± 0.198
4.544LeuPhe: 4.544 ± 0.161
5.145LeuGly: 5.145 ± 0.14
1.197LeuHis: 1.197 ± 0.067
9.289LeuIle: 9.289 ± 0.179
10.389LeuLys: 10.389 ± 0.182
8.685LeuLeu: 8.685 ± 0.199
2.011LeuMet: 2.011 ± 0.076
7.044LeuAsn: 7.044 ± 0.146
2.476LeuPro: 2.476 ± 0.087
2.003LeuGln: 2.003 ± 0.082
2.747LeuArg: 2.747 ± 0.095
6.266LeuSer: 6.266 ± 0.148
5.085LeuThr: 5.085 ± 0.11
5.79LeuVal: 5.79 ± 0.113
0.489LeuTrp: 0.489 ± 0.038
4.37LeuTyr: 4.37 ± 0.114
0.0LeuXaa: 0.0 ± 0.0
Met
1.254MetAla: 1.254 ± 0.063
0.195MetCys: 0.195 ± 0.024
1.275MetAsp: 1.275 ± 0.057
1.8MetGlu: 1.8 ± 0.065
0.952MetPhe: 0.952 ± 0.054
1.301MetGly: 1.301 ± 0.075
0.304MetHis: 0.304 ± 0.032
2.292MetIle: 2.292 ± 0.074
2.531MetLys: 2.531 ± 0.064
2.102MetLeu: 2.102 ± 0.077
0.536MetMet: 0.536 ± 0.044
1.891MetAsn: 1.891 ± 0.07
0.694MetPro: 0.694 ± 0.042
0.663MetGln: 0.663 ± 0.036
0.773MetArg: 0.773 ± 0.043
1.412MetSer: 1.412 ± 0.055
1.022MetThr: 1.022 ± 0.056
1.293MetVal: 1.293 ± 0.063
0.107MetTrp: 0.107 ± 0.017
1.072MetTyr: 1.072 ± 0.052
0.0MetXaa: 0.0 ± 0.0
Asn
2.874AsnAla: 2.874 ± 0.093
0.663AsnCys: 0.663 ± 0.054
3.564AsnAsp: 3.564 ± 0.09
4.966AsnGlu: 4.966 ± 0.11
2.913AsnPhe: 2.913 ± 0.066
3.92AsnGly: 3.92 ± 0.129
1.202AsnHis: 1.202 ± 0.061
7.408AsnIle: 7.408 ± 0.154
6.62AsnLys: 6.62 ± 0.133
6.464AsnLeu: 6.464 ± 0.144
1.782AsnMet: 1.782 ± 0.065
5.434AsnAsn: 5.434 ± 0.193
2.185AsnPro: 2.185 ± 0.086
1.803AsnGln: 1.803 ± 0.088
2.044AsnArg: 2.044 ± 0.073
4.188AsnSer: 4.188 ± 0.13
3.6AsnThr: 3.6 ± 0.111
3.597AsnVal: 3.597 ± 0.095
0.468AsnTrp: 0.468 ± 0.039
4.094AsnTyr: 4.094 ± 0.134
0.0AsnXaa: 0.0 ± 0.0
Pro
1.001ProAla: 1.001 ± 0.048
0.245ProCys: 0.245 ± 0.026
1.519ProAsp: 1.519 ± 0.102
1.894ProGlu: 1.894 ± 0.069
1.34ProPhe: 1.34 ± 0.067
1.15ProGly: 1.15 ± 0.068
0.393ProHis: 0.393 ± 0.032
2.518ProIle: 2.518 ± 0.09
2.224ProLys: 2.224 ± 0.085
2.195ProLeu: 2.195 ± 0.075
0.564ProMet: 0.564 ± 0.032
1.829ProAsn: 1.829 ± 0.068
0.393ProPro: 0.393 ± 0.046
0.515ProGln: 0.515 ± 0.047
0.622ProArg: 0.622 ± 0.038
1.55ProSer: 1.55 ± 0.065
1.691ProThr: 1.691 ± 0.105
1.756ProVal: 1.756 ± 0.091
0.185ProTrp: 0.185 ± 0.027
1.363ProTyr: 1.363 ± 0.068
0.0ProXaa: 0.0 ± 0.0
Gln
1.173GlnAla: 1.173 ± 0.047
0.13GlnCys: 0.13 ± 0.019
1.553GlnAsp: 1.553 ± 0.059
2.221GlnGlu: 2.221 ± 0.078
0.939GlnPhe: 0.939 ± 0.052
1.134GlnGly: 1.134 ± 0.099
0.302GlnHis: 0.302 ± 0.029
2.367GlnIle: 2.367 ± 0.075
2.276GlnLys: 2.276 ± 0.077
1.61GlnLeu: 1.61 ± 0.067
0.603GlnMet: 0.603 ± 0.036
2.044GlnAsn: 2.044 ± 0.069
0.375GlnPro: 0.375 ± 0.035
0.494GlnGln: 0.494 ± 0.036
0.661GlnArg: 0.661 ± 0.043
1.157GlnSer: 1.157 ± 0.055
1.053GlnThr: 1.053 ± 0.052
1.615GlnVal: 1.615 ± 0.063
0.086GlnTrp: 0.086 ± 0.016
1.098GlnTyr: 1.098 ± 0.054
0.0GlnXaa: 0.0 ± 0.0
Arg
1.256ArgAla: 1.256 ± 0.058
0.289ArgCys: 0.289 ± 0.028
1.792ArgAsp: 1.792 ± 0.075
2.731ArgGlu: 2.731 ± 0.098
1.353ArgPhe: 1.353 ± 0.056
1.61ArgGly: 1.61 ± 0.065
0.421ArgHis: 0.421 ± 0.036
3.21ArgIle: 3.21 ± 0.103
3.205ArgLys: 3.205 ± 0.095
3.103ArgLeu: 3.103 ± 0.104
0.973ArgMet: 0.973 ± 0.047
2.133ArgAsn: 2.133 ± 0.071
0.83ArgPro: 0.83 ± 0.049
0.702ArgGln: 0.702 ± 0.048
1.233ArgArg: 1.233 ± 0.071
1.751ArgSer: 1.751 ± 0.072
1.425ArgThr: 1.425 ± 0.057
1.935ArgVal: 1.935 ± 0.076
0.164ArgTrp: 0.164 ± 0.023
1.535ArgTyr: 1.535 ± 0.065
0.0ArgXaa: 0.0 ± 0.0
Ser
2.453SerAla: 2.453 ± 0.075
0.557SerCys: 0.557 ± 0.045
3.121SerAsp: 3.121 ± 0.11
3.896SerGlu: 3.896 ± 0.118
3.275SerPhe: 3.275 ± 0.103
3.904SerGly: 3.904 ± 0.109
0.957SerHis: 0.957 ± 0.052
5.894SerIle: 5.894 ± 0.129
5.54SerLys: 5.54 ± 0.128
6.433SerLeu: 6.433 ± 0.129
1.496SerMet: 1.496 ± 0.067
4.352SerAsn: 4.352 ± 0.138
1.256SerPro: 1.256 ± 0.056
1.204SerGln: 1.204 ± 0.065
1.686SerArg: 1.686 ± 0.061
4.664SerSer: 4.664 ± 0.163
3.311SerThr: 3.311 ± 0.107
3.223SerVal: 3.223 ± 0.09
0.481SerTrp: 0.481 ± 0.043
3.293SerTyr: 3.293 ± 0.106
0.0SerXaa: 0.0 ± 0.0
Thr
2.216ThrAla: 2.216 ± 0.092
0.536ThrCys: 0.536 ± 0.035
2.869ThrAsp: 2.869 ± 0.111
3.069ThrGlu: 3.069 ± 0.095
2.505ThrPhe: 2.505 ± 0.086
3.527ThrGly: 3.527 ± 0.158
0.838ThrHis: 0.838 ± 0.048
5.603ThrIle: 5.603 ± 0.121
4.404ThrLys: 4.404 ± 0.102
5.317ThrLeu: 5.317 ± 0.131
1.072ThrMet: 1.072 ± 0.05
3.709ThrAsn: 3.709 ± 0.108
1.901ThrPro: 1.901 ± 0.077
1.199ThrGln: 1.199 ± 0.067
1.516ThrArg: 1.516 ± 0.062
3.868ThrSer: 3.868 ± 0.128
3.506ThrThr: 3.506 ± 0.123
2.908ThrVal: 2.908 ± 0.101
0.31ThrTrp: 0.31 ± 0.031
2.796ThrTyr: 2.796 ± 0.101
0.0ThrXaa: 0.0 ± 0.0
Val
2.973ValAla: 2.973 ± 0.092
0.702ValCys: 0.702 ± 0.047
3.441ValAsp: 3.441 ± 0.09
3.917ValGlu: 3.917 ± 0.139
2.479ValPhe: 2.479 ± 0.093
3.483ValGly: 3.483 ± 0.101
0.892ValHis: 0.892 ± 0.053
6.055ValIle: 6.055 ± 0.12
4.703ValLys: 4.703 ± 0.115
5.977ValLeu: 5.977 ± 0.128
1.23ValMet: 1.23 ± 0.053
3.527ValAsn: 3.527 ± 0.098
1.701ValPro: 1.701 ± 0.072
1.144ValGln: 1.144 ± 0.051
2.008ValArg: 2.008 ± 0.06
4.021ValSer: 4.021 ± 0.112
3.389ValThr: 3.389 ± 0.103
4.094ValVal: 4.094 ± 0.115
0.33ValTrp: 0.33 ± 0.033
2.671ValTyr: 2.671 ± 0.08
0.0ValXaa: 0.0 ± 0.0
Trp
0.258TrpAla: 0.258 ± 0.027
0.091TrpCys: 0.091 ± 0.016
0.338TrpAsp: 0.338 ± 0.035
0.403TrpGlu: 0.403 ± 0.032
0.231TrpPhe: 0.231 ± 0.025
0.382TrpGly: 0.382 ± 0.033
0.107TrpHis: 0.107 ± 0.017
0.497TrpIle: 0.497 ± 0.033
0.416TrpLys: 0.416 ± 0.032
0.585TrpLeu: 0.585 ± 0.038
0.12TrpMet: 0.12 ± 0.018
0.484TrpAsn: 0.484 ± 0.039
0.096TrpPro: 0.096 ± 0.018
0.172TrpGln: 0.172 ± 0.022
0.265TrpArg: 0.265 ± 0.028
0.393TrpSer: 0.393 ± 0.034
0.32TrpThr: 0.32 ± 0.032
0.291TrpVal: 0.291 ± 0.027
0.049TrpTrp: 0.049 ± 0.012
0.492TrpTyr: 0.492 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.401TyrAla: 2.401 ± 0.092
0.596TyrCys: 0.596 ± 0.042
3.21TyrAsp: 3.21 ± 0.105
4.011TyrGlu: 4.011 ± 0.107
2.866TyrPhe: 2.866 ± 0.09
2.541TyrGly: 2.541 ± 0.085
1.072TyrHis: 1.072 ± 0.052
4.227TyrIle: 4.227 ± 0.124
3.655TyrLys: 3.655 ± 0.109
5.938TyrLeu: 5.938 ± 0.128
1.105TyrMet: 1.105 ± 0.059
3.345TyrAsn: 3.345 ± 0.113
1.334TyrPro: 1.334 ± 0.051
2.294TyrGln: 2.294 ± 0.084
1.644TyrArg: 1.644 ± 0.068
2.859TyrSer: 2.859 ± 0.092
2.833TyrThr: 2.833 ± 0.11
2.911TyrVal: 2.911 ± 0.096
0.453TyrTrp: 0.453 ± 0.042
3.543TyrTyr: 3.543 ± 0.12
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1349 proteins (384452 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski