Amino acid dipepetide frequency for Mycoplasma neurolyticum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.372AlaAla: 2.372 ± 0.124
0.159AlaCys: 0.159 ± 0.022
1.53AlaAsp: 1.53 ± 0.075
2.216AlaGlu: 2.216 ± 0.101
2.677AlaPhe: 2.677 ± 0.102
2.173AlaGly: 2.173 ± 0.113
0.623AlaHis: 0.623 ± 0.048
5.363AlaIle: 5.363 ± 0.153
5.217AlaLys: 5.217 ± 0.16
4.111AlaLeu: 4.111 ± 0.145
0.884AlaMet: 0.884 ± 0.064
3.303AlaAsn: 3.303 ± 0.163
1.07AlaPro: 1.07 ± 0.06
1.332AlaGln: 1.332 ± 0.066
1.305AlaArg: 1.305 ± 0.079
2.72AlaSer: 2.72 ± 0.092
2.491AlaThr: 2.491 ± 0.128
1.885AlaVal: 1.885 ± 0.102
0.407AlaTrp: 0.407 ± 0.04
1.517AlaTyr: 1.517 ± 0.077
0.0AlaXaa: 0.0 ± 0.0
Cys
0.149CysAla: 0.149 ± 0.02
0.023CysCys: 0.023 ± 0.008
0.229CysAsp: 0.229 ± 0.028
0.268CysGlu: 0.268 ± 0.035
0.351CysPhe: 0.351 ± 0.034
0.229CysGly: 0.229 ± 0.028
0.083CysHis: 0.083 ± 0.018
0.318CysIle: 0.318 ± 0.035
0.235CysLys: 0.235 ± 0.03
0.308CysLeu: 0.308 ± 0.035
0.05CysMet: 0.05 ± 0.013
0.219CysAsn: 0.219 ± 0.028
0.119CysPro: 0.119 ± 0.019
0.129CysGln: 0.129 ± 0.022
0.093CysArg: 0.093 ± 0.017
0.242CysSer: 0.242 ± 0.031
0.136CysThr: 0.136 ± 0.021
0.136CysVal: 0.136 ± 0.021
0.05CysTrp: 0.05 ± 0.014
0.126CysTyr: 0.126 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
2.166AspAla: 2.166 ± 0.137
0.113AspCys: 0.113 ± 0.02
2.892AspAsp: 2.892 ± 0.145
4.505AspGlu: 4.505 ± 0.158
3.866AspPhe: 3.866 ± 0.117
2.107AspGly: 2.107 ± 0.11
0.676AspHis: 0.676 ± 0.052
4.82AspIle: 4.82 ± 0.16
6.022AspLys: 6.022 ± 0.171
5.211AspLeu: 5.211 ± 0.162
0.467AspMet: 0.467 ± 0.037
3.892AspAsn: 3.892 ± 0.182
1.229AspPro: 1.229 ± 0.073
1.375AspGln: 1.375 ± 0.071
1.087AspArg: 1.087 ± 0.057
3.018AspSer: 3.018 ± 0.135
2.031AspThr: 2.031 ± 0.107
2.918AspVal: 2.918 ± 0.111
0.553AspTrp: 0.553 ± 0.048
2.405AspTyr: 2.405 ± 0.096
0.0AspXaa: 0.0 ± 0.0
Glu
2.849GluAla: 2.849 ± 0.104
0.176GluCys: 0.176 ± 0.034
2.723GluAsp: 2.723 ± 0.117
4.969GluGlu: 4.969 ± 0.203
3.962GluPhe: 3.962 ± 0.128
2.014GluGly: 2.014 ± 0.093
0.861GluHis: 0.861 ± 0.049
9.282GluIle: 9.282 ± 0.194
10.309GluLys: 10.309 ± 0.218
6.34GluLeu: 6.34 ± 0.164
1.116GluMet: 1.116 ± 0.069
7.888GluAsn: 7.888 ± 0.195
1.212GluPro: 1.212 ± 0.06
2.276GluGln: 2.276 ± 0.083
1.567GluArg: 1.567 ± 0.082
2.753GluSer: 2.753 ± 0.096
3.316GluThr: 3.316 ± 0.126
3.054GluVal: 3.054 ± 0.141
0.629GluTrp: 0.629 ± 0.039
2.955GluTyr: 2.955 ± 0.106
0.0GluXaa: 0.0 ± 0.0
Phe
2.869PheAla: 2.869 ± 0.12
0.298PheCys: 0.298 ± 0.033
3.72PheAsp: 3.72 ± 0.134
4.432PheGlu: 4.432 ± 0.124
4.181PhePhe: 4.181 ± 0.186
2.554PheGly: 2.554 ± 0.104
0.759PheHis: 0.759 ± 0.053
6.258PheIle: 6.258 ± 0.221
6.281PheLys: 6.281 ± 0.159
7.182PheLeu: 7.182 ± 0.278
0.795PheMet: 0.795 ± 0.058
5.022PheAsn: 5.022 ± 0.161
1.232PhePro: 1.232 ± 0.082
1.696PheGln: 1.696 ± 0.08
1.203PheArg: 1.203 ± 0.056
5.27PheSer: 5.27 ± 0.147
2.511PheThr: 2.511 ± 0.112
3.525PheVal: 3.525 ± 0.099
0.788PheTrp: 0.788 ± 0.06
2.849PheTyr: 2.849 ± 0.124
0.0PheXaa: 0.0 ± 0.0
Gly
2.253GlyAla: 2.253 ± 0.1
0.186GlyCys: 0.186 ± 0.027
1.938GlyAsp: 1.938 ± 0.09
2.541GlyGlu: 2.541 ± 0.105
2.541GlyPhe: 2.541 ± 0.12
2.342GlyGly: 2.342 ± 0.111
0.775GlyHis: 0.775 ± 0.05
4.207GlyIle: 4.207 ± 0.163
3.965GlyLys: 3.965 ± 0.133
3.405GlyLeu: 3.405 ± 0.127
0.742GlyMet: 0.742 ± 0.052
2.1GlyAsn: 2.1 ± 0.103
0.858GlyPro: 0.858 ± 0.058
1.282GlyGln: 1.282 ± 0.078
1.272GlyArg: 1.272 ± 0.074
2.687GlySer: 2.687 ± 0.101
2.269GlyThr: 2.269 ± 0.086
2.441GlyVal: 2.441 ± 0.101
0.394GlyTrp: 0.394 ± 0.043
1.805GlyTyr: 1.805 ± 0.089
0.0GlyXaa: 0.0 ± 0.0
His
0.54HisAla: 0.54 ± 0.042
0.04HisCys: 0.04 ± 0.013
0.785HisAsp: 0.785 ± 0.054
0.931HisGlu: 0.931 ± 0.05
1.09HisPhe: 1.09 ± 0.065
0.669HisGly: 0.669 ± 0.048
0.282HisHis: 0.282 ± 0.033
1.335HisIle: 1.335 ± 0.065
1.378HisLys: 1.378 ± 0.075
1.421HisLeu: 1.421 ± 0.059
0.202HisMet: 0.202 ± 0.024
1.017HisAsn: 1.017 ± 0.061
0.398HisPro: 0.398 ± 0.041
0.321HisGln: 0.321 ± 0.03
0.401HisArg: 0.401 ± 0.037
0.954HisSer: 0.954 ± 0.065
0.563HisThr: 0.563 ± 0.045
0.553HisVal: 0.553 ± 0.042
0.109HisTrp: 0.109 ± 0.018
0.573HisTyr: 0.573 ± 0.044
0.0HisXaa: 0.0 ± 0.0
Ile
5.138IleAla: 5.138 ± 0.159
0.464IleCys: 0.464 ± 0.039
6.062IleAsp: 6.062 ± 0.14
7.48IleGlu: 7.48 ± 0.179
7.609IlePhe: 7.609 ± 0.243
3.962IleGly: 3.962 ± 0.152
1.289IleHis: 1.289 ± 0.077
10.995IleIle: 10.995 ± 0.293
11.843IleLys: 11.843 ± 0.226
10.273IleLeu: 10.273 ± 0.3
1.464IleMet: 1.464 ± 0.088
8.957IleAsn: 8.957 ± 0.238
2.852IlePro: 2.852 ± 0.097
3.256IleGln: 3.256 ± 0.108
2.077IleArg: 2.077 ± 0.09
7.897IleSer: 7.897 ± 0.178
5.012IleThr: 5.012 ± 0.15
5.628IleVal: 5.628 ± 0.144
0.944IleTrp: 0.944 ± 0.067
4.273IleTyr: 4.273 ± 0.16
0.0IleXaa: 0.0 ± 0.0
Lys
4.485LysAla: 4.485 ± 0.162
0.285LysCys: 0.285 ± 0.035
5.81LysAsp: 5.81 ± 0.199
9.037LysGlu: 9.037 ± 0.221
5.708LysPhe: 5.708 ± 0.171
3.568LysGly: 3.568 ± 0.117
1.689LysHis: 1.689 ± 0.081
14.963LysIle: 14.963 ± 0.27
15.016LysLys: 15.016 ± 0.364
9.481LysLeu: 9.481 ± 0.158
2.544LysMet: 2.544 ± 0.102
14.317LysAsn: 14.317 ± 0.312
2.653LysPro: 2.653 ± 0.132
3.952LysGln: 3.952 ± 0.132
2.687LysArg: 2.687 ± 0.102
5.304LysSer: 5.304 ± 0.127
6.917LysThr: 6.917 ± 0.182
4.929LysVal: 4.929 ± 0.145
1.265LysTrp: 1.265 ± 0.068
5.353LysTyr: 5.353 ± 0.17
0.0LysXaa: 0.0 ± 0.0
Leu
4.267LeuAla: 4.267 ± 0.151
0.331LeuCys: 0.331 ± 0.035
4.966LeuAsp: 4.966 ± 0.167
7.059LeuGlu: 7.059 ± 0.182
5.724LeuPhe: 5.724 ± 0.215
3.982LeuGly: 3.982 ± 0.171
0.99LeuHis: 0.99 ± 0.048
9.703LeuIle: 9.703 ± 0.271
11.876LeuLys: 11.876 ± 0.191
8.275LeuLeu: 8.275 ± 0.252
1.504LeuMet: 1.504 ± 0.088
8.749LeuAsn: 8.749 ± 0.202
2.345LeuPro: 2.345 ± 0.094
2.273LeuGln: 2.273 ± 0.09
2.153LeuArg: 2.153 ± 0.086
6.377LeuSer: 6.377 ± 0.137
4.787LeuThr: 4.787 ± 0.141
4.618LeuVal: 4.618 ± 0.137
0.858LeuTrp: 0.858 ± 0.06
2.922LeuTyr: 2.922 ± 0.097
0.0LeuXaa: 0.0 ± 0.0
Met
0.851MetAla: 0.851 ± 0.058
0.05MetCys: 0.05 ± 0.013
0.699MetAsp: 0.699 ± 0.049
0.805MetGlu: 0.805 ± 0.066
1.067MetPhe: 1.067 ± 0.072
0.646MetGly: 0.646 ± 0.045
0.272MetHis: 0.272 ± 0.028
1.381MetIle: 1.381 ± 0.078
1.802MetLys: 1.802 ± 0.08
1.53MetLeu: 1.53 ± 0.079
0.275MetMet: 0.275 ± 0.03
1.239MetAsn: 1.239 ± 0.07
0.537MetPro: 0.537 ± 0.037
0.603MetGln: 0.603 ± 0.053
0.345MetArg: 0.345 ± 0.037
1.06MetSer: 1.06 ± 0.057
0.716MetThr: 0.716 ± 0.051
0.812MetVal: 0.812 ± 0.066
0.136MetTrp: 0.136 ± 0.022
0.451MetTyr: 0.451 ± 0.039
0.0MetXaa: 0.0 ± 0.0
Asn
3.276AsnAla: 3.276 ± 0.11
0.258AsnCys: 0.258 ± 0.028
5.327AsnAsp: 5.327 ± 0.192
6.615AsnGlu: 6.615 ± 0.16
5.86AsnPhe: 5.86 ± 0.197
2.895AsnGly: 2.895 ± 0.103
1.067AsnHis: 1.067 ± 0.062
9.372AsnIle: 9.372 ± 0.231
11.75AsnLys: 11.75 ± 0.257
8.626AsnLeu: 8.626 ± 0.194
1.037AsnMet: 1.037 ± 0.063
8.984AsnAsn: 8.984 ± 0.331
2.104AsnPro: 2.104 ± 0.11
2.832AsnGln: 2.832 ± 0.125
1.607AsnArg: 1.607 ± 0.088
5.887AsnSer: 5.887 ± 0.16
3.511AsnThr: 3.511 ± 0.178
3.786AsnVal: 3.786 ± 0.134
1.01AsnTrp: 1.01 ± 0.061
3.882AsnTyr: 3.882 ± 0.131
0.0AsnXaa: 0.0 ± 0.0
Pro
0.858ProAla: 0.858 ± 0.063
0.089ProCys: 0.089 ± 0.016
1.15ProAsp: 1.15 ± 0.058
1.799ProGlu: 1.799 ± 0.083
1.521ProPhe: 1.521 ± 0.07
1.183ProGly: 1.183 ± 0.074
0.444ProHis: 0.444 ± 0.034
2.508ProIle: 2.508 ± 0.088
2.508ProLys: 2.508 ± 0.1
2.239ProLeu: 2.239 ± 0.09
0.351ProMet: 0.351 ± 0.033
1.958ProAsn: 1.958 ± 0.109
0.5ProPro: 0.5 ± 0.042
0.663ProGln: 0.663 ± 0.052
0.603ProArg: 0.603 ± 0.041
1.59ProSer: 1.59 ± 0.084
1.421ProThr: 1.421 ± 0.071
1.05ProVal: 1.05 ± 0.071
0.258ProTrp: 0.258 ± 0.031
0.928ProTyr: 0.928 ± 0.051
0.0ProXaa: 0.0 ± 0.0
Gln
1.259GlnAla: 1.259 ± 0.062
0.053GlnCys: 0.053 ± 0.012
1.289GlnAsp: 1.289 ± 0.087
2.087GlnGlu: 2.087 ± 0.074
1.428GlnPhe: 1.428 ± 0.074
1.13GlnGly: 1.13 ± 0.063
0.454GlnHis: 0.454 ± 0.037
3.223GlnIle: 3.223 ± 0.102
4.694GlnLys: 4.694 ± 0.157
2.478GlnLeu: 2.478 ± 0.092
0.523GlnMet: 0.523 ± 0.041
3.654GlnAsn: 3.654 ± 0.127
0.639GlnPro: 0.639 ± 0.044
1.09GlnGln: 1.09 ± 0.071
0.808GlnArg: 0.808 ± 0.066
1.481GlnSer: 1.481 ± 0.078
1.517GlnThr: 1.517 ± 0.08
1.269GlnVal: 1.269 ± 0.059
0.325GlnTrp: 0.325 ± 0.034
1.103GlnTyr: 1.103 ± 0.062
0.0GlnXaa: 0.0 ± 0.0
Arg
1.047ArgAla: 1.047 ± 0.07
0.053ArgCys: 0.053 ± 0.015
1.12ArgAsp: 1.12 ± 0.064
1.54ArgGlu: 1.54 ± 0.09
1.292ArgPhe: 1.292 ± 0.069
1.014ArgGly: 1.014 ± 0.058
0.338ArgHis: 0.338 ± 0.032
2.521ArgIle: 2.521 ± 0.098
2.793ArgLys: 2.793 ± 0.113
1.898ArgLeu: 1.898 ± 0.092
0.52ArgMet: 0.52 ± 0.042
1.835ArgAsn: 1.835 ± 0.088
0.633ArgPro: 0.633 ± 0.048
0.904ArgGln: 0.904 ± 0.056
0.778ArgArg: 0.778 ± 0.056
1.338ArgSer: 1.338 ± 0.066
1.183ArgThr: 1.183 ± 0.063
1.338ArgVal: 1.338 ± 0.08
0.189ArgTrp: 0.189 ± 0.024
0.891ArgTyr: 0.891 ± 0.055
0.0ArgXaa: 0.0 ± 0.0
Ser
2.644SerAla: 2.644 ± 0.102
0.345SerCys: 0.345 ± 0.035
3.342SerAsp: 3.342 ± 0.142
4.306SerGlu: 4.306 ± 0.126
4.27SerPhe: 4.27 ± 0.127
2.912SerGly: 2.912 ± 0.112
1.05SerHis: 1.05 ± 0.059
6.357SerIle: 6.357 ± 0.152
7.314SerLys: 7.314 ± 0.148
6.261SerLeu: 6.261 ± 0.153
0.769SerMet: 0.769 ± 0.05
4.78SerAsn: 4.78 ± 0.155
1.381SerPro: 1.381 ± 0.061
2.113SerGln: 2.113 ± 0.082
1.713SerArg: 1.713 ± 0.086
4.625SerSer: 4.625 ± 0.142
3.22SerThr: 3.22 ± 0.096
2.776SerVal: 2.776 ± 0.1
0.663SerTrp: 0.663 ± 0.045
2.402SerTyr: 2.402 ± 0.088
0.0SerXaa: 0.0 ± 0.0
Thr
1.686ThrAla: 1.686 ± 0.078
0.133ThrCys: 0.133 ± 0.026
1.822ThrAsp: 1.822 ± 0.099
2.617ThrGlu: 2.617 ± 0.119
3.339ThrPhe: 3.339 ± 0.109
2.117ThrGly: 2.117 ± 0.106
0.659ThrHis: 0.659 ± 0.046
5.592ThrIle: 5.592 ± 0.124
5.996ThrLys: 5.996 ± 0.181
5.052ThrLeu: 5.052 ± 0.138
0.699ThrMet: 0.699 ± 0.045
4.333ThrAsn: 4.333 ± 0.185
1.434ThrPro: 1.434 ± 0.078
1.504ThrGln: 1.504 ± 0.079
1.279ThrArg: 1.279 ± 0.059
3.472ThrSer: 3.472 ± 0.105
3.137ThrThr: 3.137 ± 0.139
1.988ThrVal: 1.988 ± 0.093
0.487ThrTrp: 0.487 ± 0.05
1.6ThrTyr: 1.6 ± 0.08
0.0ThrXaa: 0.0 ± 0.0
Val
2.521ValAla: 2.521 ± 0.099
0.192ValCys: 0.192 ± 0.026
3.097ValAsp: 3.097 ± 0.143
3.624ValGlu: 3.624 ± 0.146
3.296ValPhe: 3.296 ± 0.13
2.398ValGly: 2.398 ± 0.116
0.629ValHis: 0.629 ± 0.051
4.853ValIle: 4.853 ± 0.144
4.651ValLys: 4.651 ± 0.16
4.548ValLeu: 4.548 ± 0.12
0.729ValMet: 0.729 ± 0.048
3.376ValAsn: 3.376 ± 0.092
1.368ValPro: 1.368 ± 0.067
1.183ValGln: 1.183 ± 0.063
1.024ValArg: 1.024 ± 0.067
3.197ValSer: 3.197 ± 0.116
2.157ValThr: 2.157 ± 0.1
3.147ValVal: 3.147 ± 0.124
0.494ValTrp: 0.494 ± 0.04
1.978ValTyr: 1.978 ± 0.089
0.0ValXaa: 0.0 ± 0.0
Trp
0.384TrpAla: 0.384 ± 0.034
0.05TrpCys: 0.05 ± 0.014
0.497TrpAsp: 0.497 ± 0.041
0.702TrpGlu: 0.702 ± 0.049
0.656TrpPhe: 0.656 ± 0.05
0.351TrpGly: 0.351 ± 0.033
0.123TrpHis: 0.123 ± 0.019
1.159TrpIle: 1.159 ± 0.076
1.282TrpLys: 1.282 ± 0.066
0.921TrpLeu: 0.921 ± 0.052
0.189TrpMet: 0.189 ± 0.026
0.977TrpAsn: 0.977 ± 0.066
0.229TrpPro: 0.229 ± 0.023
0.318TrpGln: 0.318 ± 0.035
0.252TrpArg: 0.252 ± 0.027
0.566TrpSer: 0.566 ± 0.044
0.467TrpThr: 0.467 ± 0.046
0.51TrpVal: 0.51 ± 0.043
0.156TrpTrp: 0.156 ± 0.023
0.437TrpTyr: 0.437 ± 0.046
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.742TyrAla: 1.742 ± 0.077
0.195TyrCys: 0.195 ± 0.029
2.355TyrAsp: 2.355 ± 0.08
2.716TyrGlu: 2.716 ± 0.089
2.988TyrPhe: 2.988 ± 0.118
1.759TyrGly: 1.759 ± 0.09
0.5TyrHis: 0.5 ± 0.034
3.266TyrIle: 3.266 ± 0.104
5.032TyrLys: 5.032 ± 0.144
4.197TyrLeu: 4.197 ± 0.131
0.411TyrMet: 0.411 ± 0.04
3.087TyrAsn: 3.087 ± 0.119
0.825TyrPro: 0.825 ± 0.045
1.401TyrGln: 1.401 ± 0.075
0.974TyrArg: 0.974 ± 0.056
2.806TyrSer: 2.806 ± 0.104
1.587TyrThr: 1.587 ± 0.084
2.097TyrVal: 2.097 ± 0.079
0.51TyrTrp: 0.51 ± 0.044
1.719TyrTyr: 1.719 ± 0.094
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 836 proteins (301871 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski