Amino acid dipepetide frequency for Mycoplasma sp. CAG:472

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.643AlaAla: 1.643 ± 0.088
0.565AlaCys: 0.565 ± 0.039
1.967AlaAsp: 1.967 ± 0.089
1.867AlaGlu: 1.867 ± 0.091
1.905AlaPhe: 1.905 ± 0.08
2.451AlaGly: 2.451 ± 0.103
0.589AlaHis: 0.589 ± 0.045
4.024AlaIle: 4.024 ± 0.125
4.313AlaLys: 4.313 ± 0.122
4.305AlaLeu: 4.305 ± 0.113
1.0AlaMet: 1.0 ± 0.052
2.805AlaAsn: 2.805 ± 0.093
0.93AlaPro: 0.93 ± 0.054
0.897AlaGln: 0.897 ± 0.055
1.438AlaArg: 1.438 ± 0.066
3.232AlaSer: 3.232 ± 0.115
2.135AlaThr: 2.135 ± 0.096
2.486AlaVal: 2.486 ± 0.093
0.17AlaTrp: 0.17 ± 0.022
2.197AlaTyr: 2.197 ± 0.077
0.0AlaXaa: 0.0 ± 0.0
Cys
0.605CysAla: 0.605 ± 0.04
0.143CysCys: 0.143 ± 0.022
0.857CysAsp: 0.857 ± 0.05
0.759CysGlu: 0.759 ± 0.048
0.492CysPhe: 0.492 ± 0.041
0.859CysGly: 0.859 ± 0.053
0.181CysHis: 0.181 ± 0.025
1.0CysIle: 1.0 ± 0.053
1.192CysLys: 1.192 ± 0.072
0.954CysLeu: 0.954 ± 0.05
0.273CysMet: 0.273 ± 0.029
0.838CysAsn: 0.838 ± 0.051
0.465CysPro: 0.465 ± 0.051
0.235CysGln: 0.235 ± 0.025
0.23CysArg: 0.23 ± 0.025
0.784CysSer: 0.784 ± 0.049
0.627CysThr: 0.627 ± 0.047
0.551CysVal: 0.551 ± 0.038
0.043CysTrp: 0.043 ± 0.012
0.711CysTyr: 0.711 ± 0.057
0.0CysXaa: 0.0 ± 0.0
Asp
2.635AspAla: 2.635 ± 0.092
0.511AspCys: 0.511 ± 0.043
3.486AspAsp: 3.486 ± 0.111
5.151AspGlu: 5.151 ± 0.109
3.351AspPhe: 3.351 ± 0.094
3.243AspGly: 3.243 ± 0.108
0.527AspHis: 0.527 ± 0.041
6.692AspIle: 6.692 ± 0.158
6.338AspLys: 6.338 ± 0.133
5.384AspLeu: 5.384 ± 0.131
1.289AspMet: 1.289 ± 0.055
5.143AspAsn: 5.143 ± 0.13
1.13AspPro: 1.13 ± 0.056
0.605AspGln: 0.605 ± 0.042
1.467AspArg: 1.467 ± 0.067
3.094AspSer: 3.094 ± 0.08
2.916AspThr: 2.916 ± 0.09
3.454AspVal: 3.454 ± 0.08
0.278AspTrp: 0.278 ± 0.031
4.04AspTyr: 4.04 ± 0.12
0.0AspXaa: 0.0 ± 0.0
Glu
2.586GluAla: 2.586 ± 0.099
0.784GluCys: 0.784 ± 0.057
4.148GluAsp: 4.148 ± 0.111
5.505GluGlu: 5.505 ± 0.166
2.808GluPhe: 2.808 ± 0.094
2.951GluGly: 2.951 ± 0.106
0.689GluHis: 0.689 ± 0.047
7.048GluIle: 7.048 ± 0.157
8.575GluLys: 8.575 ± 0.163
5.889GluLeu: 5.889 ± 0.132
1.673GluMet: 1.673 ± 0.061
7.016GluAsn: 7.016 ± 0.148
1.124GluPro: 1.124 ± 0.055
1.178GluGln: 1.178 ± 0.063
2.151GluArg: 2.151 ± 0.066
3.681GluSer: 3.681 ± 0.1
3.248GluThr: 3.248 ± 0.094
4.189GluVal: 4.189 ± 0.119
0.276GluTrp: 0.276 ± 0.025
4.094GluTyr: 4.094 ± 0.111
0.0GluXaa: 0.0 ± 0.0
Phe
2.127PheAla: 2.127 ± 0.081
0.532PheCys: 0.532 ± 0.04
3.243PheAsp: 3.243 ± 0.104
3.016PheGlu: 3.016 ± 0.097
1.949PhePhe: 1.949 ± 0.093
2.365PheGly: 2.365 ± 0.092
0.524PheHis: 0.524 ± 0.039
5.094PheIle: 5.094 ± 0.153
4.157PheLys: 4.157 ± 0.117
4.557PheLeu: 4.557 ± 0.149
1.013PheMet: 1.013 ± 0.056
3.835PheAsn: 3.835 ± 0.105
1.051PhePro: 1.051 ± 0.059
0.697PheGln: 0.697 ± 0.039
1.046PheArg: 1.046 ± 0.058
3.181PheSer: 3.181 ± 0.116
2.605PheThr: 2.605 ± 0.09
2.473PheVal: 2.473 ± 0.074
0.208PheTrp: 0.208 ± 0.023
2.538PheTyr: 2.538 ± 0.089
0.0PheXaa: 0.0 ± 0.0
Gly
2.632GlyAla: 2.632 ± 0.11
0.665GlyCys: 0.665 ± 0.051
2.53GlyAsp: 2.53 ± 0.101
2.816GlyGlu: 2.816 ± 0.099
2.527GlyPhe: 2.527 ± 0.091
3.078GlyGly: 3.078 ± 0.161
0.786GlyHis: 0.786 ± 0.049
5.138GlyIle: 5.138 ± 0.131
5.481GlyLys: 5.481 ± 0.14
4.416GlyLeu: 4.416 ± 0.12
1.197GlyMet: 1.197 ± 0.057
3.373GlyAsn: 3.373 ± 0.109
0.843GlyPro: 0.843 ± 0.059
1.111GlyGln: 1.111 ± 0.065
1.495GlyArg: 1.495 ± 0.076
3.376GlySer: 3.376 ± 0.111
3.189GlyThr: 3.189 ± 0.113
3.389GlyVal: 3.389 ± 0.096
0.281GlyTrp: 0.281 ± 0.034
3.124GlyTyr: 3.124 ± 0.104
0.0GlyXaa: 0.0 ± 0.0
His
0.546HisAla: 0.546 ± 0.04
0.157HisCys: 0.157 ± 0.022
0.622HisAsp: 0.622 ± 0.044
0.73HisGlu: 0.73 ± 0.045
0.659HisPhe: 0.659 ± 0.044
0.722HisGly: 0.722 ± 0.05
0.251HisHis: 0.251 ± 0.029
1.235HisIle: 1.235 ± 0.06
1.008HisLys: 1.008 ± 0.056
1.195HisLeu: 1.195 ± 0.055
0.241HisMet: 0.241 ± 0.026
1.049HisAsn: 1.049 ± 0.057
0.592HisPro: 0.592 ± 0.041
0.278HisGln: 0.278 ± 0.029
0.376HisArg: 0.376 ± 0.034
0.749HisSer: 0.749 ± 0.046
0.662HisThr: 0.662 ± 0.046
0.538HisVal: 0.538 ± 0.036
0.038HisTrp: 0.038 ± 0.01
0.676HisTyr: 0.676 ± 0.049
0.0HisXaa: 0.0 ± 0.0
Ile
4.189IleAla: 4.189 ± 0.11
1.473IleCys: 1.473 ± 0.071
6.627IleAsp: 6.627 ± 0.141
6.316IleGlu: 6.316 ± 0.125
4.505IlePhe: 4.505 ± 0.145
4.789IleGly: 4.789 ± 0.132
1.057IleHis: 1.057 ± 0.055
10.645IleIle: 10.645 ± 0.209
10.299IleLys: 10.299 ± 0.187
9.437IleLeu: 9.437 ± 0.21
2.07IleMet: 2.07 ± 0.071
9.235IleAsn: 9.235 ± 0.213
2.638IlePro: 2.638 ± 0.099
1.522IleGln: 1.522 ± 0.065
2.57IleArg: 2.57 ± 0.082
6.997IleSer: 6.997 ± 0.148
5.527IleThr: 5.527 ± 0.134
5.773IleVal: 5.773 ± 0.144
0.416IleTrp: 0.416 ± 0.035
5.216IleTyr: 5.216 ± 0.133
0.0IleXaa: 0.0 ± 0.0
Lys
3.448LysAla: 3.448 ± 0.105
1.173LysCys: 1.173 ± 0.072
7.132LysAsp: 7.132 ± 0.163
9.721LysGlu: 9.721 ± 0.193
3.551LysPhe: 3.551 ± 0.1
4.713LysGly: 4.713 ± 0.131
1.051LysHis: 1.051 ± 0.062
10.124LysIle: 10.124 ± 0.156
10.864LysLys: 10.864 ± 0.19
8.289LysLeu: 8.289 ± 0.168
3.013LysMet: 3.013 ± 0.087
9.091LysAsn: 9.091 ± 0.191
2.04LysPro: 2.04 ± 0.077
1.849LysGln: 1.849 ± 0.081
2.889LysArg: 2.889 ± 0.089
5.411LysSer: 5.411 ± 0.116
5.073LysThr: 5.073 ± 0.103
6.578LysVal: 6.578 ± 0.119
0.47LysTrp: 0.47 ± 0.035
6.47LysTyr: 6.47 ± 0.154
0.0LysXaa: 0.0 ± 0.0
Leu
3.794LeuAla: 3.794 ± 0.112
1.197LeuCys: 1.197 ± 0.048
5.486LeuAsp: 5.486 ± 0.127
5.902LeuGlu: 5.902 ± 0.148
4.578LeuPhe: 4.578 ± 0.131
4.767LeuGly: 4.767 ± 0.14
1.149LeuHis: 1.149 ± 0.06
9.056LeuIle: 9.056 ± 0.213
9.456LeuLys: 9.456 ± 0.175
8.656LeuLeu: 8.656 ± 0.193
1.943LeuMet: 1.943 ± 0.079
7.835LeuAsn: 7.835 ± 0.176
2.527LeuPro: 2.527 ± 0.085
1.605LeuGln: 1.605 ± 0.07
2.313LeuArg: 2.313 ± 0.078
6.789LeuSer: 6.789 ± 0.125
5.132LeuThr: 5.132 ± 0.127
4.978LeuVal: 4.978 ± 0.127
0.47LeuTrp: 0.47 ± 0.04
4.513LeuTyr: 4.513 ± 0.124
0.0LeuXaa: 0.0 ± 0.0
Met
1.189MetAla: 1.189 ± 0.066
0.273MetCys: 0.273 ± 0.029
1.427MetAsp: 1.427 ± 0.062
1.597MetGlu: 1.597 ± 0.065
1.04MetPhe: 1.04 ± 0.055
1.238MetGly: 1.238 ± 0.071
0.384MetHis: 0.384 ± 0.034
2.2MetIle: 2.2 ± 0.094
2.573MetLys: 2.573 ± 0.083
2.159MetLeu: 2.159 ± 0.067
0.535MetMet: 0.535 ± 0.037
1.711MetAsn: 1.711 ± 0.072
0.759MetPro: 0.759 ± 0.045
0.649MetGln: 0.649 ± 0.041
0.597MetArg: 0.597 ± 0.034
1.297MetSer: 1.297 ± 0.054
1.022MetThr: 1.022 ± 0.051
1.281MetVal: 1.281 ± 0.057
0.105MetTrp: 0.105 ± 0.017
1.084MetTyr: 1.084 ± 0.051
0.0MetXaa: 0.0 ± 0.0
Asn
3.224AsnAla: 3.224 ± 0.092
0.749AsnCys: 0.749 ± 0.049
5.094AsnAsp: 5.094 ± 0.127
7.008AsnGlu: 7.008 ± 0.158
3.465AsnPhe: 3.465 ± 0.108
4.316AsnGly: 4.316 ± 0.104
0.908AsnHis: 0.908 ± 0.047
9.359AsnIle: 9.359 ± 0.182
8.508AsnLys: 8.508 ± 0.193
7.073AsnLeu: 7.073 ± 0.158
1.954AsnMet: 1.954 ± 0.076
7.8AsnAsn: 7.8 ± 0.181
2.138AsnPro: 2.138 ± 0.08
1.451AsnGln: 1.451 ± 0.066
1.854AsnArg: 1.854 ± 0.073
4.74AsnSer: 4.74 ± 0.118
3.759AsnThr: 3.759 ± 0.092
4.721AsnVal: 4.721 ± 0.106
0.397AsnTrp: 0.397 ± 0.036
5.102AsnTyr: 5.102 ± 0.119
0.0AsnXaa: 0.0 ± 0.0
Pro
0.8ProAla: 0.8 ± 0.046
0.249ProCys: 0.249 ± 0.028
1.427ProAsp: 1.427 ± 0.06
1.713ProGlu: 1.713 ± 0.072
1.357ProPhe: 1.357 ± 0.066
1.149ProGly: 1.149 ± 0.06
0.357ProHis: 0.357 ± 0.035
2.249ProIle: 2.249 ± 0.075
2.246ProLys: 2.246 ± 0.084
2.1ProLeu: 2.1 ± 0.081
0.519ProMet: 0.519 ± 0.041
1.908ProAsn: 1.908 ± 0.066
0.378ProPro: 0.378 ± 0.036
0.427ProGln: 0.427 ± 0.036
0.565ProArg: 0.565 ± 0.037
1.765ProSer: 1.765 ± 0.086
1.324ProThr: 1.324 ± 0.063
1.635ProVal: 1.635 ± 0.076
0.13ProTrp: 0.13 ± 0.02
1.257ProTyr: 1.257 ± 0.059
0.0ProXaa: 0.0 ± 0.0
Gln
0.727GlnAla: 0.727 ± 0.054
0.122GlnCys: 0.122 ± 0.019
1.043GlnAsp: 1.043 ± 0.056
1.159GlnGlu: 1.159 ± 0.048
0.673GlnPhe: 0.673 ± 0.038
0.846GlnGly: 0.846 ± 0.055
0.232GlnHis: 0.232 ± 0.026
2.1GlnIle: 2.1 ± 0.081
2.427GlnLys: 2.427 ± 0.092
1.268GlnLeu: 1.268 ± 0.061
0.565GlnMet: 0.565 ± 0.042
1.767GlnAsn: 1.767 ± 0.066
0.335GlnPro: 0.335 ± 0.034
0.297GlnGln: 0.297 ± 0.032
0.773GlnArg: 0.773 ± 0.053
1.208GlnSer: 1.208 ± 0.059
1.081GlnThr: 1.081 ± 0.055
0.938GlnVal: 0.938 ± 0.055
0.095GlnTrp: 0.095 ± 0.018
0.67GlnTyr: 0.67 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
1.105ArgAla: 1.105 ± 0.059
0.438ArgCys: 0.438 ± 0.034
1.33ArgAsp: 1.33 ± 0.062
1.813ArgGlu: 1.813 ± 0.076
1.308ArgPhe: 1.308 ± 0.066
1.265ArgGly: 1.265 ± 0.07
0.405ArgHis: 0.405 ± 0.036
2.432ArgIle: 2.432 ± 0.098
3.086ArgLys: 3.086 ± 0.096
2.473ArgLeu: 2.473 ± 0.086
0.897ArgMet: 0.897 ± 0.05
1.938ArgAsn: 1.938 ± 0.073
0.741ArgPro: 0.741 ± 0.051
0.576ArgGln: 0.576 ± 0.039
0.878ArgArg: 0.878 ± 0.052
1.335ArgSer: 1.335 ± 0.062
1.286ArgThr: 1.286 ± 0.072
1.64ArgVal: 1.64 ± 0.065
0.143ArgTrp: 0.143 ± 0.019
1.451ArgTyr: 1.451 ± 0.064
0.0ArgXaa: 0.0 ± 0.0
Ser
2.492SerAla: 2.492 ± 0.089
0.692SerCys: 0.692 ± 0.052
3.897SerAsp: 3.897 ± 0.105
3.884SerGlu: 3.884 ± 0.114
3.646SerPhe: 3.646 ± 0.123
3.497SerGly: 3.497 ± 0.125
0.811SerHis: 0.811 ± 0.047
6.327SerIle: 6.327 ± 0.15
6.513SerLys: 6.513 ± 0.12
6.635SerLeu: 6.635 ± 0.152
1.413SerMet: 1.413 ± 0.06
4.862SerAsn: 4.862 ± 0.118
1.284SerPro: 1.284 ± 0.059
1.446SerGln: 1.446 ± 0.066
1.524SerArg: 1.524 ± 0.068
4.957SerSer: 4.957 ± 0.16
3.057SerThr: 3.057 ± 0.116
3.273SerVal: 3.273 ± 0.101
0.408SerTrp: 0.408 ± 0.032
3.9SerTyr: 3.9 ± 0.113
0.0SerXaa: 0.0 ± 0.0
Thr
1.965ThrAla: 1.965 ± 0.085
0.684ThrCys: 0.684 ± 0.055
2.967ThrAsp: 2.967 ± 0.085
2.749ThrGlu: 2.749 ± 0.097
2.74ThrPhe: 2.74 ± 0.094
3.14ThrGly: 3.14 ± 0.105
0.751ThrHis: 0.751 ± 0.044
5.181ThrIle: 5.181 ± 0.128
4.854ThrLys: 4.854 ± 0.136
5.438ThrLeu: 5.438 ± 0.129
0.976ThrMet: 0.976 ± 0.051
3.721ThrAsn: 3.721 ± 0.116
1.495ThrPro: 1.495 ± 0.068
1.059ThrGln: 1.059 ± 0.057
1.3ThrArg: 1.3 ± 0.058
3.692ThrSer: 3.692 ± 0.115
2.786ThrThr: 2.786 ± 0.114
2.892ThrVal: 2.892 ± 0.096
0.303ThrTrp: 0.303 ± 0.025
3.03ThrTyr: 3.03 ± 0.097
0.0ThrXaa: 0.0 ± 0.0
Val
2.4ValAla: 2.4 ± 0.091
0.741ValCys: 0.741 ± 0.048
3.305ValAsp: 3.305 ± 0.098
3.205ValGlu: 3.205 ± 0.109
2.57ValPhe: 2.57 ± 0.085
3.119ValGly: 3.119 ± 0.1
0.762ValHis: 0.762 ± 0.045
6.081ValIle: 6.081 ± 0.137
5.592ValLys: 5.592 ± 0.128
5.475ValLeu: 5.475 ± 0.114
1.349ValMet: 1.349 ± 0.063
4.462ValAsn: 4.462 ± 0.106
1.667ValPro: 1.667 ± 0.068
1.035ValGln: 1.035 ± 0.053
1.576ValArg: 1.576 ± 0.073
4.265ValSer: 4.265 ± 0.102
3.335ValThr: 3.335 ± 0.105
3.543ValVal: 3.543 ± 0.118
0.265ValTrp: 0.265 ± 0.026
2.843ValTyr: 2.843 ± 0.088
0.0ValXaa: 0.0 ± 0.0
Trp
0.238TrpAla: 0.238 ± 0.028
0.089TrpCys: 0.089 ± 0.016
0.3TrpAsp: 0.3 ± 0.031
0.281TrpGlu: 0.281 ± 0.027
0.284TrpPhe: 0.284 ± 0.028
0.23TrpGly: 0.23 ± 0.032
0.095TrpHis: 0.095 ± 0.015
0.395TrpIle: 0.395 ± 0.029
0.335TrpLys: 0.335 ± 0.032
0.5TrpLeu: 0.5 ± 0.039
0.097TrpMet: 0.097 ± 0.018
0.351TrpAsn: 0.351 ± 0.032
0.103TrpPro: 0.103 ± 0.014
0.189TrpGln: 0.189 ± 0.026
0.143TrpArg: 0.143 ± 0.018
0.303TrpSer: 0.303 ± 0.03
0.246TrpThr: 0.246 ± 0.027
0.284TrpVal: 0.284 ± 0.026
0.049TrpTrp: 0.049 ± 0.013
0.313TrpTyr: 0.313 ± 0.028
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.576TyrAla: 2.576 ± 0.085
0.565TyrCys: 0.565 ± 0.051
3.846TyrAsp: 3.846 ± 0.107
4.421TyrGlu: 4.421 ± 0.12
2.803TyrPhe: 2.803 ± 0.086
2.708TyrGly: 2.708 ± 0.079
0.811TyrHis: 0.811 ± 0.052
4.921TyrIle: 4.921 ± 0.11
5.138TyrLys: 5.138 ± 0.148
5.948TyrLeu: 5.948 ± 0.116
1.1TyrMet: 1.1 ± 0.053
4.881TyrAsn: 4.881 ± 0.146
1.313TyrPro: 1.313 ± 0.058
1.219TyrGln: 1.219 ± 0.064
1.357TyrArg: 1.357 ± 0.071
3.662TyrSer: 3.662 ± 0.121
2.711TyrThr: 2.711 ± 0.095
2.973TyrVal: 2.973 ± 0.092
0.278TyrTrp: 0.278 ± 0.028
3.505TyrTyr: 3.505 ± 0.106
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1281 proteins (370019 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski