Amino acid dipepetide frequency for Pseudomonas phage phiKZ

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.359AlaAla: 4.359 ± 0.306
0.646AlaCys: 0.646 ± 0.087
3.713AlaAsp: 3.713 ± 0.213
3.736AlaGlu: 3.736 ± 0.327
2.303AlaPhe: 2.303 ± 0.144
3.478AlaGly: 3.478 ± 0.29
1.222AlaHis: 1.222 ± 0.135
4.605AlaIle: 4.605 ± 0.21
4.03AlaLys: 4.03 ± 0.261
4.864AlaLeu: 4.864 ± 0.208
1.962AlaMet: 1.962 ± 0.154
3.748AlaAsn: 3.748 ± 0.223
2.009AlaPro: 2.009 ± 0.167
2.479AlaGln: 2.479 ± 0.19
2.773AlaArg: 2.773 ± 0.219
3.536AlaSer: 3.536 ± 0.227
4.183AlaThr: 4.183 ± 0.299
4.476AlaVal: 4.476 ± 0.288
0.717AlaTrp: 0.717 ± 0.095
2.549AlaTyr: 2.549 ± 0.154
0.0AlaXaa: 0.0 ± 0.0
Cys
0.552CysAla: 0.552 ± 0.078
0.141CysCys: 0.141 ± 0.044
0.587CysAsp: 0.587 ± 0.078
0.576CysGlu: 0.576 ± 0.092
0.352CysPhe: 0.352 ± 0.055
0.517CysGly: 0.517 ± 0.077
0.223CysHis: 0.223 ± 0.051
0.74CysIle: 0.74 ± 0.084
0.576CysLys: 0.576 ± 0.08
0.74CysLeu: 0.74 ± 0.101
0.247CysMet: 0.247 ± 0.051
0.705CysAsn: 0.705 ± 0.088
0.399CysPro: 0.399 ± 0.072
0.305CysGln: 0.305 ± 0.047
0.576CysArg: 0.576 ± 0.085
0.717CysSer: 0.717 ± 0.103
0.564CysThr: 0.564 ± 0.095
0.799CysVal: 0.799 ± 0.103
0.153CysTrp: 0.153 ± 0.043
0.446CysTyr: 0.446 ± 0.071
0.0CysXaa: 0.0 ± 0.0
Asp
3.454AspAla: 3.454 ± 0.215
0.493AspCys: 0.493 ± 0.078
4.523AspAsp: 4.523 ± 0.256
4.476AspGlu: 4.476 ± 0.241
2.585AspPhe: 2.585 ± 0.152
4.253AspGly: 4.253 ± 0.253
1.21AspHis: 1.21 ± 0.138
5.522AspIle: 5.522 ± 0.257
4.206AspLys: 4.206 ± 0.242
5.968AspLeu: 5.968 ± 0.272
1.692AspMet: 1.692 ± 0.15
3.701AspAsn: 3.701 ± 0.19
3.372AspPro: 3.372 ± 0.251
2.162AspGln: 2.162 ± 0.164
2.761AspArg: 2.761 ± 0.19
3.971AspSer: 3.971 ± 0.195
3.854AspThr: 3.854 ± 0.201
4.652AspVal: 4.652 ± 0.237
0.916AspTrp: 0.916 ± 0.105
3.031AspTyr: 3.031 ± 0.206
0.0AspXaa: 0.0 ± 0.0
Glu
4.077GluAla: 4.077 ± 0.274
0.67GluCys: 0.67 ± 0.091
3.83GluAsp: 3.83 ± 0.243
4.418GluGlu: 4.418 ± 0.44
2.82GluPhe: 2.82 ± 0.211
2.867GluGly: 2.867 ± 0.227
1.41GluHis: 1.41 ± 0.141
4.147GluIle: 4.147 ± 0.197
3.666GluLys: 3.666 ± 0.231
6.943GluLeu: 6.943 ± 0.272
1.903GluMet: 1.903 ± 0.122
3.254GluAsn: 3.254 ± 0.182
2.056GluPro: 2.056 ± 0.199
2.561GluGln: 2.561 ± 0.171
3.184GluArg: 3.184 ± 0.182
3.478GluSer: 3.478 ± 0.201
3.572GluThr: 3.572 ± 0.235
4.688GluVal: 4.688 ± 0.275
1.046GluTrp: 1.046 ± 0.106
2.996GluTyr: 2.996 ± 0.206
0.0GluXaa: 0.0 ± 0.0
Phe
1.986PheAla: 1.986 ± 0.155
0.352PheCys: 0.352 ± 0.059
3.102PheAsp: 3.102 ± 0.176
2.279PheGlu: 2.279 ± 0.145
1.633PhePhe: 1.633 ± 0.134
2.303PheGly: 2.303 ± 0.185
0.905PheHis: 0.905 ± 0.11
3.689PheIle: 3.689 ± 0.234
3.36PheLys: 3.36 ± 0.183
2.714PheLeu: 2.714 ± 0.201
1.069PheMet: 1.069 ± 0.12
3.466PheAsn: 3.466 ± 0.215
1.351PhePro: 1.351 ± 0.106
1.163PheGln: 1.163 ± 0.118
1.563PheArg: 1.563 ± 0.141
2.103PheSer: 2.103 ± 0.145
2.502PheThr: 2.502 ± 0.188
2.538PheVal: 2.538 ± 0.151
0.411PheTrp: 0.411 ± 0.076
1.657PheTyr: 1.657 ± 0.134
0.0PheXaa: 0.0 ± 0.0
Gly
2.984GlyAla: 2.984 ± 0.258
0.564GlyCys: 0.564 ± 0.075
3.701GlyAsp: 3.701 ± 0.24
3.595GlyGlu: 3.595 ± 0.25
2.526GlyPhe: 2.526 ± 0.154
3.266GlyGly: 3.266 ± 0.232
0.963GlyHis: 0.963 ± 0.125
4.1GlyIle: 4.1 ± 0.178
4.371GlyLys: 4.371 ± 0.247
4.418GlyLeu: 4.418 ± 0.227
1.527GlyMet: 1.527 ± 0.149
3.419GlyAsn: 3.419 ± 0.23
1.833GlyPro: 1.833 ± 0.376
1.868GlyGln: 1.868 ± 0.174
2.878GlyArg: 2.878 ± 0.215
3.078GlySer: 3.078 ± 0.184
3.536GlyThr: 3.536 ± 0.273
4.042GlyVal: 4.042 ± 0.248
0.858GlyTrp: 0.858 ± 0.097
2.82GlyTyr: 2.82 ± 0.159
0.0GlyXaa: 0.0 ± 0.0
His
1.046HisAla: 1.046 ± 0.108
0.235HisCys: 0.235 ± 0.051
1.187HisAsp: 1.187 ± 0.122
1.245HisGlu: 1.245 ± 0.148
0.928HisPhe: 0.928 ± 0.135
1.175HisGly: 1.175 ± 0.129
0.623HisHis: 0.623 ± 0.08
1.539HisIle: 1.539 ± 0.139
1.304HisLys: 1.304 ± 0.138
1.774HisLeu: 1.774 ± 0.136
0.482HisMet: 0.482 ± 0.083
0.987HisAsn: 0.987 ± 0.103
0.999HisPro: 0.999 ± 0.106
0.764HisGln: 0.764 ± 0.08
1.187HisArg: 1.187 ± 0.135
1.151HisSer: 1.151 ± 0.125
1.057HisThr: 1.057 ± 0.092
1.339HisVal: 1.339 ± 0.11
0.399HisTrp: 0.399 ± 0.073
0.916HisTyr: 0.916 ± 0.109
0.0HisXaa: 0.0 ± 0.0
Ile
4.558IleAla: 4.558 ± 0.228
0.811IleCys: 0.811 ± 0.101
5.957IleAsp: 5.957 ± 0.244
5.334IleGlu: 5.334 ± 0.246
1.927IlePhe: 1.927 ± 0.158
4.124IleGly: 4.124 ± 0.205
1.551IleHis: 1.551 ± 0.114
4.934IleIle: 4.934 ± 0.26
5.357IleLys: 5.357 ± 0.27
5.404IleLeu: 5.404 ± 0.241
1.222IleMet: 1.222 ± 0.113
5.299IleAsn: 5.299 ± 0.274
3.63IlePro: 3.63 ± 0.219
2.432IleGln: 2.432 ± 0.17
3.701IleArg: 3.701 ± 0.219
5.064IleSer: 5.064 ± 0.28
4.876IleThr: 4.876 ± 0.231
4.183IleVal: 4.183 ± 0.207
0.799IleTrp: 0.799 ± 0.084
2.632IleTyr: 2.632 ± 0.201
0.0IleXaa: 0.0 ± 0.0
Lys
4.57LysAla: 4.57 ± 0.291
0.54LysCys: 0.54 ± 0.067
4.359LysAsp: 4.359 ± 0.203
5.216LysGlu: 5.216 ± 0.313
2.573LysPhe: 2.573 ± 0.171
2.925LysGly: 2.925 ± 0.211
1.422LysHis: 1.422 ± 0.144
4.147LysIle: 4.147 ± 0.254
3.689LysLys: 3.689 ± 0.234
6.755LysLeu: 6.755 ± 0.303
2.232LysMet: 2.232 ± 0.168
3.407LysAsn: 3.407 ± 0.206
2.679LysPro: 2.679 ± 0.183
2.267LysGln: 2.267 ± 0.165
2.667LysArg: 2.667 ± 0.175
3.196LysSer: 3.196 ± 0.177
3.654LysThr: 3.654 ± 0.249
4.453LysVal: 4.453 ± 0.248
0.987LysTrp: 0.987 ± 0.12
3.078LysTyr: 3.078 ± 0.17
0.0LysXaa: 0.0 ± 0.0
Leu
5.569LeuAla: 5.569 ± 0.195
0.752LeuCys: 0.752 ± 0.092
5.463LeuAsp: 5.463 ± 0.258
5.134LeuGlu: 5.134 ± 0.284
3.419LeuPhe: 3.419 ± 0.192
4.5LeuGly: 4.5 ± 0.26
1.762LeuHis: 1.762 ± 0.144
6.192LeuIle: 6.192 ± 0.255
5.698LeuLys: 5.698 ± 0.249
6.591LeuLeu: 6.591 ± 0.267
2.256LeuMet: 2.256 ± 0.166
5.545LeuAsn: 5.545 ± 0.261
3.595LeuPro: 3.595 ± 0.19
2.432LeuGln: 2.432 ± 0.159
4.03LeuArg: 4.03 ± 0.18
5.745LeuSer: 5.745 ± 0.283
6.004LeuThr: 6.004 ± 0.227
5.91LeuVal: 5.91 ± 0.238
0.822LeuTrp: 0.822 ± 0.108
3.172LeuTyr: 3.172 ± 0.214
0.0LeuXaa: 0.0 ± 0.0
Met
2.209MetAla: 2.209 ± 0.174
0.399MetCys: 0.399 ± 0.068
1.727MetAsp: 1.727 ± 0.145
1.48MetGlu: 1.48 ± 0.117
1.328MetPhe: 1.328 ± 0.129
1.539MetGly: 1.539 ± 0.148
0.693MetHis: 0.693 ± 0.084
1.586MetIle: 1.586 ± 0.124
1.269MetLys: 1.269 ± 0.108
2.22MetLeu: 2.22 ± 0.163
0.893MetMet: 0.893 ± 0.132
1.527MetAsn: 1.527 ± 0.106
0.858MetPro: 0.858 ± 0.106
0.987MetGln: 0.987 ± 0.123
1.351MetArg: 1.351 ± 0.128
2.08MetSer: 2.08 ± 0.139
1.516MetThr: 1.516 ± 0.137
1.962MetVal: 1.962 ± 0.149
0.223MetTrp: 0.223 ± 0.038
1.245MetTyr: 1.245 ± 0.11
0.0MetXaa: 0.0 ± 0.0
Asn
3.395AsnAla: 3.395 ± 0.163
0.482AsnCys: 0.482 ± 0.073
3.83AsnAsp: 3.83 ± 0.231
3.701AsnGlu: 3.701 ± 0.187
2.068AsnPhe: 2.068 ± 0.168
4.406AsnGly: 4.406 ± 0.211
1.281AsnHis: 1.281 ± 0.132
4.84AsnIle: 4.84 ± 0.203
4.312AsnLys: 4.312 ± 0.216
4.359AsnLeu: 4.359 ± 0.25
1.727AsnMet: 1.727 ± 0.128
4.03AsnAsn: 4.03 ± 0.237
3.196AsnPro: 3.196 ± 0.206
2.162AsnGln: 2.162 ± 0.139
2.914AsnArg: 2.914 ± 0.164
3.689AsnSer: 3.689 ± 0.198
3.936AsnThr: 3.936 ± 0.222
3.525AsnVal: 3.525 ± 0.193
0.94AsnTrp: 0.94 ± 0.121
2.667AsnTyr: 2.667 ± 0.184
0.0AsnXaa: 0.0 ± 0.0
Pro
2.561ProAla: 2.561 ± 0.179
0.341ProCys: 0.341 ± 0.06
3.019ProAsp: 3.019 ± 0.228
2.82ProGlu: 2.82 ± 0.225
1.974ProPhe: 1.974 ± 0.144
2.502ProGly: 2.502 ± 0.213
0.928ProHis: 0.928 ± 0.108
2.608ProIle: 2.608 ± 0.173
2.209ProLys: 2.209 ± 0.156
3.102ProLeu: 3.102 ± 0.217
0.94ProMet: 0.94 ± 0.112
2.42ProAsn: 2.42 ± 0.156
1.198ProPro: 1.198 ± 0.134
1.175ProGln: 1.175 ± 0.174
1.586ProArg: 1.586 ± 0.139
2.479ProSer: 2.479 ± 0.155
3.066ProThr: 3.066 ± 0.207
3.313ProVal: 3.313 ± 0.212
0.364ProTrp: 0.364 ± 0.076
1.586ProTyr: 1.586 ± 0.122
0.0ProXaa: 0.0 ± 0.0
Gln
2.35GlnAla: 2.35 ± 0.187
0.376GlnCys: 0.376 ± 0.065
1.739GlnAsp: 1.739 ± 0.12
1.809GlnGlu: 1.809 ± 0.148
1.68GlnPhe: 1.68 ± 0.141
1.786GlnGly: 1.786 ± 0.319
0.693GlnHis: 0.693 ± 0.095
2.479GlnIle: 2.479 ± 0.178
1.974GlnLys: 1.974 ± 0.163
3.642GlnLeu: 3.642 ± 0.205
0.916GlnMet: 0.916 ± 0.099
1.633GlnAsn: 1.633 ± 0.138
1.375GlnPro: 1.375 ± 0.175
1.668GlnGln: 1.668 ± 0.148
1.939GlnArg: 1.939 ± 0.155
1.809GlnSer: 1.809 ± 0.161
1.692GlnThr: 1.692 ± 0.123
2.373GlnVal: 2.373 ± 0.159
0.623GlnTrp: 0.623 ± 0.085
1.504GlnTyr: 1.504 ± 0.129
0.0GlnXaa: 0.0 ± 0.0
Arg
2.925ArgAla: 2.925 ± 0.234
0.482ArgCys: 0.482 ± 0.072
3.172ArgAsp: 3.172 ± 0.23
2.749ArgGlu: 2.749 ± 0.184
1.915ArgPhe: 1.915 ± 0.159
2.632ArgGly: 2.632 ± 0.188
0.858ArgHis: 0.858 ± 0.112
3.395ArgIle: 3.395 ± 0.214
3.09ArgLys: 3.09 ± 0.191
4.852ArgLeu: 4.852 ± 0.235
1.281ArgMet: 1.281 ± 0.126
2.925ArgAsn: 2.925 ± 0.184
1.727ArgPro: 1.727 ± 0.128
1.621ArgGln: 1.621 ± 0.135
2.432ArgArg: 2.432 ± 0.176
2.632ArgSer: 2.632 ± 0.164
2.314ArgThr: 2.314 ± 0.146
3.313ArgVal: 3.313 ± 0.228
0.634ArgTrp: 0.634 ± 0.102
2.22ArgTyr: 2.22 ± 0.168
0.0ArgXaa: 0.0 ± 0.0
Ser
3.654SerAla: 3.654 ± 0.255
0.623SerCys: 0.623 ± 0.077
3.842SerAsp: 3.842 ± 0.208
3.536SerGlu: 3.536 ± 0.227
2.737SerPhe: 2.737 ± 0.173
3.442SerGly: 3.442 ± 0.207
1.057SerHis: 1.057 ± 0.107
5.017SerIle: 5.017 ± 0.261
3.701SerLys: 3.701 ± 0.21
5.299SerLeu: 5.299 ± 0.261
1.692SerMet: 1.692 ± 0.123
3.948SerAsn: 3.948 ± 0.229
2.185SerPro: 2.185 ± 0.151
1.997SerGln: 1.997 ± 0.165
2.749SerArg: 2.749 ± 0.175
3.959SerSer: 3.959 ± 0.235
3.83SerThr: 3.83 ± 0.21
4.218SerVal: 4.218 ± 0.201
0.811SerTrp: 0.811 ± 0.109
2.455SerTyr: 2.455 ± 0.182
0.0SerXaa: 0.0 ± 0.0
Thr
4.136ThrAla: 4.136 ± 0.226
0.67ThrCys: 0.67 ± 0.096
4.136ThrAsp: 4.136 ± 0.216
4.124ThrGlu: 4.124 ± 0.244
2.444ThrPhe: 2.444 ± 0.187
4.359ThrGly: 4.359 ± 0.271
1.14ThrHis: 1.14 ± 0.103
4.535ThrIle: 4.535 ± 0.203
3.771ThrLys: 3.771 ± 0.216
4.817ThrLeu: 4.817 ± 0.216
1.433ThrMet: 1.433 ± 0.117
3.536ThrAsn: 3.536 ± 0.232
2.902ThrPro: 2.902 ± 0.195
2.009ThrGln: 2.009 ± 0.152
2.655ThrArg: 2.655 ± 0.175
3.783ThrSer: 3.783 ± 0.208
4.124ThrThr: 4.124 ± 0.242
4.5ThrVal: 4.5 ± 0.25
0.822ThrTrp: 0.822 ± 0.096
2.643ThrTyr: 2.643 ± 0.155
0.0ThrXaa: 0.0 ± 0.0
Val
3.854ValAla: 3.854 ± 0.232
0.705ValCys: 0.705 ± 0.098
5.663ValAsp: 5.663 ± 0.294
4.429ValGlu: 4.429 ± 0.29
2.549ValPhe: 2.549 ± 0.19
3.313ValGly: 3.313 ± 0.187
1.069ValHis: 1.069 ± 0.116
5.545ValIle: 5.545 ± 0.254
4.723ValLys: 4.723 ± 0.19
5.075ValLeu: 5.075 ± 0.226
2.091ValMet: 2.091 ± 0.156
4.453ValAsn: 4.453 ± 0.227
2.408ValPro: 2.408 ± 0.183
1.997ValGln: 1.997 ± 0.171
3.16ValArg: 3.16 ± 0.197
4.582ValSer: 4.582 ± 0.235
4.711ValThr: 4.711 ± 0.238
4.547ValVal: 4.547 ± 0.288
0.693ValTrp: 0.693 ± 0.085
3.149ValTyr: 3.149 ± 0.192
0.0ValXaa: 0.0 ± 0.0
Trp
0.681TrpAla: 0.681 ± 0.083
0.176TrpCys: 0.176 ± 0.042
0.717TrpAsp: 0.717 ± 0.112
0.846TrpGlu: 0.846 ± 0.106
0.658TrpPhe: 0.658 ± 0.073
0.634TrpGly: 0.634 ± 0.084
0.317TrpHis: 0.317 ± 0.056
0.975TrpIle: 0.975 ± 0.088
0.822TrpLys: 0.822 ± 0.096
1.151TrpLeu: 1.151 ± 0.103
0.376TrpMet: 0.376 ± 0.068
0.764TrpAsn: 0.764 ± 0.102
0.446TrpPro: 0.446 ± 0.069
0.376TrpGln: 0.376 ± 0.063
0.587TrpArg: 0.587 ± 0.07
0.881TrpSer: 0.881 ± 0.097
0.717TrpThr: 0.717 ± 0.105
1.128TrpVal: 1.128 ± 0.153
0.176TrpTrp: 0.176 ± 0.053
0.564TrpTyr: 0.564 ± 0.079
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.467TyrAla: 2.467 ± 0.205
0.458TyrCys: 0.458 ± 0.072
2.549TyrAsp: 2.549 ± 0.179
2.08TyrGlu: 2.08 ± 0.179
1.727TyrPhe: 1.727 ± 0.165
2.326TyrGly: 2.326 ± 0.168
1.01TyrHis: 1.01 ± 0.111
3.196TyrIle: 3.196 ± 0.206
2.69TyrLys: 2.69 ± 0.179
3.901TyrLeu: 3.901 ± 0.243
1.14TyrMet: 1.14 ± 0.121
2.69TyrAsn: 2.69 ± 0.184
1.974TyrPro: 1.974 ± 0.148
1.633TyrGln: 1.633 ± 0.162
2.479TyrArg: 2.479 ± 0.189
2.82TyrSer: 2.82 ± 0.188
2.831TyrThr: 2.831 ± 0.199
2.761TyrVal: 2.761 ± 0.178
0.599TyrTrp: 0.599 ± 0.079
1.974TyrTyr: 1.974 ± 0.178
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 369 proteins (85117 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski