Amino acid dipepetide frequency for Shewanella phage Thanatos-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.695AlaAla: 3.695 ± 0.338
0.363AlaCys: 0.363 ± 0.096
2.948AlaAsp: 2.948 ± 0.263
3.614AlaGlu: 3.614 ± 0.371
2.08AlaPhe: 2.08 ± 0.25
3.089AlaGly: 3.089 ± 0.444
0.828AlaHis: 0.828 ± 0.151
4.765AlaIle: 4.765 ± 0.294
4.907AlaLys: 4.907 ± 0.344
6.078AlaLeu: 6.078 ± 0.354
1.373AlaMet: 1.373 ± 0.198
3.15AlaAsn: 3.15 ± 0.359
2.039AlaPro: 2.039 ± 0.21
1.757AlaGln: 1.757 ± 0.208
2.342AlaArg: 2.342 ± 0.238
4.099AlaSer: 4.099 ± 0.246
3.635AlaThr: 3.635 ± 0.425
3.493AlaVal: 3.493 ± 0.292
0.646AlaTrp: 0.646 ± 0.101
2.625AlaTyr: 2.625 ± 0.234
0.0AlaXaa: 0.0 ± 0.0
Cys
0.586CysAla: 0.586 ± 0.109
0.141CysCys: 0.141 ± 0.053
0.464CysAsp: 0.464 ± 0.094
0.747CysGlu: 0.747 ± 0.116
0.444CysPhe: 0.444 ± 0.109
0.828CysGly: 0.828 ± 0.146
0.343CysHis: 0.343 ± 0.078
0.828CysIle: 0.828 ± 0.126
0.848CysLys: 0.848 ± 0.163
1.01CysLeu: 1.01 ± 0.143
0.202CysMet: 0.202 ± 0.055
0.747CysAsn: 0.747 ± 0.136
0.485CysPro: 0.485 ± 0.096
0.283CysGln: 0.283 ± 0.071
0.485CysArg: 0.485 ± 0.107
0.989CysSer: 0.989 ± 0.141
0.646CysThr: 0.646 ± 0.132
0.808CysVal: 0.808 ± 0.137
0.101CysTrp: 0.101 ± 0.046
0.727CysTyr: 0.727 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
2.847AspAla: 2.847 ± 0.268
0.666AspCys: 0.666 ± 0.11
3.231AspAsp: 3.231 ± 0.251
3.837AspGlu: 3.837 ± 0.299
3.352AspPhe: 3.352 ± 0.27
3.938AspGly: 3.938 ± 0.308
0.808AspHis: 0.808 ± 0.128
4.806AspIle: 4.806 ± 0.268
4.483AspLys: 4.483 ± 0.339
5.371AspLeu: 5.371 ± 0.265
1.757AspMet: 1.757 ± 0.179
3.029AspAsn: 3.029 ± 0.235
2.403AspPro: 2.403 ± 0.244
1.393AspGln: 1.393 ± 0.142
2.262AspArg: 2.262 ± 0.213
4.988AspSer: 4.988 ± 0.325
2.787AspThr: 2.787 ± 0.215
3.695AspVal: 3.695 ± 0.248
1.01AspTrp: 1.01 ± 0.144
3.009AspTyr: 3.009 ± 0.25
0.0AspXaa: 0.0 ± 0.0
Glu
3.776GluAla: 3.776 ± 0.274
0.929GluCys: 0.929 ± 0.147
3.372GluAsp: 3.372 ± 0.259
5.089GluGlu: 5.089 ± 0.368
3.17GluPhe: 3.17 ± 0.255
3.776GluGly: 3.776 ± 0.291
1.252GluHis: 1.252 ± 0.175
5.553GluIle: 5.553 ± 0.344
4.624GluLys: 4.624 ± 0.359
7.815GluLeu: 7.815 ± 0.452
2.019GluMet: 2.019 ± 0.248
3.877GluAsn: 3.877 ± 0.268
1.817GluPro: 1.817 ± 0.177
2.14GluGln: 2.14 ± 0.259
2.363GluArg: 2.363 ± 0.202
5.593GluSer: 5.593 ± 0.369
3.453GluThr: 3.453 ± 0.35
4.139GluVal: 4.139 ± 0.259
0.788GluTrp: 0.788 ± 0.129
3.312GluTyr: 3.312 ± 0.28
0.0GluXaa: 0.0 ± 0.0
Phe
1.777PheAla: 1.777 ± 0.184
0.404PheCys: 0.404 ± 0.092
3.332PheAsp: 3.332 ± 0.318
2.787PheGlu: 2.787 ± 0.228
1.333PhePhe: 1.333 ± 0.178
2.383PheGly: 2.383 ± 0.302
0.485PheHis: 0.485 ± 0.104
2.989PheIle: 2.989 ± 0.293
3.877PheLys: 3.877 ± 0.339
3.089PheLeu: 3.089 ± 0.277
1.373PheMet: 1.373 ± 0.176
3.15PheAsn: 3.15 ± 0.222
1.191PhePro: 1.191 ± 0.148
1.131PheGln: 1.131 ± 0.141
1.898PheArg: 1.898 ± 0.193
3.796PheSer: 3.796 ± 0.219
2.544PheThr: 2.544 ± 0.21
2.464PheVal: 2.464 ± 0.213
0.384PheTrp: 0.384 ± 0.078
1.737PheTyr: 1.737 ± 0.176
0.0PheXaa: 0.0 ± 0.0
Gly
3.433GlyAla: 3.433 ± 0.342
0.565GlyCys: 0.565 ± 0.108
3.271GlyAsp: 3.271 ± 0.263
3.514GlyGlu: 3.514 ± 0.291
2.605GlyPhe: 2.605 ± 0.289
2.908GlyGly: 2.908 ± 0.395
1.171GlyHis: 1.171 ± 0.143
4.806GlyIle: 4.806 ± 0.274
4.119GlyLys: 4.119 ± 0.287
5.654GlyLeu: 5.654 ± 0.308
1.615GlyMet: 1.615 ± 0.216
3.372GlyAsn: 3.372 ± 0.257
1.393GlyPro: 1.393 ± 0.161
2.221GlyGln: 2.221 ± 0.27
2.342GlyArg: 2.342 ± 0.198
4.22GlySer: 4.22 ± 0.452
3.776GlyThr: 3.776 ± 0.348
3.574GlyVal: 3.574 ± 0.35
0.565GlyTrp: 0.565 ± 0.124
2.464GlyTyr: 2.464 ± 0.219
0.0GlyXaa: 0.0 ± 0.0
His
0.747HisAla: 0.747 ± 0.115
0.263HisCys: 0.263 ± 0.075
0.888HisAsp: 0.888 ± 0.147
0.989HisGlu: 0.989 ± 0.128
0.828HisPhe: 0.828 ± 0.134
0.788HisGly: 0.788 ± 0.132
0.303HisHis: 0.303 ± 0.078
1.555HisIle: 1.555 ± 0.18
1.514HisLys: 1.514 ± 0.176
1.737HisLeu: 1.737 ± 0.184
0.545HisMet: 0.545 ± 0.102
0.969HisAsn: 0.969 ± 0.143
0.687HisPro: 0.687 ± 0.109
0.626HisGln: 0.626 ± 0.087
0.808HisArg: 0.808 ± 0.121
1.595HisSer: 1.595 ± 0.161
1.03HisThr: 1.03 ± 0.128
1.292HisVal: 1.292 ± 0.161
0.242HisTrp: 0.242 ± 0.073
0.747HisTyr: 0.747 ± 0.141
0.0HisXaa: 0.0 ± 0.0
Ile
4.422IleAla: 4.422 ± 0.386
1.333IleCys: 1.333 ± 0.175
5.19IleAsp: 5.19 ± 0.332
5.371IleGlu: 5.371 ± 0.354
2.585IlePhe: 2.585 ± 0.207
3.857IleGly: 3.857 ± 0.325
1.191IleHis: 1.191 ± 0.16
4.705IleIle: 4.705 ± 0.319
6.24IleLys: 6.24 ± 0.379
6.118IleLeu: 6.118 ± 0.389
1.474IleMet: 1.474 ± 0.155
4.664IleAsn: 4.664 ± 0.321
2.908IlePro: 2.908 ± 0.214
2.504IleGln: 2.504 ± 0.226
3.089IleArg: 3.089 ± 0.225
5.977IleSer: 5.977 ± 0.36
4.382IleThr: 4.382 ± 0.299
4.523IleVal: 4.523 ± 0.299
0.707IleTrp: 0.707 ± 0.123
3.089IleTyr: 3.089 ± 0.252
0.0IleXaa: 0.0 ± 0.0
Lys
5.513LysAla: 5.513 ± 0.352
0.828LysCys: 0.828 ± 0.128
4.967LysAsp: 4.967 ± 0.316
5.977LysGlu: 5.977 ± 0.452
3.857LysPhe: 3.857 ± 0.305
3.816LysGly: 3.816 ± 0.347
1.555LysHis: 1.555 ± 0.171
5.27LysIle: 5.27 ± 0.354
4.786LysLys: 4.786 ± 0.372
6.603LysLeu: 6.603 ± 0.436
2.443LysMet: 2.443 ± 0.255
4.018LysAsn: 4.018 ± 0.289
2.948LysPro: 2.948 ± 0.224
2.221LysGln: 2.221 ± 0.254
2.686LysArg: 2.686 ± 0.229
5.331LysSer: 5.331 ± 0.355
4.927LysThr: 4.927 ± 0.308
5.109LysVal: 5.109 ± 0.324
1.191LysTrp: 1.191 ± 0.161
3.231LysTyr: 3.231 ± 0.32
0.0LysXaa: 0.0 ± 0.0
Leu
5.614LeuAla: 5.614 ± 0.368
1.111LeuCys: 1.111 ± 0.161
6.482LeuAsp: 6.482 ± 0.426
7.269LeuGlu: 7.269 ± 0.463
3.453LeuPhe: 3.453 ± 0.263
4.765LeuGly: 4.765 ± 0.306
1.535LeuHis: 1.535 ± 0.186
5.452LeuIle: 5.452 ± 0.291
7.148LeuLys: 7.148 ± 0.395
6.845LeuLeu: 6.845 ± 0.427
2.181LeuMet: 2.181 ± 0.206
5.129LeuAsn: 5.129 ± 0.295
3.312LeuPro: 3.312 ± 0.311
2.766LeuGln: 2.766 ± 0.248
3.958LeuArg: 3.958 ± 0.257
6.744LeuSer: 6.744 ± 0.42
4.866LeuThr: 4.866 ± 0.334
5.048LeuVal: 5.048 ± 0.296
0.949LeuTrp: 0.949 ± 0.141
3.534LeuTyr: 3.534 ± 0.251
0.0LeuXaa: 0.0 ± 0.0
Met
1.918MetAla: 1.918 ± 0.183
0.303MetCys: 0.303 ± 0.089
1.636MetAsp: 1.636 ± 0.182
1.777MetGlu: 1.777 ± 0.171
1.171MetPhe: 1.171 ± 0.146
1.252MetGly: 1.252 ± 0.151
0.424MetHis: 0.424 ± 0.093
1.979MetIle: 1.979 ± 0.192
1.555MetLys: 1.555 ± 0.199
2.181MetLeu: 2.181 ± 0.233
0.485MetMet: 0.485 ± 0.103
1.575MetAsn: 1.575 ± 0.169
1.131MetPro: 1.131 ± 0.166
0.929MetGln: 0.929 ± 0.123
0.828MetArg: 0.828 ± 0.134
2.08MetSer: 2.08 ± 0.235
1.676MetThr: 1.676 ± 0.194
1.676MetVal: 1.676 ± 0.182
0.283MetTrp: 0.283 ± 0.068
1.09MetTyr: 1.09 ± 0.16
0.0MetXaa: 0.0 ± 0.0
Asn
3.17AsnAla: 3.17 ± 0.296
0.808AsnCys: 0.808 ± 0.133
3.19AsnAsp: 3.19 ± 0.262
2.928AsnGlu: 2.928 ± 0.263
2.403AsnPhe: 2.403 ± 0.24
3.473AsnGly: 3.473 ± 0.318
1.333AsnHis: 1.333 ± 0.15
4.745AsnIle: 4.745 ± 0.306
4.624AsnLys: 4.624 ± 0.343
4.907AsnLeu: 4.907 ± 0.345
1.474AsnMet: 1.474 ± 0.167
3.614AsnAsn: 3.614 ± 0.316
2.524AsnPro: 2.524 ± 0.2
1.656AsnGln: 1.656 ± 0.196
2.706AsnArg: 2.706 ± 0.194
4.745AsnSer: 4.745 ± 0.304
3.675AsnThr: 3.675 ± 0.312
3.069AsnVal: 3.069 ± 0.248
0.666AsnTrp: 0.666 ± 0.102
2.342AsnTyr: 2.342 ± 0.219
0.0AsnXaa: 0.0 ± 0.0
Pro
1.737ProAla: 1.737 ± 0.192
0.384ProCys: 0.384 ± 0.087
2.181ProAsp: 2.181 ± 0.166
3.049ProGlu: 3.049 ± 0.258
1.272ProPhe: 1.272 ± 0.158
2.585ProGly: 2.585 ± 0.25
0.586ProHis: 0.586 ± 0.105
2.181ProIle: 2.181 ± 0.228
3.231ProLys: 3.231 ± 0.309
2.847ProLeu: 2.847 ± 0.249
0.606ProMet: 0.606 ± 0.103
1.999ProAsn: 1.999 ± 0.218
1.131ProPro: 1.131 ± 0.173
0.989ProGln: 0.989 ± 0.139
1.353ProArg: 1.353 ± 0.18
2.888ProSer: 2.888 ± 0.257
2.645ProThr: 2.645 ± 0.23
2.807ProVal: 2.807 ± 0.256
0.505ProTrp: 0.505 ± 0.114
1.656ProTyr: 1.656 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
1.979GlnAla: 1.979 ± 0.208
0.263GlnCys: 0.263 ± 0.081
1.817GlnAsp: 1.817 ± 0.144
2.262GlnGlu: 2.262 ± 0.26
1.434GlnPhe: 1.434 ± 0.157
1.615GlnGly: 1.615 ± 0.188
0.586GlnHis: 0.586 ± 0.123
2.686GlnIle: 2.686 ± 0.238
1.898GlnLys: 1.898 ± 0.221
2.867GlnLeu: 2.867 ± 0.235
0.909GlnMet: 0.909 ± 0.164
1.434GlnAsn: 1.434 ± 0.138
0.909GlnPro: 0.909 ± 0.104
1.131GlnGln: 1.131 ± 0.147
1.09GlnArg: 1.09 ± 0.137
2.181GlnSer: 2.181 ± 0.217
2.221GlnThr: 2.221 ± 0.207
2.14GlnVal: 2.14 ± 0.244
0.505GlnTrp: 0.505 ± 0.1
1.434GlnTyr: 1.434 ± 0.161
0.0GlnXaa: 0.0 ± 0.0
Arg
2.766ArgAla: 2.766 ± 0.237
0.444ArgCys: 0.444 ± 0.1
2.282ArgAsp: 2.282 ± 0.217
2.605ArgGlu: 2.605 ± 0.214
1.898ArgPhe: 1.898 ± 0.187
2.363ArgGly: 2.363 ± 0.232
0.828ArgHis: 0.828 ± 0.148
3.332ArgIle: 3.332 ± 0.243
3.17ArgLys: 3.17 ± 0.244
3.493ArgLeu: 3.493 ± 0.255
1.252ArgMet: 1.252 ± 0.157
2.342ArgAsn: 2.342 ± 0.206
1.656ArgPro: 1.656 ± 0.198
1.333ArgGln: 1.333 ± 0.157
1.716ArgArg: 1.716 ± 0.183
2.201ArgSer: 2.201 ± 0.199
2.221ArgThr: 2.221 ± 0.17
2.504ArgVal: 2.504 ± 0.231
0.464ArgTrp: 0.464 ± 0.097
1.656ArgTyr: 1.656 ± 0.177
0.0ArgXaa: 0.0 ± 0.0
Ser
3.655SerAla: 3.655 ± 0.295
0.646SerCys: 0.646 ± 0.132
3.695SerAsp: 3.695 ± 0.245
5.089SerGlu: 5.089 ± 0.377
2.968SerPhe: 2.968 ± 0.23
5.432SerGly: 5.432 ± 0.365
1.353SerHis: 1.353 ± 0.161
6.28SerIle: 6.28 ± 0.457
6.038SerLys: 6.038 ± 0.409
6.583SerLeu: 6.583 ± 0.364
1.636SerMet: 1.636 ± 0.187
4.463SerAsn: 4.463 ± 0.29
2.504SerPro: 2.504 ± 0.186
2.342SerGln: 2.342 ± 0.212
3.453SerArg: 3.453 ± 0.306
6.401SerSer: 6.401 ± 0.519
5.149SerThr: 5.149 ± 0.343
5.008SerVal: 5.008 ± 0.314
0.767SerTrp: 0.767 ± 0.115
2.867SerTyr: 2.867 ± 0.283
0.0SerXaa: 0.0 ± 0.0
Thr
3.413ThrAla: 3.413 ± 0.375
0.444ThrCys: 0.444 ± 0.101
3.13ThrAsp: 3.13 ± 0.252
4.321ThrGlu: 4.321 ± 0.329
2.161ThrPhe: 2.161 ± 0.211
3.938ThrGly: 3.938 ± 0.328
1.252ThrHis: 1.252 ± 0.146
4.483ThrIle: 4.483 ± 0.325
4.644ThrLys: 4.644 ± 0.307
5.815ThrLeu: 5.815 ± 0.387
1.636ThrMet: 1.636 ± 0.163
3.17ThrAsn: 3.17 ± 0.3
2.484ThrPro: 2.484 ± 0.277
2.221ThrGln: 2.221 ± 0.212
2.686ThrArg: 2.686 ± 0.247
3.514ThrSer: 3.514 ± 0.253
2.968ThrThr: 2.968 ± 0.304
4.039ThrVal: 4.039 ± 0.339
0.687ThrTrp: 0.687 ± 0.125
1.898ThrTyr: 1.898 ± 0.193
0.0ThrXaa: 0.0 ± 0.0
Val
3.473ValAla: 3.473 ± 0.269
0.848ValCys: 0.848 ± 0.135
4.18ValAsp: 4.18 ± 0.276
4.382ValGlu: 4.382 ± 0.272
2.746ValPhe: 2.746 ± 0.214
3.413ValGly: 3.413 ± 0.269
0.989ValHis: 0.989 ± 0.143
4.059ValIle: 4.059 ± 0.298
5.755ValLys: 5.755 ± 0.351
5.048ValLeu: 5.048 ± 0.367
1.494ValMet: 1.494 ± 0.175
3.756ValAsn: 3.756 ± 0.343
2.322ValPro: 2.322 ± 0.194
2.039ValGln: 2.039 ± 0.201
2.544ValArg: 2.544 ± 0.221
4.362ValSer: 4.362 ± 0.222
3.291ValThr: 3.291 ± 0.292
3.917ValVal: 3.917 ± 0.351
0.545ValTrp: 0.545 ± 0.097
3.11ValTyr: 3.11 ± 0.256
0.0ValXaa: 0.0 ± 0.0
Trp
0.666TrpAla: 0.666 ± 0.099
0.202TrpCys: 0.202 ± 0.063
0.929TrpAsp: 0.929 ± 0.131
0.888TrpGlu: 0.888 ± 0.123
0.565TrpPhe: 0.565 ± 0.109
0.323TrpGly: 0.323 ± 0.092
0.202TrpHis: 0.202 ± 0.066
0.828TrpIle: 0.828 ± 0.148
0.888TrpLys: 0.888 ± 0.147
0.929TrpLeu: 0.929 ± 0.132
0.384TrpMet: 0.384 ± 0.074
0.707TrpAsn: 0.707 ± 0.105
0.545TrpPro: 0.545 ± 0.103
0.323TrpGln: 0.323 ± 0.074
0.444TrpArg: 0.444 ± 0.081
0.788TrpSer: 0.788 ± 0.112
0.767TrpThr: 0.767 ± 0.121
0.767TrpVal: 0.767 ± 0.109
0.121TrpTrp: 0.121 ± 0.05
0.444TrpTyr: 0.444 ± 0.087
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.342TyrAla: 2.342 ± 0.202
0.767TyrCys: 0.767 ± 0.146
2.484TyrAsp: 2.484 ± 0.236
2.363TyrGlu: 2.363 ± 0.238
1.797TyrPhe: 1.797 ± 0.184
3.049TyrGly: 3.049 ± 0.274
1.131TyrHis: 1.131 ± 0.147
3.029TyrIle: 3.029 ± 0.287
2.928TyrLys: 2.928 ± 0.265
3.17TyrLeu: 3.17 ± 0.298
1.07TyrMet: 1.07 ± 0.162
2.888TyrAsn: 2.888 ± 0.248
2.241TyrPro: 2.241 ± 0.244
1.353TyrGln: 1.353 ± 0.156
1.474TyrArg: 1.474 ± 0.185
3.837TyrSer: 3.837 ± 0.302
2.262TyrThr: 2.262 ± 0.232
2.201TyrVal: 2.201 ± 0.2
0.586TyrTrp: 0.586 ± 0.112
1.898TyrTyr: 1.898 ± 0.257
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 193 proteins (49524 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski