Amino acid dipepetide frequency for Pelagibacter phage HTVC008M

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.016AlaAla: 4.016 ± 0.315
0.489AlaCys: 0.489 ± 0.097
4.739AlaAsp: 4.739 ± 0.437
3.443AlaGlu: 3.443 ± 0.327
2.316AlaPhe: 2.316 ± 0.246
4.973AlaGly: 4.973 ± 0.436
0.808AlaHis: 0.808 ± 0.127
4.208AlaIle: 4.208 ± 0.292
4.526AlaLys: 4.526 ± 0.363
4.356AlaLeu: 4.356 ± 0.326
1.594AlaMet: 1.594 ± 0.165
3.91AlaAsn: 3.91 ± 0.295
1.53AlaPro: 1.53 ± 0.202
1.955AlaGln: 1.955 ± 0.185
2.146AlaArg: 2.146 ± 0.221
4.293AlaSer: 4.293 ± 0.339
5.546AlaThr: 5.546 ± 0.641
3.421AlaVal: 3.421 ± 0.242
0.616AlaTrp: 0.616 ± 0.14
1.785AlaTyr: 1.785 ± 0.179
0.0AlaXaa: 0.0 ± 0.0
Cys
0.404CysAla: 0.404 ± 0.088
0.085CysCys: 0.085 ± 0.046
0.616CysAsp: 0.616 ± 0.111
0.425CysGlu: 0.425 ± 0.096
0.298CysPhe: 0.298 ± 0.074
0.51CysGly: 0.51 ± 0.118
0.276CysHis: 0.276 ± 0.074
0.638CysIle: 0.638 ± 0.12
0.659CysLys: 0.659 ± 0.118
0.446CysLeu: 0.446 ± 0.094
0.128CysMet: 0.128 ± 0.048
0.34CysAsn: 0.34 ± 0.084
0.276CysPro: 0.276 ± 0.072
0.276CysGln: 0.276 ± 0.076
0.234CysArg: 0.234 ± 0.083
0.446CysSer: 0.446 ± 0.085
0.638CysThr: 0.638 ± 0.111
0.489CysVal: 0.489 ± 0.104
0.043CysTrp: 0.043 ± 0.028
0.361CysTyr: 0.361 ± 0.085
0.0CysXaa: 0.0 ± 0.0
Asp
3.889AspAla: 3.889 ± 0.383
0.531AspCys: 0.531 ± 0.113
3.953AspAsp: 3.953 ± 0.273
4.739AspGlu: 4.739 ± 0.379
3.379AspPhe: 3.379 ± 0.279
5.228AspGly: 5.228 ± 0.428
0.978AspHis: 0.978 ± 0.159
5.993AspIle: 5.993 ± 0.428
5.376AspLys: 5.376 ± 0.413
4.973AspLeu: 4.973 ± 0.324
2.529AspMet: 2.529 ± 0.279
3.889AspAsn: 3.889 ± 0.287
2.656AspPro: 2.656 ± 0.226
1.849AspGln: 1.849 ± 0.21
2.571AspArg: 2.571 ± 0.216
3.676AspSer: 3.676 ± 0.336
4.654AspThr: 4.654 ± 0.417
4.038AspVal: 4.038 ± 0.275
0.786AspTrp: 0.786 ± 0.146
3.145AspTyr: 3.145 ± 0.283
0.0AspXaa: 0.0 ± 0.0
Glu
3.358GluAla: 3.358 ± 0.298
0.553GluCys: 0.553 ± 0.104
4.399GluAsp: 4.399 ± 0.389
4.25GluGlu: 4.25 ± 0.396
2.933GluPhe: 2.933 ± 0.217
3.825GluGly: 3.825 ± 0.27
1.594GluHis: 1.594 ± 0.215
4.781GluIle: 4.781 ± 0.38
5.631GluLys: 5.631 ± 0.456
5.164GluLeu: 5.164 ± 0.303
1.743GluMet: 1.743 ± 0.22
3.591GluAsn: 3.591 ± 0.284
1.509GluPro: 1.509 ± 0.171
2.996GluGln: 2.996 ± 0.26
2.614GluArg: 2.614 ± 0.309
2.401GluSer: 2.401 ± 0.226
4.888GluThr: 4.888 ± 0.339
3.294GluVal: 3.294 ± 0.26
0.978GluTrp: 0.978 ± 0.152
2.848GluTyr: 2.848 ± 0.246
0.0GluXaa: 0.0 ± 0.0
Phe
2.168PheAla: 2.168 ± 0.186
0.468PheCys: 0.468 ± 0.095
3.485PheAsp: 3.485 ± 0.246
3.081PheGlu: 3.081 ± 0.34
1.424PhePhe: 1.424 ± 0.173
2.699PheGly: 2.699 ± 0.253
0.723PheHis: 0.723 ± 0.112
2.784PheIle: 2.784 ± 0.235
3.591PheLys: 3.591 ± 0.291
2.933PheLeu: 2.933 ± 0.299
1.02PheMet: 1.02 ± 0.123
2.848PheAsn: 2.848 ± 0.253
1.488PhePro: 1.488 ± 0.185
1.275PheGln: 1.275 ± 0.162
1.53PheArg: 1.53 ± 0.185
2.933PheSer: 2.933 ± 0.261
3.294PheThr: 3.294 ± 0.291
2.678PheVal: 2.678 ± 0.273
0.404PheTrp: 0.404 ± 0.106
1.594PheTyr: 1.594 ± 0.167
0.0PheXaa: 0.0 ± 0.0
Gly
4.781GlyAla: 4.781 ± 0.438
0.531GlyCys: 0.531 ± 0.115
4.696GlyAsp: 4.696 ± 0.295
3.91GlyGlu: 3.91 ± 0.356
2.253GlyPhe: 2.253 ± 0.255
5.78GlyGly: 5.78 ± 0.965
1.211GlyHis: 1.211 ± 0.227
3.485GlyIle: 3.485 ± 0.271
4.696GlyLys: 4.696 ± 0.427
4.781GlyLeu: 4.781 ± 0.359
1.403GlyMet: 1.403 ± 0.148
4.101GlyAsn: 4.101 ± 0.352
1.211GlyPro: 1.211 ± 0.159
2.444GlyGln: 2.444 ± 0.239
2.21GlyArg: 2.21 ± 0.222
5.143GlySer: 5.143 ± 0.61
6.715GlyThr: 6.715 ± 0.644
3.783GlyVal: 3.783 ± 0.329
0.723GlyTrp: 0.723 ± 0.121
2.571GlyTyr: 2.571 ± 0.22
0.0GlyXaa: 0.0 ± 0.0
His
0.85HisAla: 0.85 ± 0.143
0.191HisCys: 0.191 ± 0.075
1.148HisAsp: 1.148 ± 0.191
1.19HisGlu: 1.19 ± 0.19
0.935HisPhe: 0.935 ± 0.159
1.169HisGly: 1.169 ± 0.168
0.616HisHis: 0.616 ± 0.155
1.36HisIle: 1.36 ± 0.16
1.636HisLys: 1.636 ± 0.195
1.551HisLeu: 1.551 ± 0.167
0.574HisMet: 0.574 ± 0.117
1.19HisAsn: 1.19 ± 0.169
0.808HisPro: 0.808 ± 0.142
0.51HisGln: 0.51 ± 0.114
0.893HisArg: 0.893 ± 0.146
0.999HisSer: 0.999 ± 0.135
1.063HisThr: 1.063 ± 0.17
1.063HisVal: 1.063 ± 0.168
0.361HisTrp: 0.361 ± 0.093
1.318HisTyr: 1.318 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
4.186IleAla: 4.186 ± 0.378
0.446IleCys: 0.446 ± 0.108
5.44IleAsp: 5.44 ± 0.396
4.781IleGlu: 4.781 ± 0.342
2.21IlePhe: 2.21 ± 0.188
3.761IleGly: 3.761 ± 0.295
1.509IleHis: 1.509 ± 0.213
4.803IleIle: 4.803 ± 0.345
6.588IleLys: 6.588 ± 0.401
5.313IleLeu: 5.313 ± 0.344
1.658IleMet: 1.658 ± 0.231
4.25IleAsn: 4.25 ± 0.283
2.231IlePro: 2.231 ± 0.193
2.571IleGln: 2.571 ± 0.209
2.699IleArg: 2.699 ± 0.284
4.038IleSer: 4.038 ± 0.257
5.674IleThr: 5.674 ± 0.53
4.548IleVal: 4.548 ± 0.283
0.468IleTrp: 0.468 ± 0.092
2.486IleTyr: 2.486 ± 0.243
0.0IleXaa: 0.0 ± 0.0
Lys
4.314LysAla: 4.314 ± 0.366
0.446LysCys: 0.446 ± 0.09
5.716LysAsp: 5.716 ± 0.429
7.034LysGlu: 7.034 ± 0.495
3.358LysPhe: 3.358 ± 0.34
3.761LysGly: 3.761 ± 0.309
1.849LysHis: 1.849 ± 0.238
5.185LysIle: 5.185 ± 0.345
7.565LysLys: 7.565 ± 0.696
6.481LysLeu: 6.481 ± 0.565
2.465LysMet: 2.465 ± 0.292
4.484LysAsn: 4.484 ± 0.339
2.316LysPro: 2.316 ± 0.252
3.464LysGln: 3.464 ± 0.305
3.124LysArg: 3.124 ± 0.365
5.398LysSer: 5.398 ± 0.358
4.739LysThr: 4.739 ± 0.314
5.185LysVal: 5.185 ± 0.383
1.084LysTrp: 1.084 ± 0.15
3.655LysTyr: 3.655 ± 0.35
0.0LysXaa: 0.0 ± 0.0
Leu
4.335LeuAla: 4.335 ± 0.311
0.616LeuCys: 0.616 ± 0.111
5.206LeuAsp: 5.206 ± 0.339
4.76LeuGlu: 4.76 ± 0.327
2.826LeuPhe: 2.826 ± 0.251
4.186LeuGly: 4.186 ± 0.29
1.36LeuHis: 1.36 ± 0.207
4.994LeuIle: 4.994 ± 0.317
6.673LeuLys: 6.673 ± 0.487
4.781LeuLeu: 4.781 ± 0.385
1.679LeuMet: 1.679 ± 0.237
4.718LeuAsn: 4.718 ± 0.334
3.698LeuPro: 3.698 ± 0.222
2.614LeuGln: 2.614 ± 0.228
2.975LeuArg: 2.975 ± 0.223
4.654LeuSer: 4.654 ± 0.388
6.099LeuThr: 6.099 ± 0.666
4.781LeuVal: 4.781 ± 0.444
0.616LeuTrp: 0.616 ± 0.115
2.72LeuTyr: 2.72 ± 0.242
0.0LeuXaa: 0.0 ± 0.0
Met
2.231MetAla: 2.231 ± 0.232
0.213MetCys: 0.213 ± 0.059
0.829MetAsp: 0.829 ± 0.142
1.403MetGlu: 1.403 ± 0.191
1.488MetPhe: 1.488 ± 0.193
1.233MetGly: 1.233 ± 0.184
0.255MetHis: 0.255 ± 0.081
2.083MetIle: 2.083 ± 0.262
2.869MetLys: 2.869 ± 0.296
2.019MetLeu: 2.019 ± 0.2
0.638MetMet: 0.638 ± 0.111
1.913MetAsn: 1.913 ± 0.188
1.254MetPro: 1.254 ± 0.15
0.595MetGln: 0.595 ± 0.113
1.296MetArg: 1.296 ± 0.133
1.785MetSer: 1.785 ± 0.184
1.828MetThr: 1.828 ± 0.207
1.084MetVal: 1.084 ± 0.141
0.276MetTrp: 0.276 ± 0.08
0.956MetTyr: 0.956 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
3.846AsnAla: 3.846 ± 0.32
0.659AsnCys: 0.659 ± 0.127
3.91AsnAsp: 3.91 ± 0.276
3.868AsnGlu: 3.868 ± 0.265
3.018AsnPhe: 3.018 ± 0.289
3.251AsnGly: 3.251 ± 0.256
1.211AsnHis: 1.211 ± 0.22
4.484AsnIle: 4.484 ± 0.32
4.675AsnLys: 4.675 ± 0.329
4.845AsnLeu: 4.845 ± 0.294
1.976AsnMet: 1.976 ± 0.2
3.931AsnAsn: 3.931 ± 0.259
2.444AsnPro: 2.444 ± 0.217
1.891AsnGln: 1.891 ± 0.219
1.636AsnArg: 1.636 ± 0.192
3.698AsnSer: 3.698 ± 0.264
3.953AsnThr: 3.953 ± 0.303
4.611AsnVal: 4.611 ± 0.403
0.659AsnTrp: 0.659 ± 0.112
2.529AsnTyr: 2.529 ± 0.213
0.0AsnXaa: 0.0 ± 0.0
Pro
1.785ProAla: 1.785 ± 0.184
0.106ProCys: 0.106 ± 0.052
2.061ProAsp: 2.061 ± 0.207
2.593ProGlu: 2.593 ± 0.267
1.381ProPhe: 1.381 ± 0.165
2.529ProGly: 2.529 ± 0.288
0.744ProHis: 0.744 ± 0.136
2.274ProIle: 2.274 ± 0.23
2.444ProLys: 2.444 ± 0.258
2.083ProLeu: 2.083 ± 0.221
0.659ProMet: 0.659 ± 0.137
1.934ProAsn: 1.934 ± 0.191
0.893ProPro: 0.893 ± 0.128
1.19ProGln: 1.19 ± 0.161
1.275ProArg: 1.275 ± 0.171
2.508ProSer: 2.508 ± 0.215
3.039ProThr: 3.039 ± 0.227
2.104ProVal: 2.104 ± 0.181
0.319ProTrp: 0.319 ± 0.069
1.615ProTyr: 1.615 ± 0.21
0.0ProXaa: 0.0 ± 0.0
Gln
1.658GlnAla: 1.658 ± 0.194
0.106GlnCys: 0.106 ± 0.047
1.849GlnAsp: 1.849 ± 0.178
2.274GlnGlu: 2.274 ± 0.236
1.211GlnPhe: 1.211 ± 0.167
2.571GlnGly: 2.571 ± 0.248
0.638GlnHis: 0.638 ± 0.124
2.826GlnIle: 2.826 ± 0.278
2.954GlnLys: 2.954 ± 0.308
2.975GlnLeu: 2.975 ± 0.281
1.041GlnMet: 1.041 ± 0.141
1.934GlnAsn: 1.934 ± 0.173
1.403GlnPro: 1.403 ± 0.172
1.806GlnGln: 1.806 ± 0.239
1.296GlnArg: 1.296 ± 0.175
2.125GlnSer: 2.125 ± 0.265
2.253GlnThr: 2.253 ± 0.27
2.274GlnVal: 2.274 ± 0.214
0.446GlnTrp: 0.446 ± 0.101
1.891GlnTyr: 1.891 ± 0.257
0.0GlnXaa: 0.0 ± 0.0
Arg
2.231ArgAla: 2.231 ± 0.216
0.234ArgCys: 0.234 ± 0.069
2.274ArgAsp: 2.274 ± 0.25
2.04ArgGlu: 2.04 ± 0.23
1.743ArgPhe: 1.743 ± 0.236
2.168ArgGly: 2.168 ± 0.242
0.744ArgHis: 0.744 ± 0.121
2.401ArgIle: 2.401 ± 0.226
3.209ArgLys: 3.209 ± 0.34
2.954ArgLeu: 2.954 ± 0.222
1.424ArgMet: 1.424 ± 0.182
1.913ArgAsn: 1.913 ± 0.247
1.296ArgPro: 1.296 ± 0.195
1.296ArgGln: 1.296 ± 0.165
1.806ArgArg: 1.806 ± 0.257
1.976ArgSer: 1.976 ± 0.19
2.04ArgThr: 2.04 ± 0.227
2.338ArgVal: 2.338 ± 0.244
0.553ArgTrp: 0.553 ± 0.108
1.658ArgTyr: 1.658 ± 0.212
0.0ArgXaa: 0.0 ± 0.0
Ser
4.293SerAla: 4.293 ± 0.376
0.51SerCys: 0.51 ± 0.118
3.613SerAsp: 3.613 ± 0.279
3.166SerGlu: 3.166 ± 0.282
3.251SerPhe: 3.251 ± 0.267
6.439SerGly: 6.439 ± 0.69
1.063SerHis: 1.063 ± 0.162
3.974SerIle: 3.974 ± 0.298
4.271SerLys: 4.271 ± 0.381
4.314SerLeu: 4.314 ± 0.325
1.233SerMet: 1.233 ± 0.143
3.825SerAsn: 3.825 ± 0.351
2.04SerPro: 2.04 ± 0.183
1.806SerGln: 1.806 ± 0.197
1.828SerArg: 1.828 ± 0.207
5.206SerSer: 5.206 ± 0.653
5.61SerThr: 5.61 ± 0.542
3.698SerVal: 3.698 ± 0.359
0.765SerTrp: 0.765 ± 0.138
2.444SerTyr: 2.444 ± 0.243
0.0SerXaa: 0.0 ± 0.0
Thr
5.61ThrAla: 5.61 ± 0.632
0.34ThrCys: 0.34 ± 0.083
5.759ThrAsp: 5.759 ± 0.469
3.57ThrGlu: 3.57 ± 0.325
3.676ThrPhe: 3.676 ± 0.337
5.929ThrGly: 5.929 ± 0.569
1.36ThrHis: 1.36 ± 0.17
6.141ThrIle: 6.141 ± 0.429
5.228ThrLys: 5.228 ± 0.374
6.609ThrLeu: 6.609 ± 0.819
1.211ThrMet: 1.211 ± 0.152
4.824ThrAsn: 4.824 ± 0.39
3.039ThrPro: 3.039 ± 0.229
2.38ThrGln: 2.38 ± 0.261
2.21ThrArg: 2.21 ± 0.23
4.718ThrSer: 4.718 ± 0.477
6.609ThrThr: 6.609 ± 0.901
4.994ThrVal: 4.994 ± 0.528
0.553ThrTrp: 0.553 ± 0.108
2.21ThrTyr: 2.21 ± 0.223
0.0ThrXaa: 0.0 ± 0.0
Val
4.038ValAla: 4.038 ± 0.268
0.425ValCys: 0.425 ± 0.09
5.058ValAsp: 5.058 ± 0.381
3.655ValGlu: 3.655 ± 0.293
2.444ValPhe: 2.444 ± 0.218
3.74ValGly: 3.74 ± 0.258
1.254ValHis: 1.254 ± 0.193
4.08ValIle: 4.08 ± 0.303
4.25ValLys: 4.25 ± 0.324
3.868ValLeu: 3.868 ± 0.264
1.87ValMet: 1.87 ± 0.174
4.059ValAsn: 4.059 ± 0.309
2.04ValPro: 2.04 ± 0.199
2.55ValGln: 2.55 ± 0.204
1.955ValArg: 1.955 ± 0.233
4.25ValSer: 4.25 ± 0.337
5.249ValThr: 5.249 ± 0.473
3.315ValVal: 3.315 ± 0.335
0.638ValTrp: 0.638 ± 0.115
2.274ValTyr: 2.274 ± 0.235
0.0ValXaa: 0.0 ± 0.0
Trp
0.531TrpAla: 0.531 ± 0.101
0.064TrpCys: 0.064 ± 0.036
0.723TrpAsp: 0.723 ± 0.117
0.574TrpGlu: 0.574 ± 0.123
0.383TrpPhe: 0.383 ± 0.102
0.383TrpGly: 0.383 ± 0.101
0.383TrpHis: 0.383 ± 0.107
0.723TrpIle: 0.723 ± 0.14
0.999TrpLys: 0.999 ± 0.145
0.914TrpLeu: 0.914 ± 0.136
0.234TrpMet: 0.234 ± 0.073
0.701TrpAsn: 0.701 ± 0.119
0.34TrpPro: 0.34 ± 0.092
0.51TrpGln: 0.51 ± 0.098
0.51TrpArg: 0.51 ± 0.105
0.893TrpSer: 0.893 ± 0.131
0.595TrpThr: 0.595 ± 0.121
0.85TrpVal: 0.85 ± 0.143
0.17TrpTrp: 0.17 ± 0.078
0.659TrpTyr: 0.659 ± 0.134
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.146TyrAla: 2.146 ± 0.213
0.574TyrCys: 0.574 ± 0.111
3.358TyrAsp: 3.358 ± 0.313
2.593TyrGlu: 2.593 ± 0.245
1.891TyrPhe: 1.891 ± 0.189
2.295TyrGly: 2.295 ± 0.246
0.85TyrHis: 0.85 ± 0.143
2.465TyrIle: 2.465 ± 0.271
3.783TyrLys: 3.783 ± 0.293
2.933TyrLeu: 2.933 ± 0.259
1.148TyrMet: 1.148 ± 0.145
2.741TyrAsn: 2.741 ± 0.278
1.063TyrPro: 1.063 ± 0.146
1.551TyrGln: 1.551 ± 0.202
1.488TyrArg: 1.488 ± 0.178
2.21TyrSer: 2.21 ± 0.187
2.486TyrThr: 2.486 ± 0.256
2.465TyrVal: 2.465 ± 0.217
0.68TyrTrp: 0.68 ± 0.124
1.849TyrTyr: 1.849 ± 0.22
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 198 proteins (47059 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski