Amino acid dipepetide frequency for Vibrio phage VD1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.629AlaAla: 7.629 ± 0.688
0.775AlaCys: 0.775 ± 0.162
5.548AlaAsp: 5.548 ± 0.408
5.507AlaGlu: 5.507 ± 0.49
3.1AlaPhe: 3.1 ± 0.299
5.752AlaGly: 5.752 ± 0.694
0.857AlaHis: 0.857 ± 0.195
6.038AlaIle: 6.038 ± 0.532
4.732AlaLys: 4.732 ± 0.412
8.2AlaLeu: 8.2 ± 0.557
3.427AlaMet: 3.427 ± 0.4
4.039AlaAsn: 4.039 ± 0.406
1.958AlaPro: 1.958 ± 0.244
2.733AlaGln: 2.733 ± 0.302
2.937AlaArg: 2.937 ± 0.399
4.936AlaSer: 4.936 ± 0.516
4.406AlaThr: 4.406 ± 0.541
5.997AlaVal: 5.997 ± 0.586
1.142AlaTrp: 1.142 ± 0.189
3.06AlaTyr: 3.06 ± 0.369
0.0AlaXaa: 0.0 ± 0.0
Cys
0.326CysAla: 0.326 ± 0.093
0.286CysCys: 0.286 ± 0.171
0.53CysAsp: 0.53 ± 0.127
0.979CysGlu: 0.979 ± 0.182
0.53CysPhe: 0.53 ± 0.135
0.857CysGly: 0.857 ± 0.238
0.367CysHis: 0.367 ± 0.124
0.53CysIle: 0.53 ± 0.124
0.897CysLys: 0.897 ± 0.201
0.816CysLeu: 0.816 ± 0.159
0.408CysMet: 0.408 ± 0.148
0.408CysAsn: 0.408 ± 0.127
0.694CysPro: 0.694 ± 0.158
0.49CysGln: 0.49 ± 0.157
0.53CysArg: 0.53 ± 0.148
0.857CysSer: 0.857 ± 0.172
0.49CysThr: 0.49 ± 0.136
0.612CysVal: 0.612 ± 0.132
0.326CysTrp: 0.326 ± 0.124
0.571CysTyr: 0.571 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
4.039AspAla: 4.039 ± 0.413
0.979AspCys: 0.979 ± 0.165
3.345AspAsp: 3.345 ± 0.335
4.691AspGlu: 4.691 ± 0.447
2.407AspPhe: 2.407 ± 0.261
4.12AspGly: 4.12 ± 0.451
1.183AspHis: 1.183 ± 0.247
3.672AspIle: 3.672 ± 0.447
3.223AspLys: 3.223 ± 0.314
4.691AspLeu: 4.691 ± 0.359
1.958AspMet: 1.958 ± 0.259
2.325AspAsn: 2.325 ± 0.308
2.162AspPro: 2.162 ± 0.392
1.836AspGln: 1.836 ± 0.305
2.203AspArg: 2.203 ± 0.29
3.468AspSer: 3.468 ± 0.306
3.264AspThr: 3.264 ± 0.349
4.651AspVal: 4.651 ± 0.397
1.428AspTrp: 1.428 ± 0.221
2.488AspTyr: 2.488 ± 0.392
0.0AspXaa: 0.0 ± 0.0
Glu
5.14GluAla: 5.14 ± 0.423
0.816GluCys: 0.816 ± 0.192
2.733GluAsp: 2.733 ± 0.32
4.528GluGlu: 4.528 ± 0.483
2.652GluPhe: 2.652 ± 0.361
3.835GluGly: 3.835 ± 0.352
1.509GluHis: 1.509 ± 0.225
3.957GluIle: 3.957 ± 0.338
4.079GluLys: 4.079 ± 0.394
8.2GluLeu: 8.2 ± 0.586
1.877GluMet: 1.877 ± 0.386
3.223GluAsn: 3.223 ± 0.29
2.488GluPro: 2.488 ± 0.276
5.14GluGln: 5.14 ± 0.589
3.916GluArg: 3.916 ± 0.427
4.651GluSer: 4.651 ± 0.444
3.672GluThr: 3.672 ± 0.422
4.936GluVal: 4.936 ± 0.461
0.938GluTrp: 0.938 ± 0.237
2.774GluTyr: 2.774 ± 0.333
0.0GluXaa: 0.0 ± 0.0
Phe
2.611PheAla: 2.611 ± 0.325
0.367PheCys: 0.367 ± 0.132
3.1PheAsp: 3.1 ± 0.363
2.611PheGlu: 2.611 ± 0.295
1.346PhePhe: 1.346 ± 0.242
2.325PheGly: 2.325 ± 0.337
0.571PheHis: 0.571 ± 0.126
2.285PheIle: 2.285 ± 0.271
2.774PheLys: 2.774 ± 0.38
2.733PheLeu: 2.733 ± 0.319
0.979PheMet: 0.979 ± 0.202
2.04PheAsn: 2.04 ± 0.278
1.142PhePro: 1.142 ± 0.199
1.183PheGln: 1.183 ± 0.206
2.203PheArg: 2.203 ± 0.287
3.549PheSer: 3.549 ± 0.445
2.815PheThr: 2.815 ± 0.324
2.937PheVal: 2.937 ± 0.377
0.694PheTrp: 0.694 ± 0.182
0.897PheTyr: 0.897 ± 0.185
0.0PheXaa: 0.0 ± 0.0
Gly
4.936GlyAla: 4.936 ± 0.503
0.775GlyCys: 0.775 ± 0.187
3.794GlyAsp: 3.794 ± 0.425
4.365GlyGlu: 4.365 ± 0.489
2.856GlyPhe: 2.856 ± 0.298
4.895GlyGly: 4.895 ± 0.504
1.713GlyHis: 1.713 ± 0.235
4.243GlyIle: 4.243 ± 0.421
4.977GlyLys: 4.977 ± 0.52
5.14GlyLeu: 5.14 ± 0.528
2.121GlyMet: 2.121 ± 0.315
3.631GlyAsn: 3.631 ± 0.374
0.694GlyPro: 0.694 ± 0.17
2.121GlyGln: 2.121 ± 0.316
3.304GlyArg: 3.304 ± 0.402
4.406GlySer: 4.406 ± 0.492
4.039GlyThr: 4.039 ± 0.4
4.691GlyVal: 4.691 ± 0.385
0.979GlyTrp: 0.979 ± 0.191
1.877GlyTyr: 1.877 ± 0.33
0.0GlyXaa: 0.0 ± 0.0
His
1.265HisAla: 1.265 ± 0.282
0.449HisCys: 0.449 ± 0.123
1.183HisAsp: 1.183 ± 0.21
1.02HisGlu: 1.02 ± 0.215
1.346HisPhe: 1.346 ± 0.233
1.632HisGly: 1.632 ± 0.237
0.571HisHis: 0.571 ± 0.16
0.775HisIle: 0.775 ± 0.159
1.591HisLys: 1.591 ± 0.243
2.203HisLeu: 2.203 ± 0.31
0.49HisMet: 0.49 ± 0.136
0.612HisAsn: 0.612 ± 0.133
0.694HisPro: 0.694 ± 0.173
1.305HisGln: 1.305 ± 0.191
0.734HisArg: 0.734 ± 0.208
0.938HisSer: 0.938 ± 0.235
0.979HisThr: 0.979 ± 0.216
1.101HisVal: 1.101 ± 0.237
0.694HisTrp: 0.694 ± 0.147
0.694HisTyr: 0.694 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
5.181IleAla: 5.181 ± 0.566
0.53IleCys: 0.53 ± 0.151
4.447IleAsp: 4.447 ± 0.403
4.773IleGlu: 4.773 ± 0.361
1.836IlePhe: 1.836 ± 0.289
4.079IleGly: 4.079 ± 0.414
0.857IleHis: 0.857 ± 0.171
1.917IleIle: 1.917 ± 0.331
4.12IleLys: 4.12 ± 0.374
3.631IleLeu: 3.631 ± 0.37
1.265IleMet: 1.265 ± 0.245
2.978IleAsn: 2.978 ± 0.341
2.856IlePro: 2.856 ± 0.287
2.203IleGln: 2.203 ± 0.295
3.1IleArg: 3.1 ± 0.372
4.324IleSer: 4.324 ± 0.344
4.202IleThr: 4.202 ± 0.412
3.386IleVal: 3.386 ± 0.329
0.979IleTrp: 0.979 ± 0.219
1.958IleTyr: 1.958 ± 0.23
0.0IleXaa: 0.0 ± 0.0
Lys
6.282LysAla: 6.282 ± 0.455
0.449LysCys: 0.449 ± 0.12
3.182LysAsp: 3.182 ± 0.401
4.447LysGlu: 4.447 ± 0.415
1.754LysPhe: 1.754 ± 0.325
3.59LysGly: 3.59 ± 0.393
1.713LysHis: 1.713 ± 0.307
3.223LysIle: 3.223 ± 0.399
4.079LysLys: 4.079 ± 0.494
6.038LysLeu: 6.038 ± 0.547
2.04LysMet: 2.04 ± 0.271
2.611LysAsn: 2.611 ± 0.371
2.692LysPro: 2.692 ± 0.318
3.631LysGln: 3.631 ± 0.338
3.631LysArg: 3.631 ± 0.487
4.039LysSer: 4.039 ± 0.427
3.386LysThr: 3.386 ± 0.422
3.835LysVal: 3.835 ± 0.377
0.653LysTrp: 0.653 ± 0.188
1.917LysTyr: 1.917 ± 0.287
0.0LysXaa: 0.0 ± 0.0
Leu
7.261LeuAla: 7.261 ± 0.528
1.346LeuCys: 1.346 ± 0.237
5.14LeuAsp: 5.14 ± 0.487
6.446LeuGlu: 6.446 ± 0.606
3.223LeuPhe: 3.223 ± 0.336
5.915LeuGly: 5.915 ± 0.431
1.55LeuHis: 1.55 ± 0.282
4.855LeuIle: 4.855 ± 0.564
6.405LeuLys: 6.405 ± 0.511
5.874LeuLeu: 5.874 ± 0.53
2.448LeuMet: 2.448 ± 0.352
3.957LeuAsn: 3.957 ± 0.409
3.468LeuPro: 3.468 ± 0.35
2.815LeuGln: 2.815 ± 0.326
4.365LeuArg: 4.365 ± 0.337
6.976LeuSer: 6.976 ± 0.548
5.385LeuThr: 5.385 ± 0.494
4.814LeuVal: 4.814 ± 0.432
0.857LeuTrp: 0.857 ± 0.197
2.081LeuTyr: 2.081 ± 0.299
0.0LeuXaa: 0.0 ± 0.0
Met
3.386MetAla: 3.386 ± 0.311
0.286MetCys: 0.286 ± 0.113
1.387MetAsp: 1.387 ± 0.225
1.428MetGlu: 1.428 ± 0.189
1.061MetPhe: 1.061 ± 0.168
1.632MetGly: 1.632 ± 0.283
0.612MetHis: 0.612 ± 0.142
1.469MetIle: 1.469 ± 0.2
1.632MetLys: 1.632 ± 0.255
2.815MetLeu: 2.815 ± 0.387
1.183MetMet: 1.183 ± 0.232
1.387MetAsn: 1.387 ± 0.195
1.265MetPro: 1.265 ± 0.274
1.101MetGln: 1.101 ± 0.221
1.469MetArg: 1.469 ± 0.209
3.141MetSer: 3.141 ± 0.36
1.795MetThr: 1.795 ± 0.316
1.265MetVal: 1.265 ± 0.192
0.286MetTrp: 0.286 ± 0.095
0.571MetTyr: 0.571 ± 0.147
0.0MetXaa: 0.0 ± 0.0
Asn
4.406AsnAla: 4.406 ± 0.505
0.367AsnCys: 0.367 ± 0.123
2.244AsnAsp: 2.244 ± 0.263
3.345AsnGlu: 3.345 ± 0.35
1.02AsnPhe: 1.02 ± 0.245
3.916AsnGly: 3.916 ± 0.444
0.775AsnHis: 0.775 ± 0.146
2.937AsnIle: 2.937 ± 0.366
2.325AsnLys: 2.325 ± 0.272
3.386AsnLeu: 3.386 ± 0.293
1.387AsnMet: 1.387 ± 0.261
1.591AsnAsn: 1.591 ± 0.222
2.488AsnPro: 2.488 ± 0.286
2.611AsnGln: 2.611 ± 0.377
2.407AsnArg: 2.407 ± 0.293
2.652AsnSer: 2.652 ± 0.276
2.448AsnThr: 2.448 ± 0.341
2.57AsnVal: 2.57 ± 0.338
0.734AsnTrp: 0.734 ± 0.168
1.55AsnTyr: 1.55 ± 0.273
0.0AsnXaa: 0.0 ± 0.0
Pro
2.815ProAla: 2.815 ± 0.416
0.286ProCys: 0.286 ± 0.136
3.1ProAsp: 3.1 ± 0.353
3.59ProGlu: 3.59 ± 0.355
1.387ProPhe: 1.387 ± 0.189
1.101ProGly: 1.101 ± 0.191
0.857ProHis: 0.857 ± 0.257
1.877ProIle: 1.877 ± 0.268
2.488ProLys: 2.488 ± 0.304
2.325ProLeu: 2.325 ± 0.302
1.101ProMet: 1.101 ± 0.186
1.305ProAsn: 1.305 ± 0.217
1.305ProPro: 1.305 ± 0.229
1.673ProGln: 1.673 ± 0.254
1.387ProArg: 1.387 ± 0.225
2.366ProSer: 2.366 ± 0.347
2.611ProThr: 2.611 ± 0.237
3.223ProVal: 3.223 ± 0.384
0.571ProTrp: 0.571 ± 0.153
1.101ProTyr: 1.101 ± 0.218
0.0ProXaa: 0.0 ± 0.0
Gln
3.794GlnAla: 3.794 ± 0.378
0.449GlnCys: 0.449 ± 0.132
1.632GlnAsp: 1.632 ± 0.208
2.04GlnGlu: 2.04 ± 0.365
1.999GlnPhe: 1.999 ± 0.267
2.285GlnGly: 2.285 ± 0.259
0.857GlnHis: 0.857 ± 0.149
2.896GlnIle: 2.896 ± 0.34
2.366GlnLys: 2.366 ± 0.325
4.447GlnLeu: 4.447 ± 0.398
1.224GlnMet: 1.224 ± 0.227
2.04GlnAsn: 2.04 ± 0.277
1.754GlnPro: 1.754 ± 0.294
2.448GlnGln: 2.448 ± 0.344
2.244GlnArg: 2.244 ± 0.297
2.733GlnSer: 2.733 ± 0.336
2.774GlnThr: 2.774 ± 0.405
2.692GlnVal: 2.692 ± 0.367
0.653GlnTrp: 0.653 ± 0.142
1.591GlnTyr: 1.591 ± 0.244
0.0GlnXaa: 0.0 ± 0.0
Arg
3.835ArgAla: 3.835 ± 0.441
0.408ArgCys: 0.408 ± 0.123
2.366ArgAsp: 2.366 ± 0.362
3.631ArgGlu: 3.631 ± 0.341
2.162ArgPhe: 2.162 ± 0.282
2.611ArgGly: 2.611 ± 0.315
1.101ArgHis: 1.101 ± 0.209
3.875ArgIle: 3.875 ± 0.429
3.386ArgLys: 3.386 ± 0.324
4.569ArgLeu: 4.569 ± 0.4
1.142ArgMet: 1.142 ± 0.239
1.836ArgAsn: 1.836 ± 0.283
1.305ArgPro: 1.305 ± 0.244
1.999ArgGln: 1.999 ± 0.304
2.366ArgArg: 2.366 ± 0.358
3.223ArgSer: 3.223 ± 0.401
2.529ArgThr: 2.529 ± 0.284
2.733ArgVal: 2.733 ± 0.369
0.857ArgTrp: 0.857 ± 0.177
1.469ArgTyr: 1.469 ± 0.236
0.0ArgXaa: 0.0 ± 0.0
Ser
5.507SerAla: 5.507 ± 0.521
0.612SerCys: 0.612 ± 0.174
4.12SerAsp: 4.12 ± 0.397
5.181SerGlu: 5.181 ± 0.473
2.978SerPhe: 2.978 ± 0.368
5.385SerGly: 5.385 ± 0.608
1.305SerHis: 1.305 ± 0.214
3.998SerIle: 3.998 ± 0.379
4.039SerLys: 4.039 ± 0.478
5.344SerLeu: 5.344 ± 0.453
1.877SerMet: 1.877 ± 0.242
3.304SerAsn: 3.304 ± 0.371
2.774SerPro: 2.774 ± 0.315
2.325SerGln: 2.325 ± 0.314
2.407SerArg: 2.407 ± 0.277
4.732SerSer: 4.732 ± 0.438
4.079SerThr: 4.079 ± 0.419
4.936SerVal: 4.936 ± 0.432
0.979SerTrp: 0.979 ± 0.194
2.774SerTyr: 2.774 ± 0.253
0.0SerXaa: 0.0 ± 0.0
Thr
4.814ThrAla: 4.814 ± 0.415
0.653ThrCys: 0.653 ± 0.181
3.345ThrAsp: 3.345 ± 0.374
3.916ThrGlu: 3.916 ± 0.424
2.733ThrPhe: 2.733 ± 0.331
4.161ThrGly: 4.161 ± 0.38
1.142ThrHis: 1.142 ± 0.2
3.672ThrIle: 3.672 ± 0.37
3.712ThrLys: 3.712 ± 0.341
4.039ThrLeu: 4.039 ± 0.397
1.387ThrMet: 1.387 ± 0.222
2.366ThrAsn: 2.366 ± 0.333
2.448ThrPro: 2.448 ± 0.3
2.57ThrGln: 2.57 ± 0.296
2.896ThrArg: 2.896 ± 0.374
3.59ThrSer: 3.59 ± 0.443
3.712ThrThr: 3.712 ± 0.408
4.691ThrVal: 4.691 ± 0.476
1.101ThrTrp: 1.101 ± 0.221
2.244ThrTyr: 2.244 ± 0.305
0.0ThrXaa: 0.0 ± 0.0
Val
5.874ValAla: 5.874 ± 0.521
0.734ValCys: 0.734 ± 0.192
4.12ValAsp: 4.12 ± 0.401
5.018ValGlu: 5.018 ± 0.44
3.1ValPhe: 3.1 ± 0.313
4.691ValGly: 4.691 ± 0.474
1.469ValHis: 1.469 ± 0.276
4.447ValIle: 4.447 ± 0.466
3.753ValLys: 3.753 ± 0.324
5.466ValLeu: 5.466 ± 0.449
2.04ValMet: 2.04 ± 0.305
2.978ValAsn: 2.978 ± 0.353
2.244ValPro: 2.244 ± 0.283
2.04ValGln: 2.04 ± 0.306
2.611ValArg: 2.611 ± 0.327
4.895ValSer: 4.895 ± 0.424
4.243ValThr: 4.243 ± 0.374
4.651ValVal: 4.651 ± 0.421
0.694ValTrp: 0.694 ± 0.159
1.877ValTyr: 1.877 ± 0.281
0.0ValXaa: 0.0 ± 0.0
Trp
1.101TrpAla: 1.101 ± 0.249
0.286TrpCys: 0.286 ± 0.105
0.979TrpAsp: 0.979 ± 0.226
1.305TrpGlu: 1.305 ± 0.195
0.408TrpPhe: 0.408 ± 0.132
0.938TrpGly: 0.938 ± 0.166
0.449TrpHis: 0.449 ± 0.111
0.49TrpIle: 0.49 ± 0.144
0.612TrpLys: 0.612 ± 0.144
1.713TrpLeu: 1.713 ± 0.248
0.245TrpMet: 0.245 ± 0.103
0.816TrpAsn: 0.816 ± 0.191
0.694TrpPro: 0.694 ± 0.149
1.02TrpGln: 1.02 ± 0.198
1.183TrpArg: 1.183 ± 0.287
0.653TrpSer: 0.653 ± 0.165
0.653TrpThr: 0.653 ± 0.167
1.387TrpVal: 1.387 ± 0.23
0.408TrpTrp: 0.408 ± 0.127
0.326TrpTyr: 0.326 ± 0.097
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.57TyrAla: 2.57 ± 0.312
0.653TyrCys: 0.653 ± 0.133
1.877TyrAsp: 1.877 ± 0.316
2.244TyrGlu: 2.244 ± 0.298
1.265TyrPhe: 1.265 ± 0.203
1.836TyrGly: 1.836 ± 0.311
0.816TyrHis: 0.816 ± 0.212
1.469TyrIle: 1.469 ± 0.267
1.999TyrLys: 1.999 ± 0.308
3.549TyrLeu: 3.549 ± 0.461
0.53TyrMet: 0.53 ± 0.128
1.713TyrAsn: 1.713 ± 0.233
1.346TyrPro: 1.346 ± 0.178
1.673TyrGln: 1.673 ± 0.247
1.509TyrArg: 1.509 ± 0.232
2.529TyrSer: 2.529 ± 0.318
1.713TyrThr: 1.713 ± 0.244
1.836TyrVal: 1.836 ± 0.268
0.694TyrTrp: 0.694 ± 0.147
1.142TyrTyr: 1.142 ± 0.194
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 117 proteins (24514 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski