Amino acid dipepetide frequency for Freshwater phage uvFW-CGR-AMD-COM-C449

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.657AlaAla: 11.657 ± 1.196
0.663AlaCys: 0.663 ± 0.28
4.928AlaAsp: 4.928 ± 0.534
6.729AlaGlu: 6.729 ± 0.987
2.369AlaPhe: 2.369 ± 0.559
9.666AlaGly: 9.666 ± 1.461
1.232AlaHis: 1.232 ± 0.407
4.928AlaIle: 4.928 ± 0.665
7.771AlaLys: 7.771 ± 1.1
7.013AlaLeu: 7.013 ± 0.869
2.938AlaMet: 2.938 ± 0.549
4.17AlaAsn: 4.17 ± 0.61
3.317AlaPro: 3.317 ± 0.542
4.833AlaGln: 4.833 ± 0.711
3.412AlaArg: 3.412 ± 0.664
6.444AlaSer: 6.444 ± 0.77
9.382AlaThr: 9.382 ± 1.522
7.582AlaVal: 7.582 ± 0.722
1.516AlaTrp: 1.516 ± 0.415
2.938AlaTyr: 2.938 ± 0.514
0.0AlaXaa: 0.0 ± 0.0
Cys
0.853CysAla: 0.853 ± 0.395
0.19CysCys: 0.19 ± 0.133
0.569CysAsp: 0.569 ± 0.324
0.19CysGlu: 0.19 ± 0.135
0.095CysPhe: 0.095 ± 0.115
1.042CysGly: 1.042 ± 0.431
0.095CysHis: 0.095 ± 0.105
0.379CysIle: 0.379 ± 0.202
0.474CysLys: 0.474 ± 0.264
1.042CysLeu: 1.042 ± 0.452
0.0CysMet: 0.0 ± 0.0
0.474CysAsn: 0.474 ± 0.256
0.284CysPro: 0.284 ± 0.171
0.0CysGln: 0.0 ± 0.0
0.379CysArg: 0.379 ± 0.267
0.379CysSer: 0.379 ± 0.307
0.379CysThr: 0.379 ± 0.168
0.474CysVal: 0.474 ± 0.234
0.095CysTrp: 0.095 ± 0.09
0.284CysTyr: 0.284 ± 0.234
0.0CysXaa: 0.0 ± 0.0
Asp
5.497AspAla: 5.497 ± 0.649
0.19AspCys: 0.19 ± 0.135
2.464AspAsp: 2.464 ± 0.338
2.559AspGlu: 2.559 ± 0.486
1.801AspPhe: 1.801 ± 0.35
4.359AspGly: 4.359 ± 0.647
0.758AspHis: 0.758 ± 0.316
3.412AspIle: 3.412 ± 0.54
3.696AspLys: 3.696 ± 0.637
4.549AspLeu: 4.549 ± 0.574
1.516AspMet: 1.516 ± 0.361
1.895AspAsn: 1.895 ± 0.516
3.127AspPro: 3.127 ± 0.772
1.801AspGln: 1.801 ± 0.425
2.654AspArg: 2.654 ± 0.526
2.464AspSer: 2.464 ± 0.479
3.98AspThr: 3.98 ± 0.615
3.696AspVal: 3.696 ± 0.762
1.327AspTrp: 1.327 ± 0.304
3.222AspTyr: 3.222 ± 0.504
0.0AspXaa: 0.0 ± 0.0
Glu
5.876GluAla: 5.876 ± 0.923
0.569GluCys: 0.569 ± 0.293
2.654GluAsp: 2.654 ± 0.641
2.748GluGlu: 2.748 ± 0.433
1.611GluPhe: 1.611 ± 0.417
4.17GluGly: 4.17 ± 0.534
0.853GluHis: 0.853 ± 0.352
3.886GluIle: 3.886 ± 0.657
2.559GluLys: 2.559 ± 0.584
5.497GluLeu: 5.497 ± 0.87
1.895GluMet: 1.895 ± 0.55
1.516GluAsn: 1.516 ± 0.393
2.938GluPro: 2.938 ± 0.498
3.127GluGln: 3.127 ± 0.605
3.696GluArg: 3.696 ± 0.802
2.369GluSer: 2.369 ± 0.469
4.17GluThr: 4.17 ± 0.845
3.127GluVal: 3.127 ± 0.622
0.758GluTrp: 0.758 ± 0.246
1.232GluTyr: 1.232 ± 0.304
0.0GluXaa: 0.0 ± 0.0
Phe
3.506PheAla: 3.506 ± 0.539
0.095PheCys: 0.095 ± 0.097
1.801PheAsp: 1.801 ± 0.519
1.422PheGlu: 1.422 ± 0.396
1.042PhePhe: 1.042 ± 0.367
2.843PheGly: 2.843 ± 0.512
0.095PheHis: 0.095 ± 0.09
1.232PheIle: 1.232 ± 0.387
1.327PheLys: 1.327 ± 0.459
2.369PheLeu: 2.369 ± 0.464
0.663PheMet: 0.663 ± 0.213
1.516PheAsn: 1.516 ± 0.317
1.422PhePro: 1.422 ± 0.324
0.758PheGln: 0.758 ± 0.243
1.422PheArg: 1.422 ± 0.38
1.801PheSer: 1.801 ± 0.416
2.843PheThr: 2.843 ± 0.57
2.085PheVal: 2.085 ± 0.761
0.19PheTrp: 0.19 ± 0.12
0.948PheTyr: 0.948 ± 0.417
0.0PheXaa: 0.0 ± 0.0
Gly
6.823GlyAla: 6.823 ± 1.11
0.663GlyCys: 0.663 ± 0.38
3.601GlyAsp: 3.601 ± 0.468
3.601GlyGlu: 3.601 ± 0.495
2.748GlyPhe: 2.748 ± 0.524
10.614GlyGly: 10.614 ± 4.819
1.137GlyHis: 1.137 ± 0.358
4.359GlyIle: 4.359 ± 0.873
4.928GlyLys: 4.928 ± 0.706
6.255GlyLeu: 6.255 ± 1.094
2.843GlyMet: 2.843 ± 0.505
4.265GlyAsn: 4.265 ± 0.592
1.611GlyPro: 1.611 ± 0.387
3.601GlyGln: 3.601 ± 0.535
3.412GlyArg: 3.412 ± 0.595
7.108GlySer: 7.108 ± 1.102
9.098GlyThr: 9.098 ± 2.42
5.781GlyVal: 5.781 ± 0.688
1.895GlyTrp: 1.895 ± 0.434
3.127GlyTyr: 3.127 ± 0.487
0.0GlyXaa: 0.0 ± 0.0
His
0.948HisAla: 0.948 ± 0.286
0.379HisCys: 0.379 ± 0.199
1.516HisAsp: 1.516 ± 0.498
0.663HisGlu: 0.663 ± 0.365
0.284HisPhe: 0.284 ± 0.187
0.948HisGly: 0.948 ± 0.317
0.284HisHis: 0.284 ± 0.163
1.422HisIle: 1.422 ± 0.358
0.474HisLys: 0.474 ± 0.192
0.663HisLeu: 0.663 ± 0.24
0.095HisMet: 0.095 ± 0.105
0.663HisAsn: 0.663 ± 0.276
1.042HisPro: 1.042 ± 0.388
0.569HisGln: 0.569 ± 0.281
1.042HisArg: 1.042 ± 0.48
0.758HisSer: 0.758 ± 0.254
0.853HisThr: 0.853 ± 0.293
0.758HisVal: 0.758 ± 0.277
0.19HisTrp: 0.19 ± 0.129
0.19HisTyr: 0.19 ± 0.129
0.0HisXaa: 0.0 ± 0.0
Ile
5.023IleAla: 5.023 ± 0.902
0.474IleCys: 0.474 ± 0.248
3.791IleAsp: 3.791 ± 0.66
3.696IleGlu: 3.696 ± 0.615
1.137IlePhe: 1.137 ± 0.447
4.17IleGly: 4.17 ± 0.622
0.853IleHis: 0.853 ± 0.228
2.938IleIle: 2.938 ± 0.464
3.317IleLys: 3.317 ± 0.929
2.274IleLeu: 2.274 ± 0.644
1.706IleMet: 1.706 ± 0.449
2.464IleAsn: 2.464 ± 0.511
2.18IlePro: 2.18 ± 0.569
2.559IleGln: 2.559 ± 0.577
2.274IleArg: 2.274 ± 0.386
3.222IleSer: 3.222 ± 0.7
4.549IleThr: 4.549 ± 0.895
3.696IleVal: 3.696 ± 0.453
0.569IleTrp: 0.569 ± 0.18
1.516IleTyr: 1.516 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
5.97LysAla: 5.97 ± 1.0
0.284LysCys: 0.284 ± 0.176
2.18LysAsp: 2.18 ± 0.515
4.359LysGlu: 4.359 ± 0.859
1.422LysPhe: 1.422 ± 0.493
4.075LysGly: 4.075 ± 0.713
0.948LysHis: 0.948 ± 0.422
2.843LysIle: 2.843 ± 0.465
4.075LysLys: 4.075 ± 1.402
4.833LysLeu: 4.833 ± 0.623
2.085LysMet: 2.085 ± 0.459
2.843LysAsn: 2.843 ± 0.665
3.601LysPro: 3.601 ± 0.839
3.412LysGln: 3.412 ± 0.476
2.464LysArg: 2.464 ± 0.537
3.412LysSer: 3.412 ± 0.639
3.791LysThr: 3.791 ± 0.611
3.98LysVal: 3.98 ± 0.574
1.042LysTrp: 1.042 ± 0.356
1.801LysTyr: 1.801 ± 0.453
0.0LysXaa: 0.0 ± 0.0
Leu
7.866LeuAla: 7.866 ± 0.784
0.853LeuCys: 0.853 ± 0.364
5.212LeuAsp: 5.212 ± 0.7
3.98LeuGlu: 3.98 ± 0.694
2.369LeuPhe: 2.369 ± 0.471
5.591LeuGly: 5.591 ± 0.775
1.327LeuHis: 1.327 ± 0.404
3.222LeuIle: 3.222 ± 0.508
3.696LeuLys: 3.696 ± 0.803
5.591LeuLeu: 5.591 ± 0.704
1.99LeuMet: 1.99 ± 0.449
3.886LeuAsn: 3.886 ± 0.666
3.98LeuPro: 3.98 ± 0.584
3.033LeuGln: 3.033 ± 0.563
5.118LeuArg: 5.118 ± 0.721
5.402LeuSer: 5.402 ± 0.574
5.591LeuThr: 5.591 ± 0.738
3.506LeuVal: 3.506 ± 0.46
1.042LeuTrp: 1.042 ± 0.363
1.895LeuTyr: 1.895 ± 0.427
0.0LeuXaa: 0.0 ± 0.0
Met
3.696MetAla: 3.696 ± 0.515
0.19MetCys: 0.19 ± 0.125
1.801MetAsp: 1.801 ± 0.367
1.232MetGlu: 1.232 ± 0.352
0.758MetPhe: 0.758 ± 0.33
1.422MetGly: 1.422 ± 0.334
0.284MetHis: 0.284 ± 0.148
0.853MetIle: 0.853 ± 0.233
2.18MetLys: 2.18 ± 0.447
1.327MetLeu: 1.327 ± 0.367
1.042MetMet: 1.042 ± 0.515
1.137MetAsn: 1.137 ± 0.403
1.611MetPro: 1.611 ± 0.369
0.758MetGln: 0.758 ± 0.26
1.706MetArg: 1.706 ± 0.379
1.801MetSer: 1.801 ± 0.478
1.611MetThr: 1.611 ± 0.358
1.99MetVal: 1.99 ± 0.426
0.474MetTrp: 0.474 ± 0.205
0.379MetTyr: 0.379 ± 0.164
0.0MetXaa: 0.0 ± 0.0
Asn
4.454AsnAla: 4.454 ± 0.736
0.284AsnCys: 0.284 ± 0.191
1.706AsnAsp: 1.706 ± 0.444
2.559AsnGlu: 2.559 ± 0.676
1.327AsnPhe: 1.327 ± 0.458
4.075AsnGly: 4.075 ± 0.53
0.474AsnHis: 0.474 ± 0.235
2.559AsnIle: 2.559 ± 0.492
2.748AsnLys: 2.748 ± 0.461
3.506AsnLeu: 3.506 ± 0.555
0.663AsnMet: 0.663 ± 0.244
1.327AsnAsn: 1.327 ± 0.313
2.464AsnPro: 2.464 ± 0.55
1.611AsnGln: 1.611 ± 0.415
3.317AsnArg: 3.317 ± 0.541
1.99AsnSer: 1.99 ± 0.506
2.938AsnThr: 2.938 ± 0.51
3.317AsnVal: 3.317 ± 0.661
1.706AsnTrp: 1.706 ± 0.431
1.042AsnTyr: 1.042 ± 0.305
0.0AsnXaa: 0.0 ± 0.0
Pro
5.118ProAla: 5.118 ± 0.861
0.379ProCys: 0.379 ± 0.21
2.559ProAsp: 2.559 ± 0.46
3.506ProGlu: 3.506 ± 0.686
1.611ProPhe: 1.611 ± 0.663
3.033ProGly: 3.033 ± 0.778
0.474ProHis: 0.474 ± 0.269
1.99ProIle: 1.99 ± 0.327
2.18ProLys: 2.18 ± 0.67
2.748ProLeu: 2.748 ± 0.515
1.516ProMet: 1.516 ± 0.454
2.274ProAsn: 2.274 ± 0.542
2.18ProPro: 2.18 ± 0.488
1.801ProGln: 1.801 ± 0.381
2.085ProArg: 2.085 ± 0.403
3.033ProSer: 3.033 ± 0.651
3.791ProThr: 3.791 ± 0.638
3.412ProVal: 3.412 ± 0.522
0.758ProTrp: 0.758 ± 0.263
0.853ProTyr: 0.853 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
5.686GlnAla: 5.686 ± 0.713
0.284GlnCys: 0.284 ± 0.219
2.18GlnAsp: 2.18 ± 0.473
3.033GlnGlu: 3.033 ± 0.647
1.422GlnPhe: 1.422 ± 0.471
3.412GlnGly: 3.412 ± 0.636
0.569GlnHis: 0.569 ± 0.246
3.222GlnIle: 3.222 ± 0.558
1.422GlnLys: 1.422 ± 0.311
3.127GlnLeu: 3.127 ± 0.653
1.042GlnMet: 1.042 ± 0.304
1.232GlnAsn: 1.232 ± 0.329
1.706GlnPro: 1.706 ± 0.362
2.18GlnGln: 2.18 ± 0.638
2.18GlnArg: 2.18 ± 0.566
1.422GlnSer: 1.422 ± 0.473
2.369GlnThr: 2.369 ± 0.43
2.748GlnVal: 2.748 ± 0.528
0.948GlnTrp: 0.948 ± 0.515
1.422GlnTyr: 1.422 ± 0.423
0.0GlnXaa: 0.0 ± 0.0
Arg
5.118ArgAla: 5.118 ± 0.956
0.474ArgCys: 0.474 ± 0.271
2.843ArgAsp: 2.843 ± 0.651
2.843ArgGlu: 2.843 ± 0.674
2.274ArgPhe: 2.274 ± 0.367
4.17ArgGly: 4.17 ± 0.648
0.663ArgHis: 0.663 ± 0.259
3.317ArgIle: 3.317 ± 0.515
2.938ArgLys: 2.938 ± 0.475
4.454ArgLeu: 4.454 ± 0.879
1.137ArgMet: 1.137 ± 0.258
2.938ArgAsn: 2.938 ± 0.504
1.516ArgPro: 1.516 ± 0.371
2.369ArgGln: 2.369 ± 0.428
2.369ArgArg: 2.369 ± 0.755
2.18ArgSer: 2.18 ± 0.446
3.886ArgThr: 3.886 ± 1.116
2.559ArgVal: 2.559 ± 0.659
0.663ArgTrp: 0.663 ± 0.286
2.274ArgTyr: 2.274 ± 0.544
0.0ArgXaa: 0.0 ± 0.0
Ser
5.402SerAla: 5.402 ± 1.013
0.474SerCys: 0.474 ± 0.305
3.222SerAsp: 3.222 ± 0.483
2.369SerGlu: 2.369 ± 0.433
1.422SerPhe: 1.422 ± 0.364
5.97SerGly: 5.97 ± 1.287
0.663SerHis: 0.663 ± 0.265
3.127SerIle: 3.127 ± 0.481
4.928SerLys: 4.928 ± 0.905
4.928SerLeu: 4.928 ± 0.835
1.232SerMet: 1.232 ± 0.256
2.843SerAsn: 2.843 ± 0.617
2.464SerPro: 2.464 ± 0.534
1.706SerGln: 1.706 ± 0.425
3.127SerArg: 3.127 ± 0.458
4.549SerSer: 4.549 ± 1.061
3.886SerThr: 3.886 ± 0.657
3.791SerVal: 3.791 ± 0.744
1.232SerTrp: 1.232 ± 0.402
2.274SerTyr: 2.274 ± 0.282
0.0SerXaa: 0.0 ± 0.0
Thr
8.813ThrAla: 8.813 ± 2.083
0.284ThrCys: 0.284 ± 0.177
4.359ThrAsp: 4.359 ± 0.594
3.317ThrGlu: 3.317 ± 0.574
1.99ThrPhe: 1.99 ± 0.388
8.719ThrGly: 8.719 ± 2.177
1.042ThrHis: 1.042 ± 0.361
3.317ThrIle: 3.317 ± 0.834
3.696ThrLys: 3.696 ± 0.64
5.781ThrLeu: 5.781 ± 0.795
1.516ThrMet: 1.516 ± 0.397
3.886ThrAsn: 3.886 ± 0.52
4.359ThrPro: 4.359 ± 0.731
3.127ThrGln: 3.127 ± 0.687
3.033ThrArg: 3.033 ± 0.604
5.781ThrSer: 5.781 ± 1.124
6.634ThrThr: 6.634 ± 1.182
5.497ThrVal: 5.497 ± 0.731
1.232ThrTrp: 1.232 ± 0.399
2.748ThrTyr: 2.748 ± 0.558
0.0ThrXaa: 0.0 ± 0.0
Val
6.918ValAla: 6.918 ± 1.197
0.663ValCys: 0.663 ± 0.342
3.886ValAsp: 3.886 ± 0.653
3.317ValGlu: 3.317 ± 0.792
2.18ValPhe: 2.18 ± 0.542
4.928ValGly: 4.928 ± 0.689
0.853ValHis: 0.853 ± 0.235
2.938ValIle: 2.938 ± 0.53
4.265ValLys: 4.265 ± 0.736
4.738ValLeu: 4.738 ± 0.665
1.042ValMet: 1.042 ± 0.316
2.274ValAsn: 2.274 ± 0.485
3.601ValPro: 3.601 ± 0.685
1.895ValGln: 1.895 ± 0.389
4.549ValArg: 4.549 ± 0.629
2.938ValSer: 2.938 ± 0.402
6.16ValThr: 6.16 ± 1.007
5.212ValVal: 5.212 ± 1.044
0.853ValTrp: 0.853 ± 0.276
1.99ValTyr: 1.99 ± 0.404
0.0ValXaa: 0.0 ± 0.0
Trp
1.895TrpAla: 1.895 ± 0.372
0.095TrpCys: 0.095 ± 0.09
1.137TrpAsp: 1.137 ± 0.288
1.327TrpGlu: 1.327 ± 0.349
0.284TrpPhe: 0.284 ± 0.132
1.422TrpGly: 1.422 ± 0.342
0.758TrpHis: 0.758 ± 0.308
0.474TrpIle: 0.474 ± 0.181
0.853TrpLys: 0.853 ± 0.25
1.706TrpLeu: 1.706 ± 0.444
0.663TrpMet: 0.663 ± 0.214
0.853TrpAsn: 0.853 ± 0.333
0.0TrpPro: 0.0 ± 0.0
0.853TrpGln: 0.853 ± 0.263
1.327TrpArg: 1.327 ± 0.469
0.853TrpSer: 0.853 ± 0.376
1.327TrpThr: 1.327 ± 0.353
0.758TrpVal: 0.758 ± 0.233
0.379TrpTrp: 0.379 ± 0.144
0.569TrpTyr: 0.569 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.464TyrAla: 2.464 ± 0.509
0.19TyrCys: 0.19 ± 0.148
2.559TyrAsp: 2.559 ± 0.727
1.801TyrGlu: 1.801 ± 0.468
1.232TyrPhe: 1.232 ± 0.41
2.843TyrGly: 2.843 ± 0.52
0.379TyrHis: 0.379 ± 0.207
1.99TyrIle: 1.99 ± 0.381
1.895TyrLys: 1.895 ± 0.477
2.748TyrLeu: 2.748 ± 0.489
0.569TyrMet: 0.569 ± 0.276
1.611TyrAsn: 1.611 ± 0.344
1.895TyrPro: 1.895 ± 0.419
1.611TyrGln: 1.611 ± 0.506
1.422TyrArg: 1.422 ± 0.395
1.706TyrSer: 1.706 ± 0.4
1.895TyrThr: 1.895 ± 0.562
1.232TyrVal: 1.232 ± 0.388
0.663TyrTrp: 0.663 ± 0.252
0.663TyrTyr: 0.663 ± 0.286
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (10553 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski