Amino acid dipepetide frequency for Marbled eel polyomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.871AlaAla: 6.871 ± 2.802
0.644AlaCys: 0.644 ± 0.371
3.006AlaAsp: 3.006 ± 1.082
4.724AlaGlu: 4.724 ± 1.498
3.006AlaPhe: 3.006 ± 0.627
6.871AlaGly: 6.871 ± 1.456
0.859AlaHis: 0.859 ± 0.355
3.006AlaIle: 3.006 ± 0.649
1.933AlaLys: 1.933 ± 0.601
5.154AlaLeu: 5.154 ± 1.079
1.503AlaMet: 1.503 ± 0.492
3.865AlaAsn: 3.865 ± 0.993
6.442AlaPro: 6.442 ± 2.363
2.791AlaGln: 2.791 ± 0.711
4.08AlaArg: 4.08 ± 1.139
6.227AlaSer: 6.227 ± 0.868
2.791AlaThr: 2.791 ± 0.662
4.939AlaVal: 4.939 ± 1.22
0.429AlaTrp: 0.429 ± 0.244
2.147AlaTyr: 2.147 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
1.718CysAla: 1.718 ± 0.701
1.074CysCys: 1.074 ± 0.403
0.859CysAsp: 0.859 ± 0.421
0.215CysGlu: 0.215 ± 0.223
0.644CysPhe: 0.644 ± 0.256
0.859CysGly: 0.859 ± 0.389
0.859CysHis: 0.859 ± 0.533
1.074CysIle: 1.074 ± 0.469
1.288CysLys: 1.288 ± 0.499
1.074CysLeu: 1.074 ± 0.468
0.0CysMet: 0.0 ± 0.0
0.859CysAsn: 0.859 ± 0.414
1.503CysPro: 1.503 ± 0.459
0.429CysGln: 0.429 ± 0.263
0.644CysArg: 0.644 ± 0.396
2.362CysSer: 2.362 ± 0.906
1.074CysThr: 1.074 ± 0.531
1.933CysVal: 1.933 ± 0.952
0.215CysTrp: 0.215 ± 0.226
0.215CysTyr: 0.215 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
4.509AspAla: 4.509 ± 1.023
1.718AspCys: 1.718 ± 0.41
4.509AspAsp: 4.509 ± 1.17
2.791AspGlu: 2.791 ± 0.698
2.147AspPhe: 2.147 ± 0.591
3.865AspGly: 3.865 ± 1.253
1.503AspHis: 1.503 ± 0.463
1.503AspIle: 1.503 ± 0.524
1.074AspLys: 1.074 ± 0.714
3.65AspLeu: 3.65 ± 0.856
2.362AspMet: 2.362 ± 0.747
2.791AspAsn: 2.791 ± 0.957
2.791AspPro: 2.791 ± 0.808
1.933AspGln: 1.933 ± 0.562
2.791AspArg: 2.791 ± 0.609
4.939AspSer: 4.939 ± 1.164
2.577AspThr: 2.577 ± 0.961
3.436AspVal: 3.436 ± 1.015
0.429AspTrp: 0.429 ± 0.263
1.718AspTyr: 1.718 ± 0.589
0.0AspXaa: 0.0 ± 0.0
Glu
2.791GluAla: 2.791 ± 0.86
0.429GluCys: 0.429 ± 0.394
3.006GluAsp: 3.006 ± 1.098
6.442GluGlu: 6.442 ± 1.713
1.074GluPhe: 1.074 ± 0.37
4.939GluGly: 4.939 ± 1.013
1.718GluHis: 1.718 ± 0.473
1.074GluIle: 1.074 ± 0.44
1.074GluLys: 1.074 ± 0.577
3.436GluLeu: 3.436 ± 0.865
1.503GluMet: 1.503 ± 0.506
2.577GluAsn: 2.577 ± 0.711
4.509GluPro: 4.509 ± 1.272
2.577GluGln: 2.577 ± 0.682
3.221GluArg: 3.221 ± 0.917
4.295GluSer: 4.295 ± 0.784
3.65GluThr: 3.65 ± 0.812
2.577GluVal: 2.577 ± 0.881
0.0GluTrp: 0.0 ± 0.0
0.859GluTyr: 0.859 ± 0.281
0.0GluXaa: 0.0 ± 0.0
Phe
1.288PheAla: 1.288 ± 0.598
1.074PheCys: 1.074 ± 0.634
1.933PheAsp: 1.933 ± 0.51
1.503PheGlu: 1.503 ± 0.63
1.288PhePhe: 1.288 ± 0.669
1.718PheGly: 1.718 ± 0.505
0.429PheHis: 0.429 ± 0.292
0.644PheIle: 0.644 ± 0.349
1.933PheLys: 1.933 ± 0.468
4.724PheLeu: 4.724 ± 1.097
0.644PheMet: 0.644 ± 0.31
2.147PheAsn: 2.147 ± 0.548
1.288PhePro: 1.288 ± 0.43
2.791PheGln: 2.791 ± 0.667
1.933PheArg: 1.933 ± 0.838
2.147PheSer: 2.147 ± 0.614
1.933PheThr: 1.933 ± 0.629
2.362PheVal: 2.362 ± 0.812
0.429PheTrp: 0.429 ± 0.216
1.718PheTyr: 1.718 ± 0.559
0.0PheXaa: 0.0 ± 0.0
Gly
3.221GlyAla: 3.221 ± 1.178
0.859GlyCys: 0.859 ± 0.367
3.65GlyAsp: 3.65 ± 0.895
4.724GlyGlu: 4.724 ± 1.245
2.147GlyPhe: 2.147 ± 0.55
9.019GlyGly: 9.019 ± 3.157
1.933GlyHis: 1.933 ± 0.978
2.791GlyIle: 2.791 ± 0.67
2.362GlyLys: 2.362 ± 0.544
6.657GlyLeu: 6.657 ± 1.252
1.933GlyMet: 1.933 ± 0.883
2.362GlyAsn: 2.362 ± 0.479
6.227GlyPro: 6.227 ± 0.735
2.791GlyGln: 2.791 ± 0.885
5.154GlyArg: 5.154 ± 0.928
5.154GlySer: 5.154 ± 0.965
3.006GlyThr: 3.006 ± 0.688
3.006GlyVal: 3.006 ± 0.867
1.288GlyTrp: 1.288 ± 0.427
1.288GlyTyr: 1.288 ± 0.548
0.0GlyXaa: 0.0 ± 0.0
His
0.859HisAla: 0.859 ± 0.563
1.718HisCys: 1.718 ± 0.725
0.429HisAsp: 0.429 ± 0.234
1.074HisGlu: 1.074 ± 0.462
1.933HisPhe: 1.933 ± 0.582
0.644HisGly: 0.644 ± 0.281
0.215HisHis: 0.215 ± 0.261
2.362HisIle: 2.362 ± 0.736
1.074HisLys: 1.074 ± 0.562
2.147HisLeu: 2.147 ± 1.091
1.288HisMet: 1.288 ± 0.419
0.859HisAsn: 0.859 ± 0.55
0.644HisPro: 0.644 ± 0.427
1.933HisGln: 1.933 ± 0.609
2.577HisArg: 2.577 ± 0.674
3.006HisSer: 3.006 ± 0.567
1.718HisThr: 1.718 ± 0.675
1.933HisVal: 1.933 ± 0.74
0.429HisTrp: 0.429 ± 0.255
0.644HisTyr: 0.644 ± 0.449
0.0HisXaa: 0.0 ± 0.0
Ile
1.718IleAla: 1.718 ± 0.698
0.429IleCys: 0.429 ± 0.358
2.362IleAsp: 2.362 ± 0.657
1.503IleGlu: 1.503 ± 0.649
1.288IlePhe: 1.288 ± 0.455
3.436IleGly: 3.436 ± 1.154
0.644IleHis: 0.644 ± 0.34
2.791IleIle: 2.791 ± 0.483
1.503IleLys: 1.503 ± 0.415
4.295IleLeu: 4.295 ± 1.192
0.429IleMet: 0.429 ± 0.313
0.644IleAsn: 0.644 ± 0.436
3.436IlePro: 3.436 ± 0.822
1.503IleGln: 1.503 ± 0.493
3.865IleArg: 3.865 ± 0.735
3.436IleSer: 3.436 ± 0.697
2.362IleThr: 2.362 ± 0.705
4.295IleVal: 4.295 ± 0.762
0.215IleTrp: 0.215 ± 0.261
1.503IleTyr: 1.503 ± 0.737
0.0IleXaa: 0.0 ± 0.0
Lys
3.221LysAla: 3.221 ± 0.802
0.215LysCys: 0.215 ± 0.185
1.933LysAsp: 1.933 ± 0.678
0.859LysGlu: 0.859 ± 0.485
1.074LysPhe: 1.074 ± 0.518
1.074LysGly: 1.074 ± 0.519
1.933LysHis: 1.933 ± 0.714
1.074LysIle: 1.074 ± 0.681
1.718LysLys: 1.718 ± 0.733
2.791LysLeu: 2.791 ± 0.979
0.429LysMet: 0.429 ± 0.313
1.074LysAsn: 1.074 ± 0.558
2.577LysPro: 2.577 ± 0.662
2.147LysGln: 2.147 ± 1.029
2.791LysArg: 2.791 ± 0.666
1.933LysSer: 1.933 ± 0.843
2.147LysThr: 2.147 ± 0.755
1.933LysVal: 1.933 ± 0.783
0.644LysTrp: 0.644 ± 0.268
0.859LysTyr: 0.859 ± 0.528
0.0LysXaa: 0.0 ± 0.0
Leu
4.939LeuAla: 4.939 ± 1.13
1.933LeuCys: 1.933 ± 0.887
3.436LeuAsp: 3.436 ± 1.164
2.791LeuGlu: 2.791 ± 0.671
4.724LeuPhe: 4.724 ± 0.991
6.442LeuGly: 6.442 ± 1.507
2.791LeuHis: 2.791 ± 0.681
2.147LeuIle: 2.147 ± 0.691
2.362LeuLys: 2.362 ± 0.855
7.086LeuLeu: 7.086 ± 1.374
2.577LeuMet: 2.577 ± 0.864
4.509LeuAsn: 4.509 ± 1.205
9.019LeuPro: 9.019 ± 1.862
3.006LeuGln: 3.006 ± 1.0
6.442LeuArg: 6.442 ± 0.804
7.086LeuSer: 7.086 ± 0.864
7.516LeuThr: 7.516 ± 1.076
3.436LeuVal: 3.436 ± 0.756
0.644LeuTrp: 0.644 ± 0.273
1.503LeuTyr: 1.503 ± 0.784
0.0LeuXaa: 0.0 ± 0.0
Met
2.147MetAla: 2.147 ± 0.45
0.429MetCys: 0.429 ± 0.317
0.429MetAsp: 0.429 ± 0.244
1.074MetGlu: 1.074 ± 0.332
1.718MetPhe: 1.718 ± 0.768
1.074MetGly: 1.074 ± 0.503
0.429MetHis: 0.429 ± 0.29
0.429MetIle: 0.429 ± 0.251
0.429MetLys: 0.429 ± 0.378
3.221MetLeu: 3.221 ± 0.926
0.215MetMet: 0.215 ± 0.301
1.288MetAsn: 1.288 ± 0.373
1.718MetPro: 1.718 ± 0.649
1.503MetGln: 1.503 ± 0.505
1.718MetArg: 1.718 ± 0.523
4.295MetSer: 4.295 ± 1.394
0.644MetThr: 0.644 ± 0.246
1.503MetVal: 1.503 ± 0.454
0.644MetTrp: 0.644 ± 0.378
0.429MetTyr: 0.429 ± 0.31
0.0MetXaa: 0.0 ± 0.0
Asn
4.08AsnAla: 4.08 ± 1.413
0.215AsnCys: 0.215 ± 0.301
2.147AsnAsp: 2.147 ± 0.625
1.074AsnGlu: 1.074 ± 0.573
0.429AsnPhe: 0.429 ± 0.317
2.577AsnGly: 2.577 ± 0.697
1.074AsnHis: 1.074 ± 0.347
2.147AsnIle: 2.147 ± 0.608
1.074AsnLys: 1.074 ± 0.402
3.65AsnLeu: 3.65 ± 0.754
0.429AsnMet: 0.429 ± 0.216
3.865AsnAsn: 3.865 ± 0.622
4.08AsnPro: 4.08 ± 0.894
2.577AsnGln: 2.577 ± 0.876
1.933AsnArg: 1.933 ± 0.578
2.362AsnSer: 2.362 ± 0.576
2.147AsnThr: 2.147 ± 0.806
3.865AsnVal: 3.865 ± 0.914
0.429AsnTrp: 0.429 ± 0.292
1.074AsnTyr: 1.074 ± 0.531
0.0AsnXaa: 0.0 ± 0.0
Pro
6.871ProAla: 6.871 ± 2.602
0.0ProCys: 0.0 ± 0.0
4.08ProAsp: 4.08 ± 1.087
4.509ProGlu: 4.509 ± 0.925
3.006ProPhe: 3.006 ± 0.505
5.368ProGly: 5.368 ± 1.164
3.436ProHis: 3.436 ± 0.775
3.436ProIle: 3.436 ± 0.862
1.718ProLys: 1.718 ± 0.497
8.374ProLeu: 8.374 ± 1.815
2.577ProMet: 2.577 ± 0.638
1.718ProAsn: 1.718 ± 0.541
17.608ProPro: 17.608 ± 6.999
3.436ProGln: 3.436 ± 0.767
5.798ProArg: 5.798 ± 0.848
7.086ProSer: 7.086 ± 1.265
4.08ProThr: 4.08 ± 1.247
4.724ProVal: 4.724 ± 1.385
0.644ProTrp: 0.644 ± 0.255
2.791ProTyr: 2.791 ± 0.813
0.0ProXaa: 0.0 ± 0.0
Gln
5.583GlnAla: 5.583 ± 1.115
1.288GlnCys: 1.288 ± 0.61
3.865GlnAsp: 3.865 ± 0.714
2.362GlnGlu: 2.362 ± 0.599
1.074GlnPhe: 1.074 ± 0.437
2.791GlnGly: 2.791 ± 0.881
0.859GlnHis: 0.859 ± 0.322
1.933GlnIle: 1.933 ± 0.814
2.147GlnLys: 2.147 ± 0.791
3.65GlnLeu: 3.65 ± 0.728
0.644GlnMet: 0.644 ± 0.433
1.288GlnAsn: 1.288 ± 0.443
4.08GlnPro: 4.08 ± 1.132
3.006GlnGln: 3.006 ± 0.863
3.436GlnArg: 3.436 ± 0.864
3.65GlnSer: 3.65 ± 0.792
2.577GlnThr: 2.577 ± 0.619
2.791GlnVal: 2.791 ± 0.99
0.0GlnTrp: 0.0 ± 0.0
2.362GlnTyr: 2.362 ± 0.783
0.0GlnXaa: 0.0 ± 0.0
Arg
3.65ArgAla: 3.65 ± 0.604
2.577ArgCys: 2.577 ± 0.921
2.147ArgAsp: 2.147 ± 0.594
3.006ArgGlu: 3.006 ± 0.552
1.718ArgPhe: 1.718 ± 0.473
4.724ArgGly: 4.724 ± 1.867
1.288ArgHis: 1.288 ± 0.643
5.154ArgIle: 5.154 ± 1.324
2.577ArgLys: 2.577 ± 0.909
5.798ArgLeu: 5.798 ± 1.568
1.718ArgMet: 1.718 ± 0.491
2.362ArgAsn: 2.362 ± 0.585
6.012ArgPro: 6.012 ± 1.386
2.791ArgGln: 2.791 ± 0.727
9.448ArgArg: 9.448 ± 1.844
5.798ArgSer: 5.798 ± 1.764
3.006ArgThr: 3.006 ± 1.189
4.08ArgVal: 4.08 ± 0.882
0.644ArgTrp: 0.644 ± 0.57
1.288ArgTyr: 1.288 ± 0.437
0.0ArgXaa: 0.0 ± 0.0
Ser
7.945SerAla: 7.945 ± 1.174
1.288SerCys: 1.288 ± 0.559
4.295SerAsp: 4.295 ± 1.277
5.583SerGlu: 5.583 ± 1.107
2.791SerPhe: 2.791 ± 0.753
5.368SerGly: 5.368 ± 1.128
2.362SerHis: 2.362 ± 0.791
3.65SerIle: 3.65 ± 0.82
3.006SerLys: 3.006 ± 0.88
8.589SerLeu: 8.589 ± 2.105
1.718SerMet: 1.718 ± 0.518
1.718SerAsn: 1.718 ± 0.509
8.374SerPro: 8.374 ± 1.46
3.65SerGln: 3.65 ± 0.92
4.939SerArg: 4.939 ± 0.905
6.442SerSer: 6.442 ± 0.986
4.08SerThr: 4.08 ± 0.811
6.012SerVal: 6.012 ± 1.744
0.215SerTrp: 0.215 ± 0.217
2.147SerTyr: 2.147 ± 0.676
0.0SerXaa: 0.0 ± 0.0
Thr
3.436ThrAla: 3.436 ± 0.926
0.644ThrCys: 0.644 ± 0.368
4.939ThrAsp: 4.939 ± 1.144
1.933ThrGlu: 1.933 ± 0.855
0.859ThrPhe: 0.859 ± 0.342
4.295ThrGly: 4.295 ± 0.8
2.362ThrHis: 2.362 ± 0.975
2.577ThrIle: 2.577 ± 0.887
1.288ThrLys: 1.288 ± 0.558
3.006ThrLeu: 3.006 ± 0.54
2.147ThrMet: 2.147 ± 0.958
1.718ThrAsn: 1.718 ± 0.543
4.295ThrPro: 4.295 ± 1.196
3.006ThrGln: 3.006 ± 1.211
2.577ThrArg: 2.577 ± 1.02
5.154ThrSer: 5.154 ± 0.826
4.724ThrThr: 4.724 ± 1.079
5.583ThrVal: 5.583 ± 0.827
0.429ThrTrp: 0.429 ± 0.319
0.429ThrTyr: 0.429 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
3.865ValAla: 3.865 ± 1.022
1.718ValCys: 1.718 ± 0.887
3.221ValAsp: 3.221 ± 0.849
4.08ValGlu: 4.08 ± 0.864
2.147ValPhe: 2.147 ± 0.568
3.006ValGly: 3.006 ± 0.664
1.933ValHis: 1.933 ± 0.636
3.006ValIle: 3.006 ± 0.913
2.362ValLys: 2.362 ± 0.747
5.154ValLeu: 5.154 ± 1.145
1.718ValMet: 1.718 ± 0.555
3.865ValAsn: 3.865 ± 0.983
4.295ValPro: 4.295 ± 0.964
5.154ValGln: 5.154 ± 0.985
4.724ValArg: 4.724 ± 0.873
5.798ValSer: 5.798 ± 1.122
3.65ValThr: 3.65 ± 1.103
4.295ValVal: 4.295 ± 0.787
0.215ValTrp: 0.215 ± 0.301
1.074ValTyr: 1.074 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
1.074TrpAla: 1.074 ± 0.517
0.0TrpCys: 0.0 ± 0.0
0.429TrpAsp: 0.429 ± 0.321
0.215TrpGlu: 0.215 ± 0.268
0.0TrpPhe: 0.0 ± 0.0
0.644TrpGly: 0.644 ± 0.323
0.215TrpHis: 0.215 ± 0.174
0.0TrpIle: 0.0 ± 0.0
0.644TrpLys: 0.644 ± 0.36
1.074TrpLeu: 1.074 ± 0.481
0.429TrpMet: 0.429 ± 0.252
0.429TrpAsn: 0.429 ± 0.363
0.859TrpPro: 0.859 ± 0.492
1.074TrpGln: 1.074 ± 0.343
0.0TrpArg: 0.0 ± 0.0
0.215TrpSer: 0.215 ± 0.301
0.0TrpThr: 0.0 ± 0.0
0.429TrpVal: 0.429 ± 0.359
0.429TrpTrp: 0.429 ± 0.347
0.429TrpTyr: 0.429 ± 0.275
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.503TyrAla: 1.503 ± 0.548
0.644TyrCys: 0.644 ± 0.268
2.362TyrAsp: 2.362 ± 0.593
1.288TyrGlu: 1.288 ± 0.679
0.859TyrPhe: 0.859 ± 0.404
1.074TyrGly: 1.074 ± 0.531
0.644TyrHis: 0.644 ± 0.359
0.859TyrIle: 0.859 ± 0.305
0.859TyrLys: 0.859 ± 0.38
0.859TyrLeu: 0.859 ± 0.395
0.859TyrMet: 0.859 ± 0.433
1.288TyrAsn: 1.288 ± 0.416
1.503TyrPro: 1.503 ± 0.536
1.288TyrGln: 1.288 ± 0.384
1.718TyrArg: 1.718 ± 0.763
2.791TyrSer: 2.791 ± 0.676
1.718TyrThr: 1.718 ± 0.706
2.147TyrVal: 2.147 ± 0.622
0.215TyrTrp: 0.215 ± 0.211
0.644TyrTyr: 0.644 ± 0.353
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 16 proteins (4658 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski