Amino acid dipepetide frequency for Drosophila melanogaster sigmavirus HAP23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.273AlaAla: 3.273 ± 0.892
0.504AlaCys: 0.504 ± 0.279
3.525AlaAsp: 3.525 ± 1.377
2.769AlaGlu: 2.769 ± 0.547
1.007AlaPhe: 1.007 ± 0.37
2.769AlaGly: 2.769 ± 1.269
1.007AlaHis: 1.007 ± 0.424
2.769AlaIle: 2.769 ± 1.303
1.259AlaLys: 1.259 ± 0.591
5.287AlaLeu: 5.287 ± 1.069
1.007AlaMet: 1.007 ± 0.418
1.762AlaAsn: 1.762 ± 0.613
1.762AlaPro: 1.762 ± 0.499
2.014AlaGln: 2.014 ± 0.967
2.266AlaArg: 2.266 ± 0.744
2.769AlaSer: 2.769 ± 0.637
3.273AlaThr: 3.273 ± 0.589
2.518AlaVal: 2.518 ± 0.515
0.504AlaTrp: 0.504 ± 0.316
2.769AlaTyr: 2.769 ± 1.272
0.0AlaXaa: 0.0 ± 0.0
Cys
0.504CysAla: 0.504 ± 0.307
0.0CysCys: 0.0 ± 0.0
0.504CysAsp: 0.504 ± 0.279
0.0CysGlu: 0.0 ± 0.0
0.504CysPhe: 0.504 ± 0.307
1.007CysGly: 1.007 ± 0.37
0.504CysHis: 0.504 ± 0.516
0.252CysIle: 0.252 ± 0.152
0.755CysLys: 0.755 ± 0.615
1.259CysLeu: 1.259 ± 0.408
0.0CysMet: 0.0 ± 0.0
0.252CysAsn: 0.252 ± 0.152
1.259CysPro: 1.259 ± 1.001
0.252CysGln: 0.252 ± 0.344
1.007CysArg: 1.007 ± 0.558
3.525CysSer: 3.525 ± 0.749
1.259CysThr: 1.259 ± 0.762
1.259CysVal: 1.259 ± 0.762
0.252CysTrp: 0.252 ± 0.152
0.252CysTyr: 0.252 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
2.518AspAla: 2.518 ± 0.337
1.007AspCys: 1.007 ± 0.289
1.511AspAsp: 1.511 ± 0.388
4.28AspGlu: 4.28 ± 1.045
2.769AspPhe: 2.769 ± 0.814
3.273AspGly: 3.273 ± 0.767
2.014AspHis: 2.014 ± 0.425
4.028AspIle: 4.028 ± 0.812
1.259AspLys: 1.259 ± 0.362
7.301AspLeu: 7.301 ± 0.791
2.266AspMet: 2.266 ± 0.683
1.511AspAsn: 1.511 ± 0.687
4.028AspPro: 4.028 ± 1.194
3.021AspGln: 3.021 ± 0.907
2.769AspArg: 2.769 ± 0.65
2.769AspSer: 2.769 ± 0.755
3.525AspThr: 3.525 ± 1.239
3.021AspVal: 3.021 ± 0.861
0.252AspTrp: 0.252 ± 0.402
2.014AspTyr: 2.014 ± 0.692
0.0AspXaa: 0.0 ± 0.0
Glu
2.266GluAla: 2.266 ± 0.788
0.755GluCys: 0.755 ± 0.607
4.28GluAsp: 4.28 ± 1.029
3.273GluGlu: 3.273 ± 0.842
2.266GluPhe: 2.266 ± 1.342
4.783GluGly: 4.783 ± 1.188
0.504GluHis: 0.504 ± 0.516
5.035GluIle: 5.035 ± 0.591
2.769GluLys: 2.769 ± 0.875
4.783GluLeu: 4.783 ± 0.656
1.762GluMet: 1.762 ± 0.613
2.014GluAsn: 2.014 ± 0.976
2.014GluPro: 2.014 ± 1.156
1.762GluGln: 1.762 ± 0.29
2.518GluArg: 2.518 ± 0.897
4.28GluSer: 4.28 ± 1.075
4.028GluThr: 4.028 ± 1.735
3.273GluVal: 3.273 ± 0.957
1.511GluTrp: 1.511 ± 0.614
2.769GluTyr: 2.769 ± 0.742
0.0GluXaa: 0.0 ± 0.0
Phe
1.259PheAla: 1.259 ± 0.63
0.504PheCys: 0.504 ± 0.451
1.511PheAsp: 1.511 ± 0.456
1.007PheGlu: 1.007 ± 0.351
1.007PhePhe: 1.007 ± 0.754
2.769PheGly: 2.769 ± 0.956
0.504PheHis: 0.504 ± 0.516
3.021PheIle: 3.021 ± 0.963
3.273PheLys: 3.273 ± 0.657
4.28PheLeu: 4.28 ± 1.61
1.007PheMet: 1.007 ± 0.327
1.259PheAsn: 1.259 ± 0.578
4.28PhePro: 4.28 ± 0.835
1.511PheGln: 1.511 ± 0.973
2.266PheArg: 2.266 ± 1.371
3.525PheSer: 3.525 ± 0.981
3.021PheThr: 3.021 ± 1.377
3.021PheVal: 3.021 ± 1.101
0.755PheTrp: 0.755 ± 0.29
0.504PheTyr: 0.504 ± 0.279
0.0PheXaa: 0.0 ± 0.0
Gly
2.266GlyAla: 2.266 ± 1.002
0.755GlyCys: 0.755 ± 0.578
3.525GlyAsp: 3.525 ± 1.508
4.783GlyGlu: 4.783 ± 1.878
2.266GlyPhe: 2.266 ± 0.444
3.273GlyGly: 3.273 ± 0.749
1.511GlyHis: 1.511 ± 0.691
4.783GlyIle: 4.783 ± 0.943
2.518GlyLys: 2.518 ± 0.776
7.301GlyLeu: 7.301 ± 2.074
1.511GlyMet: 1.511 ± 0.864
2.266GlyAsn: 2.266 ± 0.647
2.266GlyPro: 2.266 ± 0.649
2.518GlyGln: 2.518 ± 0.747
2.014GlyArg: 2.014 ± 1.087
4.28GlySer: 4.28 ± 0.94
3.021GlyThr: 3.021 ± 0.868
3.776GlyVal: 3.776 ± 0.67
2.014GlyTrp: 2.014 ± 0.634
3.273GlyTyr: 3.273 ± 1.301
0.0GlyXaa: 0.0 ± 0.0
His
0.755HisAla: 0.755 ± 0.498
0.252HisCys: 0.252 ± 0.365
1.259HisAsp: 1.259 ± 0.545
1.511HisGlu: 1.511 ± 0.728
0.504HisPhe: 0.504 ± 0.307
0.755HisGly: 0.755 ± 0.326
0.252HisHis: 0.252 ± 0.35
1.762HisIle: 1.762 ± 0.511
1.007HisLys: 1.007 ± 0.545
3.021HisLeu: 3.021 ± 0.332
0.252HisMet: 0.252 ± 0.365
0.755HisAsn: 0.755 ± 0.318
3.273HisPro: 3.273 ± 0.916
1.762HisGln: 1.762 ± 0.487
2.518HisArg: 2.518 ± 0.801
1.511HisSer: 1.511 ± 0.272
1.762HisThr: 1.762 ± 0.524
2.518HisVal: 2.518 ± 0.809
0.504HisTrp: 0.504 ± 0.305
2.014HisTyr: 2.014 ± 0.856
0.0HisXaa: 0.0 ± 0.0
Ile
2.518IleAla: 2.518 ± 0.927
1.007IleCys: 1.007 ± 0.37
3.021IleAsp: 3.021 ± 0.938
3.525IleGlu: 3.525 ± 0.826
2.266IlePhe: 2.266 ± 0.581
5.287IleGly: 5.287 ± 0.997
3.021IleHis: 3.021 ± 1.004
1.762IleIle: 1.762 ± 0.547
4.783IleLys: 4.783 ± 0.845
7.553IleLeu: 7.553 ± 1.547
1.007IleMet: 1.007 ± 0.622
4.28IleAsn: 4.28 ± 0.68
4.028IlePro: 4.028 ± 1.092
3.525IleGln: 3.525 ± 1.052
4.28IleArg: 4.28 ± 1.18
5.791IleSer: 5.791 ± 2.161
4.028IleThr: 4.028 ± 0.951
5.035IleVal: 5.035 ± 1.424
0.755IleTrp: 0.755 ± 0.457
1.259IleTyr: 1.259 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
2.266LysAla: 2.266 ± 0.5
1.259LysCys: 1.259 ± 0.62
3.021LysAsp: 3.021 ± 0.704
2.014LysGlu: 2.014 ± 0.487
0.755LysPhe: 0.755 ± 0.437
3.273LysGly: 3.273 ± 0.802
0.755LysHis: 0.755 ± 0.615
3.776LysIle: 3.776 ± 1.713
1.762LysLys: 1.762 ± 0.465
4.783LysLeu: 4.783 ± 1.182
1.259LysMet: 1.259 ± 0.399
1.259LysAsn: 1.259 ± 0.384
2.518LysPro: 2.518 ± 0.806
1.007LysGln: 1.007 ± 0.424
3.021LysArg: 3.021 ± 0.603
4.028LysSer: 4.028 ± 1.005
3.525LysThr: 3.525 ± 1.192
3.021LysVal: 3.021 ± 1.468
1.511LysTrp: 1.511 ± 0.683
1.511LysTyr: 1.511 ± 0.864
0.0LysXaa: 0.0 ± 0.0
Leu
7.301LeuAla: 7.301 ± 0.881
1.259LeuCys: 1.259 ± 0.954
4.783LeuAsp: 4.783 ± 0.83
5.287LeuGlu: 5.287 ± 0.787
3.273LeuPhe: 3.273 ± 0.626
4.783LeuGly: 4.783 ± 0.516
2.518LeuHis: 2.518 ± 0.592
7.553LeuIle: 7.553 ± 1.412
4.028LeuLys: 4.028 ± 0.689
6.798LeuLeu: 6.798 ± 1.5
4.028LeuMet: 4.028 ± 1.16
5.035LeuAsn: 5.035 ± 0.956
4.028LeuPro: 4.028 ± 0.923
2.518LeuGln: 2.518 ± 0.63
6.798LeuArg: 6.798 ± 1.586
7.301LeuSer: 7.301 ± 1.903
8.56LeuThr: 8.56 ± 1.132
5.035LeuVal: 5.035 ± 1.419
0.504LeuTrp: 0.504 ± 0.279
5.287LeuTyr: 5.287 ± 1.652
0.0LeuXaa: 0.0 ± 0.0
Met
2.266MetAla: 2.266 ± 0.837
0.755MetCys: 0.755 ± 0.466
2.014MetAsp: 2.014 ± 0.891
1.511MetGlu: 1.511 ± 0.429
0.504MetPhe: 0.504 ± 0.282
2.014MetGly: 2.014 ± 0.579
0.252MetHis: 0.252 ± 0.152
2.518MetIle: 2.518 ± 1.182
0.755MetLys: 0.755 ± 0.29
1.762MetLeu: 1.762 ± 0.833
1.007MetMet: 1.007 ± 0.63
2.518MetAsn: 2.518 ± 1.056
0.252MetPro: 0.252 ± 0.344
0.755MetGln: 0.755 ± 0.326
0.504MetArg: 0.504 ± 0.305
1.511MetSer: 1.511 ± 0.451
2.518MetThr: 2.518 ± 0.685
1.511MetVal: 1.511 ± 0.669
1.007MetTrp: 1.007 ± 0.682
1.762MetTyr: 1.762 ± 0.543
0.0MetXaa: 0.0 ± 0.0
Asn
2.266AsnAla: 2.266 ± 0.732
1.259AsnCys: 1.259 ± 0.505
1.259AsnAsp: 1.259 ± 0.486
1.762AsnGlu: 1.762 ± 0.685
3.021AsnPhe: 3.021 ± 0.828
2.266AsnGly: 2.266 ± 0.674
1.762AsnHis: 1.762 ± 0.355
2.518AsnIle: 2.518 ± 1.227
2.518AsnLys: 2.518 ± 1.092
5.791AsnLeu: 5.791 ± 1.025
0.755AsnMet: 0.755 ± 0.367
2.014AsnAsn: 2.014 ± 0.634
3.525AsnPro: 3.525 ± 0.413
1.511AsnGln: 1.511 ± 0.703
2.769AsnArg: 2.769 ± 1.092
3.525AsnSer: 3.525 ± 1.071
1.762AsnThr: 1.762 ± 0.833
1.762AsnVal: 1.762 ± 0.771
0.504AsnTrp: 0.504 ± 0.282
2.518AsnTyr: 2.518 ± 0.568
0.0AsnXaa: 0.0 ± 0.0
Pro
2.014ProAla: 2.014 ± 0.502
0.504ProCys: 0.504 ± 0.279
3.776ProAsp: 3.776 ± 0.936
4.28ProGlu: 4.28 ± 1.795
2.266ProPhe: 2.266 ± 1.387
3.525ProGly: 3.525 ± 1.282
1.259ProHis: 1.259 ± 0.762
3.525ProIle: 3.525 ± 0.956
1.511ProLys: 1.511 ± 0.499
6.042ProLeu: 6.042 ± 1.104
1.259ProMet: 1.259 ± 0.486
1.762ProAsn: 1.762 ± 0.949
4.028ProPro: 4.028 ± 1.059
1.762ProGln: 1.762 ± 0.767
1.762ProArg: 1.762 ± 0.867
5.539ProSer: 5.539 ± 0.874
3.525ProThr: 3.525 ± 1.508
3.525ProVal: 3.525 ± 0.414
0.504ProTrp: 0.504 ± 0.279
1.259ProTyr: 1.259 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
1.007GlnAla: 1.007 ± 0.424
0.504GlnCys: 0.504 ± 0.307
2.769GlnAsp: 2.769 ± 0.831
3.273GlnGlu: 3.273 ± 1.221
1.259GlnPhe: 1.259 ± 0.578
2.014GlnGly: 2.014 ± 0.644
0.755GlnHis: 0.755 ± 0.466
1.762GlnIle: 1.762 ± 0.484
2.266GlnLys: 2.266 ± 0.85
4.028GlnLeu: 4.028 ± 1.145
1.259GlnMet: 1.259 ± 0.716
1.762GlnAsn: 1.762 ± 0.824
0.755GlnPro: 0.755 ± 0.751
1.007GlnGln: 1.007 ± 0.329
1.259GlnArg: 1.259 ± 0.468
4.28GlnSer: 4.28 ± 0.872
2.266GlnThr: 2.266 ± 0.773
3.273GlnVal: 3.273 ± 0.81
0.0GlnTrp: 0.0 ± 0.0
0.504GlnTyr: 0.504 ± 0.533
0.0GlnXaa: 0.0 ± 0.0
Arg
3.776ArgAla: 3.776 ± 1.188
0.252ArgCys: 0.252 ± 0.152
2.769ArgAsp: 2.769 ± 0.607
2.769ArgGlu: 2.769 ± 1.445
3.525ArgPhe: 3.525 ± 0.851
3.273ArgGly: 3.273 ± 0.661
1.007ArgHis: 1.007 ± 0.394
4.028ArgIle: 4.028 ± 1.458
2.769ArgLys: 2.769 ± 1.236
3.776ArgLeu: 3.776 ± 0.724
1.007ArgMet: 1.007 ± 0.609
3.273ArgAsn: 3.273 ± 1.362
2.014ArgPro: 2.014 ± 1.167
2.266ArgGln: 2.266 ± 0.766
3.021ArgArg: 3.021 ± 0.709
4.783ArgSer: 4.783 ± 1.494
4.028ArgThr: 4.028 ± 1.314
3.273ArgVal: 3.273 ± 0.709
1.259ArgTrp: 1.259 ± 0.762
2.266ArgTyr: 2.266 ± 0.825
0.0ArgXaa: 0.0 ± 0.0
Ser
3.021SerAla: 3.021 ± 0.757
1.259SerCys: 1.259 ± 0.762
4.28SerAsp: 4.28 ± 0.884
3.776SerGlu: 3.776 ± 0.976
4.28SerPhe: 4.28 ± 0.982
4.783SerGly: 4.783 ± 1.438
4.028SerHis: 4.028 ± 0.514
6.546SerIle: 6.546 ± 0.775
3.776SerLys: 3.776 ± 0.698
6.798SerLeu: 6.798 ± 0.795
2.518SerMet: 2.518 ± 0.616
3.525SerAsn: 3.525 ± 0.895
5.035SerPro: 5.035 ± 1.397
1.762SerGln: 1.762 ± 0.299
4.532SerArg: 4.532 ± 1.448
5.791SerSer: 5.791 ± 1.133
5.287SerThr: 5.287 ± 2.11
6.294SerVal: 6.294 ± 1.953
2.266SerTrp: 2.266 ± 0.85
3.021SerTyr: 3.021 ± 0.665
0.0SerXaa: 0.0 ± 0.0
Thr
2.769ThrAla: 2.769 ± 0.74
1.511ThrCys: 1.511 ± 0.428
4.28ThrAsp: 4.28 ± 1.089
3.525ThrGlu: 3.525 ± 0.886
2.266ThrPhe: 2.266 ± 0.602
3.273ThrGly: 3.273 ± 1.087
1.762ThrHis: 1.762 ± 0.62
4.783ThrIle: 4.783 ± 1.345
4.532ThrLys: 4.532 ± 0.766
5.539ThrLeu: 5.539 ± 1.294
0.755ThrMet: 0.755 ± 0.378
3.021ThrAsn: 3.021 ± 0.849
3.273ThrPro: 3.273 ± 1.142
3.021ThrGln: 3.021 ± 1.145
5.035ThrArg: 5.035 ± 0.483
6.294ThrSer: 6.294 ± 1.836
6.294ThrThr: 6.294 ± 1.512
4.532ThrVal: 4.532 ± 0.632
1.259ThrTrp: 1.259 ± 0.399
1.762ThrTyr: 1.762 ± 0.572
0.0ThrXaa: 0.0 ± 0.0
Val
2.266ValAla: 2.266 ± 0.732
1.259ValCys: 1.259 ± 0.505
3.776ValAsp: 3.776 ± 1.083
3.273ValGlu: 3.273 ± 0.891
3.776ValPhe: 3.776 ± 0.726
2.518ValGly: 2.518 ± 0.465
1.762ValHis: 1.762 ± 0.71
4.783ValIle: 4.783 ± 1.176
3.273ValLys: 3.273 ± 1.27
4.028ValLeu: 4.028 ± 1.07
2.769ValMet: 2.769 ± 1.283
3.273ValAsn: 3.273 ± 0.877
2.518ValPro: 2.518 ± 0.495
1.762ValGln: 1.762 ± 0.643
5.035ValArg: 5.035 ± 1.204
5.539ValSer: 5.539 ± 2.046
5.791ValThr: 5.791 ± 0.459
3.525ValVal: 3.525 ± 0.88
0.755ValTrp: 0.755 ± 0.441
2.266ValTyr: 2.266 ± 1.019
0.0ValXaa: 0.0 ± 0.0
Trp
0.252TrpAla: 0.252 ± 0.152
0.0TrpCys: 0.0 ± 0.0
0.755TrpAsp: 0.755 ± 0.498
1.259TrpGlu: 1.259 ± 0.448
1.007TrpPhe: 1.007 ± 0.424
1.259TrpGly: 1.259 ± 0.486
0.504TrpHis: 0.504 ± 0.367
1.007TrpIle: 1.007 ± 0.609
0.504TrpLys: 0.504 ± 0.305
1.259TrpLeu: 1.259 ± 0.703
1.007TrpMet: 1.007 ± 0.558
1.511TrpAsn: 1.511 ± 0.581
1.007TrpPro: 1.007 ± 0.609
0.755TrpGln: 0.755 ± 0.29
0.0TrpArg: 0.0 ± 0.0
2.266TrpSer: 2.266 ± 0.642
0.252TrpThr: 0.252 ± 0.344
1.007TrpVal: 1.007 ± 0.289
0.0TrpTrp: 0.0 ± 0.0
0.504TrpTyr: 0.504 ± 0.279
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.252TyrAla: 0.252 ± 0.152
0.0TyrCys: 0.0 ± 0.0
2.518TyrAsp: 2.518 ± 0.604
2.769TyrGlu: 2.769 ± 1.054
1.762TyrPhe: 1.762 ± 0.355
3.021TyrGly: 3.021 ± 0.749
2.266TyrHis: 2.266 ± 0.871
2.518TyrIle: 2.518 ± 0.499
1.007TyrLys: 1.007 ± 0.289
5.287TyrLeu: 5.287 ± 0.743
1.007TyrMet: 1.007 ± 0.549
2.266TyrAsn: 2.266 ± 0.832
1.762TyrPro: 1.762 ± 0.91
1.511TyrGln: 1.511 ± 0.833
1.762TyrArg: 1.762 ± 0.503
3.273TyrSer: 3.273 ± 0.977
1.762TyrThr: 1.762 ± 0.487
2.769TyrVal: 2.769 ± 1.041
0.0TyrTrp: 0.0 ± 0.0
1.762TyrTyr: 1.762 ± 0.69
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3973 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski