Amino acid dipepetide frequency for Koolpinyah virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.045AlaAla: 1.045 ± 0.363
0.836AlaCys: 0.836 ± 0.309
1.045AlaAsp: 1.045 ± 0.489
0.836AlaGlu: 0.836 ± 0.354
1.045AlaPhe: 1.045 ± 0.776
2.299AlaGly: 2.299 ± 1.212
0.627AlaHis: 0.627 ± 0.288
3.762AlaIle: 3.762 ± 0.476
2.299AlaLys: 2.299 ± 0.545
2.717AlaLeu: 2.717 ± 0.609
1.045AlaMet: 1.045 ± 0.444
1.045AlaAsn: 1.045 ± 0.476
0.627AlaPro: 0.627 ± 0.324
1.672AlaGln: 1.672 ± 0.52
1.254AlaArg: 1.254 ± 0.613
2.09AlaSer: 2.09 ± 0.42
0.627AlaThr: 0.627 ± 0.284
1.045AlaVal: 1.045 ± 0.513
0.627AlaTrp: 0.627 ± 0.313
1.672AlaTyr: 1.672 ± 0.688
0.0AlaXaa: 0.0 ± 0.0
Cys
0.627CysAla: 0.627 ± 0.229
0.0CysCys: 0.0 ± 0.0
1.045CysAsp: 1.045 ± 0.567
1.672CysGlu: 1.672 ± 1.182
0.627CysPhe: 0.627 ± 0.336
1.045CysGly: 1.045 ± 0.384
0.627CysHis: 0.627 ± 0.347
1.463CysIle: 1.463 ± 0.577
2.09CysLys: 2.09 ± 1.075
2.299CysLeu: 2.299 ± 1.092
0.209CysMet: 0.209 ± 0.217
1.254CysAsn: 1.254 ± 0.377
1.045CysPro: 1.045 ± 0.64
0.418CysGln: 0.418 ± 0.211
0.418CysArg: 0.418 ± 0.192
1.881CysSer: 1.881 ± 0.384
1.045CysThr: 1.045 ± 0.263
0.209CysVal: 0.209 ± 0.135
0.418CysTrp: 0.418 ± 0.302
0.836CysTyr: 0.836 ± 0.658
0.0CysXaa: 0.0 ± 0.0
Asp
2.09AspAla: 2.09 ± 0.299
1.881AspCys: 1.881 ± 0.856
5.225AspAsp: 5.225 ± 1.741
5.016AspGlu: 5.016 ± 1.154
2.508AspPhe: 2.508 ± 0.722
3.553AspGly: 3.553 ± 0.893
1.672AspHis: 1.672 ± 0.99
5.434AspIle: 5.434 ± 1.523
3.971AspLys: 3.971 ± 0.689
4.807AspLeu: 4.807 ± 1.067
2.299AspMet: 2.299 ± 0.615
4.389AspAsn: 4.389 ± 0.706
1.881AspPro: 1.881 ± 0.292
1.254AspGln: 1.254 ± 0.412
2.09AspArg: 2.09 ± 0.511
2.508AspSer: 2.508 ± 0.714
2.09AspThr: 2.09 ± 0.533
3.344AspVal: 3.344 ± 0.609
1.254AspTrp: 1.254 ± 0.616
2.717AspTyr: 2.717 ± 0.52
0.0AspXaa: 0.0 ± 0.0
Glu
1.881GluAla: 1.881 ± 0.908
0.627GluCys: 0.627 ± 0.461
5.016GluAsp: 5.016 ± 1.165
6.061GluGlu: 6.061 ± 0.931
3.135GluPhe: 3.135 ± 0.852
4.807GluGly: 4.807 ± 0.777
1.881GluHis: 1.881 ± 0.514
5.434GluIle: 5.434 ± 1.518
4.389GluLys: 4.389 ± 0.936
6.479GluLeu: 6.479 ± 1.288
1.881GluMet: 1.881 ± 0.543
5.225GluAsn: 5.225 ± 0.693
1.881GluPro: 1.881 ± 1.232
0.836GluGln: 0.836 ± 0.246
2.508GluArg: 2.508 ± 0.677
5.852GluSer: 5.852 ± 1.465
5.016GluThr: 5.016 ± 0.59
6.061GluVal: 6.061 ± 1.3
1.672GluTrp: 1.672 ± 0.683
2.926GluTyr: 2.926 ± 0.886
0.0GluXaa: 0.0 ± 0.0
Phe
1.045PheAla: 1.045 ± 0.476
1.463PheCys: 1.463 ± 0.536
2.926PheAsp: 2.926 ± 0.699
2.717PheGlu: 2.717 ± 0.969
1.672PhePhe: 1.672 ± 0.376
3.553PheGly: 3.553 ± 0.85
1.463PheHis: 1.463 ± 0.551
2.508PheIle: 2.508 ± 0.626
2.926PheLys: 2.926 ± 0.494
3.344PheLeu: 3.344 ± 0.441
0.209PheMet: 0.209 ± 0.135
2.299PheAsn: 2.299 ± 0.524
2.09PhePro: 2.09 ± 0.492
0.836PheGln: 0.836 ± 0.423
1.254PheArg: 1.254 ± 0.416
3.135PheSer: 3.135 ± 0.907
1.672PheThr: 1.672 ± 0.427
1.881PheVal: 1.881 ± 0.669
0.209PheTrp: 0.209 ± 0.135
1.672PheTyr: 1.672 ± 0.831
0.0PheXaa: 0.0 ± 0.0
Gly
1.881GlyAla: 1.881 ± 0.627
0.627GlyCys: 0.627 ± 0.336
2.09GlyAsp: 2.09 ± 0.638
3.344GlyGlu: 3.344 ± 0.517
1.672GlyPhe: 1.672 ± 0.748
2.09GlyGly: 2.09 ± 0.758
0.836GlyHis: 0.836 ± 0.407
5.643GlyIle: 5.643 ± 0.664
6.061GlyLys: 6.061 ± 1.698
6.479GlyLeu: 6.479 ± 2.076
1.045GlyMet: 1.045 ± 0.514
4.18GlyAsn: 4.18 ± 0.869
0.418GlyPro: 0.418 ± 0.302
2.508GlyGln: 2.508 ± 0.622
1.254GlyArg: 1.254 ± 0.581
5.016GlySer: 5.016 ± 1.138
3.762GlyThr: 3.762 ± 1.182
5.852GlyVal: 5.852 ± 0.932
1.254GlyTrp: 1.254 ± 0.577
3.553GlyTyr: 3.553 ± 0.65
0.0GlyXaa: 0.0 ± 0.0
His
0.627HisAla: 0.627 ± 0.284
0.418HisCys: 0.418 ± 0.421
0.418HisAsp: 0.418 ± 0.368
1.881HisGlu: 1.881 ± 0.558
0.836HisPhe: 0.836 ± 0.298
1.045HisGly: 1.045 ± 0.497
0.418HisHis: 0.418 ± 0.373
1.881HisIle: 1.881 ± 0.573
2.299HisLys: 2.299 ± 0.524
1.045HisLeu: 1.045 ± 0.351
0.627HisMet: 0.627 ± 0.515
1.045HisAsn: 1.045 ± 0.386
0.836HisPro: 0.836 ± 0.384
0.209HisGln: 0.209 ± 0.135
2.09HisArg: 2.09 ± 0.726
0.627HisSer: 0.627 ± 0.299
0.836HisThr: 0.836 ± 0.302
0.627HisVal: 0.627 ± 0.508
0.418HisTrp: 0.418 ± 0.192
1.463HisTyr: 1.463 ± 0.521
0.0HisXaa: 0.0 ± 0.0
Ile
1.881IleAla: 1.881 ± 0.523
1.672IleCys: 1.672 ± 0.558
5.643IleAsp: 5.643 ± 0.698
5.852IleGlu: 5.852 ± 1.469
3.135IlePhe: 3.135 ± 0.729
6.688IleGly: 6.688 ± 0.949
0.627IleHis: 0.627 ± 0.299
8.568IleIle: 8.568 ± 1.556
11.285IleLys: 11.285 ± 1.794
6.688IleLeu: 6.688 ± 1.324
2.508IleMet: 2.508 ± 0.982
6.897IleAsn: 6.897 ± 1.493
4.598IlePro: 4.598 ± 0.6
1.463IleGln: 1.463 ± 0.451
5.852IleArg: 5.852 ± 0.878
7.315IleSer: 7.315 ± 0.886
3.971IleThr: 3.971 ± 1.025
2.717IleVal: 2.717 ± 0.743
0.418IleTrp: 0.418 ± 0.327
3.135IleTyr: 3.135 ± 1.088
0.0IleXaa: 0.0 ± 0.0
Lys
2.09LysAla: 2.09 ± 0.95
2.09LysCys: 2.09 ± 0.605
4.389LysAsp: 4.389 ± 0.964
7.106LysGlu: 7.106 ± 1.515
2.508LysPhe: 2.508 ± 0.841
7.315LysGly: 7.315 ± 1.219
1.045LysHis: 1.045 ± 0.783
8.777LysIle: 8.777 ± 1.152
7.732LysLys: 7.732 ± 1.383
10.449LysLeu: 10.449 ± 1.534
2.299LysMet: 2.299 ± 0.683
5.225LysAsn: 5.225 ± 0.926
3.553LysPro: 3.553 ± 0.654
1.672LysGln: 1.672 ± 0.427
4.18LysArg: 4.18 ± 0.707
6.061LysSer: 6.061 ± 0.653
3.971LysThr: 3.971 ± 1.091
3.762LysVal: 3.762 ± 1.148
1.254LysTrp: 1.254 ± 0.626
1.881LysTyr: 1.881 ± 0.529
0.0LysXaa: 0.0 ± 0.0
Leu
2.926LeuAla: 2.926 ± 1.433
2.09LeuCys: 2.09 ± 0.543
6.688LeuAsp: 6.688 ± 1.266
7.524LeuGlu: 7.524 ± 0.709
2.926LeuPhe: 2.926 ± 0.532
4.389LeuGly: 4.389 ± 1.046
1.672LeuHis: 1.672 ± 0.441
9.404LeuIle: 9.404 ± 2.282
8.568LeuLys: 8.568 ± 1.555
8.568LeuLeu: 8.568 ± 1.661
3.344LeuMet: 3.344 ± 1.05
3.971LeuAsn: 3.971 ± 1.198
3.344LeuPro: 3.344 ± 0.642
2.926LeuGln: 2.926 ± 0.529
4.18LeuArg: 4.18 ± 1.127
7.732LeuSer: 7.732 ± 1.293
4.807LeuThr: 4.807 ± 0.818
5.225LeuVal: 5.225 ± 1.533
0.836LeuTrp: 0.836 ± 0.617
3.344LeuTyr: 3.344 ± 0.602
0.0LeuXaa: 0.0 ± 0.0
Met
1.045MetAla: 1.045 ± 0.401
0.627MetCys: 0.627 ± 0.706
2.09MetAsp: 2.09 ± 0.705
2.09MetGlu: 2.09 ± 0.464
1.463MetPhe: 1.463 ± 0.541
1.881MetGly: 1.881 ± 0.71
0.209MetHis: 0.209 ± 0.212
3.971MetIle: 3.971 ± 0.709
2.09MetLys: 2.09 ± 0.856
2.299MetLeu: 2.299 ± 1.15
0.418MetMet: 0.418 ± 0.517
1.881MetAsn: 1.881 ± 0.864
0.627MetPro: 0.627 ± 0.68
1.254MetGln: 1.254 ± 0.459
1.463MetArg: 1.463 ± 1.036
2.299MetSer: 2.299 ± 0.808
0.418MetThr: 0.418 ± 0.26
0.836MetVal: 0.836 ± 0.42
0.418MetTrp: 0.418 ± 0.192
2.926MetTyr: 2.926 ± 0.669
0.0MetXaa: 0.0 ± 0.0
Asn
2.508AsnAla: 2.508 ± 0.491
1.463AsnCys: 1.463 ± 0.382
2.508AsnAsp: 2.508 ± 0.766
4.18AsnGlu: 4.18 ± 0.721
2.717AsnPhe: 2.717 ± 0.601
2.926AsnGly: 2.926 ± 1.194
1.463AsnHis: 1.463 ± 0.573
6.688AsnIle: 6.688 ± 1.098
3.971AsnLys: 3.971 ± 0.929
6.897AsnLeu: 6.897 ± 1.853
2.508AsnMet: 2.508 ± 0.74
6.061AsnAsn: 6.061 ± 0.946
2.299AsnPro: 2.299 ± 0.774
2.717AsnGln: 2.717 ± 0.525
1.463AsnArg: 1.463 ± 0.464
2.926AsnSer: 2.926 ± 0.681
3.971AsnThr: 3.971 ± 0.79
2.717AsnVal: 2.717 ± 1.328
1.254AsnTrp: 1.254 ± 0.403
3.135AsnTyr: 3.135 ± 1.035
0.0AsnXaa: 0.0 ± 0.0
Pro
0.836ProAla: 0.836 ± 0.246
0.0ProCys: 0.0 ± 0.0
1.672ProAsp: 1.672 ± 0.354
2.717ProGlu: 2.717 ± 0.419
1.672ProPhe: 1.672 ± 0.977
1.672ProGly: 1.672 ± 0.653
0.627ProHis: 0.627 ± 0.252
3.135ProIle: 3.135 ± 1.124
2.299ProLys: 2.299 ± 0.566
3.553ProLeu: 3.553 ± 1.153
1.045ProMet: 1.045 ± 0.783
1.881ProAsn: 1.881 ± 0.675
1.254ProPro: 1.254 ± 0.51
1.045ProGln: 1.045 ± 0.574
1.254ProArg: 1.254 ± 0.44
2.926ProSer: 2.926 ± 0.432
2.508ProThr: 2.508 ± 0.603
1.881ProVal: 1.881 ± 1.0
0.418ProTrp: 0.418 ± 0.211
1.463ProTyr: 1.463 ± 0.482
0.0ProXaa: 0.0 ± 0.0
Gln
0.836GlnAla: 0.836 ± 0.556
0.836GlnCys: 0.836 ± 0.368
1.672GlnAsp: 1.672 ± 0.32
1.463GlnGlu: 1.463 ± 0.412
2.09GlnPhe: 2.09 ± 0.61
1.254GlnGly: 1.254 ± 0.613
0.209GlnHis: 0.209 ± 0.212
1.881GlnIle: 1.881 ± 0.393
1.672GlnLys: 1.672 ± 0.535
1.254GlnLeu: 1.254 ± 0.383
1.254GlnMet: 1.254 ± 0.547
2.09GlnAsn: 2.09 ± 0.425
0.209GlnPro: 0.209 ± 0.135
0.627GlnGln: 0.627 ± 0.229
0.418GlnArg: 0.418 ± 0.271
2.299GlnSer: 2.299 ± 0.689
2.508GlnThr: 2.508 ± 0.554
0.836GlnVal: 0.836 ± 0.423
0.418GlnTrp: 0.418 ± 0.271
0.209GlnTyr: 0.209 ± 0.259
0.0GlnXaa: 0.0 ± 0.0
Arg
1.463ArgAla: 1.463 ± 0.525
0.627ArgCys: 0.627 ± 0.397
1.672ArgAsp: 1.672 ± 0.623
3.344ArgGlu: 3.344 ± 0.789
2.299ArgPhe: 2.299 ± 0.442
1.881ArgGly: 1.881 ± 0.746
0.836ArgHis: 0.836 ± 0.451
3.762ArgIle: 3.762 ± 0.887
2.508ArgLys: 2.508 ± 0.464
3.344ArgLeu: 3.344 ± 0.794
1.045ArgMet: 1.045 ± 0.433
2.926ArgAsn: 2.926 ± 0.7
1.672ArgPro: 1.672 ± 0.986
0.418ArgGln: 0.418 ± 0.302
1.045ArgArg: 1.045 ± 0.451
3.344ArgSer: 3.344 ± 0.702
3.135ArgThr: 3.135 ± 0.941
2.09ArgVal: 2.09 ± 0.801
0.836ArgTrp: 0.836 ± 0.356
2.299ArgTyr: 2.299 ± 0.875
0.0ArgXaa: 0.0 ± 0.0
Ser
1.463SerAla: 1.463 ± 0.439
1.672SerCys: 1.672 ± 0.63
4.598SerAsp: 4.598 ± 0.565
5.225SerGlu: 5.225 ± 0.704
2.508SerPhe: 2.508 ± 0.637
3.344SerGly: 3.344 ± 0.752
1.463SerHis: 1.463 ± 0.481
5.434SerIle: 5.434 ± 0.793
6.897SerLys: 6.897 ± 1.442
7.524SerLeu: 7.524 ± 1.098
2.508SerMet: 2.508 ± 0.44
4.807SerAsn: 4.807 ± 0.763
3.135SerPro: 3.135 ± 0.921
1.045SerGln: 1.045 ± 0.476
3.553SerArg: 3.553 ± 0.538
5.016SerSer: 5.016 ± 0.793
3.344SerThr: 3.344 ± 0.583
2.926SerVal: 2.926 ± 0.592
2.09SerTrp: 2.09 ± 0.538
3.344SerTyr: 3.344 ± 1.528
0.0SerXaa: 0.0 ± 0.0
Thr
1.463ThrAla: 1.463 ± 0.584
0.209ThrCys: 0.209 ± 0.299
2.926ThrAsp: 2.926 ± 0.896
4.389ThrGlu: 4.389 ± 1.169
2.299ThrPhe: 2.299 ± 0.552
2.299ThrGly: 2.299 ± 0.538
2.09ThrHis: 2.09 ± 0.486
4.598ThrIle: 4.598 ± 0.944
3.135ThrLys: 3.135 ± 0.5
5.643ThrLeu: 5.643 ± 0.706
1.672ThrMet: 1.672 ± 0.687
1.254ThrAsn: 1.254 ± 0.431
1.045ThrPro: 1.045 ± 0.674
1.254ThrGln: 1.254 ± 0.492
2.926ThrArg: 2.926 ± 0.684
4.18ThrSer: 4.18 ± 0.832
3.762ThrThr: 3.762 ± 0.86
3.553ThrVal: 3.553 ± 0.612
1.672ThrTrp: 1.672 ± 0.54
2.717ThrTyr: 2.717 ± 0.748
0.0ThrXaa: 0.0 ± 0.0
Val
1.254ValAla: 1.254 ± 0.459
1.254ValCys: 1.254 ± 0.643
3.762ValAsp: 3.762 ± 1.074
2.508ValGlu: 2.508 ± 1.073
0.627ValPhe: 0.627 ± 0.252
3.762ValGly: 3.762 ± 0.742
0.418ValHis: 0.418 ± 0.422
4.389ValIle: 4.389 ± 0.823
5.643ValLys: 5.643 ± 1.168
5.643ValLeu: 5.643 ± 1.182
2.508ValMet: 2.508 ± 0.328
3.971ValAsn: 3.971 ± 0.941
1.254ValPro: 1.254 ± 0.297
0.209ValGln: 0.209 ± 0.299
1.254ValArg: 1.254 ± 0.337
3.135ValSer: 3.135 ± 0.805
3.135ValThr: 3.135 ± 0.935
1.881ValVal: 1.881 ± 0.542
1.254ValTrp: 1.254 ± 0.44
1.881ValTyr: 1.881 ± 0.525
0.0ValXaa: 0.0 ± 0.0
Trp
0.209TrpAla: 0.209 ± 0.259
0.209TrpCys: 0.209 ± 0.135
1.254TrpAsp: 1.254 ± 0.545
2.299TrpGlu: 2.299 ± 0.41
1.672TrpPhe: 1.672 ± 0.55
1.463TrpGly: 1.463 ± 0.324
0.209TrpHis: 0.209 ± 0.135
1.463TrpIle: 1.463 ± 0.464
1.463TrpLys: 1.463 ± 0.537
1.045TrpLeu: 1.045 ± 0.457
0.418TrpMet: 0.418 ± 0.339
0.627TrpAsn: 0.627 ± 0.387
0.836TrpPro: 0.836 ± 0.372
0.209TrpGln: 0.209 ± 0.217
0.418TrpArg: 0.418 ± 0.271
1.672TrpSer: 1.672 ± 0.376
0.627TrpThr: 0.627 ± 0.313
1.045TrpVal: 1.045 ± 0.57
0.418TrpTrp: 0.418 ± 0.286
0.209TrpTyr: 0.209 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.254TyrAla: 1.254 ± 0.387
0.627TyrCys: 0.627 ± 0.284
3.344TyrAsp: 3.344 ± 0.925
3.344TyrGlu: 3.344 ± 0.833
1.463TyrPhe: 1.463 ± 0.519
2.299TyrGly: 2.299 ± 1.039
1.254TyrHis: 1.254 ± 0.374
2.717TyrIle: 2.717 ± 0.656
6.061TyrLys: 6.061 ± 1.467
4.389TyrLeu: 4.389 ± 1.401
1.254TyrMet: 1.254 ± 0.497
3.344TyrAsn: 3.344 ± 1.277
1.254TyrPro: 1.254 ± 0.396
1.463TyrGln: 1.463 ± 0.334
1.672TyrArg: 1.672 ± 0.411
1.881TyrSer: 1.881 ± 0.525
1.881TyrThr: 1.881 ± 0.528
1.254TyrVal: 1.254 ± 0.402
0.627TyrTrp: 0.627 ± 0.229
1.254TyrTyr: 1.254 ± 0.485
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (4786 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski