Amino acid dipepetide frequency for Spider monkey simian foamy virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.883AlaAla: 3.883 ± 2.027
1.493AlaCys: 1.493 ± 0.495
1.792AlaAsp: 1.792 ± 0.805
5.078AlaGlu: 5.078 ± 0.939
2.389AlaPhe: 2.389 ± 0.87
3.584AlaGly: 3.584 ± 1.129
2.389AlaHis: 2.389 ± 1.142
3.286AlaIle: 3.286 ± 0.883
1.792AlaLys: 1.792 ± 0.503
5.974AlaLeu: 5.974 ± 1.214
1.493AlaMet: 1.493 ± 0.711
1.792AlaAsn: 1.792 ± 0.55
3.584AlaPro: 3.584 ± 2.235
3.584AlaGln: 3.584 ± 0.951
4.48AlaArg: 4.48 ± 1.104
5.675AlaSer: 5.675 ± 1.207
3.883AlaThr: 3.883 ± 1.233
3.883AlaVal: 3.883 ± 0.894
0.597AlaTrp: 0.597 ± 0.429
2.091AlaTyr: 2.091 ± 1.074
0.0AlaXaa: 0.0 ± 0.0
Cys
1.493CysAla: 1.493 ± 0.92
0.299CysCys: 0.299 ± 0.456
0.299CysAsp: 0.299 ± 0.242
0.597CysGlu: 0.597 ± 0.227
0.896CysPhe: 0.896 ± 0.571
1.493CysGly: 1.493 ± 0.663
0.0CysHis: 0.0 ± 0.0
1.195CysIle: 1.195 ± 0.44
1.195CysLys: 1.195 ± 0.455
1.792CysLeu: 1.792 ± 0.611
0.896CysMet: 0.896 ± 0.571
0.299CysAsn: 0.299 ± 0.242
1.493CysPro: 1.493 ± 0.781
0.597CysGln: 0.597 ± 0.457
0.896CysArg: 0.896 ± 0.516
0.896CysSer: 0.896 ± 0.571
0.896CysThr: 0.896 ± 0.954
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.597CysTyr: 0.597 ± 0.483
0.0CysXaa: 0.0 ± 0.0
Asp
1.195AspAla: 1.195 ± 0.481
1.493AspCys: 1.493 ± 0.955
2.091AspAsp: 2.091 ± 0.714
2.091AspGlu: 2.091 ± 0.884
1.792AspPhe: 1.792 ± 0.782
1.792AspGly: 1.792 ± 0.503
0.896AspHis: 0.896 ± 0.403
2.688AspIle: 2.688 ± 1.093
2.389AspLys: 2.389 ± 1.913
5.675AspLeu: 5.675 ± 1.202
0.896AspMet: 0.896 ± 0.406
2.389AspAsn: 2.389 ± 0.666
3.286AspPro: 3.286 ± 2.04
1.493AspGln: 1.493 ± 0.482
1.493AspArg: 1.493 ± 0.602
4.779AspSer: 4.779 ± 1.058
2.091AspThr: 2.091 ± 0.806
3.286AspVal: 3.286 ± 0.821
1.195AspTrp: 1.195 ± 0.752
2.688AspTyr: 2.688 ± 0.898
0.0AspXaa: 0.0 ± 0.0
Glu
3.286GluAla: 3.286 ± 0.608
0.896GluCys: 0.896 ± 0.519
4.48GluAsp: 4.48 ± 1.344
3.584GluGlu: 3.584 ± 1.179
1.792GluPhe: 1.792 ± 0.812
3.883GluGly: 3.883 ± 1.175
1.195GluHis: 1.195 ± 0.306
3.584GluIle: 3.584 ± 0.865
2.091GluLys: 2.091 ± 1.517
5.376GluLeu: 5.376 ± 1.93
0.597GluMet: 0.597 ± 0.429
1.792GluAsn: 1.792 ± 0.864
3.584GluPro: 3.584 ± 1.937
4.48GluGln: 4.48 ± 1.403
2.091GluArg: 2.091 ± 0.723
1.493GluSer: 1.493 ± 0.955
4.779GluThr: 4.779 ± 1.029
1.792GluVal: 1.792 ± 0.457
2.091GluTrp: 2.091 ± 0.562
2.389GluTyr: 2.389 ± 0.918
0.0GluXaa: 0.0 ± 0.0
Phe
1.493PheAla: 1.493 ± 0.958
0.597PheCys: 0.597 ± 0.457
1.493PheAsp: 1.493 ± 0.704
0.896PheGlu: 0.896 ± 0.471
0.896PhePhe: 0.896 ± 0.791
1.792PheGly: 1.792 ± 0.819
0.896PheHis: 0.896 ± 0.571
2.091PheIle: 2.091 ± 0.776
0.597PheLys: 0.597 ± 0.227
3.883PheLeu: 3.883 ± 1.225
0.597PheMet: 0.597 ± 0.429
0.896PheAsn: 0.896 ± 0.823
1.493PhePro: 1.493 ± 0.915
1.493PheGln: 1.493 ± 0.391
1.195PheArg: 1.195 ± 0.306
1.493PheSer: 1.493 ± 0.846
2.389PheThr: 2.389 ± 0.908
1.195PheVal: 1.195 ± 0.453
1.195PheTrp: 1.195 ± 0.617
0.597PheTyr: 0.597 ± 0.478
0.0PheXaa: 0.0 ± 0.0
Gly
1.493GlyAla: 1.493 ± 1.131
0.597GlyCys: 0.597 ± 0.534
3.286GlyAsp: 3.286 ± 0.908
2.987GlyGlu: 2.987 ± 1.521
3.286GlyPhe: 3.286 ± 1.626
2.688GlyGly: 2.688 ± 1.232
1.195GlyHis: 1.195 ± 0.322
3.883GlyIle: 3.883 ± 1.014
2.987GlyLys: 2.987 ± 0.63
3.883GlyLeu: 3.883 ± 1.335
1.792GlyMet: 1.792 ± 0.805
4.779GlyAsn: 4.779 ± 1.187
4.182GlyPro: 4.182 ± 2.241
2.389GlyGln: 2.389 ± 1.234
2.987GlyArg: 2.987 ± 0.853
4.182GlySer: 4.182 ± 0.901
4.182GlyThr: 4.182 ± 0.974
1.792GlyVal: 1.792 ± 0.466
1.195GlyTrp: 1.195 ± 0.579
2.987GlyTyr: 2.987 ± 0.459
0.0GlyXaa: 0.0 ± 0.0
His
0.896HisAla: 0.896 ± 0.487
0.0HisCys: 0.0 ± 0.0
1.195HisAsp: 1.195 ± 0.579
2.091HisGlu: 2.091 ± 1.687
0.597HisPhe: 0.597 ± 0.227
2.389HisGly: 2.389 ± 0.343
0.299HisHis: 0.299 ± 0.409
1.493HisIle: 1.493 ± 0.347
1.493HisLys: 1.493 ± 0.442
1.792HisLeu: 1.792 ± 0.782
0.299HisMet: 0.299 ± 0.274
0.597HisAsn: 0.597 ± 0.417
2.389HisPro: 2.389 ± 0.655
0.896HisGln: 0.896 ± 0.578
0.896HisArg: 0.896 ± 0.354
1.493HisSer: 1.493 ± 0.505
2.688HisThr: 2.688 ± 1.116
2.091HisVal: 2.091 ± 0.586
0.597HisTrp: 0.597 ± 0.227
0.299HisTyr: 0.299 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
4.182IleAla: 4.182 ± 0.855
1.792IleCys: 1.792 ± 1.239
3.584IleAsp: 3.584 ± 0.885
2.688IleGlu: 2.688 ± 1.026
0.299IlePhe: 0.299 ± 0.239
2.688IleGly: 2.688 ± 0.886
1.195IleHis: 1.195 ± 0.64
5.078IleIle: 5.078 ± 1.102
4.182IleLys: 4.182 ± 1.0
3.286IleLeu: 3.286 ± 0.786
0.0IleMet: 0.0 ± 0.0
3.883IleAsn: 3.883 ± 0.872
5.974IlePro: 5.974 ± 1.922
3.584IleGln: 3.584 ± 0.79
3.584IleArg: 3.584 ± 1.019
2.688IleSer: 2.688 ± 1.429
7.467IleThr: 7.467 ± 1.739
6.571IleVal: 6.571 ± 1.702
1.195IleTrp: 1.195 ± 0.914
2.688IleTyr: 2.688 ± 1.106
0.0IleXaa: 0.0 ± 0.0
Lys
2.987LysAla: 2.987 ± 1.009
2.091LysCys: 2.091 ± 1.211
2.389LysAsp: 2.389 ± 0.655
2.688LysGlu: 2.688 ± 0.771
1.792LysPhe: 1.792 ± 1.036
1.493LysGly: 1.493 ± 0.614
1.792LysHis: 1.792 ± 0.514
3.584LysIle: 3.584 ± 1.256
2.091LysLys: 2.091 ± 1.03
6.272LysLeu: 6.272 ± 2.282
0.0LysMet: 0.0 ± 0.0
2.688LysAsn: 2.688 ± 0.833
4.182LysPro: 4.182 ± 1.448
3.883LysGln: 3.883 ± 1.485
3.286LysArg: 3.286 ± 1.193
1.493LysSer: 1.493 ± 0.602
4.182LysThr: 4.182 ± 1.071
2.688LysVal: 2.688 ± 0.881
2.091LysTrp: 2.091 ± 0.818
2.987LysTyr: 2.987 ± 0.778
0.0LysXaa: 0.0 ± 0.0
Leu
6.272LeuAla: 6.272 ± 1.28
0.299LeuCys: 0.299 ± 0.242
5.376LeuAsp: 5.376 ± 1.361
4.779LeuGlu: 4.779 ± 1.803
2.389LeuPhe: 2.389 ± 0.634
6.272LeuGly: 6.272 ± 0.403
2.389LeuHis: 2.389 ± 0.991
4.48LeuIle: 4.48 ± 1.491
8.065LeuLys: 8.065 ± 2.786
11.649LeuLeu: 11.649 ± 2.175
1.195LeuMet: 1.195 ± 0.752
4.779LeuAsn: 4.779 ± 1.436
7.766LeuPro: 7.766 ± 1.264
4.779LeuGln: 4.779 ± 1.408
5.675LeuArg: 5.675 ± 0.845
3.883LeuSer: 3.883 ± 1.036
6.571LeuThr: 6.571 ± 0.616
5.078LeuVal: 5.078 ± 1.05
2.091LeuTrp: 2.091 ± 0.56
2.688LeuTyr: 2.688 ± 0.566
0.0LeuXaa: 0.0 ± 0.0
Met
2.389MetAla: 2.389 ± 0.915
0.299MetCys: 0.299 ± 0.242
0.896MetAsp: 0.896 ± 0.578
2.389MetGlu: 2.389 ± 1.09
0.299MetPhe: 0.299 ± 0.409
0.896MetGly: 0.896 ± 0.216
0.299MetHis: 0.299 ± 0.239
0.0MetIle: 0.0 ± 0.0
0.597MetLys: 0.597 ± 0.483
1.792MetLeu: 1.792 ± 0.954
0.299MetMet: 0.299 ± 0.409
0.597MetAsn: 0.597 ± 0.227
1.493MetPro: 1.493 ± 0.891
0.896MetGln: 0.896 ± 0.403
0.896MetArg: 0.896 ± 0.354
0.299MetSer: 0.299 ± 0.242
1.792MetThr: 1.792 ± 1.601
0.896MetVal: 0.896 ± 0.717
0.0MetTrp: 0.0 ± 0.0
0.299MetTyr: 0.299 ± 0.242
0.0MetXaa: 0.0 ± 0.0
Asn
2.389AsnAla: 2.389 ± 0.743
0.299AsnCys: 0.299 ± 0.242
1.792AsnAsp: 1.792 ± 0.466
2.389AsnGlu: 2.389 ± 0.36
2.389AsnPhe: 2.389 ± 0.996
2.987AsnGly: 2.987 ± 0.739
0.299AsnHis: 0.299 ± 0.239
3.883AsnIle: 3.883 ± 1.111
3.286AsnLys: 3.286 ± 1.261
5.078AsnLeu: 5.078 ± 0.787
1.792AsnMet: 1.792 ± 0.653
3.584AsnAsn: 3.584 ± 0.865
2.389AsnPro: 2.389 ± 1.604
4.182AsnGln: 4.182 ± 1.191
3.584AsnArg: 3.584 ± 1.769
3.584AsnSer: 3.584 ± 0.748
2.688AsnThr: 2.688 ± 0.608
1.792AsnVal: 1.792 ± 0.819
1.195AsnTrp: 1.195 ± 0.749
0.896AsnTyr: 0.896 ± 0.516
0.0AsnXaa: 0.0 ± 0.0
Pro
4.779ProAla: 4.779 ± 2.034
0.597ProCys: 0.597 ± 0.227
2.389ProAsp: 2.389 ± 1.152
3.286ProGlu: 3.286 ± 1.009
0.896ProPhe: 0.896 ± 0.717
2.688ProGly: 2.688 ± 1.073
3.286ProHis: 3.286 ± 0.659
5.675ProIle: 5.675 ± 0.985
6.272ProLys: 6.272 ± 1.472
8.363ProLeu: 8.363 ± 1.702
0.597ProMet: 0.597 ± 0.463
4.182ProAsn: 4.182 ± 1.51
7.766ProPro: 7.766 ± 2.165
3.883ProGln: 3.883 ± 0.478
4.182ProArg: 4.182 ± 1.077
7.467ProSer: 7.467 ± 1.364
4.48ProThr: 4.48 ± 0.825
5.675ProVal: 5.675 ± 1.086
0.896ProTrp: 0.896 ± 0.388
2.091ProTyr: 2.091 ± 0.663
0.0ProXaa: 0.0 ± 0.0
Gln
3.584GlnAla: 3.584 ± 1.526
0.299GlnCys: 0.299 ± 0.239
1.792GlnAsp: 1.792 ± 0.335
4.48GlnGlu: 4.48 ± 1.032
0.299GlnPhe: 0.299 ± 0.409
3.883GlnGly: 3.883 ± 1.309
1.792GlnHis: 1.792 ± 0.335
2.987GlnIle: 2.987 ± 0.759
2.987GlnLys: 2.987 ± 0.991
4.48GlnLeu: 4.48 ± 1.137
1.195GlnMet: 1.195 ± 0.705
1.792GlnAsn: 1.792 ± 1.093
4.48GlnPro: 4.48 ± 1.501
4.48GlnGln: 4.48 ± 1.807
3.286GlnArg: 3.286 ± 1.23
2.389GlnSer: 2.389 ± 1.078
3.286GlnThr: 3.286 ± 0.736
2.091GlnVal: 2.091 ± 0.724
2.389GlnTrp: 2.389 ± 0.641
1.493GlnTyr: 1.493 ± 0.637
0.0GlnXaa: 0.0 ± 0.0
Arg
4.779ArgAla: 4.779 ± 1.345
0.597ArgCys: 0.597 ± 0.338
1.792ArgAsp: 1.792 ± 1.039
2.987ArgGlu: 2.987 ± 0.686
1.195ArgPhe: 1.195 ± 0.579
2.688ArgGly: 2.688 ± 1.188
1.195ArgHis: 1.195 ± 0.484
2.091ArgIle: 2.091 ± 0.775
2.688ArgLys: 2.688 ± 1.354
3.584ArgLeu: 3.584 ± 0.571
1.493ArgMet: 1.493 ± 0.391
2.987ArgAsn: 2.987 ± 1.163
4.779ArgPro: 4.779 ± 1.482
2.091ArgGln: 2.091 ± 0.622
2.688ArgArg: 2.688 ± 0.95
4.779ArgSer: 4.779 ± 1.56
2.987ArgThr: 2.987 ± 0.663
1.792ArgVal: 1.792 ± 0.902
0.896ArgTrp: 0.896 ± 0.791
0.896ArgTyr: 0.896 ± 0.428
0.0ArgXaa: 0.0 ± 0.0
Ser
6.87SerAla: 6.87 ± 1.497
0.597SerCys: 0.597 ± 0.457
2.389SerAsp: 2.389 ± 0.538
2.987SerGlu: 2.987 ± 0.662
0.896SerPhe: 0.896 ± 0.428
4.182SerGly: 4.182 ± 1.542
1.792SerHis: 1.792 ± 0.457
5.078SerIle: 5.078 ± 0.819
2.091SerLys: 2.091 ± 0.724
5.675SerLeu: 5.675 ± 1.551
0.597SerMet: 0.597 ± 0.227
2.688SerAsn: 2.688 ± 1.181
5.078SerPro: 5.078 ± 0.806
3.286SerGln: 3.286 ± 0.555
1.792SerArg: 1.792 ± 1.137
7.467SerSer: 7.467 ± 1.921
5.078SerThr: 5.078 ± 1.971
2.987SerVal: 2.987 ± 0.739
0.896SerTrp: 0.896 ± 0.354
1.792SerTyr: 1.792 ± 0.68
0.0SerXaa: 0.0 ± 0.0
Thr
4.779ThrAla: 4.779 ± 1.738
1.493ThrCys: 1.493 ± 0.495
3.584ThrAsp: 3.584 ± 0.608
3.584ThrGlu: 3.584 ± 0.74
3.883ThrPhe: 3.883 ± 1.438
5.675ThrGly: 5.675 ± 2.033
2.091ThrHis: 2.091 ± 0.738
3.883ThrIle: 3.883 ± 1.111
3.883ThrLys: 3.883 ± 0.962
5.376ThrLeu: 5.376 ± 1.399
1.195ThrMet: 1.195 ± 0.306
5.675ThrAsn: 5.675 ± 1.259
7.168ThrPro: 7.168 ± 1.963
2.389ThrGln: 2.389 ± 0.95
2.389ThrArg: 2.389 ± 0.424
4.48ThrSer: 4.48 ± 0.278
2.987ThrThr: 2.987 ± 0.739
3.286ThrVal: 3.286 ± 0.446
1.792ThrTrp: 1.792 ± 0.466
1.195ThrTyr: 1.195 ± 0.539
0.0ThrXaa: 0.0 ± 0.0
Val
3.286ValAla: 3.286 ± 1.964
1.493ValCys: 1.493 ± 0.971
2.091ValAsp: 2.091 ± 1.01
2.091ValGlu: 2.091 ± 0.655
1.195ValPhe: 1.195 ± 0.426
2.688ValGly: 2.688 ± 0.898
0.299ValHis: 0.299 ± 0.239
6.87ValIle: 6.87 ± 1.412
2.688ValLys: 2.688 ± 1.208
6.87ValLeu: 6.87 ± 0.56
1.195ValMet: 1.195 ± 0.306
1.493ValAsn: 1.493 ± 0.846
4.182ValPro: 4.182 ± 0.204
2.688ValGln: 2.688 ± 1.143
1.493ValArg: 1.493 ± 1.089
2.389ValSer: 2.389 ± 0.921
4.182ValThr: 4.182 ± 1.479
2.389ValVal: 2.389 ± 0.611
0.597ValTrp: 0.597 ± 0.268
2.389ValTyr: 2.389 ± 0.641
0.0ValXaa: 0.0 ± 0.0
Trp
0.896TrpAla: 0.896 ± 0.578
0.0TrpCys: 0.0 ± 0.0
2.091TrpAsp: 2.091 ± 1.044
1.792TrpGlu: 1.792 ± 0.576
0.0TrpPhe: 0.0 ± 0.0
0.896TrpGly: 0.896 ± 0.659
0.299TrpHis: 0.299 ± 0.242
1.195TrpIle: 1.195 ± 0.624
1.195TrpLys: 1.195 ± 0.426
2.091TrpLeu: 2.091 ± 0.431
0.597TrpMet: 0.597 ± 0.457
1.792TrpAsn: 1.792 ± 1.19
1.493TrpPro: 1.493 ± 0.616
0.299TrpGln: 0.299 ± 0.239
1.195TrpArg: 1.195 ± 0.617
0.896TrpSer: 0.896 ± 0.216
1.792TrpThr: 1.792 ± 0.431
1.195TrpVal: 1.195 ± 0.384
1.195TrpTrp: 1.195 ± 0.583
1.195TrpTyr: 1.195 ± 0.426
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.792TyrAla: 1.792 ± 0.409
0.597TyrCys: 0.597 ± 0.457
1.195TyrAsp: 1.195 ± 0.64
1.792TyrGlu: 1.792 ± 0.554
0.0TyrPhe: 0.0 ± 0.0
2.091TyrGly: 2.091 ± 1.096
0.896TyrHis: 0.896 ± 0.578
3.584TyrIle: 3.584 ± 0.654
2.091TyrLys: 2.091 ± 1.141
3.584TyrLeu: 3.584 ± 1.638
0.299TyrMet: 0.299 ± 0.242
1.792TyrAsn: 1.792 ± 0.819
2.688TyrPro: 2.688 ± 0.878
2.091TyrGln: 2.091 ± 0.791
0.896TyrArg: 0.896 ± 0.48
2.389TyrSer: 2.389 ± 0.723
2.389TyrThr: 2.389 ± 0.628
2.091TyrVal: 2.091 ± 1.01
0.0TyrTrp: 0.0 ± 0.0
2.688TyrTyr: 2.688 ± 0.597
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3349 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski