Amino acid dipepetide frequency for Miniopterus schreibersii polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.35AlaAla: 5.35 ± 1.418
2.14AlaCys: 2.14 ± 0.574
4.815AlaAsp: 4.815 ± 1.405
5.35AlaGlu: 5.35 ± 1.53
6.421AlaPhe: 6.421 ± 3.05
2.675AlaGly: 2.675 ± 0.894
1.605AlaHis: 1.605 ± 0.672
1.605AlaIle: 1.605 ± 0.853
5.886AlaLys: 5.886 ± 1.643
5.35AlaLeu: 5.35 ± 1.874
1.605AlaMet: 1.605 ± 0.707
0.535AlaAsn: 0.535 ± 0.456
3.21AlaPro: 3.21 ± 1.705
4.28AlaGln: 4.28 ± 1.205
2.14AlaArg: 2.14 ± 1.074
6.956AlaSer: 6.956 ± 1.689
3.745AlaThr: 3.745 ± 0.946
4.815AlaVal: 4.815 ± 1.485
1.07AlaTrp: 1.07 ± 0.713
1.07AlaTyr: 1.07 ± 0.448
0.0AlaXaa: 0.0 ± 0.0
Cys
3.21CysAla: 3.21 ± 0.722
0.535CysCys: 0.535 ± 0.585
1.605CysAsp: 1.605 ± 1.07
0.0CysGlu: 0.0 ± 0.0
2.14CysPhe: 2.14 ± 1.039
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.07CysIle: 1.07 ± 1.17
3.21CysLys: 3.21 ± 0.888
5.35CysLeu: 5.35 ± 2.238
0.0CysMet: 0.0 ± 0.0
1.605CysAsn: 1.605 ± 1.07
2.675CysPro: 2.675 ± 1.254
0.535CysGln: 0.535 ± 0.357
0.0CysArg: 0.0 ± 0.0
2.675CysSer: 2.675 ± 0.894
1.07CysThr: 1.07 ± 0.519
0.535CysVal: 0.535 ± 0.357
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.14AspAla: 2.14 ± 0.787
0.0AspCys: 0.0 ± 0.0
0.535AspAsp: 0.535 ± 0.456
2.675AspGlu: 2.675 ± 1.305
2.675AspPhe: 2.675 ± 0.894
4.28AspGly: 4.28 ± 1.028
0.0AspHis: 0.0 ± 0.0
2.14AspIle: 2.14 ± 0.574
3.21AspLys: 3.21 ± 0.888
5.35AspLeu: 5.35 ± 0.842
1.605AspMet: 1.605 ± 0.909
0.535AspAsn: 0.535 ± 0.357
3.21AspPro: 3.21 ± 0.914
1.605AspGln: 1.605 ± 0.831
1.605AspArg: 1.605 ± 0.672
2.675AspSer: 2.675 ± 1.377
1.605AspThr: 1.605 ± 0.672
3.745AspVal: 3.745 ± 1.439
1.07AspTrp: 1.07 ± 0.808
0.535AspTyr: 0.535 ± 0.357
0.0AspXaa: 0.0 ± 0.0
Glu
6.956GluAla: 6.956 ± 2.567
0.535GluCys: 0.535 ± 0.585
2.675GluAsp: 2.675 ± 0.882
7.491GluGlu: 7.491 ± 1.888
2.675GluPhe: 2.675 ± 0.894
2.675GluGly: 2.675 ± 0.79
0.535GluHis: 0.535 ± 0.357
2.14GluIle: 2.14 ± 0.973
4.815GluLys: 4.815 ± 2.623
5.35GluLeu: 5.35 ± 1.157
0.535GluMet: 0.535 ± 0.567
2.14GluAsn: 2.14 ± 0.973
3.745GluPro: 3.745 ± 1.69
3.745GluGln: 3.745 ± 1.333
2.675GluArg: 2.675 ± 1.041
4.815GluSer: 4.815 ± 1.693
2.675GluThr: 2.675 ± 1.214
4.28GluVal: 4.28 ± 1.535
0.535GluTrp: 0.535 ± 0.456
2.14GluTyr: 2.14 ± 0.638
0.0GluXaa: 0.0 ± 0.0
Phe
4.28PheAla: 4.28 ± 0.654
4.28PheCys: 4.28 ± 1.112
1.07PheAsp: 1.07 ± 0.713
6.421PheGlu: 6.421 ± 1.89
0.0PhePhe: 0.0 ± 0.0
2.675PheGly: 2.675 ± 0.724
0.0PheHis: 0.0 ± 0.0
2.14PheIle: 2.14 ± 0.704
3.745PheLys: 3.745 ± 1.1
3.21PheLeu: 3.21 ± 1.184
0.0PheMet: 0.0 ± 0.0
0.535PheAsn: 0.535 ± 0.357
2.14PhePro: 2.14 ± 0.704
2.675PheGln: 2.675 ± 1.597
1.07PheArg: 1.07 ± 0.448
5.35PheSer: 5.35 ± 0.988
4.28PheThr: 4.28 ± 1.252
1.07PheVal: 1.07 ± 0.913
0.0PheTrp: 0.0 ± 0.0
1.605PheTyr: 1.605 ± 0.532
0.0PheXaa: 0.0 ± 0.0
Gly
3.21GlyAla: 3.21 ± 1.525
0.535GlyCys: 0.535 ± 0.357
3.745GlyAsp: 3.745 ± 0.453
3.21GlyGlu: 3.21 ± 0.697
1.07GlyPhe: 1.07 ± 0.448
5.886GlyGly: 5.886 ± 1.28
0.535GlyHis: 0.535 ± 0.62
5.886GlyIle: 5.886 ± 1.927
2.675GlyLys: 2.675 ± 0.702
10.166GlyLeu: 10.166 ± 3.605
0.0GlyMet: 0.0 ± 0.0
0.535GlyAsn: 0.535 ± 0.456
3.21GlyPro: 3.21 ± 1.661
2.675GlyGln: 2.675 ± 0.79
2.14GlyArg: 2.14 ± 0.856
3.745GlySer: 3.745 ± 1.513
2.14GlyThr: 2.14 ± 0.622
4.815GlyVal: 4.815 ± 0.756
1.07GlyTrp: 1.07 ± 0.808
2.675GlyTyr: 2.675 ± 0.862
0.0GlyXaa: 0.0 ± 0.0
His
2.675HisAla: 2.675 ± 1.305
1.07HisCys: 1.07 ± 0.713
1.605HisAsp: 1.605 ± 0.669
1.605HisGlu: 1.605 ± 0.669
1.07HisPhe: 1.07 ± 0.808
0.535HisGly: 0.535 ± 0.62
1.605HisHis: 1.605 ± 0.669
0.535HisIle: 0.535 ± 0.456
1.07HisLys: 1.07 ± 0.448
2.675HisLeu: 2.675 ± 1.235
1.07HisMet: 1.07 ± 0.448
0.0HisAsn: 0.0 ± 0.0
3.745HisPro: 3.745 ± 1.485
2.14HisGln: 2.14 ± 1.616
0.535HisArg: 0.535 ± 0.357
0.535HisSer: 0.535 ± 0.357
1.07HisThr: 1.07 ± 0.7
1.07HisVal: 1.07 ± 0.808
0.0HisTrp: 0.0 ± 0.0
0.535HisTyr: 0.535 ± 0.357
0.0HisXaa: 0.0 ± 0.0
Ile
2.14IleAla: 2.14 ± 1.23
1.07IleCys: 1.07 ± 0.617
1.07IleAsp: 1.07 ± 0.913
2.675IleGlu: 2.675 ± 1.254
1.07IlePhe: 1.07 ± 0.519
2.675IleGly: 2.675 ± 0.724
0.535IleHis: 0.535 ± 0.456
0.535IleIle: 0.535 ± 0.585
2.14IleLys: 2.14 ± 1.127
3.21IleLeu: 3.21 ± 0.907
2.14IleMet: 2.14 ± 0.942
2.14IleAsn: 2.14 ± 1.427
1.07IlePro: 1.07 ± 0.7
2.675IleGln: 2.675 ± 1.597
3.21IleArg: 3.21 ± 1.371
5.35IleSer: 5.35 ± 1.969
1.605IleThr: 1.605 ± 0.769
3.21IleVal: 3.21 ± 0.684
1.07IleTrp: 1.07 ± 0.519
1.07IleTyr: 1.07 ± 0.617
0.0IleXaa: 0.0 ± 0.0
Lys
4.28LysAla: 4.28 ± 1.535
3.21LysCys: 3.21 ± 1.343
1.07LysAsp: 1.07 ± 0.656
4.815LysGlu: 4.815 ± 1.628
0.535LysPhe: 0.535 ± 0.357
3.21LysGly: 3.21 ± 0.888
1.07LysHis: 1.07 ± 0.713
4.815LysIle: 4.815 ± 0.564
7.491LysLys: 7.491 ± 2.408
5.886LysLeu: 5.886 ± 0.944
2.14LysMet: 2.14 ± 0.942
3.745LysAsn: 3.745 ± 1.513
1.07LysPro: 1.07 ± 0.617
1.605LysGln: 1.605 ± 1.185
3.745LysArg: 3.745 ± 0.782
2.675LysSer: 2.675 ± 1.204
4.815LysThr: 4.815 ± 1.677
2.14LysVal: 2.14 ± 1.426
1.605LysTrp: 1.605 ± 0.823
0.535LysTyr: 0.535 ± 0.357
0.0LysXaa: 0.0 ± 0.0
Leu
3.745LeuAla: 3.745 ± 0.859
1.605LeuCys: 1.605 ± 0.668
5.886LeuAsp: 5.886 ± 2.008
8.561LeuGlu: 8.561 ± 1.713
5.35LeuPhe: 5.35 ± 2.295
4.28LeuGly: 4.28 ± 2.162
6.421LeuHis: 6.421 ± 1.741
4.815LeuIle: 4.815 ± 1.066
4.815LeuLys: 4.815 ± 1.56
19.262LeuLeu: 19.262 ± 3.561
3.21LeuMet: 3.21 ± 1.226
8.026LeuAsn: 8.026 ± 0.688
9.096LeuPro: 9.096 ± 2.867
4.28LeuGln: 4.28 ± 1.258
6.421LeuArg: 6.421 ± 1.198
7.491LeuSer: 7.491 ± 2.69
3.21LeuThr: 3.21 ± 0.68
8.026LeuVal: 8.026 ± 1.658
2.14LeuTrp: 2.14 ± 1.615
1.605LeuTyr: 1.605 ± 0.823
0.0LeuXaa: 0.0 ± 0.0
Met
3.21MetAla: 3.21 ± 1.14
0.535MetCys: 0.535 ± 0.357
4.28MetAsp: 4.28 ± 1.45
0.535MetGlu: 0.535 ± 0.62
1.605MetPhe: 1.605 ± 0.853
1.605MetGly: 1.605 ± 0.497
0.535MetHis: 0.535 ± 0.62
0.535MetIle: 0.535 ± 0.567
1.605MetLys: 1.605 ± 0.823
3.21MetLeu: 3.21 ± 0.76
0.0MetMet: 0.0 ± 0.0
2.675MetAsn: 2.675 ± 0.807
0.535MetPro: 0.535 ± 0.456
0.0MetGln: 0.0 ± 0.0
0.535MetArg: 0.535 ± 0.357
0.535MetSer: 0.535 ± 0.456
3.21MetThr: 3.21 ± 0.487
0.535MetVal: 0.535 ± 0.62
1.07MetTrp: 1.07 ± 0.656
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.535AsnAla: 0.535 ± 0.357
0.535AsnCys: 0.535 ± 0.357
1.07AsnAsp: 1.07 ± 0.913
1.605AsnGlu: 1.605 ± 0.831
1.07AsnPhe: 1.07 ± 0.713
0.535AsnGly: 0.535 ± 0.357
0.535AsnHis: 0.535 ± 0.357
0.535AsnIle: 0.535 ± 0.456
2.14AsnLys: 2.14 ± 1.426
4.815AsnLeu: 4.815 ± 1.382
1.605AsnMet: 1.605 ± 0.669
2.675AsnAsn: 2.675 ± 1.08
4.815AsnPro: 4.815 ± 2.157
3.21AsnGln: 3.21 ± 0.896
0.535AsnArg: 0.535 ± 0.357
3.21AsnSer: 3.21 ± 1.155
1.605AsnThr: 1.605 ± 0.668
5.886AsnVal: 5.886 ± 1.121
1.605AsnTrp: 1.605 ± 0.853
0.535AsnTyr: 0.535 ± 0.456
0.0AsnXaa: 0.0 ± 0.0
Pro
4.28ProAla: 4.28 ± 0.975
0.535ProCys: 0.535 ± 0.357
4.28ProAsp: 4.28 ± 0.987
1.07ProGlu: 1.07 ± 0.713
2.14ProPhe: 2.14 ± 0.704
3.745ProGly: 3.745 ± 0.631
0.535ProHis: 0.535 ± 0.357
3.745ProIle: 3.745 ± 1.237
4.28ProLys: 4.28 ± 1.013
6.956ProLeu: 6.956 ± 2.022
2.14ProMet: 2.14 ± 1.313
1.605ProAsn: 1.605 ± 0.54
10.166ProPro: 10.166 ± 3.484
5.886ProGln: 5.886 ± 1.795
2.675ProArg: 2.675 ± 1.168
8.026ProSer: 8.026 ± 1.429
2.14ProThr: 2.14 ± 0.638
2.14ProVal: 2.14 ± 0.787
0.0ProTrp: 0.0 ± 0.0
0.535ProTyr: 0.535 ± 0.456
0.0ProXaa: 0.0 ± 0.0
Gln
6.956GlnAla: 6.956 ± 1.006
1.07GlnCys: 1.07 ± 0.519
2.14GlnAsp: 2.14 ± 0.991
2.14GlnGlu: 2.14 ± 0.638
3.745GlnPhe: 3.745 ± 1.449
4.815GlnGly: 4.815 ± 0.82
2.675GlnHis: 2.675 ± 1.75
1.605GlnIle: 1.605 ± 0.54
2.14GlnLys: 2.14 ± 1.234
8.026GlnLeu: 8.026 ± 1.687
2.14GlnMet: 2.14 ± 0.896
0.0GlnAsn: 0.0 ± 0.0
1.605GlnPro: 1.605 ± 0.54
1.605GlnGln: 1.605 ± 0.672
2.14GlnArg: 2.14 ± 0.727
0.535GlnSer: 0.535 ± 0.357
3.745GlnThr: 3.745 ± 0.631
2.675GlnVal: 2.675 ± 1.041
0.0GlnTrp: 0.0 ± 0.0
0.535GlnTyr: 0.535 ± 0.456
0.0GlnXaa: 0.0 ± 0.0
Arg
2.14ArgAla: 2.14 ± 0.638
0.535ArgCys: 0.535 ± 0.357
2.675ArgAsp: 2.675 ± 0.702
2.675ArgGlu: 2.675 ± 1.147
2.675ArgPhe: 2.675 ± 0.548
2.675ArgGly: 2.675 ± 0.862
1.07ArgHis: 1.07 ± 0.808
0.535ArgIle: 0.535 ± 0.357
2.675ArgLys: 2.675 ± 2.39
4.815ArgLeu: 4.815 ± 1.35
1.07ArgMet: 1.07 ± 0.656
1.07ArgAsn: 1.07 ± 0.617
1.07ArgPro: 1.07 ± 0.7
1.605ArgGln: 1.605 ± 1.185
2.675ArgArg: 2.675 ± 0.714
2.675ArgSer: 2.675 ± 0.667
0.535ArgThr: 0.535 ± 0.62
1.605ArgVal: 1.605 ± 0.831
1.07ArgTrp: 1.07 ± 0.808
3.745ArgTyr: 3.745 ± 1.569
0.0ArgXaa: 0.0 ± 0.0
Ser
3.745SerAla: 3.745 ± 0.946
4.815SerCys: 4.815 ± 1.583
1.07SerAsp: 1.07 ± 0.713
0.535SerGlu: 0.535 ± 0.456
3.745SerPhe: 3.745 ± 1.994
5.886SerGly: 5.886 ± 1.198
2.675SerHis: 2.675 ± 0.807
2.14SerIle: 2.14 ± 1.273
2.675SerLys: 2.675 ± 1.08
9.631SerLeu: 9.631 ± 2.028
1.07SerMet: 1.07 ± 0.915
2.14SerAsn: 2.14 ± 0.704
4.815SerPro: 4.815 ± 2.19
4.815SerGln: 4.815 ± 1.218
3.745SerArg: 3.745 ± 1.75
9.096SerSer: 9.096 ± 2.103
7.491SerThr: 7.491 ± 1.049
1.07SerVal: 1.07 ± 0.939
0.535SerTrp: 0.535 ± 0.357
1.07SerTyr: 1.07 ± 0.713
0.0SerXaa: 0.0 ± 0.0
Thr
3.745ThrAla: 3.745 ± 0.453
1.07ThrCys: 1.07 ± 0.7
0.535ThrAsp: 0.535 ± 0.357
2.675ThrGlu: 2.675 ± 1.254
3.745ThrPhe: 3.745 ± 1.039
4.28ThrGly: 4.28 ± 1.352
1.07ThrHis: 1.07 ± 0.448
1.605ThrIle: 1.605 ± 0.853
2.675ThrLys: 2.675 ± 0.794
6.421ThrLeu: 6.421 ± 1.231
2.675ThrMet: 2.675 ± 0.823
2.675ThrAsn: 2.675 ± 0.548
6.421ThrPro: 6.421 ± 0.967
2.675ThrGln: 2.675 ± 0.548
0.535ThrArg: 0.535 ± 0.357
1.605ThrSer: 1.605 ± 0.707
5.886ThrThr: 5.886 ± 0.93
5.886ThrVal: 5.886 ± 1.429
1.07ThrTrp: 1.07 ± 0.519
0.535ThrTyr: 0.535 ± 0.357
0.0ThrXaa: 0.0 ± 0.0
Val
4.815ValAla: 4.815 ± 1.981
1.07ValCys: 1.07 ± 0.519
0.535ValAsp: 0.535 ± 0.357
6.956ValGlu: 6.956 ± 1.168
3.21ValPhe: 3.21 ± 0.914
2.675ValGly: 2.675 ± 2.282
1.605ValHis: 1.605 ± 0.532
2.675ValIle: 2.675 ± 1.041
2.14ValLys: 2.14 ± 0.973
5.35ValLeu: 5.35 ± 0.906
2.14ValMet: 2.14 ± 1.273
4.815ValAsn: 4.815 ± 2.007
3.21ValPro: 3.21 ± 1.155
2.14ValGln: 2.14 ± 0.991
2.14ValArg: 2.14 ± 1.127
3.21ValSer: 3.21 ± 0.914
5.35ValThr: 5.35 ± 1.098
2.675ValVal: 2.675 ± 1.187
0.535ValTrp: 0.535 ± 0.585
1.07ValTyr: 1.07 ± 0.713
0.0ValXaa: 0.0 ± 0.0
Trp
1.605TrpAla: 1.605 ± 0.672
1.07TrpCys: 1.07 ± 0.808
0.535TrpAsp: 0.535 ± 0.585
1.07TrpGlu: 1.07 ± 0.7
0.535TrpPhe: 0.535 ± 0.585
1.07TrpGly: 1.07 ± 0.7
1.07TrpHis: 1.07 ± 0.519
0.0TrpIle: 0.0 ± 0.0
0.535TrpLys: 0.535 ± 0.357
1.07TrpLeu: 1.07 ± 0.808
0.535TrpMet: 0.535 ± 0.62
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.07TrpGln: 1.07 ± 0.519
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.07TrpThr: 1.07 ± 0.808
1.605TrpVal: 1.605 ± 1.034
0.0TrpTrp: 0.0 ± 0.0
1.605TrpTyr: 1.605 ± 0.668
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.605TyrAla: 1.605 ± 0.831
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
1.07TyrGlu: 1.07 ± 0.448
1.07TyrPhe: 1.07 ± 0.448
3.745TyrGly: 3.745 ± 1.076
1.07TyrHis: 1.07 ± 0.448
1.605TyrIle: 1.605 ± 0.532
0.535TyrLys: 0.535 ± 0.357
2.14TyrLeu: 2.14 ± 1.234
0.535TyrMet: 0.535 ± 0.357
1.605TyrAsn: 1.605 ± 0.672
1.07TyrPro: 1.07 ± 0.913
1.07TyrGln: 1.07 ± 0.519
1.07TyrArg: 1.07 ± 0.713
2.14TyrSer: 2.14 ± 0.895
0.535TyrThr: 0.535 ± 0.62
0.535TyrVal: 0.535 ± 0.357
0.0TyrTrp: 0.0 ± 0.0
0.535TyrTyr: 0.535 ± 0.456
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1870 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski