Amino acid dipepetide frequency for Bovine polyomavirus (BPyV) (Bos taurus polyomavirus 1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.525AlaAla: 5.525 ± 1.602
2.762AlaCys: 2.762 ± 1.408
2.762AlaAsp: 2.762 ± 1.4
7.735AlaGlu: 7.735 ± 4.123
3.315AlaPhe: 3.315 ± 1.275
3.315AlaGly: 3.315 ± 0.521
1.657AlaHis: 1.657 ± 0.767
1.657AlaIle: 1.657 ± 0.889
3.867AlaLys: 3.867 ± 1.039
7.182AlaLeu: 7.182 ± 1.901
2.762AlaMet: 2.762 ± 1.184
3.315AlaAsn: 3.315 ± 2.088
4.42AlaPro: 4.42 ± 0.97
1.105AlaGln: 1.105 ± 0.761
3.315AlaArg: 3.315 ± 1.386
2.762AlaSer: 2.762 ± 0.629
4.42AlaThr: 4.42 ± 2.128
3.867AlaVal: 3.867 ± 0.605
1.105AlaTrp: 1.105 ± 0.761
2.762AlaTyr: 2.762 ± 1.004
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.105CysCys: 1.105 ± 0.81
1.657CysAsp: 1.657 ± 1.16
0.552CysGlu: 0.552 ± 0.456
0.552CysPhe: 0.552 ± 0.387
1.657CysGly: 1.657 ± 0.772
0.552CysHis: 0.552 ± 0.456
2.21CysIle: 2.21 ± 1.247
2.762CysLys: 2.762 ± 0.99
1.657CysLeu: 1.657 ± 2.626
0.0CysMet: 0.0 ± 0.0
1.105CysAsn: 1.105 ± 0.773
1.105CysPro: 1.105 ± 0.502
0.0CysGln: 0.0 ± 0.0
0.552CysArg: 0.552 ± 0.875
1.657CysSer: 1.657 ± 0.919
2.21CysThr: 2.21 ± 1.547
0.552CysVal: 0.552 ± 0.387
0.552CysTrp: 0.552 ± 0.456
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.552AspAla: 0.552 ± 0.875
0.0AspCys: 0.0 ± 0.0
2.21AspAsp: 2.21 ± 1.63
3.315AspGlu: 3.315 ± 0.692
2.762AspPhe: 2.762 ± 1.575
1.657AspGly: 1.657 ± 0.83
1.105AspHis: 1.105 ± 0.773
4.972AspIle: 4.972 ± 1.137
3.867AspLys: 3.867 ± 2.003
3.867AspLeu: 3.867 ± 1.091
1.657AspMet: 1.657 ± 0.772
2.762AspAsn: 2.762 ± 1.074
3.315AspPro: 3.315 ± 1.11
1.105AspGln: 1.105 ± 0.502
2.21AspArg: 2.21 ± 1.521
2.21AspSer: 2.21 ± 0.878
1.657AspThr: 1.657 ± 0.812
2.21AspVal: 2.21 ± 0.878
1.105AspTrp: 1.105 ± 0.761
0.552AspTyr: 0.552 ± 0.875
0.0AspXaa: 0.0 ± 0.0
Glu
7.735GluAla: 7.735 ± 2.378
1.105GluCys: 1.105 ± 0.502
2.21GluAsp: 2.21 ± 1.004
16.022GluGlu: 16.022 ± 2.631
1.657GluPhe: 1.657 ± 1.16
2.762GluGly: 2.762 ± 1.155
2.21GluHis: 2.21 ± 0.752
1.105GluIle: 1.105 ± 0.951
4.972GluLys: 4.972 ± 1.246
10.497GluLeu: 10.497 ± 1.685
1.657GluMet: 1.657 ± 0.47
6.077GluAsn: 6.077 ± 1.335
2.21GluPro: 2.21 ± 0.526
3.867GluGln: 3.867 ± 1.351
1.657GluArg: 1.657 ± 1.16
1.105GluSer: 1.105 ± 0.951
3.867GluThr: 3.867 ± 1.217
6.077GluVal: 6.077 ± 1.496
0.0GluTrp: 0.0 ± 0.0
5.525GluTyr: 5.525 ± 0.911
0.0GluXaa: 0.0 ± 0.0
Phe
1.657PheAla: 1.657 ± 0.83
0.552PheCys: 0.552 ± 0.875
0.552PheAsp: 0.552 ± 0.387
4.42PheGlu: 4.42 ± 1.577
2.21PhePhe: 2.21 ± 0.526
1.657PheGly: 1.657 ± 0.877
0.0PheHis: 0.0 ± 0.0
2.21PheIle: 2.21 ± 0.658
0.552PheLys: 0.552 ± 0.456
4.972PheLeu: 4.972 ± 1.529
0.0PheMet: 0.0 ± 0.0
2.21PheAsn: 2.21 ± 1.154
2.762PhePro: 2.762 ± 0.77
1.657PheGln: 1.657 ± 0.812
3.315PheArg: 3.315 ± 1.208
0.552PheSer: 0.552 ± 0.456
4.42PheThr: 4.42 ± 1.683
1.105PheVal: 1.105 ± 0.761
0.552PheTrp: 0.552 ± 0.73
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.077GlyAla: 6.077 ± 2.566
0.0GlyCys: 0.0 ± 0.0
6.077GlyAsp: 6.077 ± 1.948
3.315GlyGlu: 3.315 ± 1.248
2.762GlyPhe: 2.762 ± 1.056
5.525GlyGly: 5.525 ± 1.17
0.0GlyHis: 0.0 ± 0.0
3.867GlyIle: 3.867 ± 1.43
3.867GlyLys: 3.867 ± 2.122
8.287GlyLeu: 8.287 ± 0.999
1.657GlyMet: 1.657 ± 0.65
2.762GlyAsn: 2.762 ± 1.995
2.21GlyPro: 2.21 ± 1.066
2.21GlyGln: 2.21 ± 1.823
2.21GlyArg: 2.21 ± 1.521
2.762GlySer: 2.762 ± 0.848
3.867GlyThr: 3.867 ± 0.877
5.525GlyVal: 5.525 ± 2.359
0.0GlyTrp: 0.0 ± 0.0
1.105GlyTyr: 1.105 ± 0.834
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.552HisCys: 0.552 ± 0.387
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.105HisPhe: 1.105 ± 0.951
0.552HisGly: 0.552 ± 0.456
2.762HisHis: 2.762 ± 0.702
0.552HisIle: 0.552 ± 0.476
0.0HisLys: 0.0 ± 0.0
3.315HisLeu: 3.315 ± 1.3
0.0HisMet: 0.0 ± 0.0
3.867HisAsn: 3.867 ± 2.107
3.315HisPro: 3.315 ± 0.445
0.552HisGln: 0.552 ± 0.387
1.657HisArg: 1.657 ± 1.16
1.105HisSer: 1.105 ± 0.816
3.315HisThr: 3.315 ± 1.3
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.63IleAla: 6.63 ± 1.973
2.21IleCys: 2.21 ± 0.989
2.762IleAsp: 2.762 ± 1.59
3.315IleGlu: 3.315 ± 1.369
4.42IlePhe: 4.42 ± 1.177
4.42IleGly: 4.42 ± 1.244
1.105IleHis: 1.105 ± 0.761
2.21IleIle: 2.21 ± 0.783
3.315IleLys: 3.315 ± 0.445
4.972IleLeu: 4.972 ± 1.838
0.0IleMet: 0.0 ± 0.0
1.657IleAsn: 1.657 ± 0.877
1.105IlePro: 1.105 ± 0.502
1.657IleGln: 1.657 ± 0.772
0.552IleArg: 0.552 ± 0.387
3.867IleSer: 3.867 ± 0.621
2.21IleThr: 2.21 ± 0.658
3.315IleVal: 3.315 ± 1.108
0.0IleTrp: 0.0 ± 0.0
1.657IleTyr: 1.657 ± 0.711
0.0IleXaa: 0.0 ± 0.0
Lys
5.525LysAla: 5.525 ± 3.032
2.21LysCys: 2.21 ± 1.154
1.657LysAsp: 1.657 ± 0.877
2.762LysGlu: 2.762 ± 1.455
0.552LysPhe: 0.552 ± 0.456
6.63LysGly: 6.63 ± 1.434
2.762LysHis: 2.762 ± 0.986
3.315LysIle: 3.315 ± 2.464
4.42LysLys: 4.42 ± 1.668
3.867LysLeu: 3.867 ± 1.746
2.21LysMet: 2.21 ± 0.87
4.42LysAsn: 4.42 ± 2.195
1.657LysPro: 1.657 ± 1.16
1.105LysGln: 1.105 ± 0.502
6.63LysArg: 6.63 ± 1.359
4.42LysSer: 4.42 ± 0.851
2.762LysThr: 2.762 ± 1.476
3.867LysVal: 3.867 ± 1.098
0.552LysTrp: 0.552 ± 0.387
1.657LysTyr: 1.657 ± 1.061
0.0LysXaa: 0.0 ± 0.0
Leu
5.525LeuAla: 5.525 ± 1.814
2.21LeuCys: 2.21 ± 0.878
5.525LeuAsp: 5.525 ± 1.354
5.525LeuGlu: 5.525 ± 1.452
4.42LeuPhe: 4.42 ± 1.114
6.63LeuGly: 6.63 ± 1.848
2.762LeuHis: 2.762 ± 1.004
4.972LeuIle: 4.972 ± 1.13
7.735LeuLys: 7.735 ± 3.544
16.022LeuLeu: 16.022 ± 2.216
2.21LeuMet: 2.21 ± 1.247
4.42LeuAsn: 4.42 ± 0.857
6.63LeuPro: 6.63 ± 2.027
4.42LeuGln: 4.42 ± 1.055
4.42LeuArg: 4.42 ± 2.494
1.657LeuSer: 1.657 ± 0.928
7.735LeuThr: 7.735 ± 1.295
2.762LeuVal: 2.762 ± 0.831
0.0LeuTrp: 0.0 ± 0.0
9.392LeuTyr: 9.392 ± 1.763
0.0LeuXaa: 0.0 ± 0.0
Met
3.867MetAla: 3.867 ± 1.039
0.552MetCys: 0.552 ± 0.875
0.552MetAsp: 0.552 ± 0.456
2.21MetGlu: 2.21 ± 1.082
0.552MetPhe: 0.552 ± 0.875
1.105MetGly: 1.105 ± 0.555
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.315MetLys: 3.315 ± 0.836
0.552MetLeu: 0.552 ± 0.875
0.552MetMet: 0.552 ± 0.875
0.552MetAsn: 0.552 ± 0.456
1.105MetPro: 1.105 ± 0.502
0.0MetGln: 0.0 ± 0.0
1.105MetArg: 1.105 ± 0.834
1.657MetSer: 1.657 ± 1.129
0.0MetThr: 0.0 ± 0.0
1.105MetVal: 1.105 ± 0.502
1.657MetTrp: 1.657 ± 0.772
1.105MetTyr: 1.105 ± 0.773
0.0MetXaa: 0.0 ± 0.0
Asn
3.315AsnAla: 3.315 ± 1.759
2.21AsnCys: 2.21 ± 1.154
0.552AsnAsp: 0.552 ± 0.456
4.972AsnGlu: 4.972 ± 0.78
1.105AsnPhe: 1.105 ± 0.761
1.105AsnGly: 1.105 ± 0.502
1.105AsnHis: 1.105 ± 0.761
3.867AsnIle: 3.867 ± 1.098
5.525AsnLys: 5.525 ± 1.356
4.972AsnLeu: 4.972 ± 1.806
3.315AsnMet: 3.315 ± 1.552
1.657AsnAsn: 1.657 ± 1.061
4.972AsnPro: 4.972 ± 0.862
2.21AsnGln: 2.21 ± 0.658
0.552AsnArg: 0.552 ± 0.387
2.21AsnSer: 2.21 ± 2.043
2.21AsnThr: 2.21 ± 1.305
1.657AsnVal: 1.657 ± 0.65
0.552AsnTrp: 0.552 ± 0.387
2.762AsnTyr: 2.762 ± 0.561
0.0AsnXaa: 0.0 ± 0.0
Pro
1.657ProAla: 1.657 ± 0.65
0.552ProCys: 0.552 ± 0.387
7.182ProAsp: 7.182 ± 0.907
2.762ProGlu: 2.762 ± 1.455
1.105ProPhe: 1.105 ± 0.761
4.42ProGly: 4.42 ± 0.964
0.552ProHis: 0.552 ± 0.73
1.105ProIle: 1.105 ± 0.911
3.867ProLys: 3.867 ± 1.721
4.972ProLeu: 4.972 ± 1.199
0.0ProMet: 0.0 ± 0.0
1.105ProAsn: 1.105 ± 0.502
7.182ProPro: 7.182 ± 1.698
3.867ProGln: 3.867 ± 2.107
2.762ProArg: 2.762 ± 1.355
4.42ProSer: 4.42 ± 1.085
1.657ProThr: 1.657 ± 0.898
4.972ProVal: 4.972 ± 1.44
0.0ProTrp: 0.0 ± 0.0
3.315ProTyr: 3.315 ± 1.467
0.0ProXaa: 0.0 ± 0.0
Gln
3.315GlnAla: 3.315 ± 1.818
0.0GlnCys: 0.0 ± 0.0
2.21GlnAsp: 2.21 ± 0.752
3.867GlnGlu: 3.867 ± 1.353
2.21GlnPhe: 2.21 ± 1.305
3.315GlnGly: 3.315 ± 1.543
1.105GlnHis: 1.105 ± 0.761
2.762GlnIle: 2.762 ± 1.361
2.21GlnLys: 2.21 ± 0.752
1.657GlnLeu: 1.657 ± 0.734
0.0GlnMet: 0.0 ± 0.0
1.657GlnAsn: 1.657 ± 0.812
2.21GlnPro: 2.21 ± 0.955
1.105GlnGln: 1.105 ± 0.911
2.762GlnArg: 2.762 ± 0.561
3.867GlnSer: 3.867 ± 1.522
0.552GlnThr: 0.552 ± 0.387
2.21GlnVal: 2.21 ± 0.989
1.657GlnTrp: 1.657 ± 0.919
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.105ArgAla: 1.105 ± 0.761
0.0ArgCys: 0.0 ± 0.0
0.552ArgAsp: 0.552 ± 0.387
2.762ArgGlu: 2.762 ± 1.626
1.657ArgPhe: 1.657 ± 0.989
3.867ArgGly: 3.867 ± 0.564
0.552ArgHis: 0.552 ± 0.387
2.762ArgIle: 2.762 ± 0.703
2.21ArgLys: 2.21 ± 1.066
4.42ArgLeu: 4.42 ± 0.836
2.21ArgMet: 2.21 ± 1.004
1.657ArgAsn: 1.657 ± 1.252
0.0ArgPro: 0.0 ± 0.0
4.42ArgGln: 4.42 ± 3.043
3.867ArgArg: 3.867 ± 1.351
2.762ArgSer: 2.762 ± 1.361
5.525ArgThr: 5.525 ± 1.701
1.657ArgVal: 1.657 ± 0.65
2.762ArgTrp: 2.762 ± 1.361
3.867ArgTyr: 3.867 ± 2.177
0.0ArgXaa: 0.0 ± 0.0
Ser
6.63SerAla: 6.63 ± 2.171
0.552SerCys: 0.552 ± 0.456
2.762SerAsp: 2.762 ± 2.481
2.762SerGlu: 2.762 ± 0.597
1.105SerPhe: 1.105 ± 0.773
3.315SerGly: 3.315 ± 1.825
1.105SerHis: 1.105 ± 0.761
4.42SerIle: 4.42 ± 1.486
3.867SerLys: 3.867 ± 0.547
4.972SerLeu: 4.972 ± 1.942
0.0SerMet: 0.0 ± 0.0
1.105SerAsn: 1.105 ± 0.502
2.762SerPro: 2.762 ± 0.739
4.42SerGln: 4.42 ± 1.668
3.315SerArg: 3.315 ± 1.467
5.525SerSer: 5.525 ± 1.716
3.315SerThr: 3.315 ± 1.796
3.315SerVal: 3.315 ± 1.049
0.0SerTrp: 0.0 ± 0.0
1.105SerTyr: 1.105 ± 0.761
0.0SerXaa: 0.0 ± 0.0
Thr
6.077ThrAla: 6.077 ± 2.61
1.657ThrCys: 1.657 ± 0.877
2.762ThrAsp: 2.762 ± 0.739
4.42ThrGlu: 4.42 ± 1.276
1.105ThrPhe: 1.105 ± 0.773
3.315ThrGly: 3.315 ± 0.977
0.0ThrHis: 0.0 ± 0.0
4.972ThrIle: 4.972 ± 1.589
0.552ThrLys: 0.552 ± 0.387
4.42ThrLeu: 4.42 ± 1.15
0.552ThrMet: 0.552 ± 0.387
2.762ThrAsn: 2.762 ± 0.739
3.867ThrPro: 3.867 ± 0.621
1.657ThrGln: 1.657 ± 0.772
3.315ThrArg: 3.315 ± 1.208
6.077ThrSer: 6.077 ± 1.305
6.63ThrThr: 6.63 ± 1.105
4.972ThrVal: 4.972 ± 1.859
1.105ThrTrp: 1.105 ± 0.494
1.105ThrTyr: 1.105 ± 0.502
0.0ThrXaa: 0.0 ± 0.0
Val
2.762ValAla: 2.762 ± 1.397
1.657ValCys: 1.657 ± 0.919
0.552ValAsp: 0.552 ± 0.387
5.525ValGlu: 5.525 ± 1.383
1.105ValPhe: 1.105 ± 0.494
4.972ValGly: 4.972 ± 1.041
1.105ValHis: 1.105 ± 0.668
0.552ValIle: 0.552 ± 0.387
2.762ValLys: 2.762 ± 1.074
5.525ValLeu: 5.525 ± 1.586
1.105ValMet: 1.105 ± 0.668
3.867ValAsn: 3.867 ± 1.135
4.42ValPro: 4.42 ± 1.052
1.657ValGln: 1.657 ± 0.772
2.21ValArg: 2.21 ± 0.955
4.42ValSer: 4.42 ± 1.052
3.315ValThr: 3.315 ± 1.193
2.762ValVal: 2.762 ± 1.243
2.21ValTrp: 2.21 ± 1.463
1.657ValTyr: 1.657 ± 0.734
0.0ValXaa: 0.0 ± 0.0
Trp
1.105TrpAla: 1.105 ± 0.761
0.552TrpCys: 0.552 ± 0.456
0.0TrpAsp: 0.0 ± 0.0
3.867TrpGlu: 3.867 ± 1.091
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.657TrpLys: 1.657 ± 0.749
3.867TrpLeu: 3.867 ± 1.039
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.552TrpGln: 0.552 ± 0.387
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.105TrpThr: 1.105 ± 0.761
0.552TrpVal: 0.552 ± 0.875
0.552TrpTrp: 0.552 ± 0.387
0.552TrpTyr: 0.552 ± 0.73
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.552TyrAla: 0.552 ± 0.875
0.0TyrCys: 0.0 ± 0.0
0.552TyrAsp: 0.552 ± 0.387
2.21TyrGlu: 2.21 ± 0.765
1.657TyrPhe: 1.657 ± 0.711
3.867TyrGly: 3.867 ± 1.207
2.21TyrHis: 2.21 ± 0.955
3.867TyrIle: 3.867 ± 0.564
1.657TyrLys: 1.657 ± 0.772
4.972TyrLeu: 4.972 ± 1.515
1.105TyrMet: 1.105 ± 0.502
4.42TyrAsn: 4.42 ± 1.417
2.21TyrPro: 2.21 ± 0.783
1.105TyrGln: 1.105 ± 0.502
2.21TyrArg: 2.21 ± 1.521
3.315TyrSer: 3.315 ± 1.467
0.552TyrThr: 0.552 ± 0.456
1.657TyrVal: 1.657 ± 0.65
0.0TyrTrp: 0.0 ± 0.0
1.657TyrTyr: 1.657 ± 0.734
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1811 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski