Amino acid dipepetide frequency for Bovine polyomavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.256AlaAla: 10.256 ± 3.837
0.57AlaCys: 0.57 ± 0.703
2.849AlaAsp: 2.849 ± 1.465
3.989AlaGlu: 3.989 ± 0.895
0.57AlaPhe: 0.57 ± 0.652
5.128AlaGly: 5.128 ± 2.365
1.709AlaHis: 1.709 ± 1.036
5.128AlaIle: 5.128 ± 2.645
6.838AlaLys: 6.838 ± 2.508
6.838AlaLeu: 6.838 ± 1.613
0.57AlaMet: 0.57 ± 0.484
2.279AlaAsn: 2.279 ± 1.59
2.279AlaPro: 2.279 ± 1.201
1.709AlaGln: 1.709 ± 0.637
4.558AlaArg: 4.558 ± 1.881
4.558AlaSer: 4.558 ± 1.27
5.128AlaThr: 5.128 ± 1.594
5.128AlaVal: 5.128 ± 1.774
1.709AlaTrp: 1.709 ± 0.915
2.279AlaTyr: 2.279 ± 1.356
0.0AlaXaa: 0.0 ± 0.0
Cys
0.57CysAla: 0.57 ± 0.397
0.0CysCys: 0.0 ± 0.0
0.57CysAsp: 0.57 ± 0.397
1.14CysGlu: 1.14 ± 0.489
1.709CysPhe: 1.709 ± 0.915
2.279CysGly: 2.279 ± 1.093
0.0CysHis: 0.0 ± 0.0
1.709CysIle: 1.709 ± 1.359
0.57CysLys: 0.57 ± 0.397
1.14CysLeu: 1.14 ± 0.489
0.0CysMet: 0.0 ± 0.0
0.57CysAsn: 0.57 ± 0.397
0.57CysPro: 0.57 ± 0.397
0.57CysGln: 0.57 ± 0.397
0.57CysArg: 0.57 ± 0.397
0.57CysSer: 0.57 ± 0.397
1.14CysThr: 1.14 ± 0.713
1.709CysVal: 1.709 ± 1.359
1.14CysTrp: 1.14 ± 0.713
3.419CysTyr: 3.419 ± 1.274
0.0CysXaa: 0.0 ± 0.0
Asp
1.14AspAla: 1.14 ± 0.536
1.14AspCys: 1.14 ± 0.795
1.709AspAsp: 1.709 ± 0.748
3.419AspGlu: 3.419 ± 1.372
1.14AspPhe: 1.14 ± 0.795
6.838AspGly: 6.838 ± 1.467
0.57AspHis: 0.57 ± 0.397
1.709AspIle: 1.709 ± 0.626
2.279AspLys: 2.279 ± 0.763
3.419AspLeu: 3.419 ± 0.837
1.14AspMet: 1.14 ± 0.795
2.279AspAsn: 2.279 ± 0.763
9.687AspPro: 9.687 ± 1.058
0.57AspGln: 0.57 ± 0.397
1.14AspArg: 1.14 ± 0.713
1.14AspSer: 1.14 ± 0.738
1.14AspThr: 1.14 ± 0.489
1.14AspVal: 1.14 ± 0.489
2.849AspTrp: 2.849 ± 0.705
1.709AspTyr: 1.709 ± 0.915
0.0AspXaa: 0.0 ± 0.0
Glu
5.698GluAla: 5.698 ± 1.503
0.57GluCys: 0.57 ± 0.703
2.279GluAsp: 2.279 ± 1.093
7.977GluGlu: 7.977 ± 3.679
2.279GluPhe: 2.279 ± 1.093
6.268GluGly: 6.268 ± 1.082
1.14GluHis: 1.14 ± 0.834
3.419GluIle: 3.419 ± 0.806
3.419GluLys: 3.419 ± 1.174
5.698GluLeu: 5.698 ± 1.363
0.57GluMet: 0.57 ± 0.703
1.709GluAsn: 1.709 ± 0.626
0.57GluPro: 0.57 ± 0.397
3.989GluGln: 3.989 ± 2.235
1.709GluArg: 1.709 ± 1.357
4.558GluSer: 4.558 ± 1.384
3.989GluThr: 3.989 ± 1.505
4.558GluVal: 4.558 ± 1.2
0.0GluTrp: 0.0 ± 0.0
1.14GluTyr: 1.14 ± 0.969
0.0GluXaa: 0.0 ± 0.0
Phe
2.849PheAla: 2.849 ± 0.769
0.0PheCys: 0.0 ± 0.0
0.57PheAsp: 0.57 ± 0.397
2.279PheGlu: 2.279 ± 1.59
2.279PhePhe: 2.279 ± 1.249
3.419PheGly: 3.419 ± 1.212
0.0PheHis: 0.0 ± 0.0
1.709PheIle: 1.709 ± 1.192
0.57PheLys: 0.57 ± 0.652
2.849PheLeu: 2.849 ± 0.712
0.57PheMet: 0.57 ± 0.397
3.989PheAsn: 3.989 ± 1.348
1.709PhePro: 1.709 ± 1.357
0.0PheGln: 0.0 ± 0.0
1.709PheArg: 1.709 ± 0.888
2.849PheSer: 2.849 ± 1.021
1.709PheThr: 1.709 ± 1.192
2.849PheVal: 2.849 ± 0.487
1.709PheTrp: 1.709 ± 0.637
0.57PheTyr: 0.57 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
7.407GlyAla: 7.407 ± 1.75
1.14GlyCys: 1.14 ± 0.795
3.419GlyAsp: 3.419 ± 0.381
5.128GlyGlu: 5.128 ± 2.44
4.558GlyPhe: 4.558 ± 0.724
10.256GlyGly: 10.256 ± 2.44
0.0GlyHis: 0.0 ± 0.0
3.989GlyIle: 3.989 ± 1.45
3.989GlyLys: 3.989 ± 1.734
7.407GlyLeu: 7.407 ± 2.542
1.14GlyMet: 1.14 ± 0.713
6.268GlyAsn: 6.268 ± 2.305
7.407GlyPro: 7.407 ± 1.181
6.838GlyGln: 6.838 ± 2.509
4.558GlyArg: 4.558 ± 1.759
6.268GlySer: 6.268 ± 2.316
2.849GlyThr: 2.849 ± 1.816
5.698GlyVal: 5.698 ± 2.027
3.419GlyTrp: 3.419 ± 1.184
1.14GlyTyr: 1.14 ± 0.969
0.0GlyXaa: 0.0 ± 0.0
His
1.14HisAla: 1.14 ± 0.795
1.709HisCys: 1.709 ± 0.915
0.57HisAsp: 0.57 ± 0.397
1.14HisGlu: 1.14 ± 0.713
1.14HisPhe: 1.14 ± 0.489
2.279HisGly: 2.279 ± 0.615
1.709HisHis: 1.709 ± 0.626
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
0.57HisLeu: 0.57 ± 0.397
0.57HisMet: 0.57 ± 0.397
0.0HisAsn: 0.0 ± 0.0
1.14HisPro: 1.14 ± 0.713
1.709HisGln: 1.709 ± 1.036
1.14HisArg: 1.14 ± 0.795
1.14HisSer: 1.14 ± 0.795
0.0HisThr: 0.0 ± 0.0
0.57HisVal: 0.57 ± 0.397
2.279HisTrp: 2.279 ± 0.637
0.57HisTyr: 0.57 ± 0.703
0.0HisXaa: 0.0 ± 0.0
Ile
4.558IleAla: 4.558 ± 1.224
2.849IleCys: 2.849 ± 1.987
2.279IleAsp: 2.279 ± 0.615
0.57IleGlu: 0.57 ± 0.652
1.14IlePhe: 1.14 ± 0.721
1.709IleGly: 1.709 ± 0.888
1.14IleHis: 1.14 ± 0.713
2.279IleIle: 2.279 ± 0.502
3.419IleLys: 3.419 ± 1.393
4.558IleLeu: 4.558 ± 1.445
0.57IleMet: 0.57 ± 0.397
3.989IleAsn: 3.989 ± 0.728
2.279IlePro: 2.279 ± 0.977
0.57IleGln: 0.57 ± 0.397
1.14IleArg: 1.14 ± 0.834
2.279IleSer: 2.279 ± 0.502
3.989IleThr: 3.989 ± 1.9
2.849IleVal: 2.849 ± 1.68
1.14IleTrp: 1.14 ± 0.834
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.279LysAla: 2.279 ± 1.249
0.57LysCys: 0.57 ± 0.397
2.279LysAsp: 2.279 ± 0.637
2.849LysGlu: 2.849 ± 1.564
1.709LysPhe: 1.709 ± 1.192
5.128LysGly: 5.128 ± 1.726
1.14LysHis: 1.14 ± 0.795
1.709LysIle: 1.709 ± 1.359
7.407LysLys: 7.407 ± 1.712
3.989LysLeu: 3.989 ± 2.291
0.57LysMet: 0.57 ± 0.397
1.709LysAsn: 1.709 ± 0.748
1.14LysPro: 1.14 ± 0.489
1.14LysGln: 1.14 ± 0.795
4.558LysArg: 4.558 ± 1.048
1.14LysSer: 1.14 ± 0.721
6.838LysThr: 6.838 ± 1.611
5.128LysVal: 5.128 ± 3.019
0.0LysTrp: 0.0 ± 0.0
2.849LysTyr: 2.849 ± 0.892
0.0LysXaa: 0.0 ± 0.0
Leu
3.989LeuAla: 3.989 ± 1.001
2.279LeuCys: 2.279 ± 0.977
2.849LeuAsp: 2.849 ± 1.349
7.407LeuGlu: 7.407 ± 1.116
1.14LeuPhe: 1.14 ± 0.713
9.687LeuGly: 9.687 ± 3.462
1.709LeuHis: 1.709 ± 0.748
2.849LeuIle: 2.849 ± 0.487
2.279LeuLys: 2.279 ± 0.763
11.966LeuLeu: 11.966 ± 2.141
2.279LeuMet: 2.279 ± 0.763
2.849LeuAsn: 2.849 ± 1.564
9.117LeuPro: 9.117 ± 1.762
5.128LeuGln: 5.128 ± 2.988
3.419LeuArg: 3.419 ± 1.151
3.419LeuSer: 3.419 ± 1.184
5.128LeuThr: 5.128 ± 3.526
3.989LeuVal: 3.989 ± 1.337
4.558LeuTrp: 4.558 ± 2.311
2.279LeuTyr: 2.279 ± 0.792
0.0LeuXaa: 0.0 ± 0.0
Met
3.989MetAla: 3.989 ± 0.42
0.57MetCys: 0.57 ± 0.484
1.709MetAsp: 1.709 ± 1.359
1.14MetGlu: 1.14 ± 0.489
0.57MetPhe: 0.57 ± 0.397
1.709MetGly: 1.709 ± 0.477
0.0MetHis: 0.0 ± 0.0
0.57MetIle: 0.57 ± 0.397
1.14MetLys: 1.14 ± 0.713
2.279MetLeu: 2.279 ± 1.015
0.57MetMet: 0.57 ± 0.451
0.57MetAsn: 0.57 ± 0.484
2.279MetPro: 2.279 ± 0.792
0.57MetGln: 0.57 ± 0.397
2.849MetArg: 2.849 ± 0.705
0.0MetSer: 0.0 ± 0.0
1.14MetThr: 1.14 ± 0.489
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.14MetTyr: 1.14 ± 0.489
0.0MetXaa: 0.0 ± 0.0
Asn
1.709AsnAla: 1.709 ± 0.748
1.14AsnCys: 1.14 ± 0.713
3.419AsnAsp: 3.419 ± 1.372
0.57AsnGlu: 0.57 ± 0.397
0.0AsnPhe: 0.0 ± 0.0
1.14AsnGly: 1.14 ± 0.969
0.0AsnHis: 0.0 ± 0.0
2.279AsnIle: 2.279 ± 1.093
3.419AsnLys: 3.419 ± 0.642
6.268AsnLeu: 6.268 ± 1.814
3.419AsnMet: 3.419 ± 1.035
1.709AsnAsn: 1.709 ± 0.626
2.849AsnPro: 2.849 ± 0.712
3.419AsnGln: 3.419 ± 1.461
0.57AsnArg: 0.57 ± 0.397
3.419AsnSer: 3.419 ± 1.831
1.709AsnThr: 1.709 ± 0.748
2.849AsnVal: 2.849 ± 0.487
1.709AsnTrp: 1.709 ± 0.626
1.14AsnTyr: 1.14 ± 0.834
0.0AsnXaa: 0.0 ± 0.0
Pro
3.419ProAla: 3.419 ± 0.642
0.0ProCys: 0.0 ± 0.0
7.407ProAsp: 7.407 ± 2.152
2.849ProGlu: 2.849 ± 1.156
1.14ProPhe: 1.14 ± 0.795
6.838ProGly: 6.838 ± 0.71
1.14ProHis: 1.14 ± 0.795
1.709ProIle: 1.709 ± 1.036
4.558ProLys: 4.558 ± 1.937
5.128ProLeu: 5.128 ± 1.725
0.0ProMet: 0.0 ± 0.0
1.14ProAsn: 1.14 ± 0.489
6.268ProPro: 6.268 ± 2.75
3.419ProGln: 3.419 ± 2.133
0.57ProArg: 0.57 ± 0.484
1.709ProSer: 1.709 ± 0.748
4.558ProThr: 4.558 ± 1.47
6.268ProVal: 6.268 ± 0.935
0.57ProTrp: 0.57 ± 0.484
1.14ProTyr: 1.14 ± 0.834
0.0ProXaa: 0.0 ± 0.0
Gln
4.558GlnAla: 4.558 ± 0.375
0.57GlnCys: 0.57 ± 0.397
2.849GlnAsp: 2.849 ± 0.487
2.849GlnGlu: 2.849 ± 1.42
0.57GlnPhe: 0.57 ± 0.484
2.849GlnGly: 2.849 ± 0.912
0.57GlnHis: 0.57 ± 0.397
2.279GlnIle: 2.279 ± 1.015
2.849GlnLys: 2.849 ± 1.564
3.419GlnLeu: 3.419 ± 2.133
1.709GlnMet: 1.709 ± 1.453
2.849GlnAsn: 2.849 ± 1.42
1.14GlnPro: 1.14 ± 0.834
1.709GlnGln: 1.709 ± 1.192
2.849GlnArg: 2.849 ± 1.68
1.709GlnSer: 1.709 ± 1.192
6.268GlnThr: 6.268 ± 1.576
3.419GlnVal: 3.419 ± 1.848
5.128GlnTrp: 5.128 ± 3.071
1.14GlnTyr: 1.14 ± 0.969
0.0GlnXaa: 0.0 ± 0.0
Arg
5.128ArgAla: 5.128 ± 2.155
0.57ArgCys: 0.57 ± 0.397
2.279ArgAsp: 2.279 ± 0.792
3.419ArgGlu: 3.419 ± 1.255
0.57ArgPhe: 0.57 ± 0.484
4.558ArgGly: 4.558 ± 1.885
0.57ArgHis: 0.57 ± 0.397
3.419ArgIle: 3.419 ± 1.372
3.419ArgLys: 3.419 ± 1.174
1.709ArgLeu: 1.709 ± 0.915
0.57ArgMet: 0.57 ± 0.786
0.57ArgAsn: 0.57 ± 0.703
2.279ArgPro: 2.279 ± 0.763
3.419ArgGln: 3.419 ± 1.827
5.128ArgArg: 5.128 ± 2.668
1.709ArgSer: 1.709 ± 0.913
1.709ArgThr: 1.709 ± 0.913
4.558ArgVal: 4.558 ± 0.893
0.0ArgTrp: 0.0 ± 0.0
0.57ArgTyr: 0.57 ± 0.397
0.0ArgXaa: 0.0 ± 0.0
Ser
1.709SerAla: 1.709 ± 0.748
1.709SerCys: 1.709 ± 0.915
3.419SerAsp: 3.419 ± 1.453
1.709SerGlu: 1.709 ± 0.888
3.989SerPhe: 3.989 ± 1.64
3.419SerGly: 3.419 ± 0.701
1.14SerHis: 1.14 ± 0.713
2.849SerIle: 2.849 ± 1.156
2.849SerLys: 2.849 ± 1.564
4.558SerLeu: 4.558 ± 2.37
1.14SerMet: 1.14 ± 0.489
1.709SerAsn: 1.709 ± 0.626
1.14SerPro: 1.14 ± 0.489
4.558SerGln: 4.558 ± 1.759
1.709SerArg: 1.709 ± 0.913
2.849SerSer: 2.849 ± 0.857
2.849SerThr: 2.849 ± 1.036
5.698SerVal: 5.698 ± 1.853
0.0SerTrp: 0.0 ± 0.0
1.14SerTyr: 1.14 ± 0.834
0.0SerXaa: 0.0 ± 0.0
Thr
5.128ThrAla: 5.128 ± 1.821
2.849ThrCys: 2.849 ± 1.924
0.57ThrAsp: 0.57 ± 0.397
2.849ThrGlu: 2.849 ± 0.912
1.709ThrPhe: 1.709 ± 1.041
5.128ThrGly: 5.128 ± 2.641
0.57ThrHis: 0.57 ± 0.397
2.849ThrIle: 2.849 ± 0.912
2.279ThrLys: 2.279 ± 1.455
7.977ThrLeu: 7.977 ± 1.233
3.989ThrMet: 3.989 ± 1.248
2.849ThrAsn: 2.849 ± 0.487
2.849ThrPro: 2.849 ± 1.349
6.268ThrGln: 6.268 ± 1.147
2.849ThrArg: 2.849 ± 1.465
3.419ThrSer: 3.419 ± 0.642
3.989ThrThr: 3.989 ± 1.223
5.698ThrVal: 5.698 ± 1.825
0.57ThrTrp: 0.57 ± 0.703
1.14ThrTyr: 1.14 ± 0.969
0.0ThrXaa: 0.0 ± 0.0
Val
3.419ValAla: 3.419 ± 0.916
0.57ValCys: 0.57 ± 0.484
2.849ValAsp: 2.849 ± 1.349
5.698ValGlu: 5.698 ± 1.825
4.558ValPhe: 4.558 ± 1.079
6.268ValGly: 6.268 ± 3.333
3.989ValHis: 3.989 ± 1.734
1.709ValIle: 1.709 ± 1.453
2.279ValLys: 2.279 ± 0.977
5.698ValLeu: 5.698 ± 0.871
0.57ValMet: 0.57 ± 0.351
3.419ValAsn: 3.419 ± 1.848
2.849ValPro: 2.849 ± 0.487
3.989ValGln: 3.989 ± 0.95
2.849ValArg: 2.849 ± 0.857
4.558ValSer: 4.558 ± 0.375
5.698ValThr: 5.698 ± 1.825
3.419ValVal: 3.419 ± 1.215
3.419ValTrp: 3.419 ± 0.381
1.14ValTyr: 1.14 ± 0.489
0.0ValXaa: 0.0 ± 0.0
Trp
4.558TrpAla: 4.558 ± 1.472
0.57TrpCys: 0.57 ± 0.397
1.14TrpAsp: 1.14 ± 0.713
3.989TrpGlu: 3.989 ± 1.142
1.709TrpPhe: 1.709 ± 1.036
4.558TrpGly: 4.558 ± 2.588
1.14TrpHis: 1.14 ± 0.834
1.14TrpIle: 1.14 ± 0.795
0.57TrpLys: 0.57 ± 0.397
0.57TrpLeu: 0.57 ± 0.397
1.14TrpMet: 1.14 ± 0.834
1.709TrpAsn: 1.709 ± 0.626
1.14TrpPro: 1.14 ± 0.489
1.709TrpGln: 1.709 ± 0.626
0.57TrpArg: 0.57 ± 0.703
1.709TrpSer: 1.709 ± 1.036
1.709TrpThr: 1.709 ± 0.913
1.14TrpVal: 1.14 ± 0.834
2.279TrpTrp: 2.279 ± 0.792
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.57TyrAla: 0.57 ± 0.484
0.57TyrCys: 0.57 ± 0.703
1.14TyrAsp: 1.14 ± 0.795
0.57TyrGlu: 0.57 ± 0.397
1.709TyrPhe: 1.709 ± 0.888
3.419TyrGly: 3.419 ± 1.325
1.14TyrHis: 1.14 ± 0.713
0.0TyrIle: 0.0 ± 0.0
0.0TyrLys: 0.0 ± 0.0
2.279TyrLeu: 2.279 ± 0.879
1.14TyrMet: 1.14 ± 0.664
0.57TyrAsn: 0.57 ± 0.397
1.14TyrPro: 1.14 ± 0.738
0.0TyrGln: 0.0 ± 0.0
2.279TyrArg: 2.279 ± 1.201
1.14TyrSer: 1.14 ± 0.721
3.989TyrThr: 3.989 ± 1.505
2.279TyrVal: 2.279 ± 0.615
0.57TyrTrp: 0.57 ± 0.703
1.14TyrTyr: 1.14 ± 0.834
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1756 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski