Amino acid dipepetide frequency for Bovine polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.531AlaAla: 5.531 ± 1.141
2.765AlaCys: 2.765 ± 1.499
2.765AlaAsp: 2.765 ± 1.224
7.743AlaGlu: 7.743 ± 4.233
3.319AlaPhe: 3.319 ± 1.47
3.319AlaGly: 3.319 ± 0.536
1.659AlaHis: 1.659 ± 0.655
1.659AlaIle: 1.659 ± 0.978
3.872AlaLys: 3.872 ± 0.964
7.19AlaLeu: 7.19 ± 2.289
2.765AlaMet: 2.765 ± 0.91
3.319AlaAsn: 3.319 ± 2.016
4.425AlaPro: 4.425 ± 0.936
1.106AlaGln: 1.106 ± 0.795
3.319AlaArg: 3.319 ± 1.596
2.765AlaSer: 2.765 ± 0.643
4.425AlaThr: 4.425 ± 2.381
3.872AlaVal: 3.872 ± 0.648
1.106AlaTrp: 1.106 ± 0.795
2.765AlaTyr: 2.765 ± 0.946
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.106CysCys: 1.106 ± 0.533
1.659CysAsp: 1.659 ± 1.091
0.553CysGlu: 0.553 ± 0.546
0.553CysPhe: 0.553 ± 0.364
1.659CysGly: 1.659 ± 0.74
0.553CysHis: 0.553 ± 0.546
2.212CysIle: 2.212 ± 0.892
2.765CysLys: 2.765 ± 1.239
1.659CysLeu: 1.659 ± 1.566
0.0CysMet: 0.0 ± 0.0
1.106CysAsn: 1.106 ± 0.728
1.106CysPro: 1.106 ± 0.538
0.0CysGln: 0.0 ± 0.0
0.553CysArg: 0.553 ± 0.522
1.659CysSer: 1.659 ± 0.749
2.212CysThr: 2.212 ± 1.455
0.553CysVal: 0.553 ± 0.364
0.553CysTrp: 0.553 ± 0.546
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.553AspAla: 0.553 ± 0.522
0.0AspCys: 0.0 ± 0.0
2.212AspAsp: 2.212 ± 0.999
3.319AspGlu: 3.319 ± 0.594
2.765AspPhe: 2.765 ± 1.007
1.659AspGly: 1.659 ± 0.609
1.106AspHis: 1.106 ± 0.728
4.978AspIle: 4.978 ± 1.073
3.319AspLys: 3.319 ± 1.672
3.872AspLeu: 3.872 ± 1.009
1.659AspMet: 1.659 ± 0.74
2.212AspAsn: 2.212 ± 0.713
3.319AspPro: 3.319 ± 1.218
1.106AspGln: 1.106 ± 0.538
2.212AspArg: 2.212 ± 1.591
2.212AspSer: 2.212 ± 0.713
2.212AspThr: 2.212 ± 1.193
2.212AspVal: 2.212 ± 0.713
1.106AspTrp: 1.106 ± 0.795
0.553AspTyr: 0.553 ± 0.522
0.0AspXaa: 0.0 ± 0.0
Glu
7.743GluAla: 7.743 ± 2.5
1.106GluCys: 1.106 ± 0.538
2.212GluAsp: 2.212 ± 1.077
16.04GluGlu: 16.04 ± 2.897
1.659GluPhe: 1.659 ± 1.091
2.765GluGly: 2.765 ± 1.128
2.212GluHis: 2.212 ± 0.737
1.106GluIle: 1.106 ± 0.707
4.978GluLys: 4.978 ± 1.044
10.509GluLeu: 10.509 ± 1.655
1.659GluMet: 1.659 ± 0.451
6.084GluAsn: 6.084 ± 0.945
2.212GluPro: 2.212 ± 0.451
3.872GluGln: 3.872 ± 1.365
1.659GluArg: 1.659 ± 1.091
1.106GluSer: 1.106 ± 0.707
3.872GluThr: 3.872 ± 1.146
6.084GluVal: 6.084 ± 1.311
0.0GluTrp: 0.0 ± 0.0
5.531GluTyr: 5.531 ± 0.859
0.0GluXaa: 0.0 ± 0.0
Phe
1.659PheAla: 1.659 ± 0.609
0.553PheCys: 0.553 ± 0.522
0.553PheAsp: 0.553 ± 0.364
4.425PheGlu: 4.425 ± 1.606
2.212PhePhe: 2.212 ± 0.451
1.659PheGly: 1.659 ± 1.021
0.0PheHis: 0.0 ± 0.0
2.212PheIle: 2.212 ± 0.586
0.553PheLys: 0.553 ± 0.546
4.978PheLeu: 4.978 ± 1.299
0.0PheMet: 0.0 ± 0.0
2.212PheAsn: 2.212 ± 1.05
2.765PhePro: 2.765 ± 0.866
1.659PheGln: 1.659 ± 0.931
3.319PheArg: 3.319 ± 1.175
0.553PheSer: 0.553 ± 0.546
4.425PheThr: 4.425 ± 1.52
1.106PheVal: 1.106 ± 0.795
0.553PheTrp: 0.553 ± 0.738
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.084GlyAla: 6.084 ± 2.7
0.0GlyCys: 0.0 ± 0.0
6.084GlyAsp: 6.084 ± 2.161
3.319GlyGlu: 3.319 ± 1.226
2.765GlyPhe: 2.765 ± 1.025
5.531GlyGly: 5.531 ± 1.501
0.0GlyHis: 0.0 ± 0.0
3.872GlyIle: 3.872 ± 1.519
3.872GlyLys: 3.872 ± 2.165
8.296GlyLeu: 8.296 ± 0.766
1.659GlyMet: 1.659 ± 0.675
2.765GlyAsn: 2.765 ± 2.192
2.212GlyPro: 2.212 ± 1.027
2.212GlyGln: 2.212 ± 2.182
2.212GlyArg: 2.212 ± 1.591
2.765GlySer: 2.765 ± 0.849
3.872GlyThr: 3.872 ± 0.598
5.531GlyVal: 5.531 ± 2.661
0.0GlyTrp: 0.0 ± 0.0
1.106GlyTyr: 1.106 ± 0.928
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.553HisCys: 0.553 ± 0.364
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.106HisPhe: 1.106 ± 0.707
0.553HisGly: 0.553 ± 0.546
2.765HisHis: 2.765 ± 0.794
0.553HisIle: 0.553 ± 0.514
0.0HisLys: 0.0 ± 0.0
3.319HisLeu: 3.319 ± 1.35
0.0HisMet: 0.0 ± 0.0
3.872HisAsn: 3.872 ± 2.213
3.319HisPro: 3.319 ± 0.507
0.553HisGln: 0.553 ± 0.364
1.659HisArg: 1.659 ± 1.091
1.106HisSer: 1.106 ± 0.825
3.319HisThr: 3.319 ± 1.35
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.637IleAla: 6.637 ± 2.072
2.212IleCys: 2.212 ± 1.234
2.765IleAsp: 2.765 ± 1.286
3.319IleGlu: 3.319 ± 1.479
4.425IlePhe: 4.425 ± 1.264
4.425IleGly: 4.425 ± 1.443
1.106IleHis: 1.106 ± 0.795
2.212IleIle: 2.212 ± 0.95
3.319IleLys: 3.319 ± 0.507
4.978IleLeu: 4.978 ± 1.649
0.0IleMet: 0.0 ± 0.0
1.659IleAsn: 1.659 ± 1.021
1.106IlePro: 1.106 ± 0.538
1.659IleGln: 1.659 ± 0.74
0.553IleArg: 0.553 ± 0.364
3.872IleSer: 3.872 ± 0.631
2.212IleThr: 2.212 ± 0.586
3.319IleVal: 3.319 ± 0.797
0.0IleTrp: 0.0 ± 0.0
1.659IleTyr: 1.659 ± 0.866
0.0IleXaa: 0.0 ± 0.0
Lys
5.531LysAla: 5.531 ± 2.964
2.212LysCys: 2.212 ± 1.05
1.106LysAsp: 1.106 ± 0.538
2.765LysGlu: 2.765 ± 1.381
0.553LysPhe: 0.553 ± 0.546
6.637LysGly: 6.637 ± 1.602
2.765LysHis: 2.765 ± 1.023
3.319LysIle: 3.319 ± 2.175
4.425LysLys: 4.425 ± 1.778
3.872LysLeu: 3.872 ± 1.98
2.212LysMet: 2.212 ± 0.67
4.425LysAsn: 4.425 ± 2.611
1.659LysPro: 1.659 ± 1.091
1.106LysGln: 1.106 ± 0.538
6.637LysArg: 6.637 ± 1.23
4.425LysSer: 4.425 ± 0.857
2.765LysThr: 2.765 ± 1.362
3.872LysVal: 3.872 ± 1.18
0.553LysTrp: 0.553 ± 0.364
1.659LysTyr: 1.659 ± 1.039
0.0LysXaa: 0.0 ± 0.0
Leu
5.531LeuAla: 5.531 ± 2.084
2.212LeuCys: 2.212 ± 0.713
5.531LeuAsp: 5.531 ± 1.174
5.531LeuGlu: 5.531 ± 1.294
4.425LeuPhe: 4.425 ± 0.873
6.637LeuGly: 6.637 ± 1.736
2.765LeuHis: 2.765 ± 0.946
4.978LeuIle: 4.978 ± 1.08
7.743LeuLys: 7.743 ± 2.637
16.04LeuLeu: 16.04 ± 2.239
2.212LeuMet: 2.212 ± 0.892
4.425LeuAsn: 4.425 ± 0.585
6.637LeuPro: 6.637 ± 2.125
4.425LeuGln: 4.425 ± 1.106
4.425LeuArg: 4.425 ± 1.784
1.659LeuSer: 1.659 ± 0.942
7.743LeuThr: 7.743 ± 1.244
2.765LeuVal: 2.765 ± 0.767
0.0LeuTrp: 0.0 ± 0.0
9.403LeuTyr: 9.403 ± 1.845
0.0LeuXaa: 0.0 ± 0.0
Met
3.872MetAla: 3.872 ± 0.964
0.553MetCys: 0.553 ± 0.522
0.553MetAsp: 0.553 ± 0.546
2.212MetGlu: 2.212 ± 1.193
0.553MetPhe: 0.553 ± 0.522
1.106MetGly: 1.106 ± 0.572
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.319MetLys: 3.319 ± 0.937
0.553MetLeu: 0.553 ± 0.522
0.553MetMet: 0.553 ± 0.522
0.553MetAsn: 0.553 ± 0.546
1.106MetPro: 1.106 ± 0.538
0.0MetGln: 0.0 ± 0.0
1.106MetArg: 1.106 ± 0.928
1.659MetSer: 1.659 ± 1.332
0.0MetThr: 0.0 ± 0.0
1.106MetVal: 1.106 ± 0.538
1.659MetTrp: 1.659 ± 0.74
1.106MetTyr: 1.106 ± 0.728
0.0MetXaa: 0.0 ± 0.0
Asn
3.319AsnAla: 3.319 ± 1.826
2.212AsnCys: 2.212 ± 1.05
0.553AsnAsp: 0.553 ± 0.546
4.425AsnGlu: 4.425 ± 0.669
1.106AsnPhe: 1.106 ± 0.795
1.106AsnGly: 1.106 ± 0.538
1.106AsnHis: 1.106 ± 0.795
3.872AsnIle: 3.872 ± 1.18
5.531AsnLys: 5.531 ± 1.197
4.978AsnLeu: 4.978 ± 1.797
3.319AsnMet: 3.319 ± 1.227
1.659AsnAsn: 1.659 ± 1.039
4.978AsnPro: 4.978 ± 0.979
2.212AsnGln: 2.212 ± 0.586
0.553AsnArg: 0.553 ± 0.364
2.212AsnSer: 2.212 ± 1.626
2.212AsnThr: 2.212 ± 1.546
1.659AsnVal: 1.659 ± 0.675
0.553AsnTrp: 0.553 ± 0.364
2.765AsnTyr: 2.765 ± 0.388
0.0AsnXaa: 0.0 ± 0.0
Pro
1.659ProAla: 1.659 ± 0.675
0.553ProCys: 0.553 ± 0.364
7.19ProAsp: 7.19 ± 1.084
2.765ProGlu: 2.765 ± 1.381
1.106ProPhe: 1.106 ± 0.795
4.425ProGly: 4.425 ± 1.154
0.553ProHis: 0.553 ± 0.738
1.106ProIle: 1.106 ± 1.091
3.872ProLys: 3.872 ± 2.141
4.978ProLeu: 4.978 ± 1.095
0.0ProMet: 0.0 ± 0.0
1.106ProAsn: 1.106 ± 0.538
7.19ProPro: 7.19 ± 1.503
3.872ProGln: 3.872 ± 2.213
2.765ProArg: 2.765 ± 1.539
4.425ProSer: 4.425 ± 1.169
1.659ProThr: 1.659 ± 0.993
4.978ProVal: 4.978 ± 1.529
0.0ProTrp: 0.0 ± 0.0
3.319ProTyr: 3.319 ± 1.443
0.0ProXaa: 0.0 ± 0.0
Gln
3.319GlnAla: 3.319 ± 1.729
0.0GlnCys: 0.0 ± 0.0
2.212GlnAsp: 2.212 ± 0.737
3.872GlnGlu: 3.872 ± 1.231
2.212GlnPhe: 2.212 ± 1.546
3.319GlnGly: 3.319 ± 1.479
1.106GlnHis: 1.106 ± 0.795
2.765GlnIle: 2.765 ± 1.429
2.212GlnLys: 2.212 ± 0.737
1.659GlnLeu: 1.659 ± 0.721
0.0GlnMet: 0.0 ± 0.0
1.659GlnAsn: 1.659 ± 0.931
2.212GlnPro: 2.212 ± 1.002
1.106GlnGln: 1.106 ± 1.091
2.765GlnArg: 2.765 ± 0.388
3.872GlnSer: 3.872 ± 1.496
0.553GlnThr: 0.553 ± 0.364
2.212GlnVal: 2.212 ± 1.234
1.659GlnTrp: 1.659 ± 0.749
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.106ArgAla: 1.106 ± 0.795
0.0ArgCys: 0.0 ± 0.0
0.553ArgAsp: 0.553 ± 0.364
2.765ArgGlu: 2.765 ± 1.776
1.659ArgPhe: 1.659 ± 0.963
3.872ArgGly: 3.872 ± 0.692
0.553ArgHis: 0.553 ± 0.364
2.765ArgIle: 2.765 ± 0.678
2.212ArgLys: 2.212 ± 1.027
4.425ArgLeu: 4.425 ± 0.859
2.212ArgMet: 2.212 ± 1.077
1.659ArgAsn: 1.659 ± 0.83
0.0ArgPro: 0.0 ± 0.0
4.425ArgGln: 4.425 ± 3.181
3.872ArgArg: 3.872 ± 1.365
2.765ArgSer: 2.765 ± 1.429
5.531ArgThr: 5.531 ± 1.592
1.659ArgVal: 1.659 ± 0.675
2.765ArgTrp: 2.765 ± 1.429
3.872ArgTyr: 3.872 ± 2.563
0.0ArgXaa: 0.0 ± 0.0
Ser
6.637SerAla: 6.637 ± 2.053
0.553SerCys: 0.553 ± 0.546
2.765SerAsp: 2.765 ± 1.473
2.765SerGlu: 2.765 ± 0.756
1.106SerPhe: 1.106 ± 0.728
3.319SerGly: 3.319 ± 1.477
1.106SerHis: 1.106 ± 0.795
4.425SerIle: 4.425 ± 1.44
3.872SerLys: 3.872 ± 0.604
4.978SerLeu: 4.978 ± 1.439
0.0SerMet: 0.0 ± 0.0
1.106SerAsn: 1.106 ± 0.538
2.765SerPro: 2.765 ± 0.739
4.425SerGln: 4.425 ± 1.778
3.319SerArg: 3.319 ± 1.443
5.531SerSer: 5.531 ± 2.09
3.319SerThr: 3.319 ± 1.986
3.319SerVal: 3.319 ± 0.978
0.0SerTrp: 0.0 ± 0.0
1.106SerTyr: 1.106 ± 0.795
0.0SerXaa: 0.0 ± 0.0
Thr
6.084ThrAla: 6.084 ± 2.675
1.659ThrCys: 1.659 ± 1.021
2.765ThrAsp: 2.765 ± 0.739
4.978ThrGlu: 4.978 ± 1.353
1.106ThrPhe: 1.106 ± 0.728
3.319ThrGly: 3.319 ± 0.967
0.0ThrHis: 0.0 ± 0.0
4.978ThrIle: 4.978 ± 1.494
0.553ThrLys: 0.553 ± 0.364
4.425ThrLeu: 4.425 ± 1.408
0.553ThrMet: 0.553 ± 0.364
2.765ThrAsn: 2.765 ± 0.739
3.872ThrPro: 3.872 ± 0.631
1.659ThrGln: 1.659 ± 0.74
3.319ThrArg: 3.319 ± 1.175
6.084ThrSer: 6.084 ± 1.668
6.637ThrThr: 6.637 ± 1.246
4.978ThrVal: 4.978 ± 1.609
1.106ThrTrp: 1.106 ± 0.529
1.106ThrTyr: 1.106 ± 0.538
0.0ThrXaa: 0.0 ± 0.0
Val
2.765ValAla: 2.765 ± 1.491
1.659ValCys: 1.659 ± 0.749
0.553ValAsp: 0.553 ± 0.364
5.531ValGlu: 5.531 ± 1.213
1.106ValPhe: 1.106 ± 0.529
4.978ValGly: 4.978 ± 0.875
1.106ValHis: 1.106 ± 0.757
0.553ValIle: 0.553 ± 0.364
2.765ValLys: 2.765 ± 0.953
5.531ValLeu: 5.531 ± 1.733
1.106ValMet: 1.106 ± 0.757
3.872ValAsn: 3.872 ± 0.916
4.425ValPro: 4.425 ± 0.901
1.659ValGln: 1.659 ± 0.74
2.212ValArg: 2.212 ± 1.002
4.425ValSer: 4.425 ± 0.901
3.319ValThr: 3.319 ± 1.302
2.765ValVal: 2.765 ± 1.241
2.212ValTrp: 2.212 ± 1.157
1.659ValTyr: 1.659 ± 0.721
0.0ValXaa: 0.0 ± 0.0
Trp
1.106TrpAla: 1.106 ± 0.795
0.553TrpCys: 0.553 ± 0.546
0.0TrpAsp: 0.0 ± 0.0
3.872TrpGlu: 3.872 ± 1.009
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.659TrpLys: 1.659 ± 0.748
3.872TrpLeu: 3.872 ± 0.964
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.553TrpGln: 0.553 ± 0.364
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.106TrpThr: 1.106 ± 0.795
0.553TrpVal: 0.553 ± 0.522
0.553TrpTrp: 0.553 ± 0.364
0.553TrpTyr: 0.553 ± 0.738
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.553TyrAla: 0.553 ± 0.522
0.0TyrCys: 0.0 ± 0.0
0.553TyrAsp: 0.553 ± 0.364
2.212TyrGlu: 2.212 ± 0.804
1.659TyrPhe: 1.659 ± 0.866
3.872TyrGly: 3.872 ± 1.347
2.212TyrHis: 2.212 ± 1.002
3.872TyrIle: 3.872 ± 0.692
1.659TyrLys: 1.659 ± 0.74
4.978TyrLeu: 4.978 ± 1.742
1.106TyrMet: 1.106 ± 0.538
4.425TyrAsn: 4.425 ± 1.473
2.212TyrPro: 2.212 ± 0.849
1.106TyrGln: 1.106 ± 0.538
2.212TyrArg: 2.212 ± 1.591
3.319TyrSer: 3.319 ± 1.443
0.553TyrThr: 0.553 ± 0.546
1.659TyrVal: 1.659 ± 0.675
0.0TyrTrp: 0.0 ± 0.0
1.659TyrTyr: 1.659 ± 0.721
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1809 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski