Amino acid dipepetide frequency for Cebus albifrons polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.693AlaAla: 6.693 ± 4.622
0.669AlaCys: 0.669 ± 0.431
0.669AlaAsp: 0.669 ± 0.616
6.693AlaGlu: 6.693 ± 3.587
4.685AlaPhe: 4.685 ± 0.674
0.669AlaGly: 0.669 ± 0.431
1.339AlaHis: 1.339 ± 1.438
4.685AlaIle: 4.685 ± 0.784
5.355AlaLys: 5.355 ± 1.663
6.693AlaLeu: 6.693 ± 2.942
0.669AlaMet: 0.669 ± 0.616
4.016AlaAsn: 4.016 ± 1.977
2.677AlaPro: 2.677 ± 1.105
2.677AlaGln: 2.677 ± 1.105
1.339AlaArg: 1.339 ± 1.438
2.008AlaSer: 2.008 ± 1.182
4.016AlaThr: 4.016 ± 2.62
7.363AlaVal: 7.363 ± 4.292
0.669AlaTrp: 0.669 ± 0.616
0.669AlaTyr: 0.669 ± 0.719
0.0AlaXaa: 0.0 ± 0.0
Cys
1.339CysAla: 1.339 ± 0.572
0.0CysCys: 0.0 ± 0.0
1.339CysAsp: 1.339 ± 0.862
2.677CysGlu: 2.677 ± 0.955
2.677CysPhe: 2.677 ± 1.967
0.669CysGly: 0.669 ± 0.431
0.669CysHis: 0.669 ± 0.431
1.339CysIle: 1.339 ± 0.572
3.347CysLys: 3.347 ± 1.069
1.339CysLeu: 1.339 ± 2.228
0.0CysMet: 0.0 ± 0.0
0.669CysAsn: 0.669 ± 0.431
0.0CysPro: 0.0 ± 0.0
0.669CysGln: 0.669 ± 0.431
0.669CysArg: 0.669 ± 1.114
1.339CysSer: 1.339 ± 0.862
1.339CysThr: 1.339 ± 0.983
0.669CysVal: 0.669 ± 0.431
0.669CysTrp: 0.669 ± 0.431
2.677CysTyr: 2.677 ± 2.064
0.0CysXaa: 0.0 ± 0.0
Asp
2.677AspAla: 2.677 ± 1.318
2.008AspCys: 2.008 ± 1.031
4.016AspAsp: 4.016 ± 1.229
3.347AspGlu: 3.347 ± 1.54
2.008AspPhe: 2.008 ± 0.85
2.677AspGly: 2.677 ± 1.144
4.016AspHis: 4.016 ± 1.249
4.016AspIle: 4.016 ± 0.53
2.677AspLys: 2.677 ± 0.955
5.355AspLeu: 5.355 ± 2.21
3.347AspMet: 3.347 ± 1.327
0.669AspAsn: 0.669 ± 0.431
2.677AspPro: 2.677 ± 1.725
5.355AspGln: 5.355 ± 1.91
0.669AspArg: 0.669 ± 0.719
2.008AspSer: 2.008 ± 0.804
1.339AspThr: 1.339 ± 0.659
1.339AspVal: 1.339 ± 0.659
1.339AspTrp: 1.339 ± 1.226
2.008AspTyr: 2.008 ± 0.804
0.0AspXaa: 0.0 ± 0.0
Glu
6.693GluAla: 6.693 ± 2.702
1.339GluCys: 1.339 ± 0.572
4.016GluAsp: 4.016 ± 1.076
6.693GluGlu: 6.693 ± 1.58
1.339GluPhe: 1.339 ± 0.862
2.677GluGly: 2.677 ± 1.156
2.008GluHis: 2.008 ± 2.23
0.669GluIle: 0.669 ± 1.114
4.685GluLys: 4.685 ± 1.18
7.363GluLeu: 7.363 ± 3.389
0.669GluMet: 0.669 ± 0.431
5.355GluAsn: 5.355 ± 2.288
2.677GluPro: 2.677 ± 1.144
2.677GluGln: 2.677 ± 0.955
0.0GluArg: 0.0 ± 0.0
3.347GluSer: 3.347 ± 0.565
4.016GluThr: 4.016 ± 1.217
7.363GluVal: 7.363 ± 1.982
0.669GluTrp: 0.669 ± 0.431
3.347GluTyr: 3.347 ± 0.854
0.0GluXaa: 0.0 ± 0.0
Phe
3.347PheAla: 3.347 ± 0.807
2.008PheCys: 2.008 ± 1.031
3.347PheAsp: 3.347 ± 2.154
4.016PheGlu: 4.016 ± 1.076
0.669PhePhe: 0.669 ± 0.616
2.008PheGly: 2.008 ± 1.627
2.008PheHis: 2.008 ± 0.507
2.677PheIle: 2.677 ± 1.156
2.008PheLys: 2.008 ± 0.804
6.024PheLeu: 6.024 ± 2.817
0.0PheMet: 0.0 ± 0.0
2.677PheAsn: 2.677 ± 1.238
5.355PhePro: 5.355 ± 0.43
1.339PheGln: 1.339 ± 0.76
2.008PheArg: 2.008 ± 1.026
2.008PheSer: 2.008 ± 1.108
5.355PheThr: 5.355 ± 1.148
4.016PheVal: 4.016 ± 1.7
0.669PheTrp: 0.669 ± 0.719
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.677GlyAla: 2.677 ± 2.008
0.669GlyCys: 0.669 ± 0.431
3.347GlyAsp: 3.347 ± 1.12
1.339GlyGlu: 1.339 ± 0.76
2.008GlyPhe: 2.008 ± 1.847
6.693GlyGly: 6.693 ± 1.115
0.0GlyHis: 0.0 ± 0.0
3.347GlyIle: 3.347 ± 1.327
1.339GlyLys: 1.339 ± 0.862
8.032GlyLeu: 8.032 ± 0.508
0.669GlyMet: 0.669 ± 0.616
2.008GlyAsn: 2.008 ± 1.026
2.677GlyPro: 2.677 ± 1.699
4.685GlyGln: 4.685 ± 1.304
1.339GlyArg: 1.339 ± 1.226
3.347GlySer: 3.347 ± 1.947
2.677GlyThr: 2.677 ± 0.853
4.685GlyVal: 4.685 ± 1.18
0.0GlyTrp: 0.0 ± 0.0
0.669GlyTyr: 0.669 ± 0.431
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.669HisCys: 0.669 ± 0.431
1.339HisAsp: 1.339 ± 1.438
1.339HisGlu: 1.339 ± 0.659
2.008HisPhe: 2.008 ± 0.804
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.669HisIle: 0.669 ± 0.719
2.008HisLys: 2.008 ± 1.031
2.008HisLeu: 2.008 ± 1.292
0.0HisMet: 0.0 ± 0.0
0.669HisAsn: 0.669 ± 0.431
2.008HisPro: 2.008 ± 1.026
0.0HisGln: 0.0 ± 0.0
1.339HisArg: 1.339 ± 0.862
1.339HisSer: 1.339 ± 0.76
2.008HisThr: 2.008 ± 2.057
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.339HisTyr: 1.339 ± 0.862
0.0HisXaa: 0.0 ± 0.0
Ile
2.677IleAla: 2.677 ± 0.853
0.669IleCys: 0.669 ± 0.431
2.008IleAsp: 2.008 ± 0.85
3.347IleGlu: 3.347 ± 0.807
5.355IlePhe: 5.355 ± 2.021
2.008IleGly: 2.008 ± 1.228
0.0IleHis: 0.0 ± 0.0
1.339IleIle: 1.339 ± 1.232
1.339IleLys: 1.339 ± 0.862
8.032IleLeu: 8.032 ± 1.539
1.339IleMet: 1.339 ± 0.761
2.677IleAsn: 2.677 ± 1.156
4.016IlePro: 4.016 ± 1.076
1.339IleGln: 1.339 ± 1.438
0.669IleArg: 0.669 ± 0.616
6.024IleSer: 6.024 ± 1.394
4.016IleThr: 4.016 ± 1.593
0.669IleVal: 0.669 ± 0.616
0.0IleTrp: 0.0 ± 0.0
2.008IleTyr: 2.008 ± 0.85
0.0IleXaa: 0.0 ± 0.0
Lys
3.347LysAla: 3.347 ± 1.919
2.677LysCys: 2.677 ± 1.967
2.677LysAsp: 2.677 ± 1.238
2.677LysGlu: 2.677 ± 1.156
2.677LysPhe: 2.677 ± 1.723
4.016LysGly: 4.016 ± 1.32
3.347LysHis: 3.347 ± 2.154
2.677LysIle: 2.677 ± 1.377
6.693LysLys: 6.693 ± 2.121
7.363LysLeu: 7.363 ± 2.303
2.677LysMet: 2.677 ± 1.059
2.677LysAsn: 2.677 ± 1.156
2.008LysPro: 2.008 ± 0.804
2.677LysGln: 2.677 ± 0.955
6.693LysArg: 6.693 ± 1.568
5.355LysSer: 5.355 ± 1.522
4.685LysThr: 4.685 ± 1.879
2.008LysVal: 2.008 ± 1.031
0.669LysTrp: 0.669 ± 0.431
2.677LysTyr: 2.677 ± 1.156
0.0LysXaa: 0.0 ± 0.0
Leu
7.363LeuAla: 7.363 ± 5.321
3.347LeuCys: 3.347 ± 1.327
5.355LeuAsp: 5.355 ± 1.605
6.693LeuGlu: 6.693 ± 1.347
4.016LeuPhe: 4.016 ± 0.53
2.677LeuGly: 2.677 ± 0.916
1.339LeuHis: 1.339 ± 0.983
7.363LeuIle: 7.363 ± 2.344
6.024LeuLys: 6.024 ± 1.98
14.056LeuLeu: 14.056 ± 2.359
4.016LeuMet: 4.016 ± 1.852
9.371LeuAsn: 9.371 ± 2.206
7.363LeuPro: 7.363 ± 1.474
5.355LeuGln: 5.355 ± 2.359
2.008LeuArg: 2.008 ± 0.507
6.024LeuSer: 6.024 ± 0.977
9.371LeuThr: 9.371 ± 1.851
4.016LeuVal: 4.016 ± 1.234
0.669LeuTrp: 0.669 ± 1.114
4.016LeuTyr: 4.016 ± 1.217
0.0LeuXaa: 0.0 ± 0.0
Met
0.669MetAla: 0.669 ± 0.719
0.669MetCys: 0.669 ± 0.431
2.008MetAsp: 2.008 ± 1.031
3.347MetGlu: 3.347 ± 1.969
0.0MetPhe: 0.0 ± 0.0
2.677MetGly: 2.677 ± 0.853
0.0MetHis: 0.0 ± 0.0
1.339MetIle: 1.339 ± 0.983
2.008MetLys: 2.008 ± 0.804
2.008MetLeu: 2.008 ± 1.026
0.0MetMet: 0.0 ± 0.0
0.669MetAsn: 0.669 ± 0.616
2.677MetPro: 2.677 ± 1.144
0.0MetGln: 0.0 ± 0.0
2.008MetArg: 2.008 ± 1.026
0.0MetSer: 0.0 ± 0.0
0.669MetThr: 0.669 ± 0.616
0.0MetVal: 0.0 ± 0.0
0.669MetTrp: 0.669 ± 0.616
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.339AsnAla: 1.339 ± 0.862
1.339AsnCys: 1.339 ± 0.983
1.339AsnAsp: 1.339 ± 0.572
2.677AsnGlu: 2.677 ± 1.144
4.016AsnPhe: 4.016 ± 1.32
0.669AsnGly: 0.669 ± 0.616
0.0AsnHis: 0.0 ± 0.0
4.685AsnIle: 4.685 ± 0.575
2.677AsnLys: 2.677 ± 1.723
6.693AsnLeu: 6.693 ± 1.588
2.677AsnMet: 2.677 ± 0.914
0.0AsnAsn: 0.0 ± 0.0
2.677AsnPro: 2.677 ± 0.916
2.008AsnGln: 2.008 ± 0.85
0.0AsnArg: 0.0 ± 0.0
1.339AsnSer: 1.339 ± 0.572
4.685AsnThr: 4.685 ± 1.286
2.677AsnVal: 2.677 ± 1.156
0.0AsnTrp: 0.0 ± 0.0
2.677AsnTyr: 2.677 ± 1.318
0.0AsnXaa: 0.0 ± 0.0
Pro
2.677ProAla: 2.677 ± 0.916
0.669ProCys: 0.669 ± 0.616
6.024ProAsp: 6.024 ± 1.715
1.339ProGlu: 1.339 ± 0.572
2.677ProPhe: 2.677 ± 0.889
4.016ProGly: 4.016 ± 1.014
0.0ProHis: 0.0 ± 0.0
2.008ProIle: 2.008 ± 0.507
3.347ProLys: 3.347 ± 1.232
5.355ProLeu: 5.355 ± 1.373
2.008ProMet: 2.008 ± 1.026
1.339ProAsn: 1.339 ± 0.572
4.016ProPro: 4.016 ± 1.324
1.339ProGln: 1.339 ± 1.438
2.677ProArg: 2.677 ± 1.725
2.008ProSer: 2.008 ± 1.182
4.685ProThr: 4.685 ± 1.944
4.685ProVal: 4.685 ± 2.211
0.0ProTrp: 0.0 ± 0.0
1.339ProTyr: 1.339 ± 0.76
0.0ProXaa: 0.0 ± 0.0
Gln
4.016GlnAla: 4.016 ± 1.653
0.669GlnCys: 0.669 ± 0.431
1.339GlnAsp: 1.339 ± 0.572
3.347GlnGlu: 3.347 ± 0.565
1.339GlnPhe: 1.339 ± 0.572
1.339GlnGly: 1.339 ± 0.572
0.669GlnHis: 0.669 ± 0.431
0.669GlnIle: 0.669 ± 0.431
4.685GlnLys: 4.685 ± 1.582
4.685GlnLeu: 4.685 ± 0.575
0.0GlnMet: 0.0 ± 0.0
1.339GlnAsn: 1.339 ± 1.438
2.008GlnPro: 2.008 ± 0.804
2.008GlnGln: 2.008 ± 1.031
2.008GlnArg: 2.008 ± 2.157
5.355GlnSer: 5.355 ± 1.953
2.008GlnThr: 2.008 ± 1.31
4.685GlnVal: 4.685 ± 2.469
1.339GlnTrp: 1.339 ± 0.659
2.677GlnTyr: 2.677 ± 0.955
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
0.0ArgCys: 0.0 ± 0.0
2.677ArgAsp: 2.677 ± 0.955
2.677ArgGlu: 2.677 ± 1.577
2.677ArgPhe: 2.677 ± 0.555
0.669ArgGly: 0.669 ± 0.616
0.669ArgHis: 0.669 ± 0.431
1.339ArgIle: 1.339 ± 0.659
4.016ArgLys: 4.016 ± 0.53
2.677ArgLeu: 2.677 ± 0.853
1.339ArgMet: 1.339 ± 1.251
2.008ArgAsn: 2.008 ± 1.228
1.339ArgPro: 1.339 ± 1.226
0.669ArgGln: 0.669 ± 0.719
2.008ArgArg: 2.008 ± 0.85
2.008ArgSer: 2.008 ± 1.108
1.339ArgThr: 1.339 ± 1.251
2.008ArgVal: 2.008 ± 0.85
0.669ArgTrp: 0.669 ± 0.719
5.355ArgTyr: 5.355 ± 1.704
0.0ArgXaa: 0.0 ± 0.0
Ser
2.677SerAla: 2.677 ± 1.144
0.0SerCys: 0.0 ± 0.0
2.008SerAsp: 2.008 ± 0.85
4.016SerGlu: 4.016 ± 1.608
3.347SerPhe: 3.347 ± 1.327
4.016SerGly: 4.016 ± 0.914
0.669SerHis: 0.669 ± 0.431
1.339SerIle: 1.339 ± 0.76
4.016SerLys: 4.016 ± 0.779
7.363SerLeu: 7.363 ± 2.103
0.0SerMet: 0.0 ± 0.0
3.347SerAsn: 3.347 ± 1.54
2.008SerPro: 2.008 ± 1.182
4.685SerGln: 4.685 ± 1.944
2.008SerArg: 2.008 ± 0.85
2.008SerSer: 2.008 ± 1.292
3.347SerThr: 3.347 ± 2.063
6.024SerVal: 6.024 ± 2.263
2.677SerTrp: 2.677 ± 0.889
0.669SerTyr: 0.669 ± 0.616
0.0SerXaa: 0.0 ± 0.0
Thr
8.032ThrAla: 8.032 ± 4.102
2.677ThrCys: 2.677 ± 3.374
5.355ThrAsp: 5.355 ± 1.52
5.355ThrGlu: 5.355 ± 1.445
3.347ThrPhe: 3.347 ± 1.002
5.355ThrGly: 5.355 ± 1.221
0.0ThrHis: 0.0 ± 0.0
4.016ThrIle: 4.016 ± 1.014
4.016ThrLys: 4.016 ± 1.608
7.363ThrLeu: 7.363 ± 1.523
0.669ThrMet: 0.669 ± 0.616
2.008ThrAsn: 2.008 ± 1.182
3.347ThrPro: 3.347 ± 0.565
2.008ThrGln: 2.008 ± 1.292
2.677ThrArg: 2.677 ± 1.238
4.016ThrSer: 4.016 ± 2.62
5.355ThrThr: 5.355 ± 0.653
4.016ThrVal: 4.016 ± 1.014
0.0ThrTrp: 0.0 ± 0.0
2.008ThrTyr: 2.008 ± 0.981
0.0ThrXaa: 0.0 ± 0.0
Val
5.355ValAla: 5.355 ± 1.11
1.339ValCys: 1.339 ± 0.983
1.339ValAsp: 1.339 ± 0.659
4.685ValGlu: 4.685 ± 1.901
2.008ValPhe: 2.008 ± 0.507
4.685ValGly: 4.685 ± 2.626
0.669ValHis: 0.669 ± 0.616
2.008ValIle: 2.008 ± 1.292
5.355ValLys: 5.355 ± 1.663
4.016ValLeu: 4.016 ± 1.653
0.669ValMet: 0.669 ± 0.398
2.677ValAsn: 2.677 ± 1.176
0.669ValPro: 0.669 ± 0.616
2.677ValGln: 2.677 ± 1.377
4.016ValArg: 4.016 ± 1.778
4.685ValSer: 4.685 ± 1.297
7.363ValThr: 7.363 ± 2.637
2.677ValVal: 2.677 ± 1.144
2.008ValTrp: 2.008 ± 1.228
0.669ValTyr: 0.669 ± 0.616
0.0ValXaa: 0.0 ± 0.0
Trp
1.339TrpAla: 1.339 ± 1.438
0.669TrpCys: 0.669 ± 0.431
1.339TrpAsp: 1.339 ± 0.572
1.339TrpGlu: 1.339 ± 0.76
1.339TrpPhe: 1.339 ± 0.983
0.669TrpGly: 0.669 ± 1.114
0.669TrpHis: 0.669 ± 0.616
0.669TrpIle: 0.669 ± 0.719
2.008TrpLys: 2.008 ± 1.031
0.669TrpLeu: 0.669 ± 0.719
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.339TrpGln: 1.339 ± 0.659
0.669TrpArg: 0.669 ± 1.114
0.669TrpSer: 0.669 ± 0.431
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.669TrpTrp: 0.669 ± 0.431
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.339TyrAla: 1.339 ± 0.659
2.008TyrCys: 2.008 ± 1.031
2.008TyrAsp: 2.008 ± 0.981
0.0TyrGlu: 0.0 ± 0.0
3.347TyrPhe: 3.347 ± 1.218
4.016TyrGly: 4.016 ± 0.872
0.669TyrHis: 0.669 ± 0.431
2.677TyrIle: 2.677 ± 0.853
3.347TyrLys: 3.347 ± 1.54
3.347TyrLeu: 3.347 ± 2.154
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
2.008TyrPro: 2.008 ± 1.182
2.677TyrGln: 2.677 ± 1.318
1.339TyrArg: 1.339 ± 1.232
1.339TyrSer: 1.339 ± 0.572
3.347TyrThr: 3.347 ± 0.807
0.669TyrVal: 0.669 ± 0.719
0.669TyrTrp: 0.669 ± 0.431
1.339TyrTyr: 1.339 ± 0.76
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1495 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski