Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_48

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.386AlaAla: 3.386 ± 0.701
0.564AlaCys: 0.564 ± 0.468
3.386AlaAsp: 3.386 ± 1.125
3.95AlaGlu: 3.95 ± 2.715
3.386AlaPhe: 3.386 ± 0.508
3.386AlaGly: 3.386 ± 1.511
1.129AlaHis: 1.129 ± 0.511
2.822AlaIle: 2.822 ± 0.64
4.515AlaLys: 4.515 ± 1.378
6.772AlaLeu: 6.772 ± 1.866
0.564AlaMet: 0.564 ± 0.429
4.515AlaAsn: 4.515 ± 3.451
2.257AlaPro: 2.257 ± 1.427
6.208AlaGln: 6.208 ± 4.253
1.693AlaArg: 1.693 ± 1.2
2.822AlaSer: 2.822 ± 1.626
3.386AlaThr: 3.386 ± 1.363
2.257AlaVal: 2.257 ± 0.934
1.129AlaTrp: 1.129 ± 0.905
1.693AlaTyr: 1.693 ± 0.904
0.0AlaXaa: 0.0 ± 0.0
Cys
1.129CysAla: 1.129 ± 0.936
0.564CysCys: 0.564 ± 0.468
1.129CysAsp: 1.129 ± 0.626
0.564CysGlu: 0.564 ± 0.614
1.129CysPhe: 1.129 ± 0.511
1.129CysGly: 1.129 ± 0.626
1.129CysHis: 1.129 ± 0.803
0.564CysIle: 0.564 ± 0.468
1.693CysLys: 1.693 ± 0.93
0.564CysLeu: 0.564 ± 0.468
0.564CysMet: 0.564 ± 0.452
1.129CysAsn: 1.129 ± 0.936
0.564CysPro: 0.564 ± 0.468
0.564CysGln: 0.564 ± 0.468
1.129CysArg: 1.129 ± 0.511
1.693CysSer: 1.693 ± 0.93
0.0CysThr: 0.0 ± 0.0
1.693CysVal: 1.693 ± 0.844
0.0CysTrp: 0.0 ± 0.0
1.129CysTyr: 1.129 ± 0.905
0.0CysXaa: 0.0 ± 0.0
Asp
5.643AspAla: 5.643 ± 1.454
0.564AspCys: 0.564 ± 0.452
2.822AspAsp: 2.822 ± 1.786
3.386AspGlu: 3.386 ± 1.363
6.208AspPhe: 6.208 ± 1.706
2.257AspGly: 2.257 ± 1.285
1.129AspHis: 1.129 ± 0.936
5.079AspIle: 5.079 ± 1.836
2.822AspLys: 2.822 ± 1.074
5.079AspLeu: 5.079 ± 1.85
1.693AspMet: 1.693 ± 1.923
2.257AspAsn: 2.257 ± 0.724
3.95AspPro: 3.95 ± 1.358
1.693AspGln: 1.693 ± 0.844
0.564AspArg: 0.564 ± 0.452
3.95AspSer: 3.95 ± 1.887
1.693AspThr: 1.693 ± 0.904
3.95AspVal: 3.95 ± 2.282
0.0AspTrp: 0.0 ± 0.0
2.257AspTyr: 2.257 ± 1.81
0.0AspXaa: 0.0 ± 0.0
Glu
3.386GluAla: 3.386 ± 1.385
0.564GluCys: 0.564 ± 0.452
2.822GluAsp: 2.822 ± 1.161
2.257GluGlu: 2.257 ± 1.497
3.386GluPhe: 3.386 ± 1.796
4.515GluGly: 4.515 ± 0.896
1.693GluHis: 1.693 ± 1.571
5.643GluIle: 5.643 ± 1.689
1.693GluLys: 1.693 ± 1.646
5.079GluLeu: 5.079 ± 1.597
0.564GluMet: 0.564 ± 0.614
2.822GluAsn: 2.822 ± 1.199
0.0GluPro: 0.0 ± 0.0
1.129GluGln: 1.129 ± 0.842
1.129GluArg: 1.129 ± 0.936
3.386GluSer: 3.386 ± 0.508
4.515GluThr: 4.515 ± 2.571
2.822GluVal: 2.822 ± 0.64
0.564GluTrp: 0.564 ± 0.452
3.386GluTyr: 3.386 ± 1.124
0.0GluXaa: 0.0 ± 0.0
Phe
2.257PheAla: 2.257 ± 1.722
1.129PheCys: 1.129 ± 0.511
5.643PheAsp: 5.643 ± 1.2
2.822PheGlu: 2.822 ± 0.986
3.386PhePhe: 3.386 ± 1.533
4.515PheGly: 4.515 ± 2.509
1.129PheHis: 1.129 ± 0.803
5.079PheIle: 5.079 ± 1.149
3.386PheLys: 3.386 ± 1.306
5.079PheLeu: 5.079 ± 1.605
1.129PheMet: 1.129 ± 0.729
2.822PheAsn: 2.822 ± 0.924
2.257PhePro: 2.257 ± 1.254
2.257PheGln: 2.257 ± 1.147
2.822PheArg: 2.822 ± 1.33
4.515PheSer: 4.515 ± 1.942
4.515PheThr: 4.515 ± 1.478
3.386PheVal: 3.386 ± 2.127
0.0PheTrp: 0.0 ± 0.0
1.693PheTyr: 1.693 ± 0.869
0.0PheXaa: 0.0 ± 0.0
Gly
3.95GlyAla: 3.95 ± 1.278
0.564GlyCys: 0.564 ± 0.452
2.822GlyAsp: 2.822 ± 1.561
3.95GlyGlu: 3.95 ± 1.245
3.95GlyPhe: 3.95 ± 1.518
2.822GlyGly: 2.822 ± 0.986
0.564GlyHis: 0.564 ± 0.614
4.515GlyIle: 4.515 ± 1.453
4.515GlyLys: 4.515 ± 1.073
6.208GlyLeu: 6.208 ± 1.723
1.693GlyMet: 1.693 ± 0.904
1.693GlyAsn: 1.693 ± 1.024
0.0GlyPro: 0.0 ± 0.0
2.822GlyGln: 2.822 ± 0.828
2.822GlyArg: 2.822 ± 1.18
8.465GlySer: 8.465 ± 2.656
0.564GlyThr: 0.564 ± 0.452
5.079GlyVal: 5.079 ± 1.737
0.564GlyTrp: 0.564 ± 0.452
3.386GlyTyr: 3.386 ± 2.133
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.129HisCys: 1.129 ± 0.936
1.129HisAsp: 1.129 ± 0.511
1.693HisGlu: 1.693 ± 1.087
5.643HisPhe: 5.643 ± 1.679
1.129HisGly: 1.129 ± 0.511
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.693HisLeu: 1.693 ± 1.404
0.564HisMet: 0.564 ± 0.614
1.129HisAsn: 1.129 ± 0.511
2.257HisPro: 2.257 ± 0.934
0.564HisGln: 0.564 ± 0.452
0.564HisArg: 0.564 ± 0.468
2.257HisSer: 2.257 ± 0.724
0.564HisThr: 0.564 ± 0.452
2.822HisVal: 2.822 ± 2.017
0.0HisTrp: 0.0 ± 0.0
2.257HisTyr: 2.257 ± 1.299
0.0HisXaa: 0.0 ± 0.0
Ile
2.822IleAla: 2.822 ± 1.016
1.129IleCys: 1.129 ± 0.748
6.208IleAsp: 6.208 ± 2.233
5.643IleGlu: 5.643 ± 2.473
3.386IlePhe: 3.386 ± 1.688
2.257IleGly: 2.257 ± 1.101
0.0IleHis: 0.0 ± 0.0
2.822IleIle: 2.822 ± 1.32
3.95IleLys: 3.95 ± 1.091
6.208IleLeu: 6.208 ± 1.348
1.693IleMet: 1.693 ± 1.923
1.693IleAsn: 1.693 ± 0.974
3.95IlePro: 3.95 ± 2.083
1.693IleGln: 1.693 ± 0.869
2.257IleArg: 2.257 ± 1.045
5.079IleSer: 5.079 ± 1.135
3.386IleThr: 3.386 ± 1.622
1.129IleVal: 1.129 ± 0.905
0.564IleTrp: 0.564 ± 0.452
0.564IleTyr: 0.564 ± 0.468
0.0IleXaa: 0.0 ± 0.0
Lys
5.643LysAla: 5.643 ± 2.343
1.693LysCys: 1.693 ± 0.869
2.822LysAsp: 2.822 ± 0.686
1.129LysGlu: 1.129 ± 0.936
0.564LysPhe: 0.564 ± 0.452
2.257LysGly: 2.257 ± 1.497
1.129LysHis: 1.129 ± 0.905
3.386LysIle: 3.386 ± 0.944
4.515LysLys: 4.515 ± 0.986
5.079LysLeu: 5.079 ± 2.689
2.257LysMet: 2.257 ± 1.12
3.386LysAsn: 3.386 ± 1.738
1.693LysPro: 1.693 ± 0.904
1.129LysGln: 1.129 ± 0.905
2.257LysArg: 2.257 ± 1.294
6.772LysSer: 6.772 ± 1.345
2.822LysThr: 2.822 ± 1.708
5.643LysVal: 5.643 ± 1.519
0.0LysTrp: 0.0 ± 0.0
3.386LysTyr: 3.386 ± 1.555
0.0LysXaa: 0.0 ± 0.0
Leu
3.95LeuAla: 3.95 ± 1.113
1.693LeuCys: 1.693 ± 0.385
5.079LeuAsp: 5.079 ± 1.311
3.95LeuGlu: 3.95 ± 1.146
2.257LeuPhe: 2.257 ± 1.497
9.029LeuGly: 9.029 ± 2.062
3.386LeuHis: 3.386 ± 1.217
3.386LeuIle: 3.386 ± 1.363
4.515LeuLys: 4.515 ± 0.932
5.643LeuLeu: 5.643 ± 1.713
2.822LeuMet: 2.822 ± 1.027
7.336LeuAsn: 7.336 ± 1.674
1.129LeuPro: 1.129 ± 0.511
1.129LeuGln: 1.129 ± 1.282
6.772LeuArg: 6.772 ± 1.413
9.029LeuSer: 9.029 ± 1.825
3.386LeuThr: 3.386 ± 0.546
5.079LeuVal: 5.079 ± 2.021
1.129LeuTrp: 1.129 ± 0.551
3.386LeuTyr: 3.386 ± 1.217
0.0LeuXaa: 0.0 ± 0.0
Met
1.693MetAla: 1.693 ± 1.365
0.0MetCys: 0.0 ± 0.0
1.129MetAsp: 1.129 ± 0.551
0.564MetGlu: 0.564 ± 0.641
1.693MetPhe: 1.693 ± 0.642
1.693MetGly: 1.693 ± 1.365
0.0MetHis: 0.0 ± 0.0
1.693MetIle: 1.693 ± 1.155
0.564MetLys: 0.564 ± 0.452
2.257MetLeu: 2.257 ± 0.472
1.693MetMet: 1.693 ± 0.778
1.129MetAsn: 1.129 ± 1.128
0.564MetPro: 0.564 ± 0.452
1.129MetGln: 1.129 ± 0.551
0.564MetArg: 0.564 ± 0.468
1.129MetSer: 1.129 ± 0.905
1.129MetThr: 1.129 ± 0.842
1.129MetVal: 1.129 ± 1.227
0.564MetTrp: 0.564 ± 0.641
1.129MetTyr: 1.129 ± 0.511
0.0MetXaa: 0.0 ± 0.0
Asn
8.465AsnAla: 8.465 ± 3.384
1.693AsnCys: 1.693 ± 0.93
2.822AsnAsp: 2.822 ± 0.758
2.822AsnGlu: 2.822 ± 1.626
1.693AsnPhe: 1.693 ± 0.778
4.515AsnGly: 4.515 ± 1.073
2.822AsnHis: 2.822 ± 1.33
1.129AsnIle: 1.129 ± 0.842
4.515AsnLys: 4.515 ± 1.138
6.208AsnLeu: 6.208 ± 1.656
0.564AsnMet: 0.564 ± 0.972
3.95AsnAsn: 3.95 ± 1.293
2.257AsnPro: 2.257 ± 1.275
2.822AsnGln: 2.822 ± 1.841
3.95AsnArg: 3.95 ± 1.105
7.336AsnSer: 7.336 ± 1.24
3.95AsnThr: 3.95 ± 1.352
1.693AsnVal: 1.693 ± 0.844
0.0AsnTrp: 0.0 ± 0.0
0.564AsnTyr: 0.564 ± 0.468
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
1.129ProCys: 1.129 ± 0.748
0.0ProAsp: 0.0 ± 0.0
2.257ProGlu: 2.257 ± 0.822
1.129ProPhe: 1.129 ± 0.842
3.95ProGly: 3.95 ± 1.822
1.129ProHis: 1.129 ± 0.936
2.257ProIle: 2.257 ± 0.657
1.129ProLys: 1.129 ± 0.511
1.693ProLeu: 1.693 ± 0.869
0.564ProMet: 0.564 ± 0.452
1.693ProAsn: 1.693 ± 0.642
1.693ProPro: 1.693 ± 1.526
2.257ProGln: 2.257 ± 1.101
1.693ProArg: 1.693 ± 0.974
3.95ProSer: 3.95 ± 2.257
1.693ProThr: 1.693 ± 0.984
4.515ProVal: 4.515 ± 1.543
0.0ProTrp: 0.0 ± 0.0
2.822ProTyr: 2.822 ± 1.347
0.0ProXaa: 0.0 ± 0.0
Gln
2.257GlnAla: 2.257 ± 1.812
0.564GlnCys: 0.564 ± 0.452
0.0GlnAsp: 0.0 ± 0.0
1.693GlnGlu: 1.693 ± 1.2
2.257GlnPhe: 2.257 ± 0.537
2.257GlnGly: 2.257 ± 0.657
1.129GlnHis: 1.129 ± 0.647
3.386GlnIle: 3.386 ± 1.992
3.386GlnLys: 3.386 ± 0.701
2.822GlnLeu: 2.822 ± 0.924
0.0GlnMet: 0.0 ± 0.0
3.95GlnAsn: 3.95 ± 2.421
0.564GlnPro: 0.564 ± 0.452
0.0GlnGln: 0.0 ± 0.0
3.386GlnArg: 3.386 ± 1.5
2.257GlnSer: 2.257 ± 1.684
2.822GlnThr: 2.822 ± 0.924
4.515GlnVal: 4.515 ± 1.664
0.0GlnTrp: 0.0 ± 0.0
1.693GlnTyr: 1.693 ± 0.984
0.0GlnXaa: 0.0 ± 0.0
Arg
1.693ArgAla: 1.693 ± 1.087
0.564ArgCys: 0.564 ± 0.468
2.257ArgAsp: 2.257 ± 0.472
1.693ArgGlu: 1.693 ± 1.2
4.515ArgPhe: 4.515 ± 0.501
1.693ArgGly: 1.693 ± 0.844
0.564ArgHis: 0.564 ± 0.452
2.257ArgIle: 2.257 ± 0.472
1.693ArgLys: 1.693 ± 1.034
2.822ArgLeu: 2.822 ± 0.758
0.0ArgMet: 0.0 ± 0.0
3.386ArgAsn: 3.386 ± 1.286
2.257ArgPro: 2.257 ± 0.822
1.693ArgGln: 1.693 ± 1.2
1.693ArgArg: 1.693 ± 1.087
5.643ArgSer: 5.643 ± 0.402
3.95ArgThr: 3.95 ± 1.778
2.257ArgVal: 2.257 ± 1.045
0.564ArgTrp: 0.564 ± 0.468
5.079ArgTyr: 5.079 ± 3.045
0.0ArgXaa: 0.0 ± 0.0
Ser
4.515SerAla: 4.515 ± 0.783
1.129SerCys: 1.129 ± 0.748
5.643SerAsp: 5.643 ± 1.173
2.822SerGlu: 2.822 ± 0.416
5.643SerPhe: 5.643 ± 1.524
7.336SerGly: 7.336 ± 2.315
2.257SerHis: 2.257 ± 1.022
4.515SerIle: 4.515 ± 3.109
5.079SerLys: 5.079 ± 1.57
7.901SerLeu: 7.901 ± 1.611
1.693SerMet: 1.693 ± 0.794
5.643SerAsn: 5.643 ± 0.994
3.386SerPro: 3.386 ± 1.426
6.208SerGln: 6.208 ± 1.901
4.515SerArg: 4.515 ± 0.986
7.901SerSer: 7.901 ± 2.833
3.95SerThr: 3.95 ± 0.817
3.386SerVal: 3.386 ± 1.426
0.564SerTrp: 0.564 ± 0.641
6.772SerTyr: 6.772 ± 1.633
0.0SerXaa: 0.0 ± 0.0
Thr
1.693ThrAla: 1.693 ± 0.955
1.129ThrCys: 1.129 ± 0.511
5.079ThrAsp: 5.079 ± 1.412
2.822ThrGlu: 2.822 ± 0.416
3.386ThrPhe: 3.386 ± 1.688
1.129ThrGly: 1.129 ± 0.626
0.564ThrHis: 0.564 ± 0.452
5.079ThrIle: 5.079 ± 1.602
0.564ThrLys: 0.564 ± 0.641
7.336ThrLeu: 7.336 ± 1.012
1.129ThrMet: 1.129 ± 0.647
6.772ThrAsn: 6.772 ± 2.343
2.257ThrPro: 2.257 ± 1.058
1.129ThrGln: 1.129 ± 0.551
1.693ThrArg: 1.693 ± 0.869
6.208ThrSer: 6.208 ± 1.696
3.386ThrThr: 3.386 ± 2.127
0.0ThrVal: 0.0 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
1.693ThrTyr: 1.693 ± 1.357
0.0ThrXaa: 0.0 ± 0.0
Val
3.386ValAla: 3.386 ± 1.845
0.564ValCys: 0.564 ± 0.811
2.822ValAsp: 2.822 ± 0.64
4.515ValGlu: 4.515 ± 1.034
1.129ValPhe: 1.129 ± 0.551
3.386ValGly: 3.386 ± 1.285
1.693ValHis: 1.693 ± 1.404
2.257ValIle: 2.257 ± 0.926
5.643ValLys: 5.643 ± 0.729
1.693ValLeu: 1.693 ± 0.778
1.129ValMet: 1.129 ± 1.227
5.079ValAsn: 5.079 ± 1.704
2.822ValPro: 2.822 ± 1.686
3.386ValGln: 3.386 ± 1.838
1.693ValArg: 1.693 ± 0.643
5.079ValSer: 5.079 ± 3.496
3.95ValThr: 3.95 ± 2.55
6.208ValVal: 6.208 ± 2.013
1.693ValTrp: 1.693 ± 1.357
2.257ValTyr: 2.257 ± 1.254
0.0ValXaa: 0.0 ± 0.0
Trp
0.564TrpAla: 0.564 ± 0.641
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.564TrpPhe: 0.564 ± 0.452
0.0TrpGly: 0.0 ± 0.0
0.564TrpHis: 0.564 ± 0.452
0.564TrpIle: 0.564 ± 0.452
0.564TrpLys: 0.564 ± 0.641
1.129TrpLeu: 1.129 ± 0.511
0.0TrpMet: 0.0 ± 0.0
1.129TrpAsn: 1.129 ± 0.551
0.0TrpPro: 0.0 ± 0.0
0.564TrpGln: 0.564 ± 0.468
1.129TrpArg: 1.129 ± 0.905
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.564TrpVal: 0.564 ± 0.452
0.0TrpTrp: 0.0 ± 0.0
0.564TrpTyr: 0.564 ± 0.452
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.386TyrAla: 3.386 ± 1.688
1.693TyrCys: 1.693 ± 0.869
3.386TyrAsp: 3.386 ± 2.714
2.822TyrGlu: 2.822 ± 0.986
4.515TyrPhe: 4.515 ± 1.843
1.693TyrGly: 1.693 ± 0.658
3.386TyrHis: 3.386 ± 0.901
0.0TyrIle: 0.0 ± 0.0
2.822TyrLys: 2.822 ± 1.32
2.822TyrLeu: 2.822 ± 0.982
0.564TyrMet: 0.564 ± 0.452
2.822TyrAsn: 2.822 ± 1.347
1.693TyrPro: 1.693 ± 0.658
0.564TyrGln: 0.564 ± 0.452
3.95TyrArg: 3.95 ± 1.549
3.386TyrSer: 3.386 ± 0.701
3.386TyrThr: 3.386 ± 1.762
2.257TyrVal: 2.257 ± 0.537
0.564TyrTrp: 0.564 ± 0.468
5.643TyrTyr: 5.643 ± 1.844
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1773 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski