Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_147

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.283AlaAla: 11.283 ± 7.687
2.116AlaCys: 2.116 ± 0.784
4.937AlaAsp: 4.937 ± 1.139
2.821AlaGlu: 2.821 ± 1.236
2.116AlaPhe: 2.116 ± 0.784
5.642AlaGly: 5.642 ± 4.429
1.41AlaHis: 1.41 ± 1.458
0.0AlaIle: 0.0 ± 0.0
1.41AlaLys: 1.41 ± 1.458
4.937AlaLeu: 4.937 ± 1.636
1.41AlaMet: 1.41 ± 1.096
6.347AlaAsn: 6.347 ± 2.906
2.821AlaPro: 2.821 ± 1.289
2.116AlaGln: 2.116 ± 2.187
3.526AlaArg: 3.526 ± 0.826
5.642AlaSer: 5.642 ± 3.221
4.231AlaThr: 4.231 ± 2.725
2.821AlaVal: 2.821 ± 0.994
1.41AlaTrp: 1.41 ± 0.958
2.116AlaTyr: 2.116 ± 0.995
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.705CysCys: 0.705 ± 0.844
0.0CysAsp: 0.0 ± 0.0
1.41CysGlu: 1.41 ± 0.816
0.705CysPhe: 0.705 ± 0.597
2.116CysGly: 2.116 ± 1.33
0.0CysHis: 0.0 ± 0.0
2.116CysIle: 2.116 ± 1.072
1.41CysLys: 1.41 ± 1.194
0.705CysLeu: 0.705 ± 0.479
0.0CysMet: 0.0 ± 0.0
0.705CysAsn: 0.705 ± 0.597
0.0CysPro: 0.0 ± 0.0
0.705CysGln: 0.705 ± 0.597
0.705CysArg: 0.705 ± 0.597
0.0CysSer: 0.0 ± 0.0
0.705CysThr: 0.705 ± 0.88
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.705CysTyr: 0.705 ± 0.844
0.0CysXaa: 0.0 ± 0.0
Asp
2.821AspAla: 2.821 ± 0.86
0.705AspCys: 0.705 ± 0.597
2.116AspAsp: 2.116 ± 0.736
2.821AspGlu: 2.821 ± 0.925
7.052AspPhe: 7.052 ± 1.722
2.116AspGly: 2.116 ± 0.995
0.705AspHis: 0.705 ± 0.479
4.231AspIle: 4.231 ± 2.822
2.116AspLys: 2.116 ± 0.877
4.937AspLeu: 4.937 ± 1.013
1.41AspMet: 1.41 ± 0.789
2.821AspAsn: 2.821 ± 1.475
4.231AspPro: 4.231 ± 1.146
2.116AspGln: 2.116 ± 1.072
1.41AspArg: 1.41 ± 0.958
0.705AspSer: 0.705 ± 0.88
5.642AspThr: 5.642 ± 2.687
4.937AspVal: 4.937 ± 1.482
0.705AspTrp: 0.705 ± 0.729
3.526AspTyr: 3.526 ± 0.671
0.0AspXaa: 0.0 ± 0.0
Glu
6.347GluAla: 6.347 ± 2.275
2.116GluCys: 2.116 ± 1.685
2.821GluAsp: 2.821 ± 1.154
2.116GluGlu: 2.116 ± 1.33
2.116GluPhe: 2.116 ± 1.054
0.705GluGly: 0.705 ± 0.479
0.705GluHis: 0.705 ± 0.479
4.231GluIle: 4.231 ± 0.984
4.231GluLys: 4.231 ± 2.139
7.757GluLeu: 7.757 ± 3.897
0.0GluMet: 0.0 ± 0.726
1.41GluAsn: 1.41 ± 0.958
2.116GluPro: 2.116 ± 1.072
1.41GluGln: 1.41 ± 0.637
2.116GluArg: 2.116 ± 1.072
0.705GluSer: 0.705 ± 0.479
2.821GluThr: 2.821 ± 0.81
4.937GluVal: 4.937 ± 1.888
0.705GluTrp: 0.705 ± 0.479
4.231GluTyr: 4.231 ± 1.472
0.0GluXaa: 0.0 ± 0.0
Phe
0.705PheAla: 0.705 ± 0.597
0.0PheCys: 0.0 ± 0.0
2.116PheAsp: 2.116 ± 1.008
1.41PheGlu: 1.41 ± 0.637
4.231PhePhe: 4.231 ± 1.504
5.642PheGly: 5.642 ± 2.579
0.705PheHis: 0.705 ± 0.844
2.821PheIle: 2.821 ± 1.65
3.526PheLys: 3.526 ± 1.486
2.821PheLeu: 2.821 ± 0.595
0.705PheMet: 0.705 ± 0.597
9.168PheAsn: 9.168 ± 3.398
0.705PhePro: 0.705 ± 0.844
1.41PheGln: 1.41 ± 0.577
2.116PheArg: 2.116 ± 1.008
3.526PheSer: 3.526 ± 1.406
3.526PheThr: 3.526 ± 1.405
2.116PheVal: 2.116 ± 0.877
0.705PheTrp: 0.705 ± 0.479
2.116PheTyr: 2.116 ± 1.562
0.0PheXaa: 0.0 ± 0.0
Gly
4.937GlyAla: 4.937 ± 2.52
0.0GlyCys: 0.0 ± 0.0
5.642GlyAsp: 5.642 ± 0.988
4.231GlyGlu: 4.231 ± 1.459
2.116GlyPhe: 2.116 ± 0.784
4.231GlyGly: 4.231 ± 2.118
1.41GlyHis: 1.41 ± 1.458
3.526GlyIle: 3.526 ± 1.065
3.526GlyLys: 3.526 ± 1.86
9.168GlyLeu: 9.168 ± 3.976
1.41GlyMet: 1.41 ± 0.637
3.526GlyAsn: 3.526 ± 0.944
0.705GlyPro: 0.705 ± 0.479
0.705GlyGln: 0.705 ± 0.479
1.41GlyArg: 1.41 ± 0.577
6.347GlySer: 6.347 ± 2.275
4.937GlyThr: 4.937 ± 1.878
2.116GlyVal: 2.116 ± 1.072
1.41GlyTrp: 1.41 ± 0.789
2.821GlyTyr: 2.821 ± 1.351
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
2.116HisAsp: 2.116 ± 1.33
2.821HisGlu: 2.821 ± 1.549
1.41HisPhe: 1.41 ± 0.577
1.41HisGly: 1.41 ± 0.958
1.41HisHis: 1.41 ± 0.789
0.0HisIle: 0.0 ± 0.0
0.705HisLys: 0.705 ± 0.597
0.705HisLeu: 0.705 ± 0.597
0.705HisMet: 0.705 ± 0.442
0.0HisAsn: 0.0 ± 0.0
1.41HisPro: 1.41 ± 0.941
2.821HisGln: 2.821 ± 1.273
1.41HisArg: 1.41 ± 1.194
0.705HisSer: 0.705 ± 0.479
0.705HisThr: 0.705 ± 0.479
0.705HisVal: 0.705 ± 0.844
0.705HisTrp: 0.705 ± 0.479
0.705HisTyr: 0.705 ± 0.597
0.0HisXaa: 0.0 ± 0.0
Ile
5.642IleAla: 5.642 ± 1.614
0.705IleCys: 0.705 ± 0.479
4.231IleAsp: 4.231 ± 1.146
2.821IleGlu: 2.821 ± 1.475
0.705IlePhe: 0.705 ± 0.597
5.642IleGly: 5.642 ± 1.189
1.41IleHis: 1.41 ± 1.194
2.116IleIle: 2.116 ± 1.791
0.0IleLys: 0.0 ± 0.0
4.937IleLeu: 4.937 ± 1.983
1.41IleMet: 1.41 ± 1.084
4.937IleAsn: 4.937 ± 1.888
4.231IlePro: 4.231 ± 1.374
2.821IleGln: 2.821 ± 0.642
3.526IleArg: 3.526 ± 0.899
2.116IleSer: 2.116 ± 0.877
3.526IleThr: 3.526 ± 1.628
2.116IleVal: 2.116 ± 0.997
0.0IleTrp: 0.0 ± 0.0
2.821IleTyr: 2.821 ± 0.595
0.0IleXaa: 0.0 ± 0.0
Lys
1.41LysAla: 1.41 ± 0.577
0.705LysCys: 0.705 ± 0.597
4.937LysAsp: 4.937 ± 0.864
4.231LysGlu: 4.231 ± 3.118
0.705LysPhe: 0.705 ± 0.479
1.41LysGly: 1.41 ± 0.958
0.705LysHis: 0.705 ± 0.597
3.526LysIle: 3.526 ± 1.065
9.873LysLys: 9.873 ± 5.435
2.116LysLeu: 2.116 ± 1.072
2.116LysMet: 2.116 ± 0.736
3.526LysAsn: 3.526 ± 1.065
1.41LysPro: 1.41 ± 1.194
1.41LysGln: 1.41 ± 1.194
2.821LysArg: 2.821 ± 1.083
4.231LysSer: 4.231 ± 1.731
4.937LysThr: 4.937 ± 1.351
1.41LysVal: 1.41 ± 0.808
0.0LysTrp: 0.0 ± 0.0
8.463LysTyr: 8.463 ± 2.563
0.0LysXaa: 0.0 ± 0.0
Leu
3.526LeuAla: 3.526 ± 1.246
0.0LeuCys: 0.0 ± 0.0
0.0LeuAsp: 0.0 ± 0.0
4.937LeuGlu: 4.937 ± 2.031
4.937LeuPhe: 4.937 ± 1.229
2.821LeuGly: 2.821 ± 1.289
3.526LeuHis: 3.526 ± 1.018
7.757LeuIle: 7.757 ± 2.055
5.642LeuLys: 5.642 ± 2.54
2.821LeuLeu: 2.821 ± 0.81
2.821LeuMet: 2.821 ± 0.82
8.463LeuAsn: 8.463 ± 3.765
3.526LeuPro: 3.526 ± 1.761
4.231LeuGln: 4.231 ± 1.165
2.821LeuArg: 2.821 ± 0.844
9.168LeuSer: 9.168 ± 3.081
6.347LeuThr: 6.347 ± 2.006
2.821LeuVal: 2.821 ± 1.289
0.705LeuTrp: 0.705 ± 0.597
0.705LeuTyr: 0.705 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
2.821MetAla: 2.821 ± 2.917
0.705MetCys: 0.705 ± 0.597
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.116MetGly: 2.116 ± 0.859
0.0MetHis: 0.0 ± 0.0
1.41MetIle: 1.41 ± 0.577
2.116MetLys: 2.116 ± 0.784
0.705MetLeu: 0.705 ± 0.844
1.41MetMet: 1.41 ± 0.637
3.526MetAsn: 3.526 ± 1.1
2.821MetPro: 2.821 ± 1.348
0.705MetGln: 0.705 ± 0.479
0.0MetArg: 0.0 ± 0.0
2.116MetSer: 2.116 ± 1.419
0.705MetThr: 0.705 ± 0.844
0.705MetVal: 0.705 ± 0.479
0.0MetTrp: 0.0 ± 0.0
2.116MetTyr: 2.116 ± 0.736
0.0MetXaa: 0.0 ± 0.0
Asn
6.347AsnAla: 6.347 ± 2.48
0.705AsnCys: 0.705 ± 0.597
2.821AsnAsp: 2.821 ± 1.083
4.937AsnGlu: 4.937 ± 1.856
4.231AsnPhe: 4.231 ± 2.228
4.937AsnGly: 4.937 ± 1.379
2.116AsnHis: 2.116 ± 0.736
1.41AsnIle: 1.41 ± 0.789
4.937AsnLys: 4.937 ± 1.083
4.231AsnLeu: 4.231 ± 1.704
0.705AsnMet: 0.705 ± 0.729
2.821AsnAsn: 2.821 ± 1.344
5.642AsnPro: 5.642 ± 1.484
4.231AsnGln: 4.231 ± 3.581
3.526AsnArg: 3.526 ± 0.944
6.347AsnSer: 6.347 ± 2.583
7.052AsnThr: 7.052 ± 2.227
3.526AsnVal: 3.526 ± 1.761
0.0AsnTrp: 0.0 ± 0.0
4.937AsnTyr: 4.937 ± 2.176
0.0AsnXaa: 0.0 ± 0.0
Pro
1.41ProAla: 1.41 ± 0.816
0.705ProCys: 0.705 ± 0.597
0.705ProAsp: 0.705 ± 0.479
2.116ProGlu: 2.116 ± 0.877
4.231ProPhe: 4.231 ± 1.088
5.642ProGly: 5.642 ± 2.027
1.41ProHis: 1.41 ± 1.194
2.116ProIle: 2.116 ± 1.436
0.705ProLys: 0.705 ± 0.479
2.821ProLeu: 2.821 ± 1.154
1.41ProMet: 1.41 ± 0.958
4.231ProAsn: 4.231 ± 1.19
1.41ProPro: 1.41 ± 0.577
2.116ProGln: 2.116 ± 1.436
0.705ProArg: 0.705 ± 0.597
3.526ProSer: 3.526 ± 2.394
4.231ProThr: 4.231 ± 0.866
9.168ProVal: 9.168 ± 2.322
0.0ProTrp: 0.0 ± 0.0
1.41ProTyr: 1.41 ± 1.194
0.0ProXaa: 0.0 ± 0.0
Gln
2.821GlnAla: 2.821 ± 1.545
0.705GlnCys: 0.705 ± 0.597
3.526GlnAsp: 3.526 ± 1.16
1.41GlnGlu: 1.41 ± 0.637
2.821GlnPhe: 2.821 ± 1.348
2.116GlnGly: 2.116 ± 1.436
0.705GlnHis: 0.705 ± 0.479
0.0GlnIle: 0.0 ± 0.0
2.116GlnLys: 2.116 ± 0.859
3.526GlnLeu: 3.526 ± 1.001
1.41GlnMet: 1.41 ± 1.084
3.526GlnAsn: 3.526 ± 1.86
1.41GlnPro: 1.41 ± 0.816
1.41GlnGln: 1.41 ± 0.637
1.41GlnArg: 1.41 ± 0.637
2.821GlnSer: 2.821 ± 1.154
0.705GlnThr: 0.705 ± 0.729
3.526GlnVal: 3.526 ± 0.826
1.41GlnTrp: 1.41 ± 0.577
2.116GlnTyr: 2.116 ± 1.791
0.0GlnXaa: 0.0 ± 0.0
Arg
0.705ArgAla: 0.705 ± 0.479
0.705ArgCys: 0.705 ± 0.597
2.116ArgAsp: 2.116 ± 0.523
1.41ArgGlu: 1.41 ± 0.637
2.116ArgPhe: 2.116 ± 1.072
1.41ArgGly: 1.41 ± 0.577
0.0ArgHis: 0.0 ± 0.0
2.116ArgIle: 2.116 ± 0.997
2.821ArgLys: 2.821 ± 1.083
4.937ArgLeu: 4.937 ± 1.983
1.41ArgMet: 1.41 ± 0.958
2.116ArgAsn: 2.116 ± 1.072
2.116ArgPro: 2.116 ± 0.877
0.705ArgGln: 0.705 ± 0.597
0.0ArgArg: 0.0 ± 0.0
2.821ArgSer: 2.821 ± 0.844
0.705ArgThr: 0.705 ± 0.729
1.41ArgVal: 1.41 ± 0.789
0.705ArgTrp: 0.705 ± 0.479
4.231ArgTyr: 4.231 ± 1.753
0.0ArgXaa: 0.0 ± 0.0
Ser
7.757SerAla: 7.757 ± 3.769
1.41SerCys: 1.41 ± 0.577
4.231SerAsp: 4.231 ± 1.945
4.937SerGlu: 4.937 ± 1.594
2.821SerPhe: 2.821 ± 1.145
4.231SerGly: 4.231 ± 0.806
1.41SerHis: 1.41 ± 0.958
6.347SerIle: 6.347 ± 1.358
0.705SerLys: 0.705 ± 0.729
7.052SerLeu: 7.052 ± 2.028
2.116SerMet: 2.116 ± 0.877
5.642SerAsn: 5.642 ± 2.403
3.526SerPro: 3.526 ± 1.736
3.526SerGln: 3.526 ± 0.944
2.116SerArg: 2.116 ± 1.072
11.283SerSer: 11.283 ± 5.678
3.526SerThr: 3.526 ± 2.704
3.526SerVal: 3.526 ± 1.216
1.41SerTrp: 1.41 ± 0.577
1.41SerTyr: 1.41 ± 1.458
0.0SerXaa: 0.0 ± 0.0
Thr
2.821ThrAla: 2.821 ± 1.273
0.0ThrCys: 0.0 ± 0.0
7.052ThrAsp: 7.052 ± 3.077
4.231ThrGlu: 4.231 ± 1.99
1.41ThrPhe: 1.41 ± 0.808
6.347ThrGly: 6.347 ± 2.927
1.41ThrHis: 1.41 ± 0.577
3.526ThrIle: 3.526 ± 1.246
6.347ThrLys: 6.347 ± 1.57
5.642ThrLeu: 5.642 ± 1.01
0.705ThrMet: 0.705 ± 0.729
2.116ThrAsn: 2.116 ± 0.859
4.231ThrPro: 4.231 ± 1.088
2.821ThrGln: 2.821 ± 1.577
0.705ThrArg: 0.705 ± 0.479
5.642ThrSer: 5.642 ± 3.054
2.821ThrThr: 2.821 ± 0.595
0.0ThrVal: 0.0 ± 0.0
0.705ThrTrp: 0.705 ± 0.479
2.821ThrTyr: 2.821 ± 1.154
0.0ThrXaa: 0.0 ± 0.0
Val
2.116ValAla: 2.116 ± 0.523
0.0ValCys: 0.0 ± 0.0
4.937ValAsp: 4.937 ± 1.033
1.41ValGlu: 1.41 ± 1.194
2.821ValPhe: 2.821 ± 1.351
0.705ValGly: 0.705 ± 0.844
0.705ValHis: 0.705 ± 0.844
2.821ValIle: 2.821 ± 1.348
4.937ValLys: 4.937 ± 1.577
3.526ValLeu: 3.526 ± 1.065
0.705ValMet: 0.705 ± 0.729
3.526ValAsn: 3.526 ± 1.758
6.347ValPro: 6.347 ± 2.118
1.41ValGln: 1.41 ± 0.958
1.41ValArg: 1.41 ± 0.577
5.642ValSer: 5.642 ± 1.319
1.41ValThr: 1.41 ± 0.577
1.41ValVal: 1.41 ± 0.958
0.0ValTrp: 0.0 ± 0.0
3.526ValTyr: 3.526 ± 1.126
0.0ValXaa: 0.0 ± 0.0
Trp
0.705TrpAla: 0.705 ± 0.597
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.705TrpPhe: 0.705 ± 0.479
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.705TrpIle: 0.705 ± 0.479
1.41TrpLys: 1.41 ± 0.958
0.705TrpLeu: 0.705 ± 0.479
0.705TrpMet: 0.705 ± 0.479
0.705TrpAsn: 0.705 ± 0.479
1.41TrpPro: 1.41 ± 0.577
1.41TrpGln: 1.41 ± 0.808
0.0TrpArg: 0.0 ± 0.0
1.41TrpSer: 1.41 ± 0.958
0.705TrpThr: 0.705 ± 0.479
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.41TrpTyr: 1.41 ± 0.789
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.937TyrAla: 4.937 ± 0.798
0.705TyrCys: 0.705 ± 0.88
2.821TyrAsp: 2.821 ± 1.273
4.231TyrGlu: 4.231 ± 1.498
2.821TyrPhe: 2.821 ± 0.81
4.937TyrGly: 4.937 ± 1.543
0.705TyrHis: 0.705 ± 0.597
4.937TyrIle: 4.937 ± 2.622
1.41TyrLys: 1.41 ± 0.941
3.526TyrLeu: 3.526 ± 1.587
0.705TyrMet: 0.705 ± 0.479
6.347TyrAsn: 6.347 ± 1.061
0.0TyrPro: 0.0 ± 0.0
1.41TyrGln: 1.41 ± 0.941
2.821TyrArg: 2.821 ± 1.348
4.231TyrSer: 4.231 ± 0.806
2.116TyrThr: 2.116 ± 0.995
2.116TyrVal: 2.116 ± 0.877
1.41TyrTrp: 1.41 ± 0.958
2.116TyrTyr: 2.116 ± 1.072
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski