Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_437

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.077AlaAla: 3.077 ± 1.919
0.615AlaCys: 0.615 ± 0.627
3.692AlaAsp: 3.692 ± 1.69
3.692AlaGlu: 3.692 ± 1.609
3.692AlaPhe: 3.692 ± 1.582
3.692AlaGly: 3.692 ± 1.939
0.0AlaHis: 0.0 ± 0.0
4.923AlaIle: 4.923 ± 2.863
4.923AlaLys: 4.923 ± 2.185
6.769AlaLeu: 6.769 ± 4.194
0.615AlaMet: 0.615 ± 0.403
6.154AlaAsn: 6.154 ± 2.435
0.615AlaPro: 0.615 ± 0.627
1.846AlaGln: 1.846 ± 1.679
0.615AlaArg: 0.615 ± 0.403
8.615AlaSer: 8.615 ± 4.082
0.615AlaThr: 0.615 ± 0.495
4.308AlaVal: 4.308 ± 1.144
0.615AlaTrp: 0.615 ± 0.627
3.692AlaTyr: 3.692 ± 1.879
0.0AlaXaa: 0.0 ± 0.0
Cys
0.615CysAla: 0.615 ± 0.403
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.231CysGlu: 1.231 ± 0.825
0.0CysPhe: 0.0 ± 0.0
1.846CysGly: 1.846 ± 1.647
1.231CysHis: 1.231 ± 0.807
1.846CysIle: 1.846 ± 1.476
0.0CysLys: 0.0 ± 0.0
0.615CysLeu: 0.615 ± 0.403
1.231CysMet: 1.231 ± 1.255
0.0CysAsn: 0.0 ± 0.0
1.231CysPro: 1.231 ± 0.545
1.231CysGln: 1.231 ± 1.196
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.231CysThr: 1.231 ± 1.255
1.231CysVal: 1.231 ± 0.825
0.0CysTrp: 0.0 ± 0.0
0.615CysTyr: 0.615 ± 0.627
0.0CysXaa: 0.0 ± 0.0
Asp
3.077AspAla: 3.077 ± 1.499
1.231AspCys: 1.231 ± 0.807
3.692AspAsp: 3.692 ± 1.451
1.846AspGlu: 1.846 ± 1.544
0.0AspPhe: 0.0 ± 0.0
3.077AspGly: 3.077 ± 0.883
2.462AspHis: 2.462 ± 1.185
1.846AspIle: 1.846 ± 0.447
3.692AspLys: 3.692 ± 1.895
5.538AspLeu: 5.538 ± 3.059
2.462AspMet: 2.462 ± 0.572
4.923AspAsn: 4.923 ± 1.048
3.692AspPro: 3.692 ± 2.6
1.846AspGln: 1.846 ± 0.447
1.231AspArg: 1.231 ± 0.807
3.692AspSer: 3.692 ± 2.841
3.077AspThr: 3.077 ± 2.09
4.923AspVal: 4.923 ± 1.929
0.615AspTrp: 0.615 ± 0.495
7.385AspTyr: 7.385 ± 1.969
0.0AspXaa: 0.0 ± 0.0
Glu
3.077GluAla: 3.077 ± 1.833
0.615GluCys: 0.615 ± 0.627
4.308GluAsp: 4.308 ± 2.103
1.846GluGlu: 1.846 ± 1.486
3.077GluPhe: 3.077 ± 0.883
1.846GluGly: 1.846 ± 1.679
0.615GluHis: 0.615 ± 0.403
3.077GluIle: 3.077 ± 1.506
3.692GluLys: 3.692 ± 3.233
6.154GluLeu: 6.154 ± 3.395
0.615GluMet: 0.615 ± 1.514
4.923GluAsn: 4.923 ± 1.542
0.615GluPro: 0.615 ± 0.403
3.077GluGln: 3.077 ± 1.004
3.077GluArg: 3.077 ± 1.018
6.769GluSer: 6.769 ± 2.987
1.846GluThr: 1.846 ± 1.425
0.615GluVal: 0.615 ± 0.495
0.615GluTrp: 0.615 ± 0.627
3.077GluTyr: 3.077 ± 0.866
0.0GluXaa: 0.0 ± 0.0
Phe
1.846PheAla: 1.846 ± 1.21
0.615PheCys: 0.615 ± 0.627
1.846PheAsp: 1.846 ± 0.725
2.462PheGlu: 2.462 ± 2.979
1.846PhePhe: 1.846 ± 0.725
3.077PheGly: 3.077 ± 0.866
0.615PheHis: 0.615 ± 0.403
2.462PheIle: 2.462 ± 1.381
3.077PheLys: 3.077 ± 0.866
3.692PheLeu: 3.692 ± 1.408
1.231PheMet: 1.231 ± 0.825
1.846PheAsn: 1.846 ± 0.725
0.0PhePro: 0.0 ± 0.0
0.615PheGln: 0.615 ± 0.495
1.846PheArg: 1.846 ± 1.679
4.308PheSer: 4.308 ± 2.824
3.692PheThr: 3.692 ± 2.42
1.846PheVal: 1.846 ± 0.951
0.0PheTrp: 0.0 ± 0.0
1.846PheTyr: 1.846 ± 0.725
0.0PheXaa: 0.0 ± 0.0
Gly
3.692GlyAla: 3.692 ± 1.184
0.0GlyCys: 0.0 ± 0.0
4.308GlyAsp: 4.308 ± 1.284
2.462GlyGlu: 2.462 ± 0.771
2.462GlyPhe: 2.462 ± 1.386
1.846GlyGly: 1.846 ± 0.97
2.462GlyHis: 2.462 ± 1.499
1.231GlyIle: 1.231 ± 0.553
3.077GlyLys: 3.077 ± 1.216
2.462GlyLeu: 2.462 ± 1.372
0.0GlyMet: 0.0 ± 0.0
3.692GlyAsn: 3.692 ± 1.704
1.846GlyPro: 1.846 ± 0.725
0.615GlyGln: 0.615 ± 0.495
2.462GlyArg: 2.462 ± 1.711
3.692GlySer: 3.692 ± 1.779
1.846GlyThr: 1.846 ± 1.092
1.231GlyVal: 1.231 ± 0.807
0.0GlyTrp: 0.0 ± 0.0
1.846GlyTyr: 1.846 ± 0.97
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.615HisCys: 0.615 ± 0.627
1.846HisAsp: 1.846 ± 1.104
1.231HisGlu: 1.231 ± 0.807
1.846HisPhe: 1.846 ± 1.21
1.231HisGly: 1.231 ± 1.255
0.0HisHis: 0.0 ± 0.0
4.308HisIle: 4.308 ± 1.742
0.0HisLys: 0.0 ± 0.0
0.615HisLeu: 0.615 ± 0.884
0.0HisMet: 0.0 ± 0.0
1.231HisAsn: 1.231 ± 0.545
0.0HisPro: 0.0 ± 0.0
0.615HisGln: 0.615 ± 0.403
3.077HisArg: 3.077 ± 1.151
1.846HisSer: 1.846 ± 1.693
0.615HisThr: 0.615 ± 0.403
0.615HisVal: 0.615 ± 0.495
0.615HisTrp: 0.615 ± 0.627
2.462HisTyr: 2.462 ± 1.499
0.0HisXaa: 0.0 ± 0.0
Ile
4.923IleAla: 4.923 ± 1.609
0.0IleCys: 0.0 ± 0.0
9.846IleAsp: 9.846 ± 2.493
1.846IleGlu: 1.846 ± 0.447
2.462IlePhe: 2.462 ± 1.185
2.462IleGly: 2.462 ± 0.572
1.231IleHis: 1.231 ± 1.196
4.923IleIle: 4.923 ± 0.928
2.462IleLys: 2.462 ± 1.252
4.308IleLeu: 4.308 ± 1.891
1.846IleMet: 1.846 ± 0.447
3.692IleAsn: 3.692 ± 1.403
3.692IlePro: 3.692 ± 1.898
4.308IleGln: 4.308 ± 1.284
3.692IleArg: 3.692 ± 1.635
4.923IleSer: 4.923 ± 1.134
3.077IleThr: 3.077 ± 2.344
1.846IleVal: 1.846 ± 1.902
0.0IleTrp: 0.0 ± 0.0
4.308IleTyr: 4.308 ± 1.819
0.0IleXaa: 0.0 ± 0.0
Lys
3.077LysAla: 3.077 ± 2.161
0.615LysCys: 0.615 ± 1.106
3.692LysAsp: 3.692 ± 2.146
2.462LysGlu: 2.462 ± 2.538
1.846LysPhe: 1.846 ± 1.21
2.462LysGly: 2.462 ± 1.263
2.462LysHis: 2.462 ± 1.711
3.077LysIle: 3.077 ± 2.226
6.154LysLys: 6.154 ± 2.084
6.769LysLeu: 6.769 ± 1.809
1.846LysMet: 1.846 ± 1.364
2.462LysAsn: 2.462 ± 1.437
1.846LysPro: 1.846 ± 0.946
3.692LysGln: 3.692 ± 0.895
5.538LysArg: 5.538 ± 2.488
4.923LysSer: 4.923 ± 1.467
2.462LysThr: 2.462 ± 0.856
3.692LysVal: 3.692 ± 1.137
0.0LysTrp: 0.0 ± 0.0
4.308LysTyr: 4.308 ± 1.017
0.0LysXaa: 0.0 ± 0.0
Leu
5.538LeuAla: 5.538 ± 2.222
0.615LeuCys: 0.615 ± 0.884
3.692LeuAsp: 3.692 ± 1.69
6.154LeuGlu: 6.154 ± 2.142
3.692LeuPhe: 3.692 ± 2.511
1.231LeuGly: 1.231 ± 0.807
2.462LeuHis: 2.462 ± 1.711
7.385LeuIle: 7.385 ± 2.092
5.538LeuLys: 5.538 ± 0.935
5.538LeuLeu: 5.538 ± 1.354
3.692LeuMet: 3.692 ± 0.969
4.308LeuAsn: 4.308 ± 0.827
2.462LeuPro: 2.462 ± 1.614
3.077LeuGln: 3.077 ± 1.282
2.462LeuArg: 2.462 ± 1.185
6.154LeuSer: 6.154 ± 0.921
5.538LeuThr: 5.538 ± 1.133
2.462LeuVal: 2.462 ± 1.307
0.0LeuTrp: 0.0 ± 0.0
4.923LeuTyr: 4.923 ± 5.046
0.0LeuXaa: 0.0 ± 0.0
Met
2.462MetAla: 2.462 ± 1.931
1.231MetCys: 1.231 ± 0.545
0.615MetAsp: 0.615 ± 0.403
0.615MetGlu: 0.615 ± 0.627
1.846MetPhe: 1.846 ± 0.725
0.615MetGly: 0.615 ± 0.403
0.0MetHis: 0.0 ± 0.0
0.615MetIle: 0.615 ± 0.884
2.462MetLys: 2.462 ± 2.223
1.846MetLeu: 1.846 ± 1.157
0.615MetMet: 0.615 ± 0.627
2.462MetAsn: 2.462 ± 1.106
1.846MetPro: 1.846 ± 1.882
1.231MetGln: 1.231 ± 0.553
1.231MetArg: 1.231 ± 0.545
3.077MetSer: 3.077 ± 1.354
0.615MetThr: 0.615 ± 0.403
0.615MetVal: 0.615 ± 0.627
0.0MetTrp: 0.0 ± 0.0
0.615MetTyr: 0.615 ± 0.627
0.0MetXaa: 0.0 ± 0.0
Asn
4.308AsnAla: 4.308 ± 2.007
3.077AsnCys: 3.077 ± 1.734
1.846AsnAsp: 1.846 ± 1.104
6.769AsnGlu: 6.769 ± 1.82
1.231AsnPhe: 1.231 ± 0.545
1.846AsnGly: 1.846 ± 1.104
0.615AsnHis: 0.615 ± 0.627
3.077AsnIle: 3.077 ± 1.562
7.385AsnLys: 7.385 ± 1.29
4.308AsnLeu: 4.308 ± 2.132
2.462AsnMet: 2.462 ± 0.771
6.769AsnAsn: 6.769 ± 1.706
8.0AsnPro: 8.0 ± 1.14
4.308AsnGln: 4.308 ± 1.894
3.692AsnArg: 3.692 ± 1.184
5.538AsnSer: 5.538 ± 2.029
7.385AsnThr: 7.385 ± 2.927
2.462AsnVal: 2.462 ± 1.039
0.0AsnTrp: 0.0 ± 0.0
6.769AsnTyr: 6.769 ± 2.638
0.0AsnXaa: 0.0 ± 0.0
Pro
1.846ProAla: 1.846 ± 0.832
0.615ProCys: 0.615 ± 0.627
1.846ProAsp: 1.846 ± 1.503
2.462ProGlu: 2.462 ± 1.263
2.462ProPhe: 2.462 ± 1.039
1.846ProGly: 1.846 ± 1.082
1.231ProHis: 1.231 ± 0.545
2.462ProIle: 2.462 ± 1.09
0.0ProLys: 0.0 ± 0.0
5.538ProLeu: 5.538 ± 1.326
0.615ProMet: 0.615 ± 0.403
6.154ProAsn: 6.154 ± 2.852
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
1.231ProArg: 1.231 ± 0.545
6.154ProSer: 6.154 ± 1.756
0.615ProThr: 0.615 ± 1.6
1.231ProVal: 1.231 ± 0.807
0.615ProTrp: 0.615 ± 0.495
3.692ProTyr: 3.692 ± 1.505
0.0ProXaa: 0.0 ± 0.0
Gln
1.846GlnAla: 1.846 ± 0.946
0.0GlnCys: 0.0 ± 0.0
1.231GlnAsp: 1.231 ± 0.553
0.615GlnGlu: 0.615 ± 0.495
2.462GlnPhe: 2.462 ± 0.841
1.846GlnGly: 1.846 ± 0.97
1.231GlnHis: 1.231 ± 1.196
3.692GlnIle: 3.692 ± 1.296
2.462GlnLys: 2.462 ± 1.437
2.462GlnLeu: 2.462 ± 1.106
0.615GlnMet: 0.615 ± 0.627
4.308GlnAsn: 4.308 ± 1.017
0.615GlnPro: 0.615 ± 0.403
1.231GlnGln: 1.231 ± 0.825
4.308GlnArg: 4.308 ± 1.281
4.923GlnSer: 4.923 ± 1.965
1.846GlnThr: 1.846 ± 1.072
3.077GlnVal: 3.077 ± 1.562
0.0GlnTrp: 0.0 ± 0.0
3.077GlnTyr: 3.077 ± 0.883
0.0GlnXaa: 0.0 ± 0.0
Arg
2.462ArgAla: 2.462 ± 1.252
0.615ArgCys: 0.615 ± 0.627
1.231ArgAsp: 1.231 ± 0.807
3.692ArgGlu: 3.692 ± 1.238
2.462ArgPhe: 2.462 ± 1.206
1.846ArgGly: 1.846 ± 0.725
0.615ArgHis: 0.615 ± 0.403
2.462ArgIle: 2.462 ± 2.336
3.692ArgLys: 3.692 ± 1.895
1.846ArgLeu: 1.846 ± 1.679
1.846ArgMet: 1.846 ± 1.104
6.769ArgAsn: 6.769 ± 2.21
2.462ArgPro: 2.462 ± 1.386
4.308ArgGln: 4.308 ± 1.982
1.846ArgArg: 1.846 ± 0.946
1.846ArgSer: 1.846 ± 1.425
3.077ArgThr: 3.077 ± 0.883
1.846ArgVal: 1.846 ± 0.832
0.0ArgTrp: 0.0 ± 0.0
2.462ArgTyr: 2.462 ± 1.185
0.0ArgXaa: 0.0 ± 0.0
Ser
8.615SerAla: 8.615 ± 1.572
1.231SerCys: 1.231 ± 1.061
7.385SerAsp: 7.385 ± 2.071
5.538SerGlu: 5.538 ± 1.516
0.615SerPhe: 0.615 ± 1.6
4.923SerGly: 4.923 ± 1.491
1.231SerHis: 1.231 ± 0.545
8.0SerIle: 8.0 ± 1.478
3.692SerLys: 3.692 ± 1.664
11.077SerLeu: 11.077 ± 4.419
1.846SerMet: 1.846 ± 1.425
3.692SerAsn: 3.692 ± 1.69
2.462SerPro: 2.462 ± 1.695
3.077SerGln: 3.077 ± 1.526
3.692SerArg: 3.692 ± 1.252
6.154SerSer: 6.154 ± 2.597
6.154SerThr: 6.154 ± 2.513
4.923SerVal: 4.923 ± 1.181
0.615SerTrp: 0.615 ± 0.403
6.154SerTyr: 6.154 ± 1.363
0.0SerXaa: 0.0 ± 0.0
Thr
4.308ThrAla: 4.308 ± 3.348
0.0ThrCys: 0.0 ± 0.0
1.846ThrAsp: 1.846 ± 1.544
1.846ThrGlu: 1.846 ± 1.163
3.077ThrPhe: 3.077 ± 2.017
1.846ThrGly: 1.846 ± 0.97
0.615ThrHis: 0.615 ± 0.627
3.692ThrIle: 3.692 ± 3.075
3.692ThrLys: 3.692 ± 2.548
1.846ThrLeu: 1.846 ± 0.725
0.615ThrMet: 0.615 ± 0.627
8.0ThrAsn: 8.0 ± 3.709
3.692ThrPro: 3.692 ± 1.117
1.231ThrGln: 1.231 ± 0.553
0.615ThrArg: 0.615 ± 1.106
6.154ThrSer: 6.154 ± 2.085
3.692ThrThr: 3.692 ± 3.127
2.462ThrVal: 2.462 ± 1.09
1.231ThrTrp: 1.231 ± 0.807
3.077ThrTyr: 3.077 ± 1.218
0.0ThrXaa: 0.0 ± 0.0
Val
3.692ValAla: 3.692 ± 1.857
0.615ValCys: 0.615 ± 0.627
3.077ValAsp: 3.077 ± 1.101
1.231ValGlu: 1.231 ± 0.807
1.846ValPhe: 1.846 ± 0.447
1.846ValGly: 1.846 ± 1.503
1.846ValHis: 1.846 ± 1.21
1.231ValIle: 1.231 ± 0.545
1.846ValLys: 1.846 ± 1.157
2.462ValLeu: 2.462 ± 0.572
1.231ValMet: 1.231 ± 0.752
3.077ValAsn: 3.077 ± 0.787
3.077ValPro: 3.077 ± 2.017
3.077ValGln: 3.077 ± 0.866
3.077ValArg: 3.077 ± 1.526
5.538ValSer: 5.538 ± 1.864
4.308ValThr: 4.308 ± 1.068
3.692ValVal: 3.692 ± 1.334
0.0ValTrp: 0.0 ± 0.0
0.615ValTyr: 0.615 ± 1.6
0.0ValXaa: 0.0 ± 0.0
Trp
0.615TrpAla: 0.615 ± 0.403
0.0TrpCys: 0.0 ± 0.0
0.615TrpAsp: 0.615 ± 0.627
0.615TrpGlu: 0.615 ± 0.403
0.615TrpPhe: 0.615 ± 0.495
0.0TrpGly: 0.0 ± 0.0
0.615TrpHis: 0.615 ± 0.627
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.615TrpLeu: 0.615 ± 0.627
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.615TrpArg: 0.615 ± 0.495
1.231TrpSer: 1.231 ± 1.255
0.0TrpThr: 0.0 ± 0.0
0.615TrpVal: 0.615 ± 0.627
0.0TrpTrp: 0.0 ± 0.0
0.615TrpTyr: 0.615 ± 0.403
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.692TyrAla: 3.692 ± 1.779
1.846TyrCys: 1.846 ± 1.693
3.692TyrAsp: 3.692 ± 1.664
4.923TyrGlu: 4.923 ± 2.878
1.231TyrPhe: 1.231 ± 0.807
1.846TyrGly: 1.846 ± 0.725
1.231TyrHis: 1.231 ± 0.825
5.538TyrIle: 5.538 ± 2.829
5.538TyrLys: 5.538 ± 0.902
2.462TyrLeu: 2.462 ± 1.206
0.615TyrMet: 0.615 ± 1.193
7.385TyrAsn: 7.385 ± 2.11
2.462TyrPro: 2.462 ± 1.09
2.462TyrGln: 2.462 ± 1.106
3.077TyrArg: 3.077 ± 1.454
5.538TyrSer: 5.538 ± 2.005
1.846TyrThr: 1.846 ± 0.951
3.692TyrVal: 3.692 ± 1.248
1.846TyrTrp: 1.846 ± 1.882
4.308TyrTyr: 4.308 ± 2.927
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1626 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski