Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_367

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.464AlaAla: 5.464 ± 3.241
0.0AlaCys: 0.0 ± 0.0
5.464AlaAsp: 5.464 ± 3.189
5.464AlaGlu: 5.464 ± 4.999
3.036AlaPhe: 3.036 ± 0.813
3.036AlaGly: 3.036 ± 1.73
0.0AlaHis: 0.0 ± 0.0
1.821AlaIle: 1.821 ± 0.781
5.464AlaLys: 5.464 ± 5.052
3.643AlaLeu: 3.643 ± 0.451
1.214AlaMet: 1.214 ± 0.884
4.25AlaAsn: 4.25 ± 1.343
1.821AlaPro: 1.821 ± 0.225
3.643AlaGln: 3.643 ± 2.159
3.036AlaArg: 3.036 ± 0.756
3.036AlaSer: 3.036 ± 1.325
0.0AlaThr: 0.0 ± 0.0
4.25AlaVal: 4.25 ± 1.054
0.607AlaTrp: 0.607 ± 0.442
3.643AlaTyr: 3.643 ± 0.654
0.0AlaXaa: 0.0 ± 0.0
Cys
1.214CysAla: 1.214 ± 0.575
1.214CysCys: 1.214 ± 0.48
1.214CysAsp: 1.214 ± 0.575
1.214CysGlu: 1.214 ± 0.884
1.821CysPhe: 1.821 ± 0.867
0.607CysGly: 0.607 ± 0.492
0.0CysHis: 0.0 ± 0.0
1.821CysIle: 1.821 ± 0.625
1.214CysLys: 1.214 ± 0.985
1.821CysLeu: 1.821 ± 1.325
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.607CysGln: 0.607 ± 0.492
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.214CysVal: 1.214 ± 0.575
0.0CysTrp: 0.0 ± 0.0
1.821CysTyr: 1.821 ± 1.477
0.0CysXaa: 0.0 ± 0.0
Asp
4.25AspAla: 4.25 ± 1.343
1.821AspCys: 1.821 ± 0.225
3.643AspAsp: 3.643 ± 1.034
3.643AspGlu: 3.643 ± 0.899
6.679AspPhe: 6.679 ± 0.963
4.25AspGly: 4.25 ± 1.68
1.214AspHis: 1.214 ± 0.48
4.857AspIle: 4.857 ± 0.69
4.25AspLys: 4.25 ± 1.918
6.679AspLeu: 6.679 ± 1.352
3.036AspMet: 3.036 ± 0.946
4.857AspAsn: 4.857 ± 1.721
0.607AspPro: 0.607 ± 0.75
0.0AspGln: 0.0 ± 0.0
1.821AspArg: 1.821 ± 0.625
7.286AspSer: 7.286 ± 1.998
5.464AspThr: 5.464 ± 2.001
3.036AspVal: 3.036 ± 1.593
2.429AspTrp: 2.429 ± 0.854
4.857AspTyr: 4.857 ± 0.604
0.0AspXaa: 0.0 ± 0.0
Glu
3.643GluAla: 3.643 ± 3.096
1.214GluCys: 1.214 ± 0.985
4.25GluAsp: 4.25 ± 0.671
3.643GluGlu: 3.643 ± 2.159
3.643GluPhe: 3.643 ± 1.441
5.464GluGly: 5.464 ± 0.396
0.607GluHis: 0.607 ± 0.655
3.036GluIle: 3.036 ± 0.813
4.857GluLys: 4.857 ± 3.512
1.214GluLeu: 1.214 ± 1.31
1.214GluMet: 1.214 ± 1.31
4.25GluAsn: 4.25 ± 1.322
0.0GluPro: 0.0 ± 0.0
1.214GluGln: 1.214 ± 0.575
3.036GluArg: 3.036 ± 0.946
2.429GluSer: 2.429 ± 1.193
3.036GluThr: 3.036 ± 1.339
1.821GluVal: 1.821 ± 1.038
0.607GluTrp: 0.607 ± 0.655
4.25GluTyr: 4.25 ± 0.478
0.0GluXaa: 0.0 ± 0.0
Phe
0.607PheAla: 0.607 ± 0.442
1.214PheCys: 1.214 ± 0.48
6.072PheAsp: 6.072 ± 2.556
1.214PheGlu: 1.214 ± 0.48
2.429PhePhe: 2.429 ± 1.174
3.643PheGly: 3.643 ± 1.561
0.0PheHis: 0.0 ± 0.0
4.25PheIle: 4.25 ± 0.79
4.25PheLys: 4.25 ± 0.586
4.25PheLeu: 4.25 ± 1.918
0.0PheMet: 0.0 ± 0.0
4.25PheAsn: 4.25 ± 1.054
1.214PhePro: 1.214 ± 0.48
0.607PheGln: 0.607 ± 0.442
1.821PheArg: 1.821 ± 1.477
3.036PheSer: 3.036 ± 1.207
3.643PheThr: 3.643 ± 0.654
1.214PheVal: 1.214 ± 0.48
0.607PheTrp: 0.607 ± 0.655
2.429PheTyr: 2.429 ± 0.961
0.0PheXaa: 0.0 ± 0.0
Gly
3.643GlyAla: 3.643 ± 1.79
0.607GlyCys: 0.607 ± 0.442
6.679GlyAsp: 6.679 ± 2.22
1.214GlyGlu: 1.214 ± 0.48
2.429GlyPhe: 2.429 ± 1.174
2.429GlyGly: 2.429 ± 1.193
0.0GlyHis: 0.0 ± 0.0
5.464GlyIle: 5.464 ± 2.033
6.679GlyLys: 6.679 ± 0.986
7.286GlyLeu: 7.286 ± 3.677
1.821GlyMet: 1.821 ± 0.225
2.429GlyAsn: 2.429 ± 0.401
0.0GlyPro: 0.0 ± 0.0
1.821GlyGln: 1.821 ± 0.225
2.429GlyArg: 2.429 ± 1.175
6.072GlySer: 6.072 ± 3.004
1.821GlyThr: 1.821 ± 0.867
6.072GlyVal: 6.072 ± 1.201
0.0GlyTrp: 0.0 ± 0.0
3.643GlyTyr: 3.643 ± 0.654
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.607HisAsp: 0.607 ± 0.492
0.607HisGlu: 0.607 ± 0.655
1.214HisPhe: 1.214 ± 0.884
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.214HisIle: 1.214 ± 0.48
1.214HisLys: 1.214 ± 0.985
0.607HisLeu: 0.607 ± 0.442
1.214HisMet: 1.214 ± 0.48
0.607HisAsn: 0.607 ± 0.442
1.821HisPro: 1.821 ± 0.625
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.214HisSer: 1.214 ± 0.985
1.214HisThr: 1.214 ± 0.48
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.821HisTyr: 1.821 ± 1.477
0.0HisXaa: 0.0 ± 0.0
Ile
4.857IleAla: 4.857 ± 1.708
0.0IleCys: 0.0 ± 0.0
4.25IleAsp: 4.25 ± 2.013
4.25IleGlu: 4.25 ± 3.356
1.214IlePhe: 1.214 ± 0.597
1.821IleGly: 1.821 ± 0.781
1.214IleHis: 1.214 ± 0.48
3.036IleIle: 3.036 ± 0.825
5.464IleLys: 5.464 ± 1.536
3.643IleLeu: 3.643 ± 0.654
0.0IleMet: 0.0 ± 0.0
3.036IleAsn: 3.036 ± 0.946
3.643IlePro: 3.643 ± 1.371
3.036IleGln: 3.036 ± 0.756
3.036IleArg: 3.036 ± 1.219
7.893IleSer: 7.893 ± 2.859
3.643IleThr: 3.643 ± 2.022
3.036IleVal: 3.036 ± 0.754
0.0IleTrp: 0.0 ± 0.0
1.214IleTyr: 1.214 ± 0.48
0.0IleXaa: 0.0 ± 0.0
Lys
5.464LysAla: 5.464 ± 2.779
1.214LysCys: 1.214 ± 0.884
6.072LysAsp: 6.072 ± 2.696
4.857LysGlu: 4.857 ± 2.109
4.857LysPhe: 4.857 ± 1.544
5.464LysGly: 5.464 ± 0.676
0.607LysHis: 0.607 ± 0.492
5.464LysIle: 5.464 ± 2.473
3.643LysLys: 3.643 ± 2.285
4.25LysLeu: 4.25 ± 1.911
0.0LysMet: 0.0 ± 0.0
3.643LysAsn: 3.643 ± 1.733
1.214LysPro: 1.214 ± 1.151
1.214LysGln: 1.214 ± 1.31
0.607LysArg: 0.607 ± 0.655
7.286LysSer: 7.286 ± 1.342
1.821LysThr: 1.821 ± 0.225
5.464LysVal: 5.464 ± 1.352
0.0LysTrp: 0.0 ± 0.0
6.679LysTyr: 6.679 ± 2.555
0.0LysXaa: 0.0 ± 0.0
Leu
4.25LeuAla: 4.25 ± 0.478
0.607LeuCys: 0.607 ± 0.442
6.679LeuAsp: 6.679 ± 0.494
4.25LeuGlu: 4.25 ± 1.911
1.821LeuPhe: 1.821 ± 0.847
5.464LeuGly: 5.464 ± 0.676
0.607LeuHis: 0.607 ± 0.442
3.036LeuIle: 3.036 ± 0.754
2.429LeuLys: 2.429 ± 0.48
3.643LeuLeu: 3.643 ± 1.441
0.0LeuMet: 0.0 ± 0.661
4.857LeuAsn: 4.857 ± 0.604
3.036LeuPro: 3.036 ± 0.946
1.821LeuGln: 1.821 ± 1.038
4.25LeuArg: 4.25 ± 1.36
9.715LeuSer: 9.715 ± 2.686
3.643LeuThr: 3.643 ± 0.769
2.429LeuVal: 2.429 ± 1.073
0.607LeuTrp: 0.607 ± 0.442
3.036LeuTyr: 3.036 ± 0.366
0.0LeuXaa: 0.0 ± 0.0
Met
1.821MetAla: 1.821 ± 0.82
0.607MetCys: 0.607 ± 0.492
1.821MetAsp: 1.821 ± 0.781
0.0MetGlu: 0.0 ± 0.0
1.214MetPhe: 1.214 ± 0.884
0.607MetGly: 0.607 ± 0.442
0.0MetHis: 0.0 ± 0.0
1.821MetIle: 1.821 ± 0.625
1.821MetLys: 1.821 ± 0.847
1.214MetLeu: 1.214 ± 0.48
0.607MetMet: 0.607 ± 0.655
3.036MetAsn: 3.036 ± 1.092
1.821MetPro: 1.821 ± 0.225
1.821MetGln: 1.821 ± 1.131
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
1.214MetThr: 1.214 ± 0.597
1.214MetVal: 1.214 ± 0.597
0.607MetTrp: 0.607 ± 0.655
2.429MetTyr: 2.429 ± 0.737
0.0MetXaa: 0.0 ± 0.0
Asn
4.25AsnAla: 4.25 ± 0.478
0.0AsnCys: 0.0 ± 0.0
6.679AsnAsp: 6.679 ± 1.099
3.036AsnGlu: 3.036 ± 0.366
3.036AsnPhe: 3.036 ± 0.946
4.25AsnGly: 4.25 ± 1.194
0.0AsnHis: 0.0 ± 0.0
6.072AsnIle: 6.072 ± 3.155
3.036AsnLys: 3.036 ± 0.366
2.429AsnLeu: 2.429 ± 1.559
1.821AsnMet: 1.821 ± 1.221
4.25AsnAsn: 4.25 ± 0.586
4.857AsnPro: 4.857 ± 0.952
3.036AsnGln: 3.036 ± 1.325
1.821AsnArg: 1.821 ± 0.847
4.857AsnSer: 4.857 ± 1.73
3.036AsnThr: 3.036 ± 0.366
3.643AsnVal: 3.643 ± 0.451
1.214AsnTrp: 1.214 ± 0.597
4.25AsnTyr: 4.25 ± 2.112
0.0AsnXaa: 0.0 ± 0.0
Pro
0.607ProAla: 0.607 ± 0.655
0.607ProCys: 0.607 ± 0.492
1.214ProAsp: 1.214 ± 0.48
3.036ProGlu: 3.036 ± 0.366
0.0ProPhe: 0.0 ± 0.0
4.25ProGly: 4.25 ± 0.586
1.214ProHis: 1.214 ± 0.48
0.607ProIle: 0.607 ± 0.442
3.643ProLys: 3.643 ± 2.213
3.036ProLeu: 3.036 ± 0.946
1.214ProMet: 1.214 ± 0.884
2.429ProAsn: 2.429 ± 0.854
0.0ProPro: 0.0 ± 0.0
1.214ProGln: 1.214 ± 0.48
0.607ProArg: 0.607 ± 0.492
3.036ProSer: 3.036 ± 0.366
3.036ProThr: 3.036 ± 1.593
3.036ProVal: 3.036 ± 1.246
0.0ProTrp: 0.0 ± 0.0
1.821ProTyr: 1.821 ± 0.225
0.0ProXaa: 0.0 ± 0.0
Gln
4.25GlnAla: 4.25 ± 1.343
0.607GlnCys: 0.607 ± 0.492
2.429GlnAsp: 2.429 ± 0.737
0.607GlnGlu: 0.607 ± 0.492
1.214GlnPhe: 1.214 ± 0.575
2.429GlnGly: 2.429 ± 1.151
0.607GlnHis: 0.607 ± 0.492
1.214GlnIle: 1.214 ± 0.597
1.214GlnLys: 1.214 ± 1.151
2.429GlnLeu: 2.429 ± 1.193
1.821GlnMet: 1.821 ± 0.82
3.643GlnAsn: 3.643 ± 2.346
0.0GlnPro: 0.0 ± 0.0
2.429GlnGln: 2.429 ± 2.621
3.643GlnArg: 3.643 ± 0.896
2.429GlnSer: 2.429 ± 1.193
1.821GlnThr: 1.821 ± 0.82
1.821GlnVal: 1.821 ± 0.867
1.214GlnTrp: 1.214 ± 0.985
1.821GlnTyr: 1.821 ± 1.716
0.0GlnXaa: 0.0 ± 0.0
Arg
0.607ArgAla: 0.607 ± 0.442
0.0ArgCys: 0.0 ± 0.0
3.036ArgAsp: 3.036 ± 1.219
3.643ArgGlu: 3.643 ± 2.159
0.607ArgPhe: 0.607 ± 0.442
3.643ArgGly: 3.643 ± 0.654
1.214ArgHis: 1.214 ± 0.48
1.214ArgIle: 1.214 ± 0.597
2.429ArgLys: 2.429 ± 0.48
2.429ArgLeu: 2.429 ± 0.401
1.821ArgMet: 1.821 ± 1.032
3.036ArgAsn: 3.036 ± 0.754
3.643ArgPro: 3.643 ± 1.441
1.214ArgGln: 1.214 ± 0.48
1.821ArgArg: 1.821 ± 0.867
3.643ArgSer: 3.643 ± 1.733
1.214ArgThr: 1.214 ± 0.758
2.429ArgVal: 2.429 ± 0.48
0.607ArgTrp: 0.607 ± 0.442
3.036ArgTyr: 3.036 ± 1.207
0.0ArgXaa: 0.0 ± 0.0
Ser
3.643SerAla: 3.643 ± 1.638
0.607SerCys: 0.607 ± 0.655
0.607SerAsp: 0.607 ± 0.442
6.072SerGlu: 6.072 ± 1.95
2.429SerPhe: 2.429 ± 0.825
6.679SerGly: 6.679 ± 2.766
2.429SerHis: 2.429 ± 0.772
5.464SerIle: 5.464 ± 1.352
5.464SerLys: 5.464 ± 2.15
9.107SerLeu: 9.107 ± 1.198
3.036SerMet: 3.036 ± 1.317
4.25SerAsn: 4.25 ± 0.824
3.036SerPro: 3.036 ± 0.756
4.857SerGln: 4.857 ± 0.372
1.821SerArg: 1.821 ± 0.225
6.072SerSer: 6.072 ± 1.385
5.464SerThr: 5.464 ± 0.676
8.5SerVal: 8.5 ± 1.49
0.607SerTrp: 0.607 ± 0.442
4.857SerTyr: 4.857 ± 2.171
0.0SerXaa: 0.0 ± 0.0
Thr
2.429ThrAla: 2.429 ± 0.401
0.0ThrCys: 0.0 ± 0.0
4.25ThrAsp: 4.25 ± 1.063
2.429ThrGlu: 2.429 ± 1.325
1.821ThrPhe: 1.821 ± 0.625
0.607ThrGly: 0.607 ± 0.442
0.0ThrHis: 0.0 ± 0.0
2.429ThrIle: 2.429 ± 1.433
1.821ThrLys: 1.821 ± 0.847
4.25ThrLeu: 4.25 ± 0.671
1.821ThrMet: 1.821 ± 1.173
3.036ThrAsn: 3.036 ± 1.362
4.25ThrPro: 4.25 ± 1.054
2.429ThrGln: 2.429 ± 1.804
2.429ThrArg: 2.429 ± 0.961
4.857ThrSer: 4.857 ± 2.849
4.857ThrThr: 4.857 ± 1.416
4.857ThrVal: 4.857 ± 1.562
0.0ThrTrp: 0.0 ± 0.0
5.464ThrTyr: 5.464 ± 2.15
0.0ThrXaa: 0.0 ± 0.0
Val
3.643ValAla: 3.643 ± 1.356
1.821ValCys: 1.821 ± 0.998
5.464ValAsp: 5.464 ± 1.874
1.214ValGlu: 1.214 ± 1.31
3.036ValPhe: 3.036 ± 1.593
4.25ValGly: 4.25 ± 1.978
1.821ValHis: 1.821 ± 1.477
3.643ValIle: 3.643 ± 1.561
5.464ValLys: 5.464 ± 1.759
3.036ValLeu: 3.036 ± 0.754
1.214ValMet: 1.214 ± 0.887
3.036ValAsn: 3.036 ± 0.756
2.429ValPro: 2.429 ± 1.174
3.036ValGln: 3.036 ± 1.518
3.643ValArg: 3.643 ± 1.356
5.464ValSer: 5.464 ± 2.033
4.25ValThr: 4.25 ± 0.478
3.036ValVal: 3.036 ± 0.754
0.0ValTrp: 0.0 ± 0.0
2.429ValTyr: 2.429 ± 0.961
0.0ValXaa: 0.0 ± 0.0
Trp
0.607TrpAla: 0.607 ± 0.655
0.607TrpCys: 0.607 ± 0.492
0.0TrpAsp: 0.0 ± 0.0
0.607TrpGlu: 0.607 ± 0.655
0.607TrpPhe: 0.607 ± 0.492
0.0TrpGly: 0.0 ± 0.0
0.607TrpHis: 0.607 ± 0.442
0.607TrpIle: 0.607 ± 0.442
0.607TrpLys: 0.607 ± 0.492
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.821TrpAsn: 1.821 ± 1.173
0.0TrpPro: 0.0 ± 0.0
1.821TrpGln: 1.821 ± 1.131
0.607TrpArg: 0.607 ± 0.492
0.607TrpSer: 0.607 ± 0.442
0.0TrpThr: 0.0 ± 0.0
1.214TrpVal: 1.214 ± 0.884
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.643TyrAla: 3.643 ± 0.896
3.036TyrCys: 3.036 ± 1.802
4.25TyrAsp: 4.25 ± 1.196
2.429TyrGlu: 2.429 ± 0.48
3.643TyrPhe: 3.643 ± 2.285
3.643TyrGly: 3.643 ± 0.769
1.214TyrHis: 1.214 ± 0.985
1.214TyrIle: 1.214 ± 0.985
4.857TyrLys: 4.857 ± 1.488
1.821TyrLeu: 1.821 ± 0.847
1.214TyrMet: 1.214 ± 0.48
4.857TyrAsn: 4.857 ± 0.69
1.214TyrPro: 1.214 ± 0.884
1.821TyrGln: 1.821 ± 0.781
4.857TyrArg: 4.857 ± 1.189
6.072TyrSer: 6.072 ± 1.698
4.857TyrThr: 4.857 ± 1.075
3.643TyrVal: 3.643 ± 1.034
1.214TyrTrp: 1.214 ± 0.985
6.679TyrTyr: 6.679 ± 3.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1648 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski