Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_612

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.315AlaAla: 5.315 ± 3.026
0.759AlaCys: 0.759 ± 0.701
4.556AlaAsp: 4.556 ± 1.41
6.834AlaGlu: 6.834 ± 2.598
2.278AlaPhe: 2.278 ± 2.559
5.315AlaGly: 5.315 ± 1.426
1.519AlaHis: 1.519 ± 0.791
3.797AlaIle: 3.797 ± 1.997
1.519AlaLys: 1.519 ± 1.514
4.556AlaLeu: 4.556 ± 1.825
0.759AlaMet: 0.759 ± 0.523
3.797AlaAsn: 3.797 ± 1.036
3.797AlaPro: 3.797 ± 1.722
3.037AlaGln: 3.037 ± 1.235
2.278AlaArg: 2.278 ± 1.192
2.278AlaSer: 2.278 ± 1.192
4.556AlaThr: 4.556 ± 1.41
1.519AlaVal: 1.519 ± 1.205
1.519AlaTrp: 1.519 ± 1.046
3.037AlaTyr: 3.037 ± 1.411
0.0AlaXaa: 0.0 ± 0.0
Cys
1.519CysAla: 1.519 ± 1.402
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.519CysGly: 1.519 ± 1.514
0.759CysHis: 0.759 ± 0.523
0.759CysIle: 0.759 ± 0.701
1.519CysLys: 1.519 ± 0.667
1.519CysLeu: 1.519 ± 1.402
0.759CysMet: 0.759 ± 0.523
0.759CysAsn: 0.759 ± 0.523
0.759CysPro: 0.759 ± 0.523
0.0CysGln: 0.0 ± 0.0
0.759CysArg: 0.759 ± 0.701
0.0CysSer: 0.0 ± 0.0
1.519CysThr: 1.519 ± 0.667
0.759CysVal: 0.759 ± 0.701
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.278AspAla: 2.278 ± 1.22
0.759AspCys: 0.759 ± 0.701
2.278AspAsp: 2.278 ± 1.851
4.556AspGlu: 4.556 ± 1.74
2.278AspPhe: 2.278 ± 0.919
2.278AspGly: 2.278 ± 1.74
0.0AspHis: 0.0 ± 0.0
2.278AspIle: 2.278 ± 1.317
4.556AspLys: 4.556 ± 2.529
1.519AspLeu: 1.519 ± 1.796
0.0AspMet: 0.0 ± 0.0
4.556AspAsn: 4.556 ± 1.943
0.759AspPro: 0.759 ± 0.701
1.519AspGln: 1.519 ± 0.667
2.278AspArg: 2.278 ± 0.956
3.797AspSer: 3.797 ± 1.874
1.519AspThr: 1.519 ± 0.618
4.556AspVal: 4.556 ± 1.771
0.759AspTrp: 0.759 ± 1.36
6.074AspTyr: 6.074 ± 1.082
0.0AspXaa: 0.0 ± 0.0
Glu
3.797GluAla: 3.797 ± 2.508
0.0GluCys: 0.0 ± 0.0
1.519GluAsp: 1.519 ± 1.073
6.074GluGlu: 6.074 ± 3.053
5.315GluPhe: 5.315 ± 1.076
4.556GluGly: 4.556 ± 1.838
0.759GluHis: 0.759 ± 0.523
8.352GluIle: 8.352 ± 2.537
0.759GluLys: 0.759 ± 0.523
6.834GluLeu: 6.834 ± 1.233
2.278GluMet: 2.278 ± 2.045
3.797GluAsn: 3.797 ± 1.476
3.037GluPro: 3.037 ± 1.583
3.797GluGln: 3.797 ± 1.135
3.037GluArg: 3.037 ± 0.621
2.278GluSer: 2.278 ± 1.575
5.315GluThr: 5.315 ± 1.923
3.037GluVal: 3.037 ± 1.362
0.759GluTrp: 0.759 ± 0.682
4.556GluTyr: 4.556 ± 0.986
0.0GluXaa: 0.0 ± 0.0
Phe
2.278PheAla: 2.278 ± 1.349
0.759PheCys: 0.759 ± 0.523
2.278PheAsp: 2.278 ± 0.919
2.278PheGlu: 2.278 ± 1.192
0.759PhePhe: 0.759 ± 0.523
3.797PheGly: 3.797 ± 2.614
0.0PheHis: 0.0 ± 0.0
3.037PheIle: 3.037 ± 1.669
2.278PheLys: 2.278 ± 0.971
3.037PheLeu: 3.037 ± 1.84
0.0PheMet: 0.0 ± 0.0
6.834PheAsn: 6.834 ± 2.132
0.759PhePro: 0.759 ± 0.898
1.519PheGln: 1.519 ± 0.805
3.037PheArg: 3.037 ± 0.835
2.278PheSer: 2.278 ± 0.919
3.037PheThr: 3.037 ± 2.092
4.556PheVal: 4.556 ± 1.13
0.0PheTrp: 0.0 ± 0.0
1.519PheTyr: 1.519 ± 0.667
0.0PheXaa: 0.0 ± 0.0
Gly
3.037GlyAla: 3.037 ± 1.18
0.0GlyCys: 0.0 ± 0.0
3.037GlyAsp: 3.037 ± 1.333
6.074GlyGlu: 6.074 ± 1.42
1.519GlyPhe: 1.519 ± 0.667
3.797GlyGly: 3.797 ± 1.893
2.278GlyHis: 2.278 ± 0.971
7.593GlyIle: 7.593 ± 1.583
0.759GlyLys: 0.759 ± 1.36
3.797GlyLeu: 3.797 ± 1.623
0.759GlyMet: 0.759 ± 0.701
3.037GlyAsn: 3.037 ± 1.235
0.0GlyPro: 0.0 ± 0.0
3.037GlyGln: 3.037 ± 2.092
3.797GlyArg: 3.797 ± 0.87
5.315GlySer: 5.315 ± 0.921
5.315GlyThr: 5.315 ± 2.023
5.315GlyVal: 5.315 ± 2.372
0.0GlyTrp: 0.0 ± 0.0
3.797GlyTyr: 3.797 ± 0.87
0.0GlyXaa: 0.0 ± 0.0
His
0.759HisAla: 0.759 ± 0.701
0.759HisCys: 0.759 ± 0.523
0.759HisAsp: 0.759 ± 0.701
0.0HisGlu: 0.0 ± 0.0
0.759HisPhe: 0.759 ± 0.523
1.519HisGly: 1.519 ± 0.667
0.0HisHis: 0.0 ± 0.0
0.759HisIle: 0.759 ± 0.523
0.0HisLys: 0.0 ± 0.0
3.037HisLeu: 3.037 ± 1.749
0.759HisMet: 0.759 ± 0.701
0.0HisAsn: 0.0 ± 0.0
0.759HisPro: 0.759 ± 0.898
0.0HisGln: 0.0 ± 0.0
1.519HisArg: 1.519 ± 0.618
2.278HisSer: 2.278 ± 0.807
0.759HisThr: 0.759 ± 0.523
2.278HisVal: 2.278 ± 1.569
0.0HisTrp: 0.0 ± 0.0
4.556HisTyr: 4.556 ± 1.555
0.0HisXaa: 0.0 ± 0.0
Ile
3.037IleAla: 3.037 ± 1.18
0.0IleCys: 0.0 ± 0.0
3.037IleAsp: 3.037 ± 1.043
6.834IleGlu: 6.834 ± 1.565
3.037IlePhe: 3.037 ± 1.333
3.797IleGly: 3.797 ± 1.176
0.759IleHis: 0.759 ± 0.523
1.519IleIle: 1.519 ± 1.249
3.037IleLys: 3.037 ± 1.669
6.074IleLeu: 6.074 ± 1.42
1.519IleMet: 1.519 ± 0.963
6.834IleAsn: 6.834 ± 1.107
4.556IlePro: 4.556 ± 1.165
1.519IleGln: 1.519 ± 0.805
5.315IleArg: 5.315 ± 2.018
6.834IleSer: 6.834 ± 3.013
4.556IleThr: 4.556 ± 1.41
0.759IleVal: 0.759 ± 0.898
0.0IleTrp: 0.0 ± 0.0
4.556IleTyr: 4.556 ± 0.743
0.0IleXaa: 0.0 ± 0.0
Lys
3.037LysAla: 3.037 ± 1.18
0.759LysCys: 0.759 ± 0.523
3.797LysAsp: 3.797 ± 1.91
5.315LysGlu: 5.315 ± 2.171
3.797LysPhe: 3.797 ± 1.476
4.556LysGly: 4.556 ± 1.53
3.037LysHis: 3.037 ± 0.986
3.037LysIle: 3.037 ± 1.333
2.278LysLys: 2.278 ± 0.493
3.797LysLeu: 3.797 ± 0.954
1.519LysMet: 1.519 ± 1.176
4.556LysAsn: 4.556 ± 3.269
1.519LysPro: 1.519 ± 0.667
0.759LysGln: 0.759 ± 0.523
3.037LysArg: 3.037 ± 2.475
3.037LysSer: 3.037 ± 0.835
3.037LysThr: 3.037 ± 1.142
1.519LysVal: 1.519 ± 0.667
0.0LysTrp: 0.0 ± 0.0
0.759LysTyr: 0.759 ± 0.701
0.0LysXaa: 0.0 ± 0.0
Leu
6.834LeuAla: 6.834 ± 1.785
1.519LeuCys: 1.519 ± 1.046
2.278LeuAsp: 2.278 ± 0.493
3.037LeuGlu: 3.037 ± 1.272
4.556LeuPhe: 4.556 ± 1.286
4.556LeuGly: 4.556 ± 0.743
1.519LeuHis: 1.519 ± 0.791
3.797LeuIle: 3.797 ± 1.582
4.556LeuLys: 4.556 ± 1.853
6.834LeuLeu: 6.834 ± 2.324
3.797LeuMet: 3.797 ± 1.211
6.834LeuAsn: 6.834 ± 1.951
6.834LeuPro: 6.834 ± 2.914
3.797LeuGln: 3.797 ± 1.535
4.556LeuArg: 4.556 ± 0.986
3.797LeuSer: 3.797 ± 1.135
5.315LeuThr: 5.315 ± 2.678
2.278LeuVal: 2.278 ± 1.507
0.759LeuTrp: 0.759 ± 0.682
1.519LeuTyr: 1.519 ± 0.618
0.0LeuXaa: 0.0 ± 0.0
Met
2.278MetAla: 2.278 ± 0.919
0.0MetCys: 0.0 ± 0.0
0.759MetAsp: 0.759 ± 0.701
1.519MetGlu: 1.519 ± 0.618
0.0MetPhe: 0.0 ± 0.0
2.278MetGly: 2.278 ± 0.996
1.519MetHis: 1.519 ± 0.618
0.759MetIle: 0.759 ± 0.701
0.759MetLys: 0.759 ± 1.36
0.759MetLeu: 0.759 ± 0.701
1.519MetMet: 1.519 ± 0.618
2.278MetAsn: 2.278 ± 0.956
2.278MetPro: 2.278 ± 0.919
0.0MetGln: 0.0 ± 0.0
0.759MetArg: 0.759 ± 0.701
2.278MetSer: 2.278 ± 0.971
2.278MetThr: 2.278 ± 1.22
0.759MetVal: 0.759 ± 0.523
0.759MetTrp: 0.759 ± 0.523
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.315AsnAla: 5.315 ± 2.399
0.0AsnCys: 0.0 ± 0.0
5.315AsnAsp: 5.315 ± 0.781
4.556AsnGlu: 4.556 ± 1.408
4.556AsnPhe: 4.556 ± 1.043
6.074AsnGly: 6.074 ± 1.083
1.519AsnHis: 1.519 ± 0.791
8.352AsnIle: 8.352 ± 1.296
3.037AsnLys: 3.037 ± 1.411
8.352AsnLeu: 8.352 ± 2.85
1.519AsnMet: 1.519 ± 1.402
2.278AsnAsn: 2.278 ± 1.221
4.556AsnPro: 4.556 ± 1.827
3.037AsnGln: 3.037 ± 1.235
3.797AsnArg: 3.797 ± 2.619
6.074AsnSer: 6.074 ± 3.154
2.278AsnThr: 2.278 ± 1.221
3.037AsnVal: 3.037 ± 1.545
0.0AsnTrp: 0.0 ± 0.0
5.315AsnTyr: 5.315 ± 1.99
0.0AsnXaa: 0.0 ± 0.0
Pro
2.278ProAla: 2.278 ± 2.047
1.519ProCys: 1.519 ± 1.402
3.797ProAsp: 3.797 ± 1.55
4.556ProGlu: 4.556 ± 1.912
3.037ProPhe: 3.037 ± 1.411
2.278ProGly: 2.278 ± 0.971
0.759ProHis: 0.759 ± 0.701
4.556ProIle: 4.556 ± 1.52
2.278ProLys: 2.278 ± 1.569
3.037ProLeu: 3.037 ± 1.411
0.759ProMet: 0.759 ± 0.523
3.797ProAsn: 3.797 ± 1.535
0.759ProPro: 0.759 ± 0.682
3.037ProGln: 3.037 ± 0.835
2.278ProArg: 2.278 ± 1.264
6.074ProSer: 6.074 ± 2.239
0.759ProThr: 0.759 ± 1.36
0.759ProVal: 0.759 ± 0.523
0.0ProTrp: 0.0 ± 0.0
3.797ProTyr: 3.797 ± 1.847
0.0ProXaa: 0.0 ± 0.0
Gln
0.759GlnAla: 0.759 ± 0.523
0.0GlnCys: 0.0 ± 0.0
0.759GlnAsp: 0.759 ± 0.898
2.278GlnGlu: 2.278 ± 1.569
1.519GlnPhe: 1.519 ± 1.046
3.797GlnGly: 3.797 ± 1.893
1.519GlnHis: 1.519 ± 0.667
2.278GlnIle: 2.278 ± 0.807
3.797GlnLys: 3.797 ± 1.861
2.278GlnLeu: 2.278 ± 0.493
1.519GlnMet: 1.519 ± 0.618
3.797GlnAsn: 3.797 ± 1.335
2.278GlnPro: 2.278 ± 0.493
0.0GlnGln: 0.0 ± 0.0
3.037GlnArg: 3.037 ± 1.933
3.797GlnSer: 3.797 ± 0.87
1.519GlnThr: 1.519 ± 1.046
2.278GlnVal: 2.278 ± 0.919
1.519GlnTrp: 1.519 ± 1.402
1.519GlnTyr: 1.519 ± 1.462
0.0GlnXaa: 0.0 ± 0.0
Arg
5.315ArgAla: 5.315 ± 0.921
1.519ArgCys: 1.519 ± 1.402
3.037ArgAsp: 3.037 ± 1.043
2.278ArgGlu: 2.278 ± 1.22
0.759ArgPhe: 0.759 ± 0.523
1.519ArgGly: 1.519 ± 0.791
0.0ArgHis: 0.0 ± 0.0
2.278ArgIle: 2.278 ± 1.22
6.074ArgLys: 6.074 ± 2.895
3.037ArgLeu: 3.037 ± 1.043
1.519ArgMet: 1.519 ± 1.046
5.315ArgAsn: 5.315 ± 2.072
5.315ArgPro: 5.315 ± 1.481
2.278ArgGln: 2.278 ± 2.104
1.519ArgArg: 1.519 ± 1.046
0.759ArgSer: 0.759 ± 0.523
1.519ArgThr: 1.519 ± 1.205
1.519ArgVal: 1.519 ± 0.667
0.759ArgTrp: 0.759 ± 0.682
3.797ArgTyr: 3.797 ± 2.619
0.0ArgXaa: 0.0 ± 0.0
Ser
4.556SerAla: 4.556 ± 1.484
2.278SerCys: 2.278 ± 1.297
0.759SerAsp: 0.759 ± 0.682
4.556SerGlu: 4.556 ± 1.365
0.759SerPhe: 0.759 ± 0.523
3.037SerGly: 3.037 ± 1.841
0.0SerHis: 0.0 ± 0.0
7.593SerIle: 7.593 ± 1.588
5.315SerLys: 5.315 ± 0.968
4.556SerLeu: 4.556 ± 1.043
1.519SerMet: 1.519 ± 0.791
5.315SerAsn: 5.315 ± 0.864
4.556SerPro: 4.556 ± 2.621
4.556SerGln: 4.556 ± 1.435
3.037SerArg: 3.037 ± 1.985
3.037SerSer: 3.037 ± 2.409
5.315SerThr: 5.315 ± 2.344
3.797SerVal: 3.797 ± 1.476
0.0SerTrp: 0.0 ± 0.0
1.519SerTyr: 1.519 ± 0.667
0.0SerXaa: 0.0 ± 0.0
Thr
3.797ThrAla: 3.797 ± 1.357
0.0ThrCys: 0.0 ± 0.0
3.797ThrAsp: 3.797 ± 1.428
2.278ThrGlu: 2.278 ± 1.607
1.519ThrPhe: 1.519 ± 0.805
3.797ThrGly: 3.797 ± 1.771
0.0ThrHis: 0.0 ± 0.0
3.037ThrIle: 3.037 ± 0.986
3.037ThrLys: 3.037 ± 3.135
9.871ThrLeu: 9.871 ± 1.123
0.0ThrMet: 0.0 ± 0.0
3.037ThrAsn: 3.037 ± 1.641
3.037ThrPro: 3.037 ± 2.092
2.278ThrGln: 2.278 ± 0.971
3.037ThrArg: 3.037 ± 1.411
4.556ThrSer: 4.556 ± 1.408
8.352ThrThr: 8.352 ± 7.521
3.037ThrVal: 3.037 ± 2.51
0.0ThrTrp: 0.0 ± 0.0
2.278ThrTyr: 2.278 ± 1.264
0.0ThrXaa: 0.0 ± 0.0
Val
3.037ValAla: 3.037 ± 1.362
0.759ValCys: 0.759 ± 0.701
2.278ValAsp: 2.278 ± 0.919
4.556ValGlu: 4.556 ± 1.827
3.037ValPhe: 3.037 ± 0.835
0.759ValGly: 0.759 ± 0.523
0.759ValHis: 0.759 ± 0.523
1.519ValIle: 1.519 ± 0.618
3.797ValLys: 3.797 ± 0.563
1.519ValLeu: 1.519 ± 1.249
1.519ValMet: 1.519 ± 1.046
5.315ValAsn: 5.315 ± 0.864
2.278ValPro: 2.278 ± 0.971
2.278ValGln: 2.278 ± 0.996
1.519ValArg: 1.519 ± 0.667
4.556ValSer: 4.556 ± 3.914
0.759ValThr: 0.759 ± 0.523
2.278ValVal: 2.278 ± 1.349
0.0ValTrp: 0.0 ± 0.0
3.037ValTyr: 3.037 ± 1.381
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.759TrpGlu: 0.759 ± 0.682
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.519TrpHis: 1.519 ± 0.667
0.0TrpIle: 0.0 ± 0.0
1.519TrpLys: 1.519 ± 0.618
0.759TrpLeu: 0.759 ± 0.682
0.0TrpMet: 0.0 ± 0.0
0.759TrpAsn: 0.759 ± 1.36
0.759TrpPro: 0.759 ± 0.523
0.759TrpGln: 0.759 ± 0.701
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.519TrpThr: 1.519 ± 0.667
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.797TyrAla: 3.797 ± 1.216
1.519TyrCys: 1.519 ± 1.402
4.556TyrAsp: 4.556 ± 2.605
2.278TyrGlu: 2.278 ± 0.971
4.556TyrPhe: 4.556 ± 2.0
3.037TyrGly: 3.037 ± 0.986
3.037TyrHis: 3.037 ± 2.189
2.278TyrIle: 2.278 ± 0.971
2.278TyrLys: 2.278 ± 1.297
3.797TyrLeu: 3.797 ± 1.036
0.759TyrMet: 0.759 ± 0.523
6.074TyrAsn: 6.074 ± 1.273
1.519TyrPro: 1.519 ± 0.667
3.037TyrGln: 3.037 ± 0.621
1.519TyrArg: 1.519 ± 0.618
3.037TyrSer: 3.037 ± 0.793
1.519TyrThr: 1.519 ± 0.667
1.519TyrVal: 1.519 ± 1.046
1.519TyrTrp: 1.519 ± 0.667
3.037TyrTyr: 3.037 ± 1.333
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1318 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski