Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_143

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.802AlaAla: 8.802 ± 4.932
0.677AlaCys: 0.677 ± 0.535
4.739AlaAsp: 4.739 ± 1.996
5.416AlaGlu: 5.416 ± 2.141
1.354AlaPhe: 1.354 ± 0.529
6.093AlaGly: 6.093 ± 0.874
0.677AlaHis: 0.677 ± 0.601
2.031AlaIle: 2.031 ± 1.134
2.708AlaLys: 2.708 ± 0.547
6.093AlaLeu: 6.093 ± 2.298
2.708AlaMet: 2.708 ± 1.6
7.448AlaAsn: 7.448 ± 3.58
3.385AlaPro: 3.385 ± 1.522
6.093AlaGln: 6.093 ± 2.823
4.062AlaArg: 4.062 ± 0.847
10.156AlaSer: 10.156 ± 3.295
3.385AlaThr: 3.385 ± 1.63
4.739AlaVal: 4.739 ± 0.758
1.354AlaTrp: 1.354 ± 0.945
4.062AlaTyr: 4.062 ± 1.111
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.677CysAsp: 0.677 ± 0.72
0.0CysGlu: 0.0 ± 0.0
0.677CysPhe: 0.677 ± 0.725
0.677CysGly: 0.677 ± 0.535
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.354CysLeu: 1.354 ± 1.071
0.0CysMet: 0.0 ± 0.0
0.677CysAsn: 0.677 ± 0.472
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.677CysArg: 0.677 ± 0.535
1.354CysSer: 1.354 ± 1.071
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.739AspAla: 4.739 ± 1.648
0.0AspCys: 0.0 ± 0.0
4.062AspAsp: 4.062 ± 1.45
4.062AspGlu: 4.062 ± 1.318
3.385AspPhe: 3.385 ± 1.663
2.031AspGly: 2.031 ± 0.644
2.031AspHis: 2.031 ± 0.904
3.385AspIle: 3.385 ± 1.206
2.031AspLys: 2.031 ± 0.965
5.416AspLeu: 5.416 ± 1.472
2.031AspMet: 2.031 ± 1.21
4.062AspAsn: 4.062 ± 0.929
4.062AspPro: 4.062 ± 1.316
2.708AspGln: 2.708 ± 1.89
3.385AspArg: 3.385 ± 1.348
2.708AspSer: 2.708 ± 0.79
3.385AspThr: 3.385 ± 1.094
4.739AspVal: 4.739 ± 2.245
0.677AspTrp: 0.677 ± 0.601
4.739AspTyr: 4.739 ± 1.685
0.0AspXaa: 0.0 ± 0.0
Glu
6.093GluAla: 6.093 ± 2.038
0.677GluCys: 0.677 ± 0.72
2.031GluAsp: 2.031 ± 0.904
2.031GluGlu: 2.031 ± 0.946
1.354GluPhe: 1.354 ± 1.071
2.031GluGly: 2.031 ± 1.687
1.354GluHis: 1.354 ± 0.483
2.708GluIle: 2.708 ± 1.345
4.062GluLys: 4.062 ± 1.727
1.354GluLeu: 1.354 ± 0.733
2.031GluMet: 2.031 ± 0.803
2.031GluAsn: 2.031 ± 0.904
0.677GluPro: 0.677 ± 0.535
1.354GluGln: 1.354 ± 0.529
3.385GluArg: 3.385 ± 0.883
3.385GluSer: 3.385 ± 1.832
1.354GluThr: 1.354 ± 1.439
4.739GluVal: 4.739 ± 0.83
1.354GluTrp: 1.354 ± 0.787
3.385GluTyr: 3.385 ± 0.897
0.0GluXaa: 0.0 ± 0.0
Phe
3.385PheAla: 3.385 ± 0.883
1.354PheCys: 1.354 ± 1.071
4.739PheAsp: 4.739 ± 1.531
1.354PheGlu: 1.354 ± 0.483
3.385PhePhe: 3.385 ± 0.883
2.031PheGly: 2.031 ± 1.417
0.0PheHis: 0.0 ± 0.0
0.677PheIle: 0.677 ± 0.725
1.354PheLys: 1.354 ± 0.945
1.354PheLeu: 1.354 ± 0.529
1.354PheMet: 1.354 ± 0.483
4.739PheAsn: 4.739 ± 1.504
0.677PhePro: 0.677 ± 0.472
0.677PheGln: 0.677 ± 0.472
1.354PheArg: 1.354 ± 0.945
3.385PheSer: 3.385 ± 1.488
5.416PheThr: 5.416 ± 1.943
7.448PheVal: 7.448 ± 2.163
0.677PheTrp: 0.677 ± 0.472
2.031PheTyr: 2.031 ± 0.904
0.0PheXaa: 0.0 ± 0.0
Gly
1.354GlyAla: 1.354 ± 1.202
0.677GlyCys: 0.677 ± 0.535
4.739GlyAsp: 4.739 ± 1.891
3.385GlyGlu: 3.385 ± 1.224
2.031GlyPhe: 2.031 ± 0.792
3.385GlyGly: 3.385 ± 0.897
0.677GlyHis: 0.677 ± 0.72
2.708GlyIle: 2.708 ± 1.89
2.708GlyLys: 2.708 ± 1.675
5.416GlyLeu: 5.416 ± 1.362
0.677GlyMet: 0.677 ± 0.472
5.416GlyAsn: 5.416 ± 2.065
0.677GlyPro: 0.677 ± 0.472
1.354GlyGln: 1.354 ± 1.202
2.031GlyArg: 2.031 ± 0.424
6.093GlySer: 6.093 ± 1.989
6.093GlyThr: 6.093 ± 2.283
4.062GlyVal: 4.062 ± 0.929
0.677GlyTrp: 0.677 ± 0.472
2.708GlyTyr: 2.708 ± 0.518
0.0GlyXaa: 0.0 ± 0.0
His
2.031HisAla: 2.031 ± 0.644
0.0HisCys: 0.0 ± 0.0
1.354HisAsp: 1.354 ± 0.483
0.677HisGlu: 0.677 ± 0.535
1.354HisPhe: 1.354 ± 0.483
1.354HisGly: 1.354 ± 0.483
0.677HisHis: 0.677 ± 0.472
0.677HisIle: 0.677 ± 0.472
0.677HisLys: 0.677 ± 0.472
3.385HisLeu: 3.385 ± 1.925
1.354HisMet: 1.354 ± 0.708
0.0HisAsn: 0.0 ± 0.0
1.354HisPro: 1.354 ± 0.929
0.677HisGln: 0.677 ± 0.601
0.677HisArg: 0.677 ± 0.725
2.031HisSer: 2.031 ± 0.792
0.677HisThr: 0.677 ± 0.535
0.677HisVal: 0.677 ± 0.535
0.677HisTrp: 0.677 ± 0.472
1.354HisTyr: 1.354 ± 0.483
0.0HisXaa: 0.0 ± 0.0
Ile
3.385IleAla: 3.385 ± 1.554
0.677IleCys: 0.677 ± 0.472
2.708IleAsp: 2.708 ± 1.211
2.708IleGlu: 2.708 ± 0.754
3.385IlePhe: 3.385 ± 0.883
4.739IleGly: 4.739 ± 1.618
1.354IleHis: 1.354 ± 0.483
1.354IleIle: 1.354 ± 0.483
2.708IleLys: 2.708 ± 0.808
1.354IleLeu: 1.354 ± 1.202
1.354IleMet: 1.354 ± 1.071
4.062IleAsn: 4.062 ± 1.887
4.062IlePro: 4.062 ± 2.118
4.062IleGln: 4.062 ± 1.404
0.677IleArg: 0.677 ± 0.472
5.416IleSer: 5.416 ± 2.026
0.677IleThr: 0.677 ± 0.472
1.354IleVal: 1.354 ± 0.733
0.677IleTrp: 0.677 ± 0.535
4.062IleTyr: 4.062 ± 1.127
0.0IleXaa: 0.0 ± 0.0
Lys
0.677LysAla: 0.677 ± 0.601
0.677LysCys: 0.677 ± 0.535
4.062LysAsp: 4.062 ± 1.274
3.385LysGlu: 3.385 ± 1.421
2.708LysPhe: 2.708 ± 1.682
2.031LysGly: 2.031 ± 1.417
0.677LysHis: 0.677 ± 0.535
3.385LysIle: 3.385 ± 0.984
4.739LysLys: 4.739 ± 2.289
2.708LysLeu: 2.708 ± 0.949
1.354LysMet: 1.354 ± 1.093
3.385LysAsn: 3.385 ± 0.682
1.354LysPro: 1.354 ± 0.483
1.354LysGln: 1.354 ± 1.071
2.031LysArg: 2.031 ± 1.471
3.385LysSer: 3.385 ± 1.554
4.739LysThr: 4.739 ± 1.369
2.708LysVal: 2.708 ± 0.967
0.677LysTrp: 0.677 ± 0.601
2.031LysTyr: 2.031 ± 0.644
0.0LysXaa: 0.0 ± 0.0
Leu
5.416LeuAla: 5.416 ± 1.641
0.0LeuCys: 0.0 ± 0.0
4.739LeuAsp: 4.739 ± 1.369
3.385LeuGlu: 3.385 ± 1.832
2.708LeuPhe: 2.708 ± 0.667
5.416LeuGly: 5.416 ± 1.796
0.677LeuHis: 0.677 ± 0.472
4.739LeuIle: 4.739 ± 1.474
1.354LeuLys: 1.354 ± 0.945
2.708LeuLeu: 2.708 ± 1.277
3.385LeuMet: 3.385 ± 1.276
2.708LeuAsn: 2.708 ± 1.6
6.093LeuPro: 6.093 ± 1.746
2.708LeuGln: 2.708 ± 0.972
3.385LeuArg: 3.385 ± 1.005
8.802LeuSer: 8.802 ± 2.674
2.708LeuThr: 2.708 ± 0.547
4.739LeuVal: 4.739 ± 1.213
1.354LeuTrp: 1.354 ± 1.45
4.739LeuTyr: 4.739 ± 0.758
0.0LeuXaa: 0.0 ± 0.0
Met
2.708MetAla: 2.708 ± 0.921
0.0MetCys: 0.0 ± 0.0
2.708MetAsp: 2.708 ± 1.102
0.677MetGlu: 0.677 ± 0.601
0.677MetPhe: 0.677 ± 0.601
0.677MetGly: 0.677 ± 0.472
0.677MetHis: 0.677 ± 0.535
0.0MetIle: 0.0 ± 0.0
1.354MetLys: 1.354 ± 1.071
3.385MetLeu: 3.385 ± 0.829
0.677MetMet: 0.677 ± 0.601
0.0MetAsn: 0.0 ± 0.0
2.708MetPro: 2.708 ± 0.79
0.677MetGln: 0.677 ± 0.472
0.677MetArg: 0.677 ± 0.472
5.416MetSer: 5.416 ± 2.461
0.677MetThr: 0.677 ± 0.72
0.677MetVal: 0.677 ± 0.601
0.677MetTrp: 0.677 ± 0.472
1.354MetTyr: 1.354 ± 0.733
0.0MetXaa: 0.0 ± 0.0
Asn
8.802AsnAla: 8.802 ± 3.403
0.0AsnCys: 0.0 ± 0.0
1.354AsnAsp: 1.354 ± 0.787
2.708AsnGlu: 2.708 ± 0.754
2.031AsnPhe: 2.031 ± 0.803
2.708AsnGly: 2.708 ± 0.921
1.354AsnHis: 1.354 ± 0.483
3.385AsnIle: 3.385 ± 1.224
2.708AsnLys: 2.708 ± 0.79
4.739AsnLeu: 4.739 ± 1.203
2.031AsnMet: 2.031 ± 1.803
1.354AsnAsn: 1.354 ± 1.202
6.093AsnPro: 6.093 ± 1.019
0.677AsnGln: 0.677 ± 0.601
2.708AsnArg: 2.708 ± 1.211
8.125AsnSer: 8.125 ± 3.354
4.739AsnThr: 4.739 ± 3.377
2.708AsnVal: 2.708 ± 0.808
0.0AsnTrp: 0.0 ± 0.0
3.385AsnTyr: 3.385 ± 1.276
0.0AsnXaa: 0.0 ± 0.0
Pro
6.093ProAla: 6.093 ± 2.116
0.677ProCys: 0.677 ± 0.535
3.385ProAsp: 3.385 ± 1.224
2.708ProGlu: 2.708 ± 0.967
4.062ProPhe: 4.062 ± 2.118
1.354ProGly: 1.354 ± 0.945
1.354ProHis: 1.354 ± 0.483
2.708ProIle: 2.708 ± 0.518
3.385ProLys: 3.385 ± 1.665
2.708ProLeu: 2.708 ± 0.754
0.0ProMet: 0.0 ± 0.0
4.062ProAsn: 4.062 ± 0.847
0.0ProPro: 0.0 ± 0.0
3.385ProGln: 3.385 ± 1.826
0.0ProArg: 0.0 ± 0.0
2.708ProSer: 2.708 ± 1.211
2.708ProThr: 2.708 ± 0.79
4.739ProVal: 4.739 ± 0.728
0.0ProTrp: 0.0 ± 0.0
0.677ProTyr: 0.677 ± 0.535
0.0ProXaa: 0.0 ± 0.0
Gln
2.031GlnAla: 2.031 ± 0.644
0.677GlnCys: 0.677 ± 0.725
2.031GlnAsp: 2.031 ± 1.011
2.031GlnGlu: 2.031 ± 1.029
0.677GlnPhe: 0.677 ± 0.472
2.708GlnGly: 2.708 ± 1.89
2.031GlnHis: 2.031 ± 0.825
1.354GlnIle: 1.354 ± 0.945
2.708GlnLys: 2.708 ± 0.518
3.385GlnLeu: 3.385 ± 0.897
2.031GlnMet: 2.031 ± 1.357
3.385GlnAsn: 3.385 ± 0.636
0.0GlnPro: 0.0 ± 0.0
0.677GlnGln: 0.677 ± 0.472
1.354GlnArg: 1.354 ± 0.529
4.062GlnSer: 4.062 ± 1.946
1.354GlnThr: 1.354 ± 0.945
1.354GlnVal: 1.354 ± 0.529
0.677GlnTrp: 0.677 ± 0.535
0.677GlnTyr: 0.677 ± 0.601
0.0GlnXaa: 0.0 ± 0.0
Arg
4.739ArgAla: 4.739 ± 0.82
0.0ArgCys: 0.0 ± 0.0
2.031ArgAsp: 2.031 ± 0.803
3.385ArgGlu: 3.385 ± 1.464
3.385ArgPhe: 3.385 ± 0.883
0.677ArgGly: 0.677 ± 0.472
0.677ArgHis: 0.677 ± 0.535
2.031ArgIle: 2.031 ± 1.212
2.708ArgLys: 2.708 ± 1.146
5.416ArgLeu: 5.416 ± 1.731
1.354ArgMet: 1.354 ± 0.945
1.354ArgAsn: 1.354 ± 0.733
2.031ArgPro: 2.031 ± 0.644
0.0ArgGln: 0.0 ± 0.0
2.031ArgArg: 2.031 ± 1.134
4.062ArgSer: 4.062 ± 1.45
2.031ArgThr: 2.031 ± 0.965
2.031ArgVal: 2.031 ± 0.965
0.0ArgTrp: 0.0 ± 0.0
4.062ArgTyr: 4.062 ± 0.851
0.0ArgXaa: 0.0 ± 0.0
Ser
10.156SerAla: 10.156 ± 4.355
0.0SerCys: 0.0 ± 0.0
5.416SerAsp: 5.416 ± 1.823
2.708SerGlu: 2.708 ± 0.667
0.677SerPhe: 0.677 ± 0.472
8.125SerGly: 8.125 ± 2.463
3.385SerHis: 3.385 ± 1.233
7.448SerIle: 7.448 ± 2.065
4.739SerLys: 4.739 ± 1.956
6.77SerLeu: 6.77 ± 1.502
2.031SerMet: 2.031 ± 1.228
8.125SerAsn: 8.125 ± 1.89
6.093SerPro: 6.093 ± 1.284
2.708SerGln: 2.708 ± 0.518
8.125SerArg: 8.125 ± 1.697
8.802SerSer: 8.802 ± 2.675
4.739SerThr: 4.739 ± 1.242
7.448SerVal: 7.448 ± 2.105
1.354SerTrp: 1.354 ± 0.483
4.739SerTyr: 4.739 ± 2.034
0.0SerXaa: 0.0 ± 0.0
Thr
4.739ThrAla: 4.739 ± 1.878
0.0ThrCys: 0.0 ± 0.0
2.031ThrAsp: 2.031 ± 0.644
2.031ThrGlu: 2.031 ± 1.417
4.062ThrPhe: 4.062 ± 1.488
3.385ThrGly: 3.385 ± 0.386
1.354ThrHis: 1.354 ± 0.945
1.354ThrIle: 1.354 ± 0.945
0.677ThrLys: 0.677 ± 0.472
4.739ThrLeu: 4.739 ± 1.508
0.0ThrMet: 0.0 ± 0.0
3.385ThrAsn: 3.385 ± 1.63
3.385ThrPro: 3.385 ± 0.912
0.0ThrGln: 0.0 ± 0.0
2.708ThrArg: 2.708 ± 1.207
8.802ThrSer: 8.802 ± 1.136
3.385ThrThr: 3.385 ± 0.636
2.708ThrVal: 2.708 ± 1.465
0.677ThrTrp: 0.677 ± 0.601
6.093ThrTyr: 6.093 ± 2.054
0.0ThrXaa: 0.0 ± 0.0
Val
2.708ValAla: 2.708 ± 0.754
0.0ValCys: 0.0 ± 0.0
6.77ValAsp: 6.77 ± 0.789
3.385ValGlu: 3.385 ± 1.663
2.708ValPhe: 2.708 ± 1.49
4.062ValGly: 4.062 ± 0.847
0.0ValHis: 0.0 ± 0.0
5.416ValIle: 5.416 ± 1.493
4.739ValLys: 4.739 ± 1.545
3.385ValLeu: 3.385 ± 1.012
0.677ValMet: 0.677 ± 0.697
3.385ValAsn: 3.385 ± 1.276
2.708ValPro: 2.708 ± 0.967
2.031ValGln: 2.031 ± 0.792
2.031ValArg: 2.031 ± 0.792
7.448ValSer: 7.448 ± 1.351
3.385ValThr: 3.385 ± 1.4
2.708ValVal: 2.708 ± 1.615
0.677ValTrp: 0.677 ± 0.472
2.708ValTyr: 2.708 ± 0.547
0.0ValXaa: 0.0 ± 0.0
Trp
1.354TrpAla: 1.354 ± 0.529
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.354TrpPhe: 1.354 ± 0.483
0.0TrpGly: 0.0 ± 0.0
0.677TrpHis: 0.677 ± 0.472
2.031TrpIle: 2.031 ± 0.803
0.677TrpLys: 0.677 ± 0.472
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.677TrpAsn: 0.677 ± 0.472
0.0TrpPro: 0.0 ± 0.0
1.354TrpGln: 1.354 ± 0.84
0.0TrpArg: 0.0 ± 0.0
2.031TrpSer: 2.031 ± 0.825
2.031TrpThr: 2.031 ± 0.825
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.77TyrAla: 6.77 ± 2.794
0.0TyrCys: 0.0 ± 0.0
4.739TyrAsp: 4.739 ± 1.547
1.354TyrGlu: 1.354 ± 0.841
5.416TyrPhe: 5.416 ± 1.934
3.385TyrGly: 3.385 ± 1.496
2.031TyrHis: 2.031 ± 0.904
3.385TyrIle: 3.385 ± 1.224
2.031TyrLys: 2.031 ± 0.904
6.093TyrLeu: 6.093 ± 0.563
0.677TyrMet: 0.677 ± 0.601
1.354TyrAsn: 1.354 ± 1.202
1.354TyrPro: 1.354 ± 0.483
2.708TyrGln: 2.708 ± 1.345
2.708TyrArg: 2.708 ± 0.949
5.416TyrSer: 5.416 ± 1.633
2.031TyrThr: 2.031 ± 0.825
1.354TyrVal: 1.354 ± 0.529
0.0TyrTrp: 0.0 ± 0.0
1.354TyrTyr: 1.354 ± 0.945
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1478 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski