Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_632

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.567AlaAla: 8.567 ± 4.4
0.779AlaCys: 0.779 ± 0.5
5.452AlaAsp: 5.452 ± 1.801
1.558AlaGlu: 1.558 ± 0.839
1.558AlaPhe: 1.558 ± 0.839
5.452AlaGly: 5.452 ± 1.801
0.779AlaHis: 0.779 ± 0.824
1.558AlaIle: 1.558 ± 1.08
2.336AlaLys: 2.336 ± 2.0
3.115AlaLeu: 3.115 ± 1.306
0.779AlaMet: 0.779 ± 0.824
5.452AlaAsn: 5.452 ± 1.494
0.779AlaPro: 0.779 ± 0.5
3.115AlaGln: 3.115 ± 3.296
3.894AlaArg: 3.894 ± 1.63
3.115AlaSer: 3.115 ± 1.75
2.336AlaThr: 2.336 ± 1.484
3.894AlaVal: 3.894 ± 1.941
0.779AlaTrp: 0.779 ± 0.5
3.894AlaTyr: 3.894 ± 1.298
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.779CysCys: 0.779 ± 0.712
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
2.336CysPhe: 2.336 ± 2.135
1.558CysGly: 1.558 ± 1.423
0.0CysHis: 0.0 ± 0.0
0.779CysIle: 0.779 ± 1.013
0.779CysLys: 0.779 ± 0.5
3.115CysLeu: 3.115 ± 1.088
1.558CysMet: 1.558 ± 0.544
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.558CysArg: 1.558 ± 1.423
2.336CysSer: 2.336 ± 1.771
0.779CysThr: 0.779 ± 0.712
0.779CysVal: 0.779 ± 0.5
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.115AspAla: 3.115 ± 1.306
0.779AspCys: 0.779 ± 0.712
5.452AspAsp: 5.452 ± 3.999
6.231AspGlu: 6.231 ± 1.105
3.894AspPhe: 3.894 ± 2.05
1.558AspGly: 1.558 ± 0.839
2.336AspHis: 2.336 ± 1.499
2.336AspIle: 2.336 ± 1.499
3.115AspLys: 3.115 ± 1.034
7.009AspLeu: 7.009 ± 0.45
0.0AspMet: 0.0 ± 0.0
4.673AspAsn: 4.673 ± 1.945
1.558AspPro: 1.558 ± 0.839
0.0AspGln: 0.0 ± 0.0
1.558AspArg: 1.558 ± 0.999
6.231AspSer: 6.231 ± 3.327
3.894AspThr: 3.894 ± 1.298
6.231AspVal: 6.231 ± 2.006
2.336AspTrp: 2.336 ± 0.764
9.346AspTyr: 9.346 ± 1.576
0.0AspXaa: 0.0 ± 0.0
Glu
3.894GluAla: 3.894 ± 0.869
0.0GluCys: 0.0 ± 0.0
2.336GluAsp: 2.336 ± 1.499
3.115GluGlu: 3.115 ± 1.36
3.115GluPhe: 3.115 ± 2.186
3.115GluGly: 3.115 ± 1.071
1.558GluHis: 1.558 ± 0.999
4.673GluIle: 4.673 ± 1.741
0.0GluLys: 0.0 ± 0.0
1.558GluLeu: 1.558 ± 0.544
2.336GluMet: 2.336 ± 0.728
3.894GluAsn: 3.894 ± 0.999
2.336GluPro: 2.336 ± 1.57
2.336GluGln: 2.336 ± 0.956
2.336GluArg: 2.336 ± 1.013
3.115GluSer: 3.115 ± 1.071
1.558GluThr: 1.558 ± 1.648
1.558GluVal: 1.558 ± 0.74
0.0GluTrp: 0.0 ± 0.0
2.336GluTyr: 2.336 ± 0.764
0.0GluXaa: 0.0 ± 0.0
Phe
3.115PheAla: 3.115 ± 1.36
0.0PheCys: 0.0 ± 0.0
6.231PheAsp: 6.231 ± 2.303
0.0PheGlu: 0.0 ± 0.0
7.009PhePhe: 7.009 ± 1.741
5.452PheGly: 5.452 ± 1.74
1.558PheHis: 1.558 ± 1.074
4.673PheIle: 4.673 ± 3.257
1.558PheLys: 1.558 ± 1.08
3.894PheLeu: 3.894 ± 0.999
1.558PheMet: 1.558 ± 0.74
6.231PheAsn: 6.231 ± 1.573
1.558PhePro: 1.558 ± 1.255
0.779PheGln: 0.779 ± 0.938
3.115PheArg: 3.115 ± 1.088
5.452PheSer: 5.452 ± 1.54
5.452PheThr: 5.452 ± 2.19
3.894PheVal: 3.894 ± 1.229
0.0PheTrp: 0.0 ± 0.0
5.452PheTyr: 5.452 ± 1.528
0.0PheXaa: 0.0 ± 0.0
Gly
1.558GlyAla: 1.558 ± 0.74
1.558GlyCys: 1.558 ± 1.423
7.788GlyAsp: 7.788 ± 2.801
6.231GlyGlu: 6.231 ± 2.303
5.452GlyPhe: 5.452 ± 1.037
4.673GlyGly: 4.673 ± 1.259
1.558GlyHis: 1.558 ± 0.999
3.894GlyIle: 3.894 ± 1.63
4.673GlyLys: 4.673 ± 3.257
3.894GlyLeu: 3.894 ± 1.182
0.0GlyMet: 0.0 ± 0.0
3.894GlyAsn: 3.894 ± 0.935
0.779GlyPro: 0.779 ± 0.5
1.558GlyGln: 1.558 ± 0.544
0.779GlyArg: 0.779 ± 0.5
7.788GlySer: 7.788 ± 2.549
3.115GlyThr: 3.115 ± 1.061
3.115GlyVal: 3.115 ± 1.334
0.0GlyTrp: 0.0 ± 0.0
3.115GlyTyr: 3.115 ± 1.488
0.0GlyXaa: 0.0 ± 0.0
His
0.779HisAla: 0.779 ± 0.5
0.0HisCys: 0.0 ± 0.0
2.336HisAsp: 2.336 ± 0.728
0.0HisGlu: 0.0 ± 0.0
2.336HisPhe: 2.336 ± 1.164
0.779HisGly: 0.779 ± 0.5
0.0HisHis: 0.0 ± 0.0
0.779HisIle: 0.779 ± 1.013
2.336HisLys: 2.336 ± 1.499
0.779HisLeu: 0.779 ± 0.5
0.0HisMet: 0.0 ± 0.0
0.779HisAsn: 0.779 ± 0.5
0.779HisPro: 0.779 ± 0.5
0.779HisGln: 0.779 ± 0.824
0.779HisArg: 0.779 ± 0.5
3.115HisSer: 3.115 ± 1.171
0.779HisThr: 0.779 ± 0.5
0.779HisVal: 0.779 ± 0.5
0.779HisTrp: 0.779 ± 0.824
0.779HisTyr: 0.779 ± 0.5
0.0HisXaa: 0.0 ± 0.0
Ile
5.452IleAla: 5.452 ± 2.064
2.336IleCys: 2.336 ± 1.055
6.231IleAsp: 6.231 ± 3.177
2.336IleGlu: 2.336 ± 0.728
3.894IlePhe: 3.894 ± 2.081
2.336IleGly: 2.336 ± 1.055
1.558IleHis: 1.558 ± 1.395
2.336IleIle: 2.336 ± 2.168
4.673IleLys: 4.673 ± 1.898
3.115IleLeu: 3.115 ± 0.627
0.779IleMet: 0.779 ± 0.5
0.779IleAsn: 0.779 ± 1.013
3.894IlePro: 3.894 ± 1.63
3.115IleGln: 3.115 ± 1.071
3.115IleArg: 3.115 ± 1.586
5.452IleSer: 5.452 ± 1.08
0.779IleThr: 0.779 ± 0.712
0.0IleVal: 0.0 ± 0.0
2.336IleTrp: 2.336 ± 0.956
2.336IleTyr: 2.336 ± 1.499
0.0IleXaa: 0.0 ± 0.0
Lys
2.336LysAla: 2.336 ± 1.097
1.558LysCys: 1.558 ± 1.423
3.894LysAsp: 3.894 ± 2.259
2.336LysGlu: 2.336 ± 1.097
6.231LysPhe: 6.231 ± 2.044
2.336LysGly: 2.336 ± 1.164
0.0LysHis: 0.0 ± 0.0
2.336LysIle: 2.336 ± 0.749
3.115LysLys: 3.115 ± 2.267
4.673LysLeu: 4.673 ± 1.501
0.0LysMet: 0.0 ± 0.0
3.894LysAsn: 3.894 ± 1.344
1.558LysPro: 1.558 ± 0.544
3.894LysGln: 3.894 ± 1.783
2.336LysArg: 2.336 ± 2.135
5.452LysSer: 5.452 ± 1.69
6.231LysThr: 6.231 ± 0.608
4.673LysVal: 4.673 ± 2.126
0.0LysTrp: 0.0 ± 0.0
2.336LysTyr: 2.336 ± 0.749
0.0LysXaa: 0.0 ± 0.0
Leu
3.115LeuAla: 3.115 ± 1.071
0.0LeuCys: 0.0 ± 0.0
4.673LeuAsp: 4.673 ± 1.272
2.336LeuGlu: 2.336 ± 0.728
3.115LeuPhe: 3.115 ± 1.851
9.346LeuGly: 9.346 ± 2.623
0.0LeuHis: 0.0 ± 0.0
4.673LeuIle: 4.673 ± 2.027
6.231LeuLys: 6.231 ± 1.955
2.336LeuLeu: 2.336 ± 1.005
0.0LeuMet: 0.0 ± 0.0
5.452LeuAsn: 5.452 ± 1.08
6.231LeuPro: 6.231 ± 2.199
3.115LeuGln: 3.115 ± 1.334
1.558LeuArg: 1.558 ± 0.544
4.673LeuSer: 4.673 ± 2.286
5.452LeuThr: 5.452 ± 1.528
3.115LeuVal: 3.115 ± 1.034
0.0LeuTrp: 0.0 ± 0.0
4.673LeuTyr: 4.673 ± 1.367
0.0LeuXaa: 0.0 ± 0.0
Met
0.779MetAla: 0.779 ± 0.824
0.0MetCys: 0.0 ± 0.0
0.779MetAsp: 0.779 ± 0.5
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.779MetGly: 0.779 ± 0.5
0.779MetHis: 0.779 ± 0.5
0.779MetIle: 0.779 ± 1.013
1.558MetLys: 1.558 ± 1.08
0.779MetLeu: 0.779 ± 0.712
0.0MetMet: 0.0 ± 0.0
0.779MetAsn: 0.779 ± 0.5
2.336MetPro: 2.336 ± 0.956
0.779MetGln: 0.779 ± 0.824
0.779MetArg: 0.779 ± 0.5
2.336MetSer: 2.336 ± 0.749
0.779MetThr: 0.779 ± 0.5
0.0MetVal: 0.0 ± 0.0
0.779MetTrp: 0.779 ± 0.5
0.779MetTyr: 0.779 ± 0.938
0.0MetXaa: 0.0 ± 0.0
Asn
3.894AsnAla: 3.894 ± 1.771
0.779AsnCys: 0.779 ± 0.712
3.894AsnAsp: 3.894 ± 1.462
3.115AsnGlu: 3.115 ± 1.586
4.673AsnPhe: 4.673 ± 0.822
4.673AsnGly: 4.673 ± 1.548
0.779AsnHis: 0.779 ± 0.5
3.894AsnIle: 3.894 ± 2.522
1.558AsnLys: 1.558 ± 1.255
3.115AsnLeu: 3.115 ± 1.0
0.779AsnMet: 0.779 ± 0.488
3.115AsnAsn: 3.115 ± 1.26
3.115AsnPro: 3.115 ± 1.334
3.115AsnGln: 3.115 ± 1.334
5.452AsnArg: 5.452 ± 1.311
7.009AsnSer: 7.009 ± 1.796
3.115AsnThr: 3.115 ± 1.75
7.009AsnVal: 7.009 ± 1.837
0.0AsnTrp: 0.0 ± 0.0
3.894AsnTyr: 3.894 ± 2.941
0.0AsnXaa: 0.0 ± 0.0
Pro
0.779ProAla: 0.779 ± 0.712
2.336ProCys: 2.336 ± 2.135
2.336ProAsp: 2.336 ± 0.764
3.115ProGlu: 3.115 ± 0.627
3.115ProPhe: 3.115 ± 1.171
1.558ProGly: 1.558 ± 0.999
0.779ProHis: 0.779 ± 0.712
1.558ProIle: 1.558 ± 0.999
3.115ProLys: 3.115 ± 1.903
4.673ProLeu: 4.673 ± 2.108
0.779ProMet: 0.779 ± 0.669
1.558ProAsn: 1.558 ± 0.999
2.336ProPro: 2.336 ± 1.164
2.336ProGln: 2.336 ± 1.233
0.0ProArg: 0.0 ± 0.0
1.558ProSer: 1.558 ± 0.839
4.673ProThr: 4.673 ± 2.234
3.894ProVal: 3.894 ± 0.935
0.779ProTrp: 0.779 ± 0.5
0.779ProTyr: 0.779 ± 0.5
0.0ProXaa: 0.0 ± 0.0
Gln
0.779GlnAla: 0.779 ± 0.824
0.0GlnCys: 0.0 ± 0.0
3.894GlnAsp: 3.894 ± 1.298
1.558GlnGlu: 1.558 ± 1.648
0.0GlnPhe: 0.0 ± 0.0
2.336GlnGly: 2.336 ± 0.764
0.0GlnHis: 0.0 ± 0.0
3.115GlnIle: 3.115 ± 3.296
3.894GlnLys: 3.894 ± 1.635
5.452GlnLeu: 5.452 ± 1.346
0.779GlnMet: 0.779 ± 0.5
0.779GlnAsn: 0.779 ± 0.5
1.558GlnPro: 1.558 ± 0.999
1.558GlnGln: 1.558 ± 0.74
2.336GlnArg: 2.336 ± 1.097
2.336GlnSer: 2.336 ± 1.893
1.558GlnThr: 1.558 ± 0.74
0.779GlnVal: 0.779 ± 0.5
0.0GlnTrp: 0.0 ± 0.0
1.558GlnTyr: 1.558 ± 1.648
0.0GlnXaa: 0.0 ± 0.0
Arg
2.336ArgAla: 2.336 ± 0.764
2.336ArgCys: 2.336 ± 0.764
2.336ArgAsp: 2.336 ± 1.963
1.558ArgGlu: 1.558 ± 1.648
2.336ArgPhe: 2.336 ± 1.708
2.336ArgGly: 2.336 ± 0.764
0.0ArgHis: 0.0 ± 0.0
2.336ArgIle: 2.336 ± 0.764
2.336ArgLys: 2.336 ± 1.164
3.894ArgLeu: 3.894 ± 1.672
0.779ArgMet: 0.779 ± 0.5
2.336ArgAsn: 2.336 ± 0.749
2.336ArgPro: 2.336 ± 1.164
0.779ArgGln: 0.779 ± 0.5
2.336ArgArg: 2.336 ± 1.499
3.115ArgSer: 3.115 ± 1.998
3.115ArgThr: 3.115 ± 0.627
0.779ArgVal: 0.779 ± 0.5
0.0ArgTrp: 0.0 ± 0.0
3.115ArgTyr: 3.115 ± 1.171
0.0ArgXaa: 0.0 ± 0.0
Ser
7.788SerAla: 7.788 ± 2.615
2.336SerCys: 2.336 ± 1.164
5.452SerAsp: 5.452 ± 2.346
3.894SerGlu: 3.894 ± 1.26
6.231SerPhe: 6.231 ± 1.54
7.788SerGly: 7.788 ± 2.271
1.558SerHis: 1.558 ± 0.999
11.682SerIle: 11.682 ± 2.363
5.452SerLys: 5.452 ± 2.742
6.231SerLeu: 6.231 ± 1.255
0.0SerMet: 0.0 ± 0.772
8.567SerAsn: 8.567 ± 2.924
6.231SerPro: 6.231 ± 1.244
0.779SerGln: 0.779 ± 0.938
1.558SerArg: 1.558 ± 1.648
7.009SerSer: 7.009 ± 1.939
0.779SerThr: 0.779 ± 1.013
6.231SerVal: 6.231 ± 0.659
0.0SerTrp: 0.0 ± 0.0
3.894SerTyr: 3.894 ± 1.344
0.0SerXaa: 0.0 ± 0.0
Thr
2.336ThrAla: 2.336 ± 1.005
0.0ThrCys: 0.0 ± 0.0
0.779ThrAsp: 0.779 ± 0.5
2.336ThrGlu: 2.336 ± 1.499
3.894ThrPhe: 3.894 ± 1.319
3.115ThrGly: 3.115 ± 1.178
1.558ThrHis: 1.558 ± 0.999
1.558ThrIle: 1.558 ± 0.839
3.894ThrLys: 3.894 ± 0.694
3.115ThrLeu: 3.115 ± 1.17
0.779ThrMet: 0.779 ± 0.5
2.336ThrAsn: 2.336 ± 0.956
0.779ThrPro: 0.779 ± 0.5
2.336ThrGln: 2.336 ± 2.472
2.336ThrArg: 2.336 ± 0.764
10.125ThrSer: 10.125 ± 1.402
0.0ThrThr: 0.0 ± 0.0
3.894ThrVal: 3.894 ± 2.384
0.779ThrTrp: 0.779 ± 0.5
3.115ThrTyr: 3.115 ± 1.088
0.0ThrXaa: 0.0 ± 0.0
Val
5.452ValAla: 5.452 ± 1.989
0.0ValCys: 0.0 ± 0.0
3.115ValAsp: 3.115 ± 1.061
1.558ValGlu: 1.558 ± 0.544
3.115ValPhe: 3.115 ± 1.941
2.336ValGly: 2.336 ± 1.383
1.558ValHis: 1.558 ± 0.544
0.779ValIle: 0.779 ± 0.5
5.452ValLys: 5.452 ± 1.92
3.115ValLeu: 3.115 ± 1.061
0.779ValMet: 0.779 ± 0.804
6.231ValAsn: 6.231 ± 2.199
1.558ValPro: 1.558 ± 0.999
0.779ValGln: 0.779 ± 0.5
3.115ValArg: 3.115 ± 1.088
7.009ValSer: 7.009 ± 2.307
3.115ValThr: 3.115 ± 1.17
3.115ValVal: 3.115 ± 2.846
1.558ValTrp: 1.558 ± 1.423
3.115ValTyr: 3.115 ± 1.171
0.0ValXaa: 0.0 ± 0.0
Trp
1.558TrpAla: 1.558 ± 0.544
0.0TrpCys: 0.0 ± 0.0
0.779TrpAsp: 0.779 ± 0.5
0.0TrpGlu: 0.0 ± 0.0
1.558TrpPhe: 1.558 ± 0.999
0.779TrpGly: 0.779 ± 0.712
1.558TrpHis: 1.558 ± 0.999
0.0TrpIle: 0.0 ± 0.0
0.779TrpLys: 0.779 ± 0.712
0.779TrpLeu: 0.779 ± 0.5
0.779TrpMet: 0.779 ± 0.78
0.779TrpAsn: 0.779 ± 0.824
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.779TrpSer: 0.779 ± 0.5
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.558TyrAla: 1.558 ± 1.181
0.779TyrCys: 0.779 ± 1.013
3.894TyrAsp: 3.894 ± 0.999
3.894TyrGlu: 3.894 ± 0.935
3.115TyrPhe: 3.115 ± 1.171
3.115TyrGly: 3.115 ± 1.17
1.558TyrHis: 1.558 ± 0.544
3.115TyrIle: 3.115 ± 1.17
2.336TyrLys: 2.336 ± 1.499
5.452TyrLeu: 5.452 ± 2.594
2.336TyrMet: 2.336 ± 0.956
5.452TyrAsn: 5.452 ± 1.785
2.336TyrPro: 2.336 ± 1.013
3.115TyrGln: 3.115 ± 1.088
1.558TyrArg: 1.558 ± 0.544
6.231TyrSer: 6.231 ± 3.21
1.558TyrThr: 1.558 ± 0.74
3.115TyrVal: 3.115 ± 1.851
0.0TyrTrp: 0.0 ± 0.0
6.231TyrTyr: 6.231 ± 2.256
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1285 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski