Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_206

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.519AlaAla: 3.519 ± 2.711
0.704AlaCys: 0.704 ± 0.69
2.815AlaAsp: 2.815 ± 0.783
3.519AlaGlu: 3.519 ± 2.462
2.815AlaPhe: 2.815 ± 1.404
2.815AlaGly: 2.815 ± 1.157
2.111AlaHis: 2.111 ± 1.17
2.111AlaIle: 2.111 ± 0.937
3.519AlaLys: 3.519 ± 1.582
4.926AlaLeu: 4.926 ± 2.116
3.519AlaMet: 3.519 ± 1.593
4.222AlaAsn: 4.222 ± 1.917
1.407AlaPro: 1.407 ± 0.953
1.407AlaGln: 1.407 ± 0.76
1.407AlaArg: 1.407 ± 0.712
5.63AlaSer: 5.63 ± 1.913
2.815AlaThr: 2.815 ± 0.748
4.926AlaVal: 4.926 ± 0.841
0.704AlaTrp: 0.704 ± 0.477
2.815AlaTyr: 2.815 ± 0.748
0.0AlaXaa: 0.0 ± 0.0
Cys
0.704CysAla: 0.704 ± 0.477
0.704CysCys: 0.704 ± 0.69
0.0CysAsp: 0.0 ± 0.0
0.704CysGlu: 0.704 ± 0.69
0.704CysPhe: 0.704 ± 0.69
0.704CysGly: 0.704 ± 0.69
0.704CysHis: 0.704 ± 0.998
1.407CysIle: 1.407 ± 0.957
0.704CysLys: 0.704 ± 0.998
2.815CysLeu: 2.815 ± 1.301
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.704CysArg: 0.704 ± 0.477
0.704CysSer: 0.704 ± 0.75
0.704CysThr: 0.704 ± 0.69
0.704CysVal: 0.704 ± 0.477
0.704CysTrp: 0.704 ± 0.69
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.407AspAla: 1.407 ± 1.487
2.111AspCys: 2.111 ± 1.074
4.926AspAsp: 4.926 ± 2.211
4.926AspGlu: 4.926 ± 1.749
6.334AspPhe: 6.334 ± 1.314
1.407AspGly: 1.407 ± 0.76
1.407AspHis: 1.407 ± 0.568
4.926AspIle: 4.926 ± 1.836
2.111AspLys: 2.111 ± 1.172
7.037AspLeu: 7.037 ± 3.282
1.407AspMet: 1.407 ± 0.921
0.704AspAsn: 0.704 ± 0.477
5.63AspPro: 5.63 ± 2.808
0.704AspGln: 0.704 ± 0.75
1.407AspArg: 1.407 ± 0.942
2.815AspSer: 2.815 ± 1.71
2.111AspThr: 2.111 ± 1.124
3.519AspVal: 3.519 ± 1.83
0.0AspTrp: 0.0 ± 0.0
7.037AspTyr: 7.037 ± 2.077
0.0AspXaa: 0.0 ± 0.0
Glu
5.63GluAla: 5.63 ± 1.657
2.111GluCys: 2.111 ± 1.382
4.926GluAsp: 4.926 ± 1.317
4.926GluGlu: 4.926 ± 3.337
4.222GluPhe: 4.222 ± 2.343
0.704GluGly: 0.704 ± 0.75
0.704GluHis: 0.704 ± 0.69
2.111GluIle: 2.111 ± 1.159
5.63GluLys: 5.63 ± 1.727
7.741GluLeu: 7.741 ± 2.518
3.519GluMet: 3.519 ± 2.696
3.519GluAsn: 3.519 ± 2.117
2.111GluPro: 2.111 ± 1.43
1.407GluGln: 1.407 ± 0.953
2.815GluArg: 2.815 ± 1.174
4.926GluSer: 4.926 ± 1.843
2.111GluThr: 2.111 ± 0.891
1.407GluVal: 1.407 ± 0.976
2.111GluTrp: 2.111 ± 0.79
3.519GluTyr: 3.519 ± 2.22
0.0GluXaa: 0.0 ± 0.0
Phe
2.111PheAla: 2.111 ± 1.43
0.0PheCys: 0.0 ± 0.0
5.63PheAsp: 5.63 ± 2.441
3.519PheGlu: 3.519 ± 1.314
0.0PhePhe: 0.0 ± 0.0
4.222PheGly: 4.222 ± 0.795
0.704PheHis: 0.704 ± 0.477
3.519PheIle: 3.519 ± 1.167
4.926PheLys: 4.926 ± 3.295
3.519PheLeu: 3.519 ± 0.932
2.111PheMet: 2.111 ± 1.274
2.815PheAsn: 2.815 ± 1.512
1.407PhePro: 1.407 ± 0.953
2.111PheGln: 2.111 ± 0.79
2.111PheArg: 2.111 ± 1.011
2.815PheSer: 2.815 ± 1.512
1.407PheThr: 1.407 ± 0.953
2.815PheVal: 2.815 ± 1.141
0.0PheTrp: 0.0 ± 0.0
2.815PheTyr: 2.815 ± 1.182
0.0PheXaa: 0.0 ± 0.0
Gly
6.334GlyAla: 6.334 ± 2.028
0.0GlyCys: 0.0 ± 0.0
2.111GlyAsp: 2.111 ± 1.165
1.407GlyGlu: 1.407 ± 1.023
5.63GlyPhe: 5.63 ± 1.496
4.222GlyGly: 4.222 ± 1.875
0.704GlyHis: 0.704 ± 0.743
3.519GlyIle: 3.519 ± 1.768
3.519GlyLys: 3.519 ± 1.239
2.815GlyLeu: 2.815 ± 2.183
0.0GlyMet: 0.0 ± 0.0
7.741GlyAsn: 7.741 ± 3.896
0.704GlyPro: 0.704 ± 0.477
0.704GlyGln: 0.704 ± 0.477
2.815GlyArg: 2.815 ± 2.759
9.852GlySer: 9.852 ± 3.362
6.334GlyThr: 6.334 ± 2.622
5.63GlyVal: 5.63 ± 2.305
1.407GlyTrp: 1.407 ± 0.976
4.926GlyTyr: 4.926 ± 1.831
0.0GlyXaa: 0.0 ± 0.0
His
0.704HisAla: 0.704 ± 0.743
0.0HisCys: 0.0 ± 0.0
2.111HisAsp: 2.111 ± 1.074
0.704HisGlu: 0.704 ± 1.015
0.704HisPhe: 0.704 ± 0.69
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.704HisIle: 0.704 ± 0.477
2.815HisLys: 2.815 ± 1.357
1.407HisLeu: 1.407 ± 0.957
1.407HisMet: 1.407 ± 1.322
0.704HisAsn: 0.704 ± 0.477
0.0HisPro: 0.0 ± 0.0
0.704HisGln: 0.704 ± 0.75
1.407HisArg: 1.407 ± 0.978
1.407HisSer: 1.407 ± 0.921
0.0HisThr: 0.0 ± 0.0
0.704HisVal: 0.704 ± 0.69
0.0HisTrp: 0.0 ± 0.0
2.111HisTyr: 2.111 ± 1.471
0.0HisXaa: 0.0 ± 0.0
Ile
2.815IleAla: 2.815 ± 0.748
0.0IleCys: 0.0 ± 0.0
4.926IleAsp: 4.926 ± 1.899
4.222IleGlu: 4.222 ± 2.818
3.519IlePhe: 3.519 ± 1.433
4.926IleGly: 4.926 ± 1.285
0.0IleHis: 0.0 ± 0.0
2.815IleIle: 2.815 ± 1.023
2.815IleLys: 2.815 ± 2.096
4.222IleLeu: 4.222 ± 1.143
0.704IleMet: 0.704 ± 0.69
2.111IleAsn: 2.111 ± 1.423
2.815IlePro: 2.815 ± 1.174
2.111IleGln: 2.111 ± 0.79
2.815IleArg: 2.815 ± 1.054
3.519IleSer: 3.519 ± 1.302
2.815IleThr: 2.815 ± 0.848
4.926IleVal: 4.926 ± 1.171
2.111IleTrp: 2.111 ± 1.011
2.111IleTyr: 2.111 ± 1.914
0.0IleXaa: 0.0 ± 0.0
Lys
1.407LysAla: 1.407 ± 0.978
0.704LysCys: 0.704 ± 0.998
4.222LysAsp: 4.222 ± 2.184
7.037LysGlu: 7.037 ± 4.479
1.407LysPhe: 1.407 ± 0.76
5.63LysGly: 5.63 ± 1.376
2.111LysHis: 2.111 ± 1.324
1.407LysIle: 1.407 ± 1.15
8.445LysLys: 8.445 ± 5.478
7.741LysLeu: 7.741 ± 2.399
0.704LysMet: 0.704 ± 0.69
3.519LysAsn: 3.519 ± 1.806
3.519LysPro: 3.519 ± 1.609
1.407LysGln: 1.407 ± 0.942
3.519LysArg: 3.519 ± 1.063
4.222LysSer: 4.222 ± 0.956
4.222LysThr: 4.222 ± 1.035
3.519LysVal: 3.519 ± 1.291
0.0LysTrp: 0.0 ± 0.0
2.111LysTyr: 2.111 ± 2.069
0.0LysXaa: 0.0 ± 0.0
Leu
4.222LeuAla: 4.222 ± 1.407
0.0LeuCys: 0.0 ± 0.0
2.815LeuAsp: 2.815 ± 1.842
6.334LeuGlu: 6.334 ± 1.119
3.519LeuPhe: 3.519 ± 1.488
7.741LeuGly: 7.741 ± 1.804
2.815LeuHis: 2.815 ± 1.082
5.63LeuIle: 5.63 ± 3.102
8.445LeuLys: 8.445 ± 2.602
4.926LeuLeu: 4.926 ± 1.748
2.815LeuMet: 2.815 ± 1.913
4.926LeuAsn: 4.926 ± 1.19
2.815LeuPro: 2.815 ± 1.301
2.111LeuGln: 2.111 ± 0.728
4.926LeuArg: 4.926 ± 2.258
8.445LeuSer: 8.445 ± 1.573
7.037LeuThr: 7.037 ± 2.335
6.334LeuVal: 6.334 ± 1.634
0.0LeuTrp: 0.0 ± 0.0
3.519LeuTyr: 3.519 ± 2.229
0.0LeuXaa: 0.0 ± 0.0
Met
0.704MetAla: 0.704 ± 0.743
0.704MetCys: 0.704 ± 0.69
1.407MetAsp: 1.407 ± 1.317
2.111MetGlu: 2.111 ± 1.461
0.0MetPhe: 0.0 ± 0.0
2.111MetGly: 2.111 ± 0.957
1.407MetHis: 1.407 ± 0.957
0.704MetIle: 0.704 ± 0.69
0.704MetLys: 0.704 ± 0.998
0.704MetLeu: 0.704 ± 1.015
0.0MetMet: 0.0 ± 0.0
0.704MetAsn: 0.704 ± 0.69
0.0MetPro: 0.0 ± 0.0
0.704MetGln: 0.704 ± 0.743
1.407MetArg: 1.407 ± 0.921
3.519MetSer: 3.519 ± 0.754
2.111MetThr: 2.111 ± 1.774
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.407MetTyr: 1.407 ± 0.953
0.0MetXaa: 0.0 ± 0.0
Asn
4.926AsnAla: 4.926 ± 2.452
0.704AsnCys: 0.704 ± 0.477
4.926AsnAsp: 4.926 ± 1.914
5.63AsnGlu: 5.63 ± 1.605
2.111AsnPhe: 2.111 ± 1.647
2.111AsnGly: 2.111 ± 1.43
0.0AsnHis: 0.0 ± 0.0
2.815AsnIle: 2.815 ± 0.888
3.519AsnLys: 3.519 ± 0.823
6.334AsnLeu: 6.334 ± 2.145
0.0AsnMet: 0.0 ± 0.0
4.222AsnAsn: 4.222 ± 1.685
0.704AsnPro: 0.704 ± 0.477
0.704AsnGln: 0.704 ± 0.69
2.111AsnArg: 2.111 ± 1.038
8.445AsnSer: 8.445 ± 2.602
2.111AsnThr: 2.111 ± 1.43
0.704AsnVal: 0.704 ± 0.477
0.0AsnTrp: 0.0 ± 0.0
1.407AsnTyr: 1.407 ± 0.953
0.0AsnXaa: 0.0 ± 0.0
Pro
0.704ProAla: 0.704 ± 0.477
0.704ProCys: 0.704 ± 0.69
2.111ProAsp: 2.111 ± 1.43
2.815ProGlu: 2.815 ± 1.082
2.111ProPhe: 2.111 ± 1.024
2.815ProGly: 2.815 ± 1.906
0.0ProHis: 0.0 ± 0.0
2.111ProIle: 2.111 ± 1.17
2.111ProLys: 2.111 ± 0.891
4.222ProLeu: 4.222 ± 1.012
0.0ProMet: 0.0 ± 0.0
1.407ProAsn: 1.407 ± 0.953
0.704ProPro: 0.704 ± 0.477
2.815ProGln: 2.815 ± 1.4
1.407ProArg: 1.407 ± 0.942
3.519ProSer: 3.519 ± 1.83
1.407ProThr: 1.407 ± 0.953
2.815ProVal: 2.815 ± 1.383
0.704ProTrp: 0.704 ± 0.477
1.407ProTyr: 1.407 ± 0.712
0.0ProXaa: 0.0 ± 0.0
Gln
0.704GlnAla: 0.704 ± 0.743
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.111GlnGlu: 2.111 ± 1.011
2.111GlnPhe: 2.111 ± 1.43
1.407GlnGly: 1.407 ± 0.568
0.704GlnHis: 0.704 ± 1.015
1.407GlnIle: 1.407 ± 1.317
2.111GlnLys: 2.111 ± 0.79
2.111GlnLeu: 2.111 ± 1.557
0.704GlnMet: 0.704 ± 0.998
2.111GlnAsn: 2.111 ± 1.376
2.815GlnPro: 2.815 ± 0.867
0.0GlnGln: 0.0 ± 0.0
0.704GlnArg: 0.704 ± 0.477
3.519GlnSer: 3.519 ± 1.128
2.815GlnThr: 2.815 ± 1.174
0.704GlnVal: 0.704 ± 0.477
0.0GlnTrp: 0.0 ± 0.0
2.111GlnTyr: 2.111 ± 0.728
0.0GlnXaa: 0.0 ± 0.0
Arg
6.334ArgAla: 6.334 ± 2.367
1.407ArgCys: 1.407 ± 0.568
4.222ArgAsp: 4.222 ± 1.102
2.111ArgGlu: 2.111 ± 1.17
1.407ArgPhe: 1.407 ± 0.953
2.111ArgGly: 2.111 ± 0.728
1.407ArgHis: 1.407 ± 0.953
2.111ArgIle: 2.111 ± 1.43
1.407ArgLys: 1.407 ± 1.023
4.926ArgLeu: 4.926 ± 1.181
0.704ArgMet: 0.704 ± 0.743
1.407ArgAsn: 1.407 ± 0.712
0.704ArgPro: 0.704 ± 0.69
0.704ArgGln: 0.704 ± 0.998
2.111ArgArg: 2.111 ± 1.43
1.407ArgSer: 1.407 ± 0.953
2.111ArgThr: 2.111 ± 1.011
0.0ArgVal: 0.0 ± 0.0
1.407ArgTrp: 1.407 ± 0.921
5.63ArgTyr: 5.63 ± 2.163
0.0ArgXaa: 0.0 ± 0.0
Ser
7.037SerAla: 7.037 ± 2.868
0.704SerCys: 0.704 ± 0.69
6.334SerAsp: 6.334 ± 1.355
7.741SerGlu: 7.741 ± 1.967
4.222SerPhe: 4.222 ± 1.914
8.445SerGly: 8.445 ± 2.42
0.0SerHis: 0.0 ± 0.0
6.334SerIle: 6.334 ± 1.385
8.445SerLys: 8.445 ± 2.481
8.445SerLeu: 8.445 ± 3.098
0.704SerMet: 0.704 ± 0.836
4.222SerAsn: 4.222 ± 1.121
1.407SerPro: 1.407 ± 1.224
2.111SerGln: 2.111 ± 0.911
4.926SerArg: 4.926 ± 1.56
15.482SerSer: 15.482 ± 5.631
6.334SerThr: 6.334 ± 1.957
2.111SerVal: 2.111 ± 0.911
0.704SerTrp: 0.704 ± 0.69
3.519SerTyr: 3.519 ± 1.076
0.0SerXaa: 0.0 ± 0.0
Thr
2.815ThrAla: 2.815 ± 1.431
0.704ThrCys: 0.704 ± 0.69
0.704ThrAsp: 0.704 ± 0.69
1.407ThrGlu: 1.407 ± 0.76
2.815ThrPhe: 2.815 ± 1.186
7.741ThrGly: 7.741 ± 3.113
0.0ThrHis: 0.0 ± 0.0
4.926ThrIle: 4.926 ± 2.74
2.111ThrLys: 2.111 ± 1.074
4.926ThrLeu: 4.926 ± 1.749
0.0ThrMet: 0.0 ± 0.0
2.111ThrAsn: 2.111 ± 1.43
3.519ThrPro: 3.519 ± 0.823
4.222ThrGln: 4.222 ± 1.545
2.111ThrArg: 2.111 ± 1.011
6.334ThrSer: 6.334 ± 1.468
2.815ThrThr: 2.815 ± 1.71
3.519ThrVal: 3.519 ± 1.599
0.0ThrTrp: 0.0 ± 0.0
2.815ThrTyr: 2.815 ± 0.888
0.0ThrXaa: 0.0 ± 0.0
Val
4.222ValAla: 4.222 ± 1.646
0.0ValCys: 0.0 ± 0.0
2.111ValAsp: 2.111 ± 1.024
1.407ValGlu: 1.407 ± 0.953
2.111ValPhe: 2.111 ± 0.835
5.63ValGly: 5.63 ± 1.406
2.111ValHis: 2.111 ± 1.352
4.926ValIle: 4.926 ± 2.395
2.815ValLys: 2.815 ± 1.934
3.519ValLeu: 3.519 ± 1.355
0.704ValMet: 0.704 ± 0.75
2.815ValAsn: 2.815 ± 1.334
3.519ValPro: 3.519 ± 1.832
1.407ValGln: 1.407 ± 1.288
0.704ValArg: 0.704 ± 0.69
4.926ValSer: 4.926 ± 1.794
2.815ValThr: 2.815 ± 0.848
0.704ValVal: 0.704 ± 0.75
0.0ValTrp: 0.0 ± 0.0
3.519ValTyr: 3.519 ± 1.076
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.704TrpCys: 0.704 ± 0.477
0.704TrpAsp: 0.704 ± 0.711
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.704TrpLys: 0.704 ± 0.69
2.111TrpLeu: 2.111 ± 1.135
0.0TrpMet: 0.0 ± 0.0
0.704TrpAsn: 0.704 ± 0.75
0.0TrpPro: 0.0 ± 0.0
1.407TrpGln: 1.407 ± 0.568
0.704TrpArg: 0.704 ± 0.477
2.111TrpSer: 2.111 ± 1.43
0.0TrpThr: 0.0 ± 0.0
1.407TrpVal: 1.407 ± 0.568
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.407TyrAla: 1.407 ± 1.5
0.704TyrCys: 0.704 ± 1.015
4.926TyrAsp: 4.926 ± 1.794
3.519TyrGlu: 3.519 ± 1.202
2.815TyrPhe: 2.815 ± 1.835
5.63TyrGly: 5.63 ± 1.727
0.704TyrHis: 0.704 ± 0.69
2.815TyrIle: 2.815 ± 1.082
0.0TyrLys: 0.0 ± 0.0
4.926TyrLeu: 4.926 ± 1.831
0.704TyrMet: 0.704 ± 0.743
3.519TyrAsn: 3.519 ± 1.128
2.111TyrPro: 2.111 ± 1.074
1.407TyrGln: 1.407 ± 0.568
3.519TyrArg: 3.519 ± 1.096
6.334TyrSer: 6.334 ± 1.782
3.519TyrThr: 3.519 ± 1.291
3.519TyrVal: 3.519 ± 1.074
0.704TyrTrp: 0.704 ± 0.477
2.815TyrTyr: 2.815 ± 1.301
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1422 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski