Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_83

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.44AlaAla: 3.44 ± 0.769
0.573AlaCys: 0.573 ± 0.498
3.44AlaAsp: 3.44 ± 0.917
1.72AlaGlu: 1.72 ± 0.653
2.867AlaPhe: 2.867 ± 0.604
1.147AlaGly: 1.147 ± 1.037
1.147AlaHis: 1.147 ± 0.434
1.147AlaIle: 1.147 ± 0.941
1.72AlaLys: 1.72 ± 1.157
5.161AlaLeu: 5.161 ± 1.59
0.573AlaMet: 0.573 ± 0.399
1.147AlaAsn: 1.147 ± 0.798
2.294AlaPro: 2.294 ± 0.493
4.587AlaGln: 4.587 ± 1.482
1.72AlaArg: 1.72 ± 0.972
5.734AlaSer: 5.734 ± 1.84
2.294AlaThr: 2.294 ± 1.519
3.44AlaVal: 3.44 ± 1.306
0.0AlaTrp: 0.0 ± 0.0
5.734AlaTyr: 5.734 ± 2.212
0.0AlaXaa: 0.0 ± 0.0
Cys
0.573CysAla: 0.573 ± 0.498
0.0CysCys: 0.0 ± 0.0
0.573CysAsp: 0.573 ± 0.399
1.147CysGlu: 1.147 ± 0.798
1.147CysPhe: 1.147 ± 0.804
0.573CysGly: 0.573 ± 0.498
0.0CysHis: 0.0 ± 0.0
0.573CysIle: 0.573 ± 0.399
0.0CysLys: 0.0 ± 0.0
0.573CysLeu: 0.573 ± 0.701
0.573CysMet: 0.573 ± 0.498
1.147CysAsn: 1.147 ± 0.781
1.147CysPro: 1.147 ± 0.886
0.573CysGln: 0.573 ± 0.399
1.147CysArg: 1.147 ± 0.521
4.014CysSer: 4.014 ± 1.503
0.573CysThr: 0.573 ± 0.498
1.147CysVal: 1.147 ± 0.521
0.0CysTrp: 0.0 ± 0.0
0.573CysTyr: 0.573 ± 0.498
0.0CysXaa: 0.0 ± 0.0
Asp
2.867AspAla: 2.867 ± 0.814
1.147AspCys: 1.147 ± 0.997
6.307AspAsp: 6.307 ± 1.956
1.147AspGlu: 1.147 ± 1.037
5.161AspPhe: 5.161 ± 1.568
2.294AspGly: 2.294 ± 1.363
0.573AspHis: 0.573 ± 0.811
4.587AspIle: 4.587 ± 0.793
4.014AspLys: 4.014 ± 1.103
6.881AspLeu: 6.881 ± 1.138
1.147AspMet: 1.147 ± 0.434
4.014AspAsn: 4.014 ± 1.36
2.294AspPro: 2.294 ± 1.147
3.44AspGln: 3.44 ± 1.184
3.44AspArg: 3.44 ± 1.302
12.615AspSer: 12.615 ± 1.949
1.72AspThr: 1.72 ± 0.862
5.734AspVal: 5.734 ± 1.008
1.147AspTrp: 1.147 ± 0.798
6.307AspTyr: 6.307 ± 1.137
0.0AspXaa: 0.0 ± 0.0
Glu
3.44GluAla: 3.44 ± 2.484
0.573GluCys: 0.573 ± 0.399
3.44GluAsp: 3.44 ± 1.404
0.573GluGlu: 0.573 ± 0.518
1.72GluPhe: 1.72 ± 0.611
1.147GluGly: 1.147 ± 0.997
1.147GluHis: 1.147 ± 0.816
2.867GluIle: 2.867 ± 0.814
2.294GluLys: 2.294 ± 0.741
1.147GluLeu: 1.147 ± 0.434
1.147GluMet: 1.147 ± 0.878
1.72GluAsn: 1.72 ± 0.938
0.573GluPro: 0.573 ± 0.399
2.294GluGln: 2.294 ± 2.073
1.147GluArg: 1.147 ± 0.521
4.587GluSer: 4.587 ± 1.457
0.573GluThr: 0.573 ± 0.518
0.573GluVal: 0.573 ± 0.399
0.0GluTrp: 0.0 ± 0.0
2.294GluTyr: 2.294 ± 1.16
0.0GluXaa: 0.0 ± 0.0
Phe
2.867PheAla: 2.867 ± 1.5
1.72PheCys: 1.72 ± 1.247
5.734PheAsp: 5.734 ± 1.303
2.294PheGlu: 2.294 ± 0.868
5.734PhePhe: 5.734 ± 1.83
5.161PheGly: 5.161 ± 1.506
2.294PheHis: 2.294 ± 1.042
4.587PheIle: 4.587 ± 1.536
1.72PheLys: 1.72 ± 1.247
9.748PheLeu: 9.748 ± 3.025
0.0PheMet: 0.0 ± 0.0
2.867PheAsn: 2.867 ± 1.62
1.147PhePro: 1.147 ± 1.402
0.573PheGln: 0.573 ± 0.399
2.867PheArg: 2.867 ± 0.746
12.041PheSer: 12.041 ± 2.097
2.294PheThr: 2.294 ± 0.992
4.587PheVal: 4.587 ± 2.558
0.0PheTrp: 0.0 ± 0.0
1.72PheTyr: 1.72 ± 0.884
0.0PheXaa: 0.0 ± 0.0
Gly
2.867GlyAla: 2.867 ± 0.932
0.0GlyCys: 0.0 ± 0.0
5.734GlyAsp: 5.734 ± 1.521
1.72GlyGlu: 1.72 ± 0.376
4.587GlyPhe: 4.587 ± 1.834
1.147GlyGly: 1.147 ± 0.521
2.867GlyHis: 2.867 ± 1.109
4.014GlyIle: 4.014 ± 1.693
1.72GlyLys: 1.72 ± 1.555
4.014GlyLeu: 4.014 ± 1.634
0.573GlyMet: 0.573 ± 0.811
1.147GlyAsn: 1.147 ± 0.997
0.573GlyPro: 0.573 ± 0.518
1.72GlyGln: 1.72 ± 0.869
3.44GlyArg: 3.44 ± 0.783
7.454GlySer: 7.454 ± 2.153
1.72GlyThr: 1.72 ± 0.376
1.147GlyVal: 1.147 ± 0.798
0.0GlyTrp: 0.0 ± 0.0
1.72GlyTyr: 1.72 ± 1.197
0.0GlyXaa: 0.0 ± 0.0
His
0.573HisAla: 0.573 ± 0.518
0.573HisCys: 0.573 ± 0.498
2.294HisAsp: 2.294 ± 1.692
0.573HisGlu: 0.573 ± 0.399
3.44HisPhe: 3.44 ± 1.565
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.72HisIle: 1.72 ± 0.986
0.0HisLys: 0.0 ± 0.0
2.294HisLeu: 2.294 ± 1.242
0.573HisMet: 0.573 ± 0.399
1.147HisAsn: 1.147 ± 0.798
0.573HisPro: 0.573 ± 0.399
0.573HisGln: 0.573 ± 0.518
1.147HisArg: 1.147 ± 0.798
0.0HisSer: 0.0 ± 0.0
0.573HisThr: 0.573 ± 0.399
1.72HisVal: 1.72 ± 1.495
0.0HisTrp: 0.0 ± 0.0
2.294HisTyr: 2.294 ± 1.111
0.0HisXaa: 0.0 ± 0.0
Ile
1.147IleAla: 1.147 ± 0.804
0.0IleCys: 0.0 ± 0.0
4.587IleAsp: 4.587 ± 1.471
0.573IleGlu: 0.573 ± 0.498
1.72IlePhe: 1.72 ± 0.784
3.44IleGly: 3.44 ± 1.302
1.147IleHis: 1.147 ± 0.886
2.294IleIle: 2.294 ± 0.971
1.72IleLys: 1.72 ± 1.003
4.587IleLeu: 4.587 ± 1.732
1.72IleMet: 1.72 ± 1.696
3.44IleAsn: 3.44 ± 1.671
4.587IlePro: 4.587 ± 1.689
1.72IleGln: 1.72 ± 0.869
4.014IleArg: 4.014 ± 1.103
5.161IleSer: 5.161 ± 1.184
2.867IleThr: 2.867 ± 0.814
2.867IleVal: 2.867 ± 0.883
0.573IleTrp: 0.573 ± 0.701
1.147IleTyr: 1.147 ± 0.886
0.0IleXaa: 0.0 ± 0.0
Lys
1.72LysAla: 1.72 ± 0.869
0.573LysCys: 0.573 ± 0.498
2.294LysAsp: 2.294 ± 0.769
2.294LysGlu: 2.294 ± 1.147
8.028LysPhe: 8.028 ± 1.418
0.573LysGly: 0.573 ± 0.518
0.573LysHis: 0.573 ± 0.518
2.294LysIle: 2.294 ± 0.793
0.573LysLys: 0.573 ± 0.399
5.161LysLeu: 5.161 ± 1.378
1.72LysMet: 1.72 ± 0.799
2.294LysAsn: 2.294 ± 1.609
1.147LysPro: 1.147 ± 0.997
1.147LysGln: 1.147 ± 0.521
1.72LysArg: 1.72 ± 0.972
4.014LysSer: 4.014 ± 1.635
3.44LysThr: 3.44 ± 1.4
2.867LysVal: 2.867 ± 1.122
0.0LysTrp: 0.0 ± 0.0
2.867LysTyr: 2.867 ± 1.433
0.0LysXaa: 0.0 ± 0.0
Leu
4.587LeuAla: 4.587 ± 0.793
0.573LeuCys: 0.573 ± 0.498
8.028LeuAsp: 8.028 ± 3.293
1.72LeuGlu: 1.72 ± 0.692
6.307LeuPhe: 6.307 ± 2.463
5.161LeuGly: 5.161 ± 2.328
1.72LeuHis: 1.72 ± 0.938
4.587LeuIle: 4.587 ± 2.477
5.161LeuLys: 5.161 ± 2.928
10.321LeuLeu: 10.321 ± 2.302
2.294LeuMet: 2.294 ± 0.469
7.454LeuAsn: 7.454 ± 1.397
5.161LeuPro: 5.161 ± 2.223
2.867LeuGln: 2.867 ± 0.932
6.881LeuArg: 6.881 ± 0.923
16.628LeuSer: 16.628 ± 2.898
5.161LeuThr: 5.161 ± 1.575
4.587LeuVal: 4.587 ± 2.382
0.573LeuTrp: 0.573 ± 0.518
4.014LeuTyr: 4.014 ± 1.231
0.0LeuXaa: 0.0 ± 0.0
Met
0.573MetAla: 0.573 ± 0.518
1.147MetCys: 1.147 ± 0.521
0.573MetAsp: 0.573 ± 0.399
1.72MetGlu: 1.72 ± 0.986
2.294MetPhe: 2.294 ± 1.229
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.573MetIle: 0.573 ± 0.518
1.147MetLys: 1.147 ± 0.521
1.72MetLeu: 1.72 ± 1.247
0.0MetMet: 0.0 ± 0.0
0.573MetAsn: 0.573 ± 0.701
0.573MetPro: 0.573 ± 0.518
2.294MetGln: 2.294 ± 0.493
0.0MetArg: 0.0 ± 0.0
1.72MetSer: 1.72 ± 0.884
0.573MetThr: 0.573 ± 0.811
1.72MetVal: 1.72 ± 1.197
0.0MetTrp: 0.0 ± 0.0
0.573MetTyr: 0.573 ± 0.399
0.0MetXaa: 0.0 ± 0.0
Asn
2.867AsnAla: 2.867 ± 1.122
0.573AsnCys: 0.573 ± 0.701
4.014AsnAsp: 4.014 ± 1.774
2.294AsnGlu: 2.294 ± 1.798
2.294AsnPhe: 2.294 ± 1.147
2.867AsnGly: 2.867 ± 0.964
0.0AsnHis: 0.0 ± 0.0
5.734AsnIle: 5.734 ± 2.505
1.147AsnLys: 1.147 ± 1.017
5.161AsnLeu: 5.161 ± 0.733
1.147AsnMet: 1.147 ± 0.941
3.44AsnAsn: 3.44 ± 1.71
4.014AsnPro: 4.014 ± 0.781
1.72AsnGln: 1.72 ± 0.653
3.44AsnArg: 3.44 ± 2.067
4.014AsnSer: 4.014 ± 1.098
1.72AsnThr: 1.72 ± 0.692
2.867AsnVal: 2.867 ± 1.123
0.573AsnTrp: 0.573 ± 0.399
2.867AsnTyr: 2.867 ± 1.268
0.0AsnXaa: 0.0 ± 0.0
Pro
2.294ProAla: 2.294 ± 0.868
0.573ProCys: 0.573 ± 0.498
5.734ProAsp: 5.734 ± 1.415
0.573ProGlu: 0.573 ± 0.811
3.44ProPhe: 3.44 ± 1.569
1.72ProGly: 1.72 ± 0.784
1.72ProHis: 1.72 ± 0.938
1.147ProIle: 1.147 ± 0.886
1.72ProLys: 1.72 ± 0.376
4.014ProLeu: 4.014 ± 1.9
1.147ProMet: 1.147 ± 0.798
1.72ProAsn: 1.72 ± 0.963
2.294ProPro: 2.294 ± 1.21
0.573ProGln: 0.573 ± 0.399
1.147ProArg: 1.147 ± 0.798
6.881ProSer: 6.881 ± 1.009
0.573ProThr: 0.573 ± 0.399
2.294ProVal: 2.294 ± 0.493
0.0ProTrp: 0.0 ± 0.0
1.147ProTyr: 1.147 ± 0.598
0.0ProXaa: 0.0 ± 0.0
Gln
2.294GlnAla: 2.294 ± 2.073
0.573GlnCys: 0.573 ± 0.399
1.72GlnAsp: 1.72 ± 1.197
1.72GlnGlu: 1.72 ± 1.555
2.867GlnPhe: 2.867 ± 0.957
2.867GlnGly: 2.867 ± 2.592
0.573GlnHis: 0.573 ± 0.399
2.294GlnIle: 2.294 ± 0.992
1.147GlnLys: 1.147 ± 0.792
3.44GlnLeu: 3.44 ± 1.302
0.573GlnMet: 0.573 ± 0.399
0.0GlnAsn: 0.0 ± 0.0
1.72GlnPro: 1.72 ± 1.197
0.573GlnGln: 0.573 ± 0.518
1.72GlnArg: 1.72 ± 0.938
4.587GlnSer: 4.587 ± 1.066
2.294GlnThr: 2.294 ± 0.493
2.867GlnVal: 2.867 ± 0.916
0.0GlnTrp: 0.0 ± 0.0
2.294GlnTyr: 2.294 ± 0.741
0.0GlnXaa: 0.0 ± 0.0
Arg
4.587ArgAla: 4.587 ± 0.96
0.573ArgCys: 0.573 ± 0.399
2.294ArgAsp: 2.294 ± 1.196
1.147ArgGlu: 1.147 ± 0.434
4.587ArgPhe: 4.587 ± 2.648
4.014ArgGly: 4.014 ± 0.568
1.72ArgHis: 1.72 ± 0.376
2.867ArgIle: 2.867 ± 0.631
6.881ArgLys: 6.881 ± 1.617
4.014ArgLeu: 4.014 ± 1.816
0.573ArgMet: 0.573 ± 0.518
2.867ArgAsn: 2.867 ± 2.156
2.294ArgPro: 2.294 ± 1.042
0.573ArgGln: 0.573 ± 0.399
1.72ArgArg: 1.72 ± 0.938
5.161ArgSer: 5.161 ± 1.377
1.147ArgThr: 1.147 ± 0.434
1.147ArgVal: 1.147 ± 0.816
0.573ArgTrp: 0.573 ± 0.498
2.294ArgTyr: 2.294 ± 0.769
0.0ArgXaa: 0.0 ± 0.0
Ser
5.734SerAla: 5.734 ± 3.374
3.44SerCys: 3.44 ± 1.873
6.881SerAsp: 6.881 ± 1.799
6.307SerGlu: 6.307 ± 1.783
3.44SerPhe: 3.44 ± 1.747
6.881SerGly: 6.881 ± 1.747
2.294SerHis: 2.294 ± 1.042
2.867SerIle: 2.867 ± 1.057
9.748SerLys: 9.748 ± 2.352
19.495SerLeu: 19.495 ± 2.869
0.573SerMet: 0.573 ± 0.518
5.734SerAsn: 5.734 ± 0.898
4.587SerPro: 4.587 ± 0.957
6.307SerGln: 6.307 ± 3.052
6.881SerArg: 6.881 ± 0.941
16.628SerSer: 16.628 ± 4.01
2.294SerThr: 2.294 ± 1.409
12.041SerVal: 12.041 ± 2.563
1.147SerTrp: 1.147 ± 0.521
8.028SerTyr: 8.028 ± 1.889
0.0SerXaa: 0.0 ± 0.0
Thr
1.147ThrAla: 1.147 ± 0.521
1.72ThrCys: 1.72 ± 0.862
2.294ThrAsp: 2.294 ± 0.741
1.72ThrGlu: 1.72 ± 0.908
4.014ThrPhe: 4.014 ± 2.983
4.014ThrGly: 4.014 ± 1.086
0.573ThrHis: 0.573 ± 0.399
0.573ThrIle: 0.573 ± 0.498
0.0ThrLys: 0.0 ± 0.0
3.44ThrLeu: 3.44 ± 1.558
1.147ThrMet: 1.147 ± 0.798
1.72ThrAsn: 1.72 ± 0.783
2.294ThrPro: 2.294 ± 1.128
1.147ThrGln: 1.147 ± 0.434
2.294ThrArg: 2.294 ± 0.493
5.161ThrSer: 5.161 ± 2.146
1.72ThrThr: 1.72 ± 0.862
1.147ThrVal: 1.147 ± 1.017
0.0ThrTrp: 0.0 ± 0.0
1.147ThrTyr: 1.147 ± 0.521
0.0ThrXaa: 0.0 ± 0.0
Val
4.014ValAla: 4.014 ± 0.794
1.147ValCys: 1.147 ± 0.434
4.587ValAsp: 4.587 ± 0.96
2.294ValGlu: 2.294 ± 0.493
3.44ValPhe: 3.44 ± 1.562
1.72ValGly: 1.72 ± 0.939
0.573ValHis: 0.573 ± 0.399
3.44ValIle: 3.44 ± 1.54
2.294ValLys: 2.294 ± 0.493
5.734ValLeu: 5.734 ± 2.245
0.0ValMet: 0.0 ± 0.0
5.734ValAsn: 5.734 ± 1.654
2.294ValPro: 2.294 ± 0.899
1.147ValGln: 1.147 ± 0.792
2.867ValArg: 2.867 ± 1.923
9.174ValSer: 9.174 ± 1.807
1.72ValThr: 1.72 ± 0.986
2.294ValVal: 2.294 ± 1.042
1.147ValTrp: 1.147 ± 1.134
3.44ValTyr: 3.44 ± 0.712
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.72TrpAsp: 1.72 ± 0.653
0.573TrpGlu: 0.573 ± 0.399
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.147TrpLeu: 1.147 ± 1.134
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.573TrpGln: 0.573 ± 0.498
0.573TrpArg: 0.573 ± 0.701
0.573TrpSer: 0.573 ± 0.498
0.0TrpThr: 0.0 ± 0.0
0.573TrpVal: 0.573 ± 0.399
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.294TyrAla: 2.294 ± 1.042
0.573TyrCys: 0.573 ± 0.498
4.587TyrAsp: 4.587 ± 0.986
1.72TyrGlu: 1.72 ± 0.376
2.867TyrPhe: 2.867 ± 1.506
3.44TyrGly: 3.44 ± 1.057
1.147TyrHis: 1.147 ± 0.677
1.147TyrIle: 1.147 ± 0.798
2.294TyrLys: 2.294 ± 0.647
5.734TyrLeu: 5.734 ± 0.485
1.72TyrMet: 1.72 ± 0.938
4.587TyrAsn: 4.587 ± 2.499
1.147TyrPro: 1.147 ± 0.598
1.72TyrGln: 1.72 ± 0.783
2.867TyrArg: 2.867 ± 0.814
5.161TyrSer: 5.161 ± 1.778
4.014TyrThr: 4.014 ± 1.76
3.44TyrVal: 3.44 ± 1.245
0.0TyrTrp: 0.0 ± 0.0
5.161TyrTyr: 5.161 ± 1.736
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1745 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski