Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_550

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.203AlaAla: 8.203 ± 3.07
1.491AlaCys: 1.491 ± 0.565
2.983AlaAsp: 2.983 ± 1.758
5.966AlaGlu: 5.966 ± 1.449
4.474AlaPhe: 4.474 ± 1.487
2.983AlaGly: 2.983 ± 1.278
0.746AlaHis: 0.746 ± 0.508
7.457AlaIle: 7.457 ± 1.274
8.949AlaLys: 8.949 ± 4.263
8.949AlaLeu: 8.949 ± 2.656
1.491AlaMet: 1.491 ± 1.445
4.474AlaAsn: 4.474 ± 1.671
0.746AlaPro: 0.746 ± 0.508
2.983AlaGln: 2.983 ± 2.389
2.983AlaArg: 2.983 ± 1.406
8.949AlaSer: 8.949 ± 3.574
5.966AlaThr: 5.966 ± 1.75
3.729AlaVal: 3.729 ± 0.987
1.491AlaTrp: 1.491 ± 0.792
6.711AlaTyr: 6.711 ± 1.209
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.746CysGlu: 0.746 ± 1.074
0.0CysPhe: 0.0 ± 0.0
0.746CysGly: 0.746 ± 0.661
0.746CysHis: 0.746 ± 0.661
0.746CysIle: 0.746 ± 1.043
0.746CysLys: 0.746 ± 0.508
1.491CysLeu: 1.491 ± 1.009
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.746CysPro: 0.746 ± 0.661
0.0CysGln: 0.0 ± 0.0
2.237CysArg: 2.237 ± 0.847
0.0CysSer: 0.0 ± 0.0
1.491CysThr: 1.491 ± 1.323
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.746CysTyr: 0.746 ± 0.508
0.0CysXaa: 0.0 ± 0.0
Asp
1.491AspAla: 1.491 ± 1.286
0.0AspCys: 0.0 ± 0.0
2.983AspAsp: 2.983 ± 1.775
6.711AspGlu: 6.711 ± 2.692
5.22AspPhe: 5.22 ± 1.018
1.491AspGly: 1.491 ± 0.792
2.237AspHis: 2.237 ± 0.847
2.237AspIle: 2.237 ± 1.65
3.729AspLys: 3.729 ± 2.105
5.22AspLeu: 5.22 ± 1.145
2.237AspMet: 2.237 ± 0.847
3.729AspAsn: 3.729 ± 2.052
2.237AspPro: 2.237 ± 0.947
0.0AspGln: 0.0 ± 0.0
1.491AspArg: 1.491 ± 1.016
4.474AspSer: 4.474 ± 1.487
2.983AspThr: 2.983 ± 1.454
1.491AspVal: 1.491 ± 1.622
0.0AspTrp: 0.0 ± 0.0
5.22AspTyr: 5.22 ± 1.196
0.0AspXaa: 0.0 ± 0.0
Glu
6.711GluAla: 6.711 ± 2.434
1.491GluCys: 1.491 ± 1.262
0.746GluAsp: 0.746 ± 0.508
5.966GluGlu: 5.966 ± 4.585
1.491GluPhe: 1.491 ± 0.792
0.746GluGly: 0.746 ± 0.896
2.237GluHis: 2.237 ± 1.121
2.983GluIle: 2.983 ± 1.593
5.966GluLys: 5.966 ± 3.172
5.22GluLeu: 5.22 ± 3.131
1.491GluMet: 1.491 ± 0.565
1.491GluAsn: 1.491 ± 0.565
0.746GluPro: 0.746 ± 0.508
0.746GluGln: 0.746 ± 0.508
1.491GluArg: 1.491 ± 1.016
5.966GluSer: 5.966 ± 1.757
8.203GluThr: 8.203 ± 1.8
9.694GluVal: 9.694 ± 4.076
1.491GluTrp: 1.491 ± 1.009
3.729GluTyr: 3.729 ± 0.776
0.0GluXaa: 0.0 ± 0.0
Phe
5.22PheAla: 5.22 ± 1.364
0.746PheCys: 0.746 ± 0.508
1.491PheAsp: 1.491 ± 1.243
3.729PheGlu: 3.729 ± 1.743
1.491PhePhe: 1.491 ± 1.262
5.966PheGly: 5.966 ± 1.693
0.746PheHis: 0.746 ± 0.508
3.729PheIle: 3.729 ± 1.266
2.983PheLys: 2.983 ± 1.42
3.729PheLeu: 3.729 ± 1.283
1.491PheMet: 1.491 ± 0.565
1.491PheAsn: 1.491 ± 0.769
1.491PhePro: 1.491 ± 1.016
0.0PheGln: 0.0 ± 0.0
2.237PheArg: 2.237 ± 1.121
2.983PheSer: 2.983 ± 1.575
2.983PheThr: 2.983 ± 2.032
0.746PheVal: 0.746 ± 0.508
0.746PheTrp: 0.746 ± 0.508
0.746PheTyr: 0.746 ± 0.508
0.0PheXaa: 0.0 ± 0.0
Gly
3.729GlyAla: 3.729 ± 1.747
0.0GlyCys: 0.0 ± 0.0
0.746GlyAsp: 0.746 ± 0.508
5.22GlyGlu: 5.22 ± 1.895
4.474GlyPhe: 4.474 ± 1.141
2.237GlyGly: 2.237 ± 1.054
0.746GlyHis: 0.746 ± 0.661
5.22GlyIle: 5.22 ± 1.406
3.729GlyLys: 3.729 ± 1.818
3.729GlyLeu: 3.729 ± 1.647
0.0GlyMet: 0.0 ± 0.0
5.966GlyAsn: 5.966 ± 3.247
0.0GlyPro: 0.0 ± 0.0
0.746GlyGln: 0.746 ± 0.508
0.746GlyArg: 0.746 ± 0.508
10.44GlySer: 10.44 ± 3.28
2.983GlyThr: 2.983 ± 1.081
7.457GlyVal: 7.457 ± 2.756
0.0GlyTrp: 0.0 ± 0.0
5.966GlyTyr: 5.966 ± 1.223
0.0GlyXaa: 0.0 ± 0.0
His
2.237HisAla: 2.237 ± 1.418
0.0HisCys: 0.0 ± 0.0
2.983HisAsp: 2.983 ± 1.454
0.746HisGlu: 0.746 ± 0.661
0.0HisPhe: 0.0 ± 0.0
4.474HisGly: 4.474 ± 2.883
0.0HisHis: 0.0 ± 0.0
1.491HisIle: 1.491 ± 0.565
0.746HisLys: 0.746 ± 0.661
2.983HisLeu: 2.983 ± 0.956
0.0HisMet: 0.0 ± 0.0
0.746HisAsn: 0.746 ± 0.661
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.491HisSer: 1.491 ± 1.016
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.237HisTyr: 2.237 ± 1.121
0.0HisXaa: 0.0 ± 0.0
Ile
8.949IleAla: 8.949 ± 3.655
0.746IleCys: 0.746 ± 0.508
5.966IleAsp: 5.966 ± 1.262
0.746IleGlu: 0.746 ± 0.508
2.983IlePhe: 2.983 ± 1.219
3.729IleGly: 3.729 ± 0.948
0.746IleHis: 0.746 ± 1.043
2.237IleIle: 2.237 ± 1.054
3.729IleLys: 3.729 ± 1.172
2.237IleLeu: 2.237 ± 0.773
0.746IleMet: 0.746 ± 0.508
2.983IleAsn: 2.983 ± 1.514
2.983IlePro: 2.983 ± 0.956
0.746IleGln: 0.746 ± 0.661
0.746IleArg: 0.746 ± 1.074
4.474IleSer: 4.474 ± 1.793
3.729IleThr: 3.729 ± 1.325
3.729IleVal: 3.729 ± 1.647
0.746IleTrp: 0.746 ± 0.508
1.491IleTyr: 1.491 ± 2.086
0.0IleXaa: 0.0 ± 0.0
Lys
5.22LysAla: 5.22 ± 1.547
1.491LysCys: 1.491 ± 0.565
2.983LysAsp: 2.983 ± 2.115
6.711LysGlu: 6.711 ± 3.66
2.237LysPhe: 2.237 ± 1.524
2.983LysGly: 2.983 ± 1.758
2.983LysHis: 2.983 ± 1.388
1.491LysIle: 1.491 ± 0.947
8.203LysLys: 8.203 ± 3.451
3.729LysLeu: 3.729 ± 1.282
5.22LysMet: 5.22 ± 2.836
2.237LysAsn: 2.237 ± 0.986
5.966LysPro: 5.966 ± 1.805
2.983LysGln: 2.983 ± 1.575
1.491LysArg: 1.491 ± 1.016
3.729LysSer: 3.729 ± 0.892
5.22LysThr: 5.22 ± 1.564
3.729LysVal: 3.729 ± 2.398
0.746LysTrp: 0.746 ± 0.661
1.491LysTyr: 1.491 ± 1.243
0.0LysXaa: 0.0 ± 0.0
Leu
4.474LeuAla: 4.474 ± 1.101
0.0LeuCys: 0.0 ± 0.0
5.22LeuAsp: 5.22 ± 1.79
3.729LeuGlu: 3.729 ± 1.778
2.983LeuPhe: 2.983 ± 1.575
6.711LeuGly: 6.711 ± 1.988
2.983LeuHis: 2.983 ± 1.278
4.474LeuIle: 4.474 ± 1.483
6.711LeuLys: 6.711 ± 2.261
3.729LeuLeu: 3.729 ± 1.412
0.746LeuMet: 0.746 ± 1.043
5.22LeuAsn: 5.22 ± 2.598
5.22LeuPro: 5.22 ± 1.44
2.983LeuGln: 2.983 ± 1.278
2.983LeuArg: 2.983 ± 1.429
3.729LeuSer: 3.729 ± 2.255
5.966LeuThr: 5.966 ± 1.451
7.457LeuVal: 7.457 ± 2.199
1.491LeuTrp: 1.491 ± 1.016
2.237LeuTyr: 2.237 ± 1.696
0.0LeuXaa: 0.0 ± 0.0
Met
3.729MetAla: 3.729 ± 1.683
1.491MetCys: 1.491 ± 1.323
0.0MetAsp: 0.0 ± 0.0
0.746MetGlu: 0.746 ± 1.074
0.746MetPhe: 0.746 ± 0.508
0.746MetGly: 0.746 ± 0.508
0.746MetHis: 0.746 ± 0.661
2.237MetIle: 2.237 ± 2.197
0.0MetLys: 0.0 ± 0.0
0.746MetLeu: 0.746 ± 0.508
0.746MetMet: 0.746 ± 0.661
1.491MetAsn: 1.491 ± 0.792
0.0MetPro: 0.0 ± 0.0
1.491MetGln: 1.491 ± 1.323
0.746MetArg: 0.746 ± 0.508
0.746MetSer: 0.746 ± 0.896
0.746MetThr: 0.746 ± 0.508
0.746MetVal: 0.746 ± 1.043
0.0MetTrp: 0.0 ± 0.0
1.491MetTyr: 1.491 ± 1.016
0.0MetXaa: 0.0 ± 0.0
Asn
4.474AsnAla: 4.474 ± 1.702
0.746AsnCys: 0.746 ± 0.508
2.237AsnAsp: 2.237 ± 0.847
5.966AsnGlu: 5.966 ± 1.805
2.237AsnPhe: 2.237 ± 1.039
2.983AsnGly: 2.983 ± 1.278
0.0AsnHis: 0.0 ± 0.0
1.491AsnIle: 1.491 ± 1.016
5.966AsnLys: 5.966 ± 1.987
5.966AsnLeu: 5.966 ± 2.693
1.491AsnMet: 1.491 ± 0.813
1.491AsnAsn: 1.491 ± 0.565
2.237AsnPro: 2.237 ± 0.847
0.746AsnGln: 0.746 ± 0.508
2.237AsnArg: 2.237 ± 1.984
2.237AsnSer: 2.237 ± 1.881
5.22AsnThr: 5.22 ± 2.764
2.237AsnVal: 2.237 ± 1.12
0.746AsnTrp: 0.746 ± 1.074
2.983AsnTyr: 2.983 ± 1.538
0.0AsnXaa: 0.0 ± 0.0
Pro
1.491ProAla: 1.491 ± 1.016
0.746ProCys: 0.746 ± 0.661
1.491ProAsp: 1.491 ± 1.016
3.729ProGlu: 3.729 ± 2.187
3.729ProPhe: 3.729 ± 1.747
5.22ProGly: 5.22 ± 2.006
0.746ProHis: 0.746 ± 0.661
2.237ProIle: 2.237 ± 0.847
0.0ProLys: 0.0 ± 0.0
4.474ProLeu: 4.474 ± 1.049
0.0ProMet: 0.0 ± 0.0
0.746ProAsn: 0.746 ± 0.508
0.746ProPro: 0.746 ± 0.508
2.983ProGln: 2.983 ± 0.956
0.746ProArg: 0.746 ± 0.661
2.237ProSer: 2.237 ± 0.773
1.491ProThr: 1.491 ± 1.016
2.983ProVal: 2.983 ± 1.131
0.746ProTrp: 0.746 ± 1.074
2.237ProTyr: 2.237 ± 1.054
0.0ProXaa: 0.0 ± 0.0
Gln
5.966GlnAla: 5.966 ± 3.359
0.0GlnCys: 0.0 ± 0.0
0.746GlnAsp: 0.746 ± 0.508
0.0GlnGlu: 0.0 ± 0.0
2.237GlnPhe: 2.237 ± 0.847
1.491GlnGly: 1.491 ± 0.565
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
2.983GlnLys: 2.983 ± 0.981
1.491GlnLeu: 1.491 ± 1.016
0.0GlnMet: 0.0 ± 0.0
2.237GlnAsn: 2.237 ± 0.773
0.746GlnPro: 0.746 ± 0.508
0.746GlnGln: 0.746 ± 1.043
2.237GlnArg: 2.237 ± 1.591
2.237GlnSer: 2.237 ± 1.498
1.491GlnThr: 1.491 ± 1.016
1.491GlnVal: 1.491 ± 1.323
2.237GlnTrp: 2.237 ± 1.121
0.746GlnTyr: 0.746 ± 0.508
0.0GlnXaa: 0.0 ± 0.0
Arg
3.729ArgAla: 3.729 ± 0.776
0.0ArgCys: 0.0 ± 0.0
2.983ArgAsp: 2.983 ± 0.726
0.746ArgGlu: 0.746 ± 1.043
2.983ArgPhe: 2.983 ± 1.278
2.237ArgGly: 2.237 ± 1.524
0.0ArgHis: 0.0 ± 0.0
1.491ArgIle: 1.491 ± 1.016
1.491ArgLys: 1.491 ± 1.323
2.237ArgLeu: 2.237 ± 0.7
0.0ArgMet: 0.0 ± 0.0
1.491ArgAsn: 1.491 ± 1.016
1.491ArgPro: 1.491 ± 1.323
0.0ArgGln: 0.0 ± 0.0
2.237ArgArg: 2.237 ± 0.847
2.237ArgSer: 2.237 ± 1.182
1.491ArgThr: 1.491 ± 1.016
0.746ArgVal: 0.746 ± 0.508
0.0ArgTrp: 0.0 ± 0.0
2.983ArgTyr: 2.983 ± 1.751
0.0ArgXaa: 0.0 ± 0.0
Ser
5.22SerAla: 5.22 ± 1.638
0.746SerCys: 0.746 ± 1.074
4.474SerAsp: 4.474 ± 1.789
3.729SerGlu: 3.729 ± 2.139
1.491SerPhe: 1.491 ± 0.769
5.22SerGly: 5.22 ± 0.83
1.491SerHis: 1.491 ± 1.286
2.983SerIle: 2.983 ± 1.278
7.457SerLys: 7.457 ± 3.231
8.203SerLeu: 8.203 ± 2.199
0.746SerMet: 0.746 ± 0.967
7.457SerAsn: 7.457 ± 1.784
2.237SerPro: 2.237 ± 0.947
2.237SerGln: 2.237 ± 1.498
0.746SerArg: 0.746 ± 0.508
9.694SerSer: 9.694 ± 5.486
4.474SerThr: 4.474 ± 2.471
6.711SerVal: 6.711 ± 1.309
0.746SerTrp: 0.746 ± 0.508
3.729SerTyr: 3.729 ± 1.649
0.0SerXaa: 0.0 ± 0.0
Thr
8.203ThrAla: 8.203 ± 1.694
0.0ThrCys: 0.0 ± 0.0
5.22ThrAsp: 5.22 ± 2.273
4.474ThrGlu: 4.474 ± 1.193
0.746ThrPhe: 0.746 ± 0.811
8.203ThrGly: 8.203 ± 3.12
0.746ThrHis: 0.746 ± 0.896
2.237ThrIle: 2.237 ± 1.099
2.983ThrLys: 2.983 ± 1.278
3.729ThrLeu: 3.729 ± 1.016
0.0ThrMet: 0.0 ± 0.0
3.729ThrAsn: 3.729 ± 2.54
3.729ThrPro: 3.729 ± 1.661
2.983ThrGln: 2.983 ± 1.081
1.491ThrArg: 1.491 ± 1.323
5.966ThrSer: 5.966 ± 2.384
0.0ThrThr: 0.0 ± 0.0
0.746ThrVal: 0.746 ± 0.508
0.746ThrTrp: 0.746 ± 0.508
3.729ThrTyr: 3.729 ± 1.647
0.0ThrXaa: 0.0 ± 0.0
Val
8.203ValAla: 8.203 ± 1.815
0.0ValCys: 0.0 ± 0.0
2.237ValAsp: 2.237 ± 1.955
3.729ValGlu: 3.729 ± 0.898
1.491ValPhe: 1.491 ± 1.262
4.474ValGly: 4.474 ± 1.049
0.0ValHis: 0.0 ± 0.0
5.22ValIle: 5.22 ± 3.282
2.983ValLys: 2.983 ± 1.216
3.729ValLeu: 3.729 ± 1.172
1.491ValMet: 1.491 ± 0.792
4.474ValAsn: 4.474 ± 2.077
5.22ValPro: 5.22 ± 1.397
0.0ValGln: 0.0 ± 0.0
2.237ValArg: 2.237 ± 1.524
5.966ValSer: 5.966 ± 1.251
2.983ValThr: 2.983 ± 1.406
5.966ValVal: 5.966 ± 2.219
0.0ValTrp: 0.0 ± 0.0
2.983ValTyr: 2.983 ± 1.584
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.746TrpGlu: 0.746 ± 1.074
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.491TrpHis: 1.491 ± 0.947
1.491TrpIle: 1.491 ± 1.009
0.746TrpLys: 0.746 ± 0.508
1.491TrpLeu: 1.491 ± 1.009
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.746TrpGln: 0.746 ± 0.661
0.746TrpArg: 0.746 ± 0.508
1.491TrpSer: 1.491 ± 1.016
0.746TrpThr: 0.746 ± 0.508
2.237TrpVal: 2.237 ± 0.847
0.0TrpTrp: 0.0 ± 0.0
0.746TrpTyr: 0.746 ± 0.508
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.22TyrAla: 5.22 ± 1.864
0.746TyrCys: 0.746 ± 1.043
9.694TyrAsp: 9.694 ± 1.119
4.474TyrGlu: 4.474 ± 0.992
3.729TyrPhe: 3.729 ± 1.348
1.491TyrGly: 1.491 ± 0.792
0.746TyrHis: 0.746 ± 0.661
2.983TyrIle: 2.983 ± 1.405
0.746TyrLys: 0.746 ± 0.508
5.966TyrLeu: 5.966 ± 2.702
0.746TyrMet: 0.746 ± 0.896
2.237TyrAsn: 2.237 ± 1.524
2.237TyrPro: 2.237 ± 0.847
5.966TyrGln: 5.966 ± 1.451
0.746TyrArg: 0.746 ± 0.661
0.746TyrSer: 0.746 ± 0.661
1.491TyrThr: 1.491 ± 1.243
1.491TyrVal: 1.491 ± 0.565
0.746TyrTrp: 0.746 ± 0.508
1.491TyrTyr: 1.491 ± 1.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1342 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski