Amino acid dipepetide frequency for Helicobasidium mompa totivirus 1-17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.182AlaAla: 18.182 ± 1.659
3.519AlaCys: 3.519 ± 0.968
5.865AlaAsp: 5.865 ± 1.406
3.519AlaGlu: 3.519 ± 0.63
1.76AlaPhe: 1.76 ± 0.484
9.971AlaGly: 9.971 ± 2.869
1.173AlaHis: 1.173 ± 0.21
4.106AlaIle: 4.106 ± 0.796
5.279AlaLys: 5.279 ± 0.982
9.384AlaLeu: 9.384 ± 3.272
2.346AlaMet: 2.346 ± 0.871
6.452AlaAsn: 6.452 ± 1.782
7.625AlaPro: 7.625 ± 4.186
2.933AlaGln: 2.933 ± 1.925
7.625AlaArg: 7.625 ± 1.48
6.452AlaSer: 6.452 ± 1.271
7.038AlaThr: 7.038 ± 1.414
9.384AlaVal: 9.384 ± 1.644
1.173AlaTrp: 1.173 ± 0.817
1.173AlaTyr: 1.173 ± 2.265
0.0AlaXaa: 0.0 ± 0.0
Cys
2.346CysAla: 2.346 ± 0.42
0.0CysCys: 0.0 ± 0.0
2.346CysAsp: 2.346 ± 0.871
1.76CysGlu: 1.76 ± 0.545
0.0CysPhe: 0.0 ± 0.0
0.587CysGly: 0.587 ± 0.409
1.173CysHis: 1.173 ± 0.817
1.173CysIle: 1.173 ± 0.817
0.587CysLys: 0.587 ± 0.433
2.346CysLeu: 2.346 ± 0.871
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.76CysPro: 1.76 ± 0.545
0.0CysGln: 0.0 ± 0.0
2.933CysArg: 2.933 ± 0.703
1.76CysSer: 1.76 ± 0.484
0.587CysThr: 0.587 ± 0.433
1.76CysVal: 1.76 ± 0.545
0.587CysTrp: 0.587 ± 0.409
1.173CysTyr: 1.173 ± 0.21
0.0CysXaa: 0.0 ± 0.0
Asp
6.452AspAla: 6.452 ± 1.176
1.76AspCys: 1.76 ± 0.484
2.346AspAsp: 2.346 ± 0.871
2.933AspGlu: 2.933 ± 1.389
1.173AspPhe: 1.173 ± 0.817
3.519AspGly: 3.519 ± 0.63
1.173AspHis: 1.173 ± 0.21
0.587AspIle: 0.587 ± 0.409
1.173AspLys: 1.173 ± 0.817
4.692AspLeu: 4.692 ± 1.24
0.587AspMet: 0.587 ± 0.409
0.587AspAsn: 0.587 ± 0.433
4.106AspPro: 4.106 ± 1.348
1.76AspGln: 1.76 ± 2.132
1.76AspArg: 1.76 ± 0.545
4.106AspSer: 4.106 ± 0.796
3.519AspThr: 3.519 ± 0.63
3.519AspVal: 3.519 ± 0.63
0.0AspTrp: 0.0 ± 0.0
0.587AspTyr: 0.587 ± 0.409
0.0AspXaa: 0.0 ± 0.0
Glu
7.038GluAla: 7.038 ± 1.936
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
2.933GluGlu: 2.933 ± 2.043
1.76GluPhe: 1.76 ± 0.545
4.692GluGly: 4.692 ± 1.24
0.587GluHis: 0.587 ± 0.409
1.76GluIle: 1.76 ± 2.303
0.0GluLys: 0.0 ± 0.0
3.519GluLeu: 3.519 ± 1.676
1.173GluMet: 1.173 ± 0.21
1.173GluAsn: 1.173 ± 0.817
2.346GluPro: 2.346 ± 2.055
1.173GluGln: 1.173 ± 0.817
4.692GluArg: 4.692 ± 0.84
0.587GluSer: 0.587 ± 0.433
4.106GluThr: 4.106 ± 1.502
2.346GluVal: 2.346 ± 1.635
0.0GluTrp: 0.0 ± 0.0
1.76GluTyr: 1.76 ± 0.484
0.0GluXaa: 0.0 ± 0.0
Phe
2.933PheAla: 2.933 ± 1.389
0.0PheCys: 0.0 ± 0.0
2.346PheAsp: 2.346 ± 0.871
1.76PheGlu: 1.76 ± 0.484
1.76PhePhe: 1.76 ± 0.484
5.279PheGly: 5.279 ± 1.635
0.0PheHis: 0.0 ± 0.0
1.76PheIle: 1.76 ± 0.484
0.0PheLys: 0.0 ± 0.0
2.346PheLeu: 2.346 ± 0.871
1.173PheMet: 1.173 ± 0.867
1.76PheAsn: 1.76 ± 1.226
1.173PhePro: 1.173 ± 0.21
0.0PheGln: 0.0 ± 0.0
1.173PheArg: 1.173 ± 0.21
2.346PheSer: 2.346 ± 2.405
1.76PheThr: 1.76 ± 0.545
0.587PheVal: 0.587 ± 0.409
1.173PheTrp: 1.173 ± 0.21
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
9.971GlyAla: 9.971 ± 2.284
2.933GlyCys: 2.933 ± 0.703
3.519GlyAsp: 3.519 ± 0.63
2.933GlyGlu: 2.933 ± 0.624
1.173GlyPhe: 1.173 ± 0.867
9.384GlyGly: 9.384 ± 3.848
2.933GlyHis: 2.933 ± 0.624
4.106GlyIle: 4.106 ± 1.502
2.933GlyLys: 2.933 ± 1.271
8.798GlyLeu: 8.798 ± 2.802
0.587GlyMet: 0.587 ± 0.409
3.519GlyAsn: 3.519 ± 1.09
5.279GlyPro: 5.279 ± 1.781
2.933GlyGln: 2.933 ± 1.389
5.865GlyArg: 5.865 ± 2.045
4.106GlySer: 4.106 ± 0.883
4.106GlyThr: 4.106 ± 0.796
7.625GlyVal: 7.625 ± 1.471
1.173GlyTrp: 1.173 ± 0.21
1.76GlyTyr: 1.76 ± 0.545
0.0GlyXaa: 0.0 ± 0.0
His
1.76HisAla: 1.76 ± 1.226
1.173HisCys: 1.173 ± 0.867
1.173HisAsp: 1.173 ± 0.867
0.587HisGlu: 0.587 ± 0.433
1.173HisPhe: 1.173 ± 0.867
1.76HisGly: 1.76 ± 0.545
1.173HisHis: 1.173 ± 0.817
1.173HisIle: 1.173 ± 0.21
0.587HisLys: 0.587 ± 0.409
4.692HisLeu: 4.692 ± 0.84
0.0HisMet: 0.0 ± 0.0
1.173HisAsn: 1.173 ± 0.21
0.587HisPro: 0.587 ± 0.433
0.587HisGln: 0.587 ± 0.433
2.346HisArg: 2.346 ± 1.635
0.587HisSer: 0.587 ± 0.409
2.933HisThr: 2.933 ± 2.058
0.587HisVal: 0.587 ± 0.409
0.587HisTrp: 0.587 ± 0.433
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.279IleAla: 5.279 ± 1.781
0.0IleCys: 0.0 ± 0.0
2.346IleAsp: 2.346 ± 0.42
2.933IleGlu: 2.933 ± 2.123
0.587IlePhe: 0.587 ± 0.409
2.346IleGly: 2.346 ± 0.42
1.76IleHis: 1.76 ± 0.545
1.76IleIle: 1.76 ± 0.484
2.933IleLys: 2.933 ± 2.043
3.519IleLeu: 3.519 ± 0.63
1.173IleMet: 1.173 ± 0.21
2.933IleAsn: 2.933 ± 0.703
1.173IlePro: 1.173 ± 0.21
2.346IleGln: 2.346 ± 0.871
2.933IleArg: 2.933 ± 0.624
2.933IleSer: 2.933 ± 2.043
2.346IleThr: 2.346 ± 2.055
2.933IleVal: 2.933 ± 0.703
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.692LysAla: 4.692 ± 4.665
1.173LysCys: 1.173 ± 0.817
0.0LysAsp: 0.0 ± 0.0
1.173LysGlu: 1.173 ± 0.817
1.173LysPhe: 1.173 ± 0.817
1.173LysGly: 1.173 ± 0.817
1.173LysHis: 1.173 ± 0.21
1.173LysIle: 1.173 ± 0.817
1.173LysLys: 1.173 ± 0.21
2.933LysLeu: 2.933 ± 0.624
1.173LysMet: 1.173 ± 0.817
0.0LysAsn: 0.0 ± 0.0
1.173LysPro: 1.173 ± 0.817
0.587LysGln: 0.587 ± 0.409
2.346LysArg: 2.346 ± 1.635
1.173LysSer: 1.173 ± 0.817
1.76LysThr: 1.76 ± 0.484
4.692LysVal: 4.692 ± 1.742
0.0LysTrp: 0.0 ± 0.0
1.173LysTyr: 1.173 ± 0.867
0.0LysXaa: 0.0 ± 0.0
Leu
11.73LeuAla: 11.73 ± 0.384
1.76LeuCys: 1.76 ± 0.484
4.692LeuAsp: 4.692 ± 1.742
4.106LeuGlu: 4.106 ± 2.082
3.519LeuPhe: 3.519 ± 0.968
8.798LeuGly: 8.798 ± 3.786
1.76LeuHis: 1.76 ± 0.545
3.519LeuIle: 3.519 ± 1.676
1.173LeuLys: 1.173 ± 0.817
8.798LeuLeu: 8.798 ± 2.404
0.0LeuMet: 0.0 ± 0.0
5.865LeuAsn: 5.865 ± 2.543
8.211LeuPro: 8.211 ± 2.446
3.519LeuGln: 3.519 ± 0.63
6.452LeuArg: 6.452 ± 1.308
10.557LeuSer: 10.557 ± 1.58
2.346LeuThr: 2.346 ± 0.871
4.106LeuVal: 4.106 ± 1.718
2.346LeuTrp: 2.346 ± 0.962
0.587LeuTyr: 0.587 ± 0.409
0.0LeuXaa: 0.0 ± 0.0
Met
3.519MetAla: 3.519 ± 0.63
0.0MetCys: 0.0 ± 0.0
1.173MetAsp: 1.173 ± 0.21
0.587MetGlu: 0.587 ± 0.409
0.587MetPhe: 0.587 ± 0.409
1.76MetGly: 1.76 ± 0.545
0.587MetHis: 0.587 ± 0.409
1.173MetIle: 1.173 ± 0.867
1.173MetLys: 1.173 ± 0.817
2.346MetLeu: 2.346 ± 0.42
2.346MetMet: 2.346 ± 1.506
2.346MetAsn: 2.346 ± 0.871
0.587MetPro: 0.587 ± 0.409
1.173MetGln: 1.173 ± 0.21
1.173MetArg: 1.173 ± 0.21
2.346MetSer: 2.346 ± 2.083
1.76MetThr: 1.76 ± 0.484
0.587MetVal: 0.587 ± 0.409
0.0MetTrp: 0.0 ± 0.0
1.173MetTyr: 1.173 ± 0.817
0.0MetXaa: 0.0 ± 0.0
Asn
5.279AsnAla: 5.279 ± 1.781
0.587AsnCys: 0.587 ± 0.409
1.76AsnAsp: 1.76 ± 2.303
1.173AsnGlu: 1.173 ± 0.21
2.346AsnPhe: 2.346 ± 0.962
5.279AsnGly: 5.279 ± 1.635
1.76AsnHis: 1.76 ± 2.132
1.76AsnIle: 1.76 ± 0.545
0.587AsnLys: 0.587 ± 0.409
2.933AsnLeu: 2.933 ± 1.271
2.346AsnMet: 2.346 ± 0.42
1.76AsnAsn: 1.76 ± 0.545
1.76AsnPro: 1.76 ± 1.3
2.346AsnGln: 2.346 ± 0.962
3.519AsnArg: 3.519 ± 0.968
4.692AsnSer: 4.692 ± 1.952
1.76AsnThr: 1.76 ± 0.484
2.933AsnVal: 2.933 ± 0.624
1.76AsnTrp: 1.76 ± 2.262
2.346AsnTyr: 2.346 ± 0.42
0.0AsnXaa: 0.0 ± 0.0
Pro
3.519ProAla: 3.519 ± 2.247
0.0ProCys: 0.0 ± 0.0
0.587ProAsp: 0.587 ± 0.409
2.346ProGlu: 2.346 ± 2.332
2.346ProPhe: 2.346 ± 1.734
4.106ProGly: 4.106 ± 1.502
0.587ProHis: 0.587 ± 0.433
2.933ProIle: 2.933 ± 0.624
1.173ProLys: 1.173 ± 0.817
5.865ProLeu: 5.865 ± 1.248
0.587ProMet: 0.587 ± 0.433
1.173ProAsn: 1.173 ± 4.68
2.933ProPro: 2.933 ± 6.812
1.173ProGln: 1.173 ± 2.265
7.625ProArg: 7.625 ± 2.85
5.279ProSer: 5.279 ± 1.781
2.346ProThr: 2.346 ± 1.734
6.452ProVal: 6.452 ± 1.454
0.0ProTrp: 0.0 ± 0.0
1.173ProTyr: 1.173 ± 0.21
0.0ProXaa: 0.0 ± 0.0
Gln
2.933GlnAla: 2.933 ± 2.123
0.587GlnCys: 0.587 ± 0.409
1.76GlnAsp: 1.76 ± 0.484
1.173GlnGlu: 1.173 ± 0.817
0.0GlnPhe: 0.0 ± 0.0
2.346GlnGly: 2.346 ± 0.962
0.587GlnHis: 0.587 ± 0.409
1.173GlnIle: 1.173 ± 0.21
2.933GlnLys: 2.933 ± 4.509
2.346GlnLeu: 2.346 ± 0.42
1.173GlnMet: 1.173 ± 1.212
1.76GlnAsn: 1.76 ± 1.226
1.173GlnPro: 1.173 ± 2.281
1.173GlnGln: 1.173 ± 0.867
0.0GlnArg: 0.0 ± 0.0
2.346GlnSer: 2.346 ± 0.871
1.173GlnThr: 1.173 ± 0.21
4.106GlnVal: 4.106 ± 1.718
0.0GlnTrp: 0.0 ± 0.0
1.173GlnTyr: 1.173 ± 0.21
0.0GlnXaa: 0.0 ± 0.0
Arg
7.038ArgAla: 7.038 ± 1.328
2.933ArgCys: 2.933 ± 1.389
2.346ArgAsp: 2.346 ± 0.962
2.346ArgGlu: 2.346 ± 0.871
1.173ArgPhe: 1.173 ± 2.265
4.692ArgGly: 4.692 ± 1.24
2.346ArgHis: 2.346 ± 1.734
2.933ArgIle: 2.933 ± 2.058
4.106ArgLys: 4.106 ± 2.86
4.106ArgLeu: 4.106 ± 2.293
4.692ArgMet: 4.692 ± 1.742
5.279ArgAsn: 5.279 ± 4.184
1.76ArgPro: 1.76 ± 0.545
1.173ArgGln: 1.173 ± 0.867
7.625ArgArg: 7.625 ± 1.941
5.865ArgSer: 5.865 ± 1.508
3.519ArgThr: 3.519 ± 0.63
7.625ArgVal: 7.625 ± 1.471
1.76ArgTrp: 1.76 ± 1.226
0.587ArgTyr: 0.587 ± 0.433
0.0ArgXaa: 0.0 ± 0.0
Ser
7.038SerAla: 7.038 ± 1.259
1.76SerCys: 1.76 ± 0.484
2.933SerAsp: 2.933 ± 2.043
2.346SerGlu: 2.346 ± 0.962
4.106SerPhe: 4.106 ± 0.796
3.519SerGly: 3.519 ± 1.09
2.933SerHis: 2.933 ± 1.271
4.106SerIle: 4.106 ± 1.718
0.0SerLys: 0.0 ± 0.0
8.798SerLeu: 8.798 ± 1.069
1.173SerMet: 1.173 ± 0.817
2.933SerAsn: 2.933 ± 1.389
3.519SerPro: 3.519 ± 2.805
2.346SerGln: 2.346 ± 0.871
5.865SerArg: 5.865 ± 6.446
5.279SerSer: 5.279 ± 1.074
5.865SerThr: 5.865 ± 1.406
4.106SerVal: 4.106 ± 0.883
0.587SerTrp: 0.587 ± 0.409
3.519SerTyr: 3.519 ± 0.63
0.0SerXaa: 0.0 ± 0.0
Thr
4.692ThrAla: 4.692 ± 0.84
1.173ThrCys: 1.173 ± 0.867
2.346ThrAsp: 2.346 ± 0.42
2.346ThrGlu: 2.346 ± 0.871
1.76ThrPhe: 1.76 ± 0.545
7.038ThrGly: 7.038 ± 1.582
1.76ThrHis: 1.76 ± 0.545
1.173ThrIle: 1.173 ± 0.21
0.587ThrLys: 0.587 ± 0.409
4.692ThrLeu: 4.692 ± 1.952
4.106ThrMet: 4.106 ± 1.251
2.933ThrAsn: 2.933 ± 0.703
3.519ThrPro: 3.519 ± 0.968
2.346ThrGln: 2.346 ± 0.871
2.933ThrArg: 2.933 ± 0.624
3.519ThrSer: 3.519 ± 1.09
4.692ThrThr: 4.692 ± 2.683
3.519ThrVal: 3.519 ± 1.819
1.76ThrTrp: 1.76 ± 1.3
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
8.211ValAla: 8.211 ± 2.325
1.76ValCys: 1.76 ± 0.484
5.865ValAsp: 5.865 ± 1.406
3.519ValGlu: 3.519 ± 0.63
1.76ValPhe: 1.76 ± 0.545
7.625ValGly: 7.625 ± 2.589
1.173ValHis: 1.173 ± 0.21
2.933ValIle: 2.933 ± 0.624
2.933ValLys: 2.933 ± 0.624
8.211ValLeu: 8.211 ± 2.697
1.173ValMet: 1.173 ± 0.21
4.106ValAsn: 4.106 ± 0.883
2.933ValPro: 2.933 ± 2.058
2.346ValGln: 2.346 ± 4.53
2.933ValArg: 2.933 ± 2.469
5.279ValSer: 5.279 ± 1.635
2.346ValThr: 2.346 ± 0.962
3.519ValVal: 3.519 ± 1.819
0.587ValTrp: 0.587 ± 0.409
3.519ValTyr: 3.519 ± 0.968
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.587TrpCys: 0.587 ± 0.409
1.173TrpAsp: 1.173 ± 0.21
0.0TrpGlu: 0.0 ± 0.0
0.587TrpPhe: 0.587 ± 0.433
0.587TrpGly: 0.587 ± 0.409
0.0TrpHis: 0.0 ± 0.0
1.173TrpIle: 1.173 ± 0.21
0.0TrpLys: 0.0 ± 0.0
1.76TrpLeu: 1.76 ± 2.132
0.0TrpMet: 0.0 ± 0.0
1.76TrpAsn: 1.76 ± 0.545
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.346TrpArg: 2.346 ± 0.871
1.173TrpSer: 1.173 ± 0.817
1.173TrpThr: 1.173 ± 0.21
1.173TrpVal: 1.173 ± 0.21
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.346TyrAla: 2.346 ± 0.962
1.173TyrCys: 1.173 ± 0.817
1.76TyrAsp: 1.76 ± 0.484
1.173TyrGlu: 1.173 ± 0.21
0.587TyrPhe: 0.587 ± 0.409
1.173TyrGly: 1.173 ± 0.21
0.0TyrHis: 0.0 ± 0.0
1.76TyrIle: 1.76 ± 0.484
0.587TyrLys: 0.587 ± 0.409
1.76TyrLeu: 1.76 ± 0.545
0.0TyrMet: 0.0 ± 0.0
1.173TyrAsn: 1.173 ± 2.281
0.587TyrPro: 0.587 ± 0.409
0.587TyrGln: 0.587 ± 0.409
1.76TyrArg: 1.76 ± 0.484
2.346TyrSer: 2.346 ± 0.42
1.76TyrThr: 1.76 ± 0.545
1.173TyrVal: 1.173 ± 0.21
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1706 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski