Amino acid dipepetide frequency for Microviridae Fen685_11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.084AlaAla: 9.084 ± 4.197
0.0AlaCys: 0.0 ± 0.0
9.084AlaAsp: 9.084 ± 4.197
1.514AlaGlu: 1.514 ± 1.284
4.542AlaPhe: 4.542 ± 1.242
6.813AlaGly: 6.813 ± 2.325
3.028AlaHis: 3.028 ± 1.832
3.785AlaIle: 3.785 ± 0.917
4.542AlaLys: 4.542 ± 1.547
9.084AlaLeu: 9.084 ± 3.054
0.0AlaMet: 0.0 ± 0.0
4.542AlaAsn: 4.542 ± 1.243
5.299AlaPro: 5.299 ± 2.249
9.841AlaGln: 9.841 ± 4.213
5.299AlaArg: 5.299 ± 0.698
3.028AlaSer: 3.028 ± 1.214
6.813AlaThr: 6.813 ± 0.938
7.57AlaVal: 7.57 ± 2.144
1.514AlaTrp: 1.514 ± 0.935
4.542AlaTyr: 4.542 ± 0.72
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.514CysGly: 1.514 ± 1.407
0.0CysHis: 0.0 ± 0.0
0.757CysIle: 0.757 ± 0.581
0.0CysLys: 0.0 ± 0.0
1.514CysLeu: 1.514 ± 1.407
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.271CysPro: 2.271 ± 1.534
0.757CysGln: 0.757 ± 0.703
0.0CysArg: 0.0 ± 0.0
1.514CysSer: 1.514 ± 0.595
0.0CysThr: 0.0 ± 0.0
0.757CysVal: 0.757 ± 0.703
0.757CysTrp: 0.757 ± 0.703
0.757CysTyr: 0.757 ± 0.703
0.0CysXaa: 0.0 ± 0.0
Asp
6.056AspAla: 6.056 ± 0.945
0.757AspCys: 0.757 ± 0.703
4.542AspAsp: 4.542 ± 1.73
0.757AspGlu: 0.757 ± 0.581
1.514AspPhe: 1.514 ± 1.161
5.299AspGly: 5.299 ± 2.373
1.514AspHis: 1.514 ± 0.595
0.757AspIle: 0.757 ± 0.703
3.028AspLys: 3.028 ± 1.225
5.299AspLeu: 5.299 ± 0.698
2.271AspMet: 2.271 ± 0.999
1.514AspAsn: 1.514 ± 0.607
4.542AspPro: 4.542 ± 2.038
0.757AspGln: 0.757 ± 0.642
6.056AspArg: 6.056 ± 2.081
4.542AspSer: 4.542 ± 3.852
5.299AspThr: 5.299 ± 1.268
2.271AspVal: 2.271 ± 1.278
0.757AspTrp: 0.757 ± 0.581
4.542AspTyr: 4.542 ± 1.615
0.0AspXaa: 0.0 ± 0.0
Glu
4.542GluAla: 4.542 ± 1.509
0.0GluCys: 0.0 ± 0.0
1.514GluAsp: 1.514 ± 0.607
1.514GluGlu: 1.514 ± 0.595
3.028GluPhe: 3.028 ± 0.531
0.757GluGly: 0.757 ± 0.581
0.757GluHis: 0.757 ± 0.581
2.271GluIle: 2.271 ± 0.954
0.757GluLys: 0.757 ± 0.642
3.785GluLeu: 3.785 ± 2.221
0.757GluMet: 0.757 ± 0.626
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
4.542GluGln: 4.542 ± 0.951
0.757GluArg: 0.757 ± 0.703
0.757GluSer: 0.757 ± 0.581
2.271GluThr: 2.271 ± 0.826
2.271GluVal: 2.271 ± 1.166
0.757GluTrp: 0.757 ± 0.581
2.271GluTyr: 2.271 ± 0.942
0.0GluXaa: 0.0 ± 0.0
Phe
2.271PheAla: 2.271 ± 1.742
0.0PheCys: 0.0 ± 0.0
3.785PheAsp: 3.785 ± 1.713
0.0PheGlu: 0.0 ± 0.0
1.514PhePhe: 1.514 ± 0.944
2.271PheGly: 2.271 ± 2.11
1.514PheHis: 1.514 ± 0.595
4.542PheIle: 4.542 ± 0.748
3.785PheLys: 3.785 ± 1.11
5.299PheLeu: 5.299 ± 2.752
1.514PheMet: 1.514 ± 0.924
0.757PheAsn: 0.757 ± 0.642
0.757PhePro: 0.757 ± 0.581
1.514PheGln: 1.514 ± 0.607
4.542PheArg: 4.542 ± 1.989
2.271PheSer: 2.271 ± 0.954
3.028PheThr: 3.028 ± 1.148
1.514PheVal: 1.514 ± 0.607
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.271GlyAla: 2.271 ± 0.999
0.757GlyCys: 0.757 ± 0.703
5.299GlyAsp: 5.299 ± 4.064
3.785GlyGlu: 3.785 ± 1.987
0.0GlyPhe: 0.0 ± 0.0
3.785GlyGly: 3.785 ± 1.994
1.514GlyHis: 1.514 ± 0.595
2.271GlyIle: 2.271 ± 1.742
1.514GlyLys: 1.514 ± 0.807
6.056GlyLeu: 6.056 ± 1.334
0.0GlyMet: 0.0 ± 0.0
3.028GlyAsn: 3.028 ± 1.19
0.757GlyPro: 0.757 ± 0.581
3.028GlyGln: 3.028 ± 1.704
1.514GlyArg: 1.514 ± 1.085
4.542GlySer: 4.542 ± 1.547
5.299GlyThr: 5.299 ± 3.125
3.028GlyVal: 3.028 ± 0.531
0.757GlyTrp: 0.757 ± 0.581
3.785GlyTyr: 3.785 ± 1.713
0.0GlyXaa: 0.0 ± 0.0
His
1.514HisAla: 1.514 ± 1.161
0.0HisCys: 0.0 ± 0.0
0.757HisAsp: 0.757 ± 0.815
0.757HisGlu: 0.757 ± 0.703
0.757HisPhe: 0.757 ± 0.581
2.271HisGly: 2.271 ± 0.942
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.785HisLeu: 3.785 ± 1.713
3.028HisMet: 3.028 ± 1.547
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
3.028HisArg: 3.028 ± 1.001
0.0HisSer: 0.0 ± 0.0
0.757HisThr: 0.757 ± 0.581
3.028HisVal: 3.028 ± 1.19
0.757HisTrp: 0.757 ± 0.581
2.271HisTyr: 2.271 ± 2.11
0.0HisXaa: 0.0 ± 0.0
Ile
3.785IleAla: 3.785 ± 1.858
1.514IleCys: 1.514 ± 0.595
2.271IleAsp: 2.271 ± 0.999
1.514IleGlu: 1.514 ± 0.595
2.271IlePhe: 2.271 ± 1.737
3.028IleGly: 3.028 ± 1.001
0.0IleHis: 0.0 ± 0.0
0.757IleIle: 0.757 ± 0.887
3.785IleLys: 3.785 ± 1.107
4.542IleLeu: 4.542 ± 1.806
0.0IleMet: 0.0 ± 0.0
1.514IleAsn: 1.514 ± 0.924
4.542IlePro: 4.542 ± 2.057
0.757IleGln: 0.757 ± 0.703
3.028IleArg: 3.028 ± 1.448
5.299IleSer: 5.299 ± 3.203
1.514IleThr: 1.514 ± 1.085
4.542IleVal: 4.542 ± 2.64
3.028IleTrp: 3.028 ± 1.533
1.514IleTyr: 1.514 ± 1.284
0.0IleXaa: 0.0 ± 0.0
Lys
6.813LysAla: 6.813 ± 3.008
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
1.514LysGlu: 1.514 ± 1.085
1.514LysPhe: 1.514 ± 0.607
0.757LysGly: 0.757 ± 0.581
3.028LysHis: 3.028 ± 2.274
1.514LysIle: 1.514 ± 0.607
0.0LysLys: 0.0 ± 0.0
3.785LysLeu: 3.785 ± 1.703
0.757LysMet: 0.757 ± 0.581
1.514LysAsn: 1.514 ± 1.284
1.514LysPro: 1.514 ± 0.607
3.785LysGln: 3.785 ± 2.463
6.056LysArg: 6.056 ± 4.548
0.757LysSer: 0.757 ± 0.581
2.271LysThr: 2.271 ± 1.926
1.514LysVal: 1.514 ± 0.935
0.0LysTrp: 0.0 ± 0.0
1.514LysTyr: 1.514 ± 1.085
0.0LysXaa: 0.0 ± 0.0
Leu
6.813LeuAla: 6.813 ± 2.222
0.757LeuCys: 0.757 ± 0.703
4.542LeuAsp: 4.542 ± 0.951
3.785LeuGlu: 3.785 ± 1.549
6.056LeuPhe: 6.056 ± 3.306
6.813LeuGly: 6.813 ± 0.938
0.757LeuHis: 0.757 ± 0.581
6.056LeuIle: 6.056 ± 3.662
3.028LeuLys: 3.028 ± 1.367
9.841LeuLeu: 9.841 ± 6.213
2.271LeuMet: 2.271 ± 0.965
3.785LeuAsn: 3.785 ± 0.745
6.056LeuPro: 6.056 ± 3.958
5.299LeuGln: 5.299 ± 2.045
3.785LeuArg: 3.785 ± 2.473
9.084LeuSer: 9.084 ± 1.253
5.299LeuThr: 5.299 ± 3.23
9.084LeuVal: 9.084 ± 2.53
3.028LeuTrp: 3.028 ± 1.001
1.514LeuTyr: 1.514 ± 0.607
0.0LeuXaa: 0.0 ± 0.0
Met
2.271MetAla: 2.271 ± 1.106
1.514MetCys: 1.514 ± 0.595
1.514MetAsp: 1.514 ± 0.607
0.0MetGlu: 0.0 ± 0.0
0.757MetPhe: 0.757 ± 0.703
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.757MetIle: 0.757 ± 0.887
0.757MetLys: 0.757 ± 0.642
3.785MetLeu: 3.785 ± 1.99
0.0MetMet: 0.0 ± 0.0
1.514MetAsn: 1.514 ± 0.595
2.271MetPro: 2.271 ± 2.11
2.271MetGln: 2.271 ± 1.106
1.514MetArg: 1.514 ± 0.807
1.514MetSer: 1.514 ± 0.924
2.271MetThr: 2.271 ± 0.826
0.0MetVal: 0.0 ± 0.0
0.757MetTrp: 0.757 ± 0.581
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
7.57AsnAla: 7.57 ± 3.737
1.514AsnCys: 1.514 ± 1.407
1.514AsnAsp: 1.514 ± 0.607
1.514AsnGlu: 1.514 ± 1.085
2.271AsnPhe: 2.271 ± 0.36
0.757AsnGly: 0.757 ± 0.581
0.0AsnHis: 0.0 ± 0.0
1.514AsnIle: 1.514 ± 1.284
2.271AsnLys: 2.271 ± 1.278
3.028AsnLeu: 3.028 ± 0.854
0.757AsnMet: 0.757 ± 0.642
5.299AsnAsn: 5.299 ± 2.801
1.514AsnPro: 1.514 ± 0.807
1.514AsnGln: 1.514 ± 0.607
4.542AsnArg: 4.542 ± 1.042
2.271AsnSer: 2.271 ± 0.999
4.542AsnThr: 4.542 ± 1.82
2.271AsnVal: 2.271 ± 1.366
1.514AsnTrp: 1.514 ± 0.607
1.514AsnTyr: 1.514 ± 0.807
0.0AsnXaa: 0.0 ± 0.0
Pro
4.542ProAla: 4.542 ± 1.615
0.757ProCys: 0.757 ± 0.703
3.785ProAsp: 3.785 ± 1.549
0.757ProGlu: 0.757 ± 0.581
2.271ProPhe: 2.271 ± 0.942
0.0ProGly: 0.0 ± 0.0
1.514ProHis: 1.514 ± 1.037
4.542ProIle: 4.542 ± 1.785
1.514ProLys: 1.514 ± 0.607
3.785ProLeu: 3.785 ± 1.41
0.757ProMet: 0.757 ± 0.581
4.542ProAsn: 4.542 ± 1.243
1.514ProPro: 1.514 ± 0.935
2.271ProGln: 2.271 ± 0.942
4.542ProArg: 4.542 ± 0.951
4.542ProSer: 4.542 ± 1.904
0.757ProThr: 0.757 ± 0.581
5.299ProVal: 5.299 ± 0.698
0.757ProTrp: 0.757 ± 0.581
1.514ProTyr: 1.514 ± 0.807
0.0ProXaa: 0.0 ± 0.0
Gln
6.056GlnAla: 6.056 ± 3.373
1.514GlnCys: 1.514 ± 1.407
0.757GlnAsp: 0.757 ± 0.642
5.299GlnGlu: 5.299 ± 1.268
2.271GlnPhe: 2.271 ± 1.366
3.028GlnGly: 3.028 ± 1.517
0.757GlnHis: 0.757 ± 0.581
0.757GlnIle: 0.757 ± 0.581
1.514GlnLys: 1.514 ± 0.595
6.813GlnLeu: 6.813 ± 2.588
1.514GlnMet: 1.514 ± 0.607
3.028GlnAsn: 3.028 ± 2.568
4.542GlnPro: 4.542 ± 1.998
4.542GlnGln: 4.542 ± 3.086
3.785GlnArg: 3.785 ± 0.7
2.271GlnSer: 2.271 ± 1.153
1.514GlnThr: 1.514 ± 1.284
0.757GlnVal: 0.757 ± 0.703
2.271GlnTrp: 2.271 ± 0.999
1.514GlnTyr: 1.514 ± 0.935
0.0GlnXaa: 0.0 ± 0.0
Arg
5.299ArgAla: 5.299 ± 1.903
0.757ArgCys: 0.757 ± 0.703
5.299ArgAsp: 5.299 ± 2.017
0.757ArgGlu: 0.757 ± 0.642
4.542ArgPhe: 4.542 ± 1.091
2.271ArgGly: 2.271 ± 0.942
1.514ArgHis: 1.514 ± 1.161
6.056ArgIle: 6.056 ± 1.543
4.542ArgLys: 4.542 ± 2.658
5.299ArgLeu: 5.299 ± 1.851
2.271ArgMet: 2.271 ± 0.914
2.271ArgAsn: 2.271 ± 1.166
1.514ArgPro: 1.514 ± 1.407
2.271ArgGln: 2.271 ± 2.11
2.271ArgArg: 2.271 ± 1.534
4.542ArgSer: 4.542 ± 1.845
3.028ArgThr: 3.028 ± 1.001
3.028ArgVal: 3.028 ± 1.001
0.0ArgTrp: 0.0 ± 0.0
5.299ArgTyr: 5.299 ± 1.235
0.0ArgXaa: 0.0 ± 0.0
Ser
9.841SerAla: 9.841 ± 2.65
0.0SerCys: 0.0 ± 0.0
3.028SerAsp: 3.028 ± 0.531
4.542SerGlu: 4.542 ± 2.054
1.514SerPhe: 1.514 ± 0.595
0.757SerGly: 0.757 ± 0.642
0.757SerHis: 0.757 ± 0.815
3.028SerIle: 3.028 ± 1.517
2.271SerLys: 2.271 ± 1.371
7.57SerLeu: 7.57 ± 3.683
2.271SerMet: 2.271 ± 1.054
4.542SerAsn: 4.542 ± 1.586
3.028SerPro: 3.028 ± 1.147
4.542SerGln: 4.542 ± 0.951
4.542SerArg: 4.542 ± 1.164
7.57SerSer: 7.57 ± 2.994
3.028SerThr: 3.028 ± 0.938
4.542SerVal: 4.542 ± 1.839
1.514SerTrp: 1.514 ± 0.595
1.514SerTyr: 1.514 ± 0.595
0.0SerXaa: 0.0 ± 0.0
Thr
10.598ThrAla: 10.598 ± 2.6
0.757ThrCys: 0.757 ± 0.887
5.299ThrAsp: 5.299 ± 2.128
0.757ThrGlu: 0.757 ± 0.581
0.0ThrPhe: 0.0 ± 0.0
8.327ThrGly: 8.327 ± 3.283
0.0ThrHis: 0.0 ± 0.0
1.514ThrIle: 1.514 ± 0.607
0.0ThrLys: 0.0 ± 0.0
5.299ThrLeu: 5.299 ± 2.825
1.514ThrMet: 1.514 ± 0.607
2.271ThrAsn: 2.271 ± 1.926
2.271ThrPro: 2.271 ± 0.954
1.514ThrGln: 1.514 ± 0.595
1.514ThrArg: 1.514 ± 0.607
6.813ThrSer: 6.813 ± 1.486
4.542ThrThr: 4.542 ± 0.72
5.299ThrVal: 5.299 ± 0.822
0.757ThrTrp: 0.757 ± 0.581
0.757ThrTyr: 0.757 ± 0.703
0.0ThrXaa: 0.0 ± 0.0
Val
7.57ValAla: 7.57 ± 3.737
0.0ValCys: 0.0 ± 0.0
5.299ValAsp: 5.299 ± 0.822
3.785ValGlu: 3.785 ± 1.549
1.514ValPhe: 1.514 ± 1.629
1.514ValGly: 1.514 ± 0.607
2.271ValHis: 2.271 ± 0.942
6.056ValIle: 6.056 ± 1.634
3.785ValLys: 3.785 ± 2.143
4.542ValLeu: 4.542 ± 1.852
1.514ValMet: 1.514 ± 0.924
2.271ValAsn: 2.271 ± 0.942
3.785ValPro: 3.785 ± 2.703
3.028ValGln: 3.028 ± 1.214
2.271ValArg: 2.271 ± 1.371
6.813ValSer: 6.813 ± 1.34
3.785ValThr: 3.785 ± 1.057
3.785ValVal: 3.785 ± 1.846
0.757ValTrp: 0.757 ± 0.581
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.514TrpAla: 1.514 ± 1.407
0.0TrpCys: 0.0 ± 0.0
0.757TrpAsp: 0.757 ± 0.581
0.757TrpGlu: 0.757 ± 0.581
2.271TrpPhe: 2.271 ± 0.942
0.757TrpGly: 0.757 ± 0.581
1.514TrpHis: 1.514 ± 1.161
0.757TrpIle: 0.757 ± 0.887
0.0TrpLys: 0.0 ± 0.0
1.514TrpLeu: 1.514 ± 0.607
0.757TrpMet: 0.757 ± 0.703
3.028TrpAsn: 3.028 ± 1.404
2.271TrpPro: 2.271 ± 1.742
0.757TrpGln: 0.757 ± 0.581
1.514TrpArg: 1.514 ± 0.595
1.514TrpSer: 1.514 ± 0.935
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.757TrpTrp: 0.757 ± 0.581
0.757TrpTyr: 0.757 ± 0.581
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.514TyrAla: 1.514 ± 1.161
0.0TyrCys: 0.0 ± 0.0
3.028TyrAsp: 3.028 ± 1.832
0.0TyrGlu: 0.0 ± 0.0
2.271TyrPhe: 2.271 ± 1.166
2.271TyrGly: 2.271 ± 1.278
2.271TyrHis: 2.271 ± 1.371
1.514TyrIle: 1.514 ± 0.595
1.514TyrLys: 1.514 ± 0.935
3.785TyrLeu: 3.785 ± 1.465
0.757TyrMet: 0.757 ± 0.703
2.271TyrAsn: 2.271 ± 1.926
1.514TyrPro: 1.514 ± 0.807
1.514TyrGln: 1.514 ± 1.161
2.271TyrArg: 2.271 ± 0.36
0.757TyrSer: 0.757 ± 0.581
3.785TyrThr: 3.785 ± 1.619
3.785TyrVal: 3.785 ± 1.11
0.757TyrTrp: 0.757 ± 0.581
2.271TyrTyr: 2.271 ± 0.942
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1322 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski