Amino acid dipepetide frequency for Gremmeniella abietina RNA virus MS2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.031AlaAla: 9.031 ± 4.035
0.821AlaCys: 0.821 ± 0.673
4.926AlaAsp: 4.926 ± 4.037
4.105AlaGlu: 4.105 ± 2.319
6.568AlaPhe: 6.568 ± 1.488
5.747AlaGly: 5.747 ± 1.895
1.642AlaHis: 1.642 ± 0.479
0.821AlaIle: 0.821 ± 0.569
2.463AlaLys: 2.463 ± 0.808
7.389AlaLeu: 7.389 ± 2.111
2.463AlaMet: 2.463 ± 1.152
2.463AlaAsn: 2.463 ± 1.358
5.747AlaPro: 5.747 ± 1.895
1.642AlaGln: 1.642 ± 0.479
8.21AlaArg: 8.21 ± 0.455
7.389AlaSer: 7.389 ± 2.893
2.463AlaThr: 2.463 ± 1.02
6.568AlaVal: 6.568 ± 3.826
1.642AlaTrp: 1.642 ± 1.346
1.642AlaTyr: 1.642 ± 0.967
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.821CysCys: 0.821 ± 0.965
0.0CysAsp: 0.0 ± 0.0
1.642CysGlu: 1.642 ± 1.137
0.821CysPhe: 0.821 ± 0.673
0.0CysGly: 0.0 ± 0.0
0.821CysHis: 0.821 ± 0.569
1.642CysIle: 1.642 ± 0.967
1.642CysLys: 1.642 ± 0.479
3.284CysLeu: 3.284 ± 1.913
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.821CysPro: 0.821 ± 0.569
0.821CysGln: 0.821 ± 0.569
0.0CysArg: 0.0 ± 0.0
3.284CysSer: 3.284 ± 2.711
0.0CysThr: 0.0 ± 0.0
0.821CysVal: 0.821 ± 0.673
0.0CysTrp: 0.0 ± 0.0
0.821CysTyr: 0.821 ± 0.673
0.0CysXaa: 0.0 ± 0.0
Asp
7.389AspAla: 7.389 ± 3.283
0.821AspCys: 0.821 ± 0.569
0.821AspAsp: 0.821 ± 0.569
0.0AspGlu: 0.0 ± 0.0
4.926AspPhe: 4.926 ± 0.879
2.463AspGly: 2.463 ± 1.706
0.0AspHis: 0.0 ± 0.0
3.284AspIle: 3.284 ± 1.468
4.105AspLys: 4.105 ± 1.297
0.0AspLeu: 0.0 ± 0.0
0.0AspMet: 0.0 ± 0.0
0.821AspAsn: 0.821 ± 0.569
3.284AspPro: 3.284 ± 2.275
1.642AspGln: 1.642 ± 1.137
8.21AspArg: 8.21 ± 2.752
4.926AspSer: 4.926 ± 2.04
4.105AspThr: 4.105 ± 2.843
2.463AspVal: 2.463 ± 1.02
2.463AspTrp: 2.463 ± 1.02
2.463AspTyr: 2.463 ± 1.02
0.0AspXaa: 0.0 ± 0.0
Glu
4.105GluAla: 4.105 ± 2.641
2.463GluCys: 2.463 ± 0.808
1.642GluAsp: 1.642 ± 0.479
1.642GluGlu: 1.642 ± 0.897
5.747GluPhe: 5.747 ± 1.549
5.747GluGly: 5.747 ± 1.635
0.0GluHis: 0.0 ± 0.0
2.463GluIle: 2.463 ± 0.808
3.284GluLys: 3.284 ± 2.275
1.642GluLeu: 1.642 ± 1.346
0.821GluMet: 0.821 ± 0.569
0.0GluAsn: 0.0 ± 0.0
3.284GluPro: 3.284 ± 0.486
0.821GluGln: 0.821 ± 0.569
2.463GluArg: 2.463 ± 0.808
3.284GluSer: 3.284 ± 0.486
3.284GluThr: 3.284 ± 1.579
4.105GluVal: 4.105 ± 1.2
3.284GluTrp: 3.284 ± 1.579
0.821GluTyr: 0.821 ± 0.569
0.0GluXaa: 0.0 ± 0.0
Phe
2.463PheAla: 2.463 ± 2.019
1.642PheCys: 1.642 ± 0.479
4.105PheAsp: 4.105 ± 1.854
0.821PheGlu: 0.821 ± 0.965
1.642PhePhe: 1.642 ± 0.967
3.284PheGly: 3.284 ± 1.913
0.821PheHis: 0.821 ± 0.673
1.642PheIle: 1.642 ± 0.897
4.926PheLys: 4.926 ± 0.879
2.463PheLeu: 2.463 ± 1.358
1.642PheMet: 1.642 ± 1.562
0.821PheAsn: 0.821 ± 0.569
3.284PhePro: 3.284 ± 0.79
1.642PheGln: 1.642 ± 0.897
3.284PheArg: 3.284 ± 2.275
8.21PheSer: 8.21 ± 2.393
3.284PheThr: 3.284 ± 1.661
3.284PheVal: 3.284 ± 0.79
0.821PheTrp: 0.821 ± 0.569
0.821PheTyr: 0.821 ± 0.569
0.0PheXaa: 0.0 ± 0.0
Gly
5.747GlyAla: 5.747 ± 2.104
1.642GlyCys: 1.642 ± 1.929
6.568GlyAsp: 6.568 ± 3.321
4.926GlyGlu: 4.926 ± 1.436
3.284GlyPhe: 3.284 ± 0.79
4.105GlyGly: 4.105 ± 1.854
2.463GlyHis: 2.463 ± 0.512
0.0GlyIle: 0.0 ± 0.0
6.568GlyLys: 6.568 ± 1.915
8.21GlyLeu: 8.21 ± 0.703
3.284GlyMet: 3.284 ± 2.275
2.463GlyAsn: 2.463 ± 1.152
0.0GlyPro: 0.0 ± 0.0
4.926GlyGln: 4.926 ± 2.04
2.463GlyArg: 2.463 ± 1.152
3.284GlySer: 3.284 ± 1.661
1.642GlyThr: 1.642 ± 0.479
6.568GlyVal: 6.568 ± 0.688
1.642GlyTrp: 1.642 ± 1.137
0.821GlyTyr: 0.821 ± 0.569
0.0GlyXaa: 0.0 ± 0.0
His
0.821HisAla: 0.821 ± 0.673
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.821HisGlu: 0.821 ± 0.569
0.821HisPhe: 0.821 ± 0.569
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.284HisLeu: 3.284 ± 0.79
0.821HisMet: 0.821 ± 0.673
0.821HisAsn: 0.821 ± 0.673
0.821HisPro: 0.821 ± 0.569
0.821HisGln: 0.821 ± 0.569
1.642HisArg: 1.642 ± 1.137
0.821HisSer: 0.821 ± 0.965
0.821HisThr: 0.821 ± 0.965
2.463HisVal: 2.463 ± 1.81
0.0HisTrp: 0.0 ± 0.0
0.821HisTyr: 0.821 ± 0.965
0.0HisXaa: 0.0 ± 0.0
Ile
2.463IleAla: 2.463 ± 0.808
0.0IleCys: 0.0 ± 0.0
2.463IleAsp: 2.463 ± 1.81
0.821IleGlu: 0.821 ± 0.569
2.463IlePhe: 2.463 ± 0.512
2.463IleGly: 2.463 ± 0.808
1.642IleHis: 1.642 ± 1.929
0.0IleIle: 0.0 ± 0.0
1.642IleLys: 1.642 ± 1.929
3.284IleLeu: 3.284 ± 2.275
0.0IleMet: 0.0 ± 0.0
1.642IleAsn: 1.642 ± 0.479
1.642IlePro: 1.642 ± 0.967
0.0IleGln: 0.0 ± 0.0
1.642IleArg: 1.642 ± 0.897
4.926IleSer: 4.926 ± 1.7
0.0IleThr: 0.0 ± 0.0
1.642IleVal: 1.642 ± 0.897
0.0IleTrp: 0.0 ± 0.0
1.642IleTyr: 1.642 ± 1.137
0.0IleXaa: 0.0 ± 0.0
Lys
8.21LysAla: 8.21 ± 1.131
3.284LysCys: 3.284 ± 0.957
0.821LysAsp: 0.821 ± 0.569
1.642LysGlu: 1.642 ± 0.897
3.284LysPhe: 3.284 ± 0.486
2.463LysGly: 2.463 ± 0.512
0.0LysHis: 0.0 ± 0.0
1.642LysIle: 1.642 ± 0.967
6.568LysLys: 6.568 ± 2.254
2.463LysLeu: 2.463 ± 0.808
0.821LysMet: 0.821 ± 0.535
2.463LysAsn: 2.463 ± 0.512
3.284LysPro: 3.284 ± 1.468
2.463LysGln: 2.463 ± 1.774
3.284LysArg: 3.284 ± 0.486
4.926LysSer: 4.926 ± 2.233
5.747LysThr: 5.747 ± 0.548
3.284LysVal: 3.284 ± 0.486
0.821LysTrp: 0.821 ± 0.569
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
9.031LeuAla: 9.031 ± 4.238
0.821LeuCys: 0.821 ± 0.965
7.389LeuAsp: 7.389 ± 2.711
4.105LeuGlu: 4.105 ± 1.297
2.463LeuPhe: 2.463 ± 1.02
4.105LeuGly: 4.105 ± 1.2
0.821LeuHis: 0.821 ± 0.673
0.821LeuIle: 0.821 ± 0.569
4.926LeuLys: 4.926 ± 1.458
4.105LeuLeu: 4.105 ± 0.228
0.0LeuMet: 0.0 ± 0.0
4.926LeuAsn: 4.926 ± 1.361
4.926LeuPro: 4.926 ± 1.7
1.642LeuGln: 1.642 ± 0.897
5.747LeuArg: 5.747 ± 0.82
10.673LeuSer: 10.673 ± 4.042
1.642LeuThr: 1.642 ± 1.137
6.568LeuVal: 6.568 ± 2.034
0.821LeuTrp: 0.821 ± 0.569
3.284LeuTyr: 3.284 ± 1.579
0.0LeuXaa: 0.0 ± 0.0
Met
0.821MetAla: 0.821 ± 0.673
0.0MetCys: 0.0 ± 0.0
1.642MetAsp: 1.642 ± 1.137
0.821MetGlu: 0.821 ± 0.569
0.821MetPhe: 0.821 ± 0.569
2.463MetGly: 2.463 ± 0.808
0.0MetHis: 0.0 ± 0.0
1.642MetIle: 1.642 ± 1.137
0.0MetLys: 0.0 ± 0.0
2.463MetLeu: 2.463 ± 0.808
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.642MetArg: 1.642 ± 0.897
4.105MetSer: 4.105 ± 0.228
0.0MetThr: 0.0 ± 0.0
2.463MetVal: 2.463 ± 1.152
0.821MetTrp: 0.821 ± 0.569
1.642MetTyr: 1.642 ± 1.137
0.0MetXaa: 0.0 ± 0.0
Asn
3.284AsnAla: 3.284 ± 2.692
0.0AsnCys: 0.0 ± 0.0
0.821AsnAsp: 0.821 ± 0.965
0.821AsnGlu: 0.821 ± 0.569
2.463AsnPhe: 2.463 ± 0.808
1.642AsnGly: 1.642 ± 1.137
0.821AsnHis: 0.821 ± 0.673
2.463AsnIle: 2.463 ± 1.774
0.0AsnLys: 0.0 ± 0.0
4.105AsnLeu: 4.105 ± 3.663
0.821AsnMet: 0.821 ± 0.673
0.0AsnAsn: 0.0 ± 0.0
2.463AsnPro: 2.463 ± 1.02
0.821AsnGln: 0.821 ± 0.569
2.463AsnArg: 2.463 ± 1.774
2.463AsnSer: 2.463 ± 0.512
3.284AsnThr: 3.284 ± 1.312
0.821AsnVal: 0.821 ± 0.569
0.0AsnTrp: 0.0 ± 0.0
0.821AsnTyr: 0.821 ± 0.569
0.0AsnXaa: 0.0 ± 0.0
Pro
2.463ProAla: 2.463 ± 1.152
0.821ProCys: 0.821 ± 0.569
3.284ProAsp: 3.284 ± 0.957
4.926ProGlu: 4.926 ± 1.361
0.0ProPhe: 0.0 ± 0.0
5.747ProGly: 5.747 ± 2.673
0.821ProHis: 0.821 ± 0.569
1.642ProIle: 1.642 ± 0.479
4.105ProLys: 4.105 ± 2.431
2.463ProLeu: 2.463 ± 1.774
1.642ProMet: 1.642 ± 0.479
0.0ProAsn: 0.0 ± 0.0
2.463ProPro: 2.463 ± 0.512
1.642ProGln: 1.642 ± 1.346
1.642ProArg: 1.642 ± 1.137
6.568ProSer: 6.568 ± 1.755
4.105ProThr: 4.105 ± 1.2
5.747ProVal: 5.747 ± 1.149
1.642ProTrp: 1.642 ± 0.479
0.821ProTyr: 0.821 ± 0.569
0.0ProXaa: 0.0 ± 0.0
Gln
1.642GlnAla: 1.642 ± 0.479
0.0GlnCys: 0.0 ± 0.0
0.821GlnAsp: 0.821 ± 0.569
0.821GlnGlu: 0.821 ± 0.965
3.284GlnPhe: 3.284 ± 2.692
3.284GlnGly: 3.284 ± 0.957
0.821GlnHis: 0.821 ± 0.965
0.821GlnIle: 0.821 ± 0.965
0.821GlnLys: 0.821 ± 0.569
2.463GlnLeu: 2.463 ± 1.81
1.642GlnMet: 1.642 ± 1.137
2.463GlnAsn: 2.463 ± 1.706
3.284GlnPro: 3.284 ± 1.794
0.821GlnGln: 0.821 ± 0.673
4.105GlnArg: 4.105 ± 1.854
3.284GlnSer: 3.284 ± 0.957
0.821GlnThr: 0.821 ± 0.673
1.642GlnVal: 1.642 ± 0.897
0.0GlnTrp: 0.0 ± 0.0
0.821GlnTyr: 0.821 ± 0.569
0.0GlnXaa: 0.0 ± 0.0
Arg
5.747ArgAla: 5.747 ± 1.549
1.642ArgCys: 1.642 ± 0.967
6.568ArgAsp: 6.568 ± 1.915
3.284ArgGlu: 3.284 ± 1.579
4.105ArgPhe: 4.105 ± 0.228
4.105ArgGly: 4.105 ± 1.854
0.0ArgHis: 0.0 ± 0.0
2.463ArgIle: 2.463 ± 0.512
1.642ArgLys: 1.642 ± 1.137
8.21ArgLeu: 8.21 ± 3.969
1.642ArgMet: 1.642 ± 0.701
2.463ArgAsn: 2.463 ± 1.774
0.821ArgPro: 0.821 ± 0.965
5.747ArgGln: 5.747 ± 1.635
6.568ArgArg: 6.568 ± 0.972
5.747ArgSer: 5.747 ± 0.82
0.821ArgThr: 0.821 ± 0.673
4.926ArgVal: 4.926 ± 1.458
1.642ArgTrp: 1.642 ± 1.137
3.284ArgTyr: 3.284 ± 1.661
0.0ArgXaa: 0.0 ± 0.0
Ser
10.673SerAla: 10.673 ± 5.195
1.642SerCys: 1.642 ± 0.967
4.105SerAsp: 4.105 ± 1.2
7.389SerGlu: 7.389 ± 1.003
2.463SerPhe: 2.463 ± 1.152
9.031SerGly: 9.031 ± 1.905
0.821SerHis: 0.821 ± 0.965
3.284SerIle: 3.284 ± 0.486
4.926SerLys: 4.926 ± 3.548
7.389SerLeu: 7.389 ± 0.351
0.821SerMet: 0.821 ± 0.965
3.284SerAsn: 3.284 ± 0.79
4.105SerPro: 4.105 ± 1.297
4.105SerGln: 4.105 ± 1.376
5.747SerArg: 5.747 ± 0.82
17.241SerSer: 17.241 ± 2.684
3.284SerThr: 3.284 ± 2.692
7.389SerVal: 7.389 ± 1.536
2.463SerTrp: 2.463 ± 0.808
4.926SerTyr: 4.926 ± 1.436
0.0SerXaa: 0.0 ± 0.0
Thr
3.284ThrAla: 3.284 ± 0.957
0.0ThrCys: 0.0 ± 0.0
1.642ThrAsp: 1.642 ± 1.137
4.926ThrGlu: 4.926 ± 2.409
1.642ThrPhe: 1.642 ± 0.479
3.284ThrGly: 3.284 ± 1.661
1.642ThrHis: 1.642 ± 0.897
0.0ThrIle: 0.0 ± 0.0
1.642ThrLys: 1.642 ± 0.479
2.463ThrLeu: 2.463 ± 1.706
0.0ThrMet: 0.0 ± 0.0
1.642ThrAsn: 1.642 ± 0.897
4.926ThrPro: 4.926 ± 0.879
0.821ThrGln: 0.821 ± 0.569
4.105ThrArg: 4.105 ± 1.444
0.821ThrSer: 0.821 ± 0.965
5.747ThrThr: 5.747 ± 1.549
4.926ThrVal: 4.926 ± 4.037
0.821ThrTrp: 0.821 ± 0.569
1.642ThrTyr: 1.642 ± 1.137
0.0ThrXaa: 0.0 ± 0.0
Val
4.105ValAla: 4.105 ± 1.444
0.821ValCys: 0.821 ± 0.673
2.463ValAsp: 2.463 ± 1.152
5.747ValGlu: 5.747 ± 2.01
1.642ValPhe: 1.642 ± 1.346
6.568ValGly: 6.568 ± 0.688
0.821ValHis: 0.821 ± 0.673
3.284ValIle: 3.284 ± 1.468
6.568ValLys: 6.568 ± 3.6
6.568ValLeu: 6.568 ± 2.692
2.463ValMet: 2.463 ± 1.706
2.463ValAsn: 2.463 ± 0.512
4.105ValPro: 4.105 ± 1.444
1.642ValGln: 1.642 ± 0.897
4.926ValArg: 4.926 ± 3.395
5.747ValSer: 5.747 ± 0.548
1.642ValThr: 1.642 ± 1.346
7.389ValVal: 7.389 ± 2.893
3.284ValTrp: 3.284 ± 1.579
3.284ValTyr: 3.284 ± 1.312
0.0ValXaa: 0.0 ± 0.0
Trp
0.821TrpAla: 0.821 ± 0.673
0.0TrpCys: 0.0 ± 0.0
2.463TrpAsp: 2.463 ± 0.808
0.821TrpGlu: 0.821 ± 0.569
1.642TrpPhe: 1.642 ± 1.137
0.821TrpGly: 0.821 ± 0.569
0.821TrpHis: 0.821 ± 0.569
0.821TrpIle: 0.821 ± 0.569
0.821TrpLys: 0.821 ± 0.673
4.105TrpLeu: 4.105 ± 1.854
0.0TrpMet: 0.0 ± 0.0
0.821TrpAsn: 0.821 ± 0.673
0.0TrpPro: 0.0 ± 0.0
0.821TrpGln: 0.821 ± 0.569
1.642TrpArg: 1.642 ± 1.137
1.642TrpSer: 1.642 ± 0.967
1.642TrpThr: 1.642 ± 0.897
1.642TrpVal: 1.642 ± 1.137
0.821TrpTrp: 0.821 ± 0.569
0.821TrpTyr: 0.821 ± 0.569
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.642TyrAla: 1.642 ± 0.479
0.0TyrCys: 0.0 ± 0.0
1.642TyrAsp: 1.642 ± 0.967
0.821TyrGlu: 0.821 ± 0.569
1.642TyrPhe: 1.642 ± 1.137
3.284TyrGly: 3.284 ± 2.275
0.821TyrHis: 0.821 ± 0.569
1.642TyrIle: 1.642 ± 0.479
0.821TyrLys: 0.821 ± 0.569
2.463TyrLeu: 2.463 ± 1.02
1.642TyrMet: 1.642 ± 0.479
0.821TyrAsn: 0.821 ± 0.673
3.284TyrPro: 3.284 ± 1.579
0.821TyrGln: 0.821 ± 0.673
1.642TyrArg: 1.642 ± 1.137
5.747TyrSer: 5.747 ± 0.92
1.642TyrThr: 1.642 ± 1.137
0.821TyrVal: 0.821 ± 0.673
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1219 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski