Amino acid dipepetide frequency for Mulberry mosaic dwarf associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.474AlaAla: 5.474 ± 0.865
2.737AlaCys: 2.737 ± 0.842
0.912AlaAsp: 0.912 ± 0.734
5.474AlaGlu: 5.474 ± 1.227
0.0AlaPhe: 0.0 ± 0.0
1.825AlaGly: 1.825 ± 2.027
0.912AlaHis: 0.912 ± 1.013
3.65AlaIle: 3.65 ± 1.991
3.65AlaLys: 3.65 ± 1.793
5.474AlaLeu: 5.474 ± 1.066
2.737AlaMet: 2.737 ± 1.888
3.65AlaAsn: 3.65 ± 2.573
0.912AlaPro: 0.912 ± 1.059
2.737AlaGln: 2.737 ± 1.844
5.474AlaArg: 5.474 ± 1.227
10.036AlaSer: 10.036 ± 2.079
2.737AlaThr: 2.737 ± 0.661
2.737AlaVal: 2.737 ± 0.842
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.912CysGlu: 0.912 ± 1.059
0.0CysPhe: 0.0 ± 0.0
0.912CysGly: 0.912 ± 0.644
1.825CysHis: 1.825 ± 1.003
0.912CysIle: 0.912 ± 0.644
1.825CysLys: 1.825 ± 2.027
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.912CysAsn: 0.912 ± 0.734
1.825CysPro: 1.825 ± 0.897
0.0CysGln: 0.0 ± 0.0
1.825CysArg: 1.825 ± 1.304
3.65CysSer: 3.65 ± 1.634
0.912CysThr: 0.912 ± 1.059
3.65CysVal: 3.65 ± 1.634
0.0CysTrp: 0.0 ± 0.0
0.912CysTyr: 0.912 ± 0.734
0.0CysXaa: 0.0 ± 0.0
Asp
0.912AspAla: 0.912 ± 0.772
0.912AspCys: 0.912 ± 1.013
6.387AspAsp: 6.387 ± 2.323
1.825AspGlu: 1.825 ± 0.897
5.474AspPhe: 5.474 ± 2.082
0.0AspGly: 0.0 ± 0.0
3.65AspHis: 3.65 ± 0.648
1.825AspIle: 1.825 ± 1.288
2.737AspLys: 2.737 ± 1.378
3.65AspLeu: 3.65 ± 1.173
2.737AspMet: 2.737 ± 1.057
5.474AspAsn: 5.474 ± 1.796
0.912AspPro: 0.912 ± 0.734
1.825AspGln: 1.825 ± 1.003
2.737AspArg: 2.737 ± 1.285
3.65AspSer: 3.65 ± 2.077
1.825AspThr: 1.825 ± 1.544
0.912AspVal: 0.912 ± 0.772
1.825AspTrp: 1.825 ± 0.759
3.65AspTyr: 3.65 ± 1.115
0.0AspXaa: 0.0 ± 0.0
Glu
0.912GluAla: 0.912 ± 1.013
0.0GluCys: 0.0 ± 0.0
4.562GluAsp: 4.562 ± 1.72
5.474GluGlu: 5.474 ± 0.865
2.737GluPhe: 2.737 ± 0.842
1.825GluGly: 1.825 ± 0.897
3.65GluHis: 3.65 ± 1.793
2.737GluIle: 2.737 ± 0.842
5.474GluLys: 5.474 ± 1.792
3.65GluLeu: 3.65 ± 3.0
0.0GluMet: 0.0 ± 0.663
3.65GluAsn: 3.65 ± 2.479
4.562GluPro: 4.562 ± 1.22
0.0GluGln: 0.0 ± 0.0
2.737GluArg: 2.737 ± 0.661
5.474GluSer: 5.474 ± 1.792
5.474GluThr: 5.474 ± 3.571
0.912GluVal: 0.912 ± 0.734
0.0GluTrp: 0.0 ± 0.0
3.65GluTyr: 3.65 ± 1.948
0.0GluXaa: 0.0 ± 0.0
Phe
2.737PheAla: 2.737 ± 1.378
0.0PheCys: 0.0 ± 0.0
3.65PheAsp: 3.65 ± 1.048
1.825PheGlu: 1.825 ± 0.938
1.825PhePhe: 1.825 ± 0.897
3.65PheGly: 3.65 ± 0.648
2.737PheHis: 2.737 ± 0.661
2.737PheIle: 2.737 ± 1.124
3.65PheLys: 3.65 ± 1.496
5.474PheLeu: 5.474 ± 1.796
0.912PheMet: 0.912 ± 0.772
2.737PheAsn: 2.737 ± 0.661
3.65PhePro: 3.65 ± 1.793
0.912PheGln: 0.912 ± 0.772
2.737PheArg: 2.737 ± 1.378
2.737PheSer: 2.737 ± 1.656
0.912PheThr: 0.912 ± 0.644
1.825PheVal: 1.825 ± 1.205
0.912PheTrp: 0.912 ± 0.734
1.825PheTyr: 1.825 ± 0.759
0.0PheXaa: 0.0 ± 0.0
Gly
1.825GlyAla: 1.825 ± 0.759
1.825GlyCys: 1.825 ± 2.027
1.825GlyAsp: 1.825 ± 1.304
5.474GlyGlu: 5.474 ± 0.865
2.737GlyPhe: 2.737 ± 0.842
1.825GlyGly: 1.825 ± 0.759
1.825GlyHis: 1.825 ± 0.897
3.65GlyIle: 3.65 ± 2.115
3.65GlyLys: 3.65 ± 1.169
3.65GlyLeu: 3.65 ± 1.173
0.912GlyMet: 0.912 ± 1.013
1.825GlyAsn: 1.825 ± 1.24
8.212GlyPro: 8.212 ± 2.807
0.0GlyGln: 0.0 ± 0.0
4.562GlyArg: 4.562 ± 0.962
2.737GlySer: 2.737 ± 1.389
3.65GlyThr: 3.65 ± 1.123
0.0GlyVal: 0.0 ± 0.0
0.912GlyTrp: 0.912 ± 0.772
1.825GlyTyr: 1.825 ± 1.544
0.0GlyXaa: 0.0 ± 0.0
His
3.65HisAla: 3.65 ± 1.634
0.0HisCys: 0.0 ± 0.0
1.825HisAsp: 1.825 ± 0.779
1.825HisGlu: 1.825 ± 1.24
4.562HisPhe: 4.562 ± 1.374
2.737HisGly: 2.737 ± 0.661
0.0HisHis: 0.0 ± 0.0
1.825HisIle: 1.825 ± 1.304
2.737HisLys: 2.737 ± 0.842
5.474HisLeu: 5.474 ± 1.853
0.0HisMet: 0.0 ± 0.0
0.912HisAsn: 0.912 ± 0.644
2.737HisPro: 2.737 ± 0.661
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.737HisSer: 2.737 ± 0.842
0.912HisThr: 0.912 ± 0.772
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.825HisTyr: 1.825 ± 0.897
0.0HisXaa: 0.0 ± 0.0
Ile
0.912IleAla: 0.912 ± 1.059
0.912IleCys: 0.912 ± 0.644
1.825IleAsp: 1.825 ± 0.938
2.737IleGlu: 2.737 ± 1.378
3.65IlePhe: 3.65 ± 0.847
0.0IleGly: 0.0 ± 0.0
0.0IleHis: 0.0 ± 0.0
1.825IleIle: 1.825 ± 0.759
2.737IleLys: 2.737 ± 1.888
9.124IleLeu: 9.124 ± 2.57
0.912IleMet: 0.912 ± 1.013
4.562IleAsn: 4.562 ± 3.152
3.65IlePro: 3.65 ± 1.048
3.65IleGln: 3.65 ± 1.793
3.65IleArg: 3.65 ± 2.104
2.737IleSer: 2.737 ± 1.932
4.562IleThr: 4.562 ± 1.374
5.474IleVal: 5.474 ± 1.73
0.912IleTrp: 0.912 ± 1.013
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
0.912LysAla: 0.912 ± 1.013
3.65LysCys: 3.65 ± 1.634
8.212LysAsp: 8.212 ± 1.485
2.737LysGlu: 2.737 ± 0.842
1.825LysPhe: 1.825 ± 0.759
1.825LysGly: 1.825 ± 0.759
1.825LysHis: 1.825 ± 0.897
5.474LysIle: 5.474 ± 0.993
4.562LysLys: 4.562 ± 1.567
4.562LysLeu: 4.562 ± 1.149
0.912LysMet: 0.912 ± 1.059
1.825LysAsn: 1.825 ± 0.897
1.825LysPro: 1.825 ± 1.288
0.912LysGln: 0.912 ± 1.013
2.737LysArg: 2.737 ± 1.888
0.912LysSer: 0.912 ± 0.644
0.912LysThr: 0.912 ± 0.644
0.912LysVal: 0.912 ± 1.013
0.0LysTrp: 0.0 ± 0.0
6.387LysTyr: 6.387 ± 2.433
0.0LysXaa: 0.0 ± 0.0
Leu
12.774LeuAla: 12.774 ± 2.215
1.825LeuCys: 1.825 ± 1.003
2.737LeuAsp: 2.737 ± 1.378
2.737LeuGlu: 2.737 ± 0.661
1.825LeuPhe: 1.825 ± 0.897
4.562LeuGly: 4.562 ± 1.227
0.0LeuHis: 0.0 ± 0.0
1.825LeuIle: 1.825 ± 1.5
2.737LeuLys: 2.737 ± 1.177
3.65LeuLeu: 3.65 ± 1.115
1.825LeuMet: 1.825 ± 0.931
2.737LeuAsn: 2.737 ± 0.661
4.562LeuPro: 4.562 ± 1.414
3.65LeuGln: 3.65 ± 1.127
4.562LeuArg: 4.562 ± 0.895
2.737LeuSer: 2.737 ± 1.032
8.212LeuThr: 8.212 ± 1.439
5.474LeuVal: 5.474 ± 1.202
1.825LeuTrp: 1.825 ± 1.304
4.562LeuTyr: 4.562 ± 2.101
0.0LeuXaa: 0.0 ± 0.0
Met
3.65MetAla: 3.65 ± 1.115
0.912MetCys: 0.912 ± 0.734
1.825MetAsp: 1.825 ± 0.759
1.825MetGlu: 1.825 ± 2.027
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.912MetHis: 0.912 ± 0.772
0.0MetIle: 0.0 ± 0.0
0.912MetLys: 0.912 ± 0.644
0.912MetLeu: 0.912 ± 1.013
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.825MetArg: 1.825 ± 1.304
5.474MetSer: 5.474 ± 1.924
0.912MetThr: 0.912 ± 0.772
1.825MetVal: 1.825 ± 1.544
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.912AsnAla: 0.912 ± 0.772
0.912AsnCys: 0.912 ± 0.772
0.0AsnAsp: 0.0 ± 0.0
2.737AsnGlu: 2.737 ± 1.389
3.65AsnPhe: 3.65 ± 1.793
7.299AsnGly: 7.299 ± 3.401
2.737AsnHis: 2.737 ± 1.888
3.65AsnIle: 3.65 ± 0.847
0.0AsnLys: 0.0 ± 0.0
0.0AsnLeu: 0.0 ± 0.0
0.912AsnMet: 0.912 ± 0.932
0.912AsnAsn: 0.912 ± 0.772
4.562AsnPro: 4.562 ± 1.149
0.912AsnGln: 0.912 ± 0.772
2.737AsnArg: 2.737 ± 2.146
4.562AsnSer: 4.562 ± 1.374
1.825AsnThr: 1.825 ± 0.897
7.299AsnVal: 7.299 ± 2.095
0.0AsnTrp: 0.0 ± 0.0
1.825AsnTyr: 1.825 ± 1.041
0.0AsnXaa: 0.0 ± 0.0
Pro
3.65ProAla: 3.65 ± 1.588
0.912ProCys: 0.912 ± 0.644
3.65ProAsp: 3.65 ± 1.588
5.474ProGlu: 5.474 ± 2.082
0.0ProPhe: 0.0 ± 0.0
3.65ProGly: 3.65 ± 1.123
0.0ProHis: 0.0 ± 0.0
3.65ProIle: 3.65 ± 1.048
0.912ProLys: 0.912 ± 1.013
0.912ProLeu: 0.912 ± 1.059
1.825ProMet: 1.825 ± 1.113
4.562ProAsn: 4.562 ± 1.736
1.825ProPro: 1.825 ± 2.117
7.299ProGln: 7.299 ± 2.768
5.474ProArg: 5.474 ± 2.547
6.387ProSer: 6.387 ± 2.19
0.912ProThr: 0.912 ± 0.644
4.562ProVal: 4.562 ± 1.149
1.825ProTrp: 1.825 ± 0.759
2.737ProTyr: 2.737 ± 1.124
0.0ProXaa: 0.0 ± 0.0
Gln
0.912GlnAla: 0.912 ± 1.059
2.737GlnCys: 2.737 ± 0.842
2.737GlnAsp: 2.737 ± 0.842
2.737GlnGlu: 2.737 ± 0.842
0.912GlnPhe: 0.912 ± 1.059
0.912GlnGly: 0.912 ± 1.013
0.0GlnHis: 0.0 ± 0.0
3.65GlnIle: 3.65 ± 1.127
5.474GlnLys: 5.474 ± 1.792
2.737GlnLeu: 2.737 ± 0.842
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.912GlnPro: 0.912 ± 0.734
2.737GlnGln: 2.737 ± 1.507
0.912GlnArg: 0.912 ± 0.772
0.912GlnSer: 0.912 ± 1.013
0.912GlnThr: 0.912 ± 0.734
0.912GlnVal: 0.912 ± 0.772
3.65GlnTrp: 3.65 ± 1.793
0.912GlnTyr: 0.912 ± 0.734
0.0GlnXaa: 0.0 ± 0.0
Arg
0.912ArgAla: 0.912 ± 1.013
0.912ArgCys: 0.912 ± 0.734
4.562ArgAsp: 4.562 ± 1.799
1.825ArgGlu: 1.825 ± 0.897
1.825ArgPhe: 1.825 ± 1.304
5.474ArgGly: 5.474 ± 2.547
1.825ArgHis: 1.825 ± 1.041
0.912ArgIle: 0.912 ± 0.644
3.65ArgLys: 3.65 ± 1.123
5.474ArgLeu: 5.474 ± 1.715
0.912ArgMet: 0.912 ± 0.644
4.562ArgAsn: 4.562 ± 1.404
7.299ArgPro: 7.299 ± 1.488
0.912ArgGln: 0.912 ± 0.734
14.599ArgArg: 14.599 ± 2.599
9.124ArgSer: 9.124 ± 2.255
4.562ArgThr: 4.562 ± 3.296
5.474ArgVal: 5.474 ± 3.198
3.65ArgTrp: 3.65 ± 1.793
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.474SerAla: 5.474 ± 1.323
0.0SerCys: 0.0 ± 0.0
0.912SerAsp: 0.912 ± 0.644
5.474SerGlu: 5.474 ± 3.312
5.474SerPhe: 5.474 ± 1.752
6.387SerGly: 6.387 ± 1.892
5.474SerHis: 5.474 ± 0.865
6.387SerIle: 6.387 ± 2.224
1.825SerLys: 1.825 ± 1.041
8.212SerLeu: 8.212 ± 2.15
0.0SerMet: 0.0 ± 0.0
1.825SerAsn: 1.825 ± 0.897
5.474SerPro: 5.474 ± 3.014
0.912SerGln: 0.912 ± 0.734
3.65SerArg: 3.65 ± 1.936
10.949SerSer: 10.949 ± 1.857
4.562SerThr: 4.562 ± 1.625
3.65SerVal: 3.65 ± 1.694
0.912SerTrp: 0.912 ± 0.644
4.562SerTyr: 4.562 ± 1.799
0.0SerXaa: 0.0 ± 0.0
Thr
2.737ThrAla: 2.737 ± 1.888
0.0ThrCys: 0.0 ± 0.0
3.65ThrAsp: 3.65 ± 1.793
1.825ThrGlu: 1.825 ± 2.027
3.65ThrPhe: 3.65 ± 1.123
6.387ThrGly: 6.387 ± 0.829
0.912ThrHis: 0.912 ± 0.772
1.825ThrIle: 1.825 ± 1.24
0.912ThrLys: 0.912 ± 1.013
3.65ThrLeu: 3.65 ± 1.048
0.0ThrMet: 0.0 ± 0.0
1.825ThrAsn: 1.825 ± 1.544
0.912ThrPro: 0.912 ± 0.772
1.825ThrGln: 1.825 ± 0.938
7.299ThrArg: 7.299 ± 1.81
3.65ThrSer: 3.65 ± 0.648
0.912ThrThr: 0.912 ± 1.059
3.65ThrVal: 3.65 ± 1.173
2.737ThrTrp: 2.737 ± 0.842
1.825ThrTyr: 1.825 ± 0.897
0.0ThrXaa: 0.0 ± 0.0
Val
5.474ValAla: 5.474 ± 1.227
0.0ValCys: 0.0 ± 0.0
1.825ValAsp: 1.825 ± 1.003
3.65ValGlu: 3.65 ± 2.45
2.737ValPhe: 2.737 ± 1.124
2.737ValGly: 2.737 ± 1.073
0.912ValHis: 0.912 ± 1.013
3.65ValIle: 3.65 ± 1.307
4.562ValLys: 4.562 ± 1.149
7.299ValLeu: 7.299 ± 1.81
1.825ValMet: 1.825 ± 0.897
1.825ValAsn: 1.825 ± 0.938
3.65ValPro: 3.65 ± 1.588
4.562ValGln: 4.562 ± 1.736
7.299ValArg: 7.299 ± 1.281
1.825ValSer: 1.825 ± 0.779
1.825ValThr: 1.825 ± 1.544
1.825ValVal: 1.825 ± 0.759
0.0ValTrp: 0.0 ± 0.0
1.825ValTyr: 1.825 ± 0.759
0.0ValXaa: 0.0 ± 0.0
Trp
3.65TrpAla: 3.65 ± 1.793
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.912TrpGlu: 0.912 ± 0.734
1.825TrpPhe: 1.825 ± 1.288
0.912TrpGly: 0.912 ± 0.772
0.912TrpHis: 0.912 ± 1.013
0.912TrpIle: 0.912 ± 0.772
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.825TrpAsn: 1.825 ± 1.304
1.825TrpPro: 1.825 ± 0.897
0.0TrpGln: 0.0 ± 0.0
0.912TrpArg: 0.912 ± 0.772
0.912TrpSer: 0.912 ± 1.013
1.825TrpThr: 1.825 ± 0.897
2.737TrpVal: 2.737 ± 1.378
0.912TrpTrp: 0.912 ± 0.734
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.65TyrAla: 3.65 ± 0.847
1.825TyrCys: 1.825 ± 2.117
1.825TyrAsp: 1.825 ± 0.759
0.912TyrGlu: 0.912 ± 0.644
2.737TyrPhe: 2.737 ± 1.389
0.0TyrGly: 0.0 ± 0.0
3.65TyrHis: 3.65 ± 1.123
2.737TyrIle: 2.737 ± 1.378
0.912TyrLys: 0.912 ± 1.059
0.912TyrLeu: 0.912 ± 0.772
3.65TyrMet: 3.65 ± 1.868
1.825TyrAsn: 1.825 ± 0.759
0.912TyrPro: 0.912 ± 1.059
1.825TyrGln: 1.825 ± 1.5
2.737TyrArg: 2.737 ± 1.507
0.912TyrSer: 0.912 ± 1.059
1.825TyrThr: 1.825 ± 0.897
5.474TyrVal: 5.474 ± 1.227
0.0TyrTrp: 0.0 ± 0.0
0.912TyrTyr: 0.912 ± 0.644
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1097 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski