Amino acid dipepetide frequency for Soybean mild mottle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.75AlaAla: 2.75 ± 2.022
1.833AlaCys: 1.833 ± 0.73
0.917AlaAsp: 0.917 ± 0.963
1.833AlaGlu: 1.833 ± 1.063
1.833AlaPhe: 1.833 ± 1.181
1.833AlaGly: 1.833 ± 0.73
0.0AlaHis: 0.0 ± 0.0
2.75AlaIle: 2.75 ± 1.177
6.416AlaLys: 6.416 ± 1.413
7.333AlaLeu: 7.333 ± 3.022
0.0AlaMet: 0.0 ± 0.0
0.917AlaAsn: 0.917 ± 0.674
0.917AlaPro: 0.917 ± 0.752
1.833AlaGln: 1.833 ± 1.377
3.666AlaArg: 3.666 ± 2.696
4.583AlaSer: 4.583 ± 1.851
3.666AlaThr: 3.666 ± 2.229
0.917AlaVal: 0.917 ± 0.752
2.75AlaTrp: 2.75 ± 0.855
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.917CysAla: 0.917 ± 0.674
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.917CysGlu: 0.917 ± 0.752
1.833CysPhe: 1.833 ± 2.073
1.833CysGly: 1.833 ± 1.035
0.0CysHis: 0.0 ± 0.0
0.917CysIle: 0.917 ± 0.752
2.75CysLys: 2.75 ± 1.32
0.917CysLeu: 0.917 ± 1.191
0.917CysMet: 0.917 ± 0.963
1.833CysAsn: 1.833 ± 0.73
2.75CysPro: 2.75 ± 1.258
1.833CysGln: 1.833 ± 1.035
1.833CysArg: 1.833 ± 1.926
3.666CysSer: 3.666 ± 2.074
2.75CysThr: 2.75 ± 1.362
2.75CysVal: 2.75 ± 1.025
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.75AspAla: 2.75 ± 1.385
2.75AspCys: 2.75 ± 2.065
1.833AspAsp: 1.833 ± 0.73
2.75AspGlu: 2.75 ± 1.293
0.917AspPhe: 0.917 ± 0.752
2.75AspGly: 2.75 ± 1.385
0.917AspHis: 0.917 ± 1.065
1.833AspIle: 1.833 ± 1.181
0.917AspLys: 0.917 ± 0.674
4.583AspLeu: 4.583 ± 1.4
0.0AspMet: 0.0 ± 0.0
2.75AspAsn: 2.75 ± 1.32
1.833AspPro: 1.833 ± 1.063
1.833AspGln: 1.833 ± 1.142
1.833AspArg: 1.833 ± 1.504
3.666AspSer: 3.666 ± 1.417
3.666AspThr: 3.666 ± 1.373
5.5AspVal: 5.5 ± 1.758
0.917AspTrp: 0.917 ± 0.674
2.75AspTyr: 2.75 ± 1.025
0.0AspXaa: 0.0 ± 0.0
Glu
3.666GluAla: 3.666 ± 1.282
0.0GluCys: 0.0 ± 0.0
1.833GluAsp: 1.833 ± 1.221
5.5GluGlu: 5.5 ± 1.884
1.833GluPhe: 1.833 ± 1.063
3.666GluGly: 3.666 ± 1.31
1.833GluHis: 1.833 ± 1.035
2.75GluIle: 2.75 ± 2.259
0.917GluLys: 0.917 ± 0.963
4.583GluLeu: 4.583 ± 2.145
1.833GluMet: 1.833 ± 1.369
2.75GluAsn: 2.75 ± 1.32
3.666GluPro: 3.666 ± 1.282
4.583GluGln: 4.583 ± 1.066
3.666GluArg: 3.666 ± 1.845
3.666GluSer: 3.666 ± 1.513
0.917GluThr: 0.917 ± 1.191
4.583GluVal: 4.583 ± 2.364
0.917GluTrp: 0.917 ± 0.674
0.917GluTyr: 0.917 ± 0.674
0.0GluXaa: 0.0 ± 0.0
Phe
1.833PheAla: 1.833 ± 1.142
2.75PheCys: 2.75 ± 1.01
3.666PheAsp: 3.666 ± 2.02
2.75PheGlu: 2.75 ± 1.177
2.75PhePhe: 2.75 ± 1.687
0.917PheGly: 0.917 ± 0.752
0.917PheHis: 0.917 ± 0.674
0.917PheIle: 0.917 ± 0.674
4.583PheLys: 4.583 ± 1.895
6.416PheLeu: 6.416 ± 3.891
0.917PheMet: 0.917 ± 0.674
4.583PheAsn: 4.583 ± 1.895
0.917PhePro: 0.917 ± 0.963
4.583PheGln: 4.583 ± 1.744
2.75PheArg: 2.75 ± 1.604
3.666PheSer: 3.666 ± 1.788
0.917PheThr: 0.917 ± 0.674
0.917PheVal: 0.917 ± 1.037
0.0PheTrp: 0.0 ± 0.0
2.75PheTyr: 2.75 ± 1.32
0.0PheXaa: 0.0 ± 0.0
Gly
0.917GlyAla: 0.917 ± 0.674
2.75GlyCys: 2.75 ± 0.855
2.75GlyAsp: 2.75 ± 1.989
4.583GlyGlu: 4.583 ± 1.4
1.833GlyPhe: 1.833 ± 2.129
4.583GlyGly: 4.583 ± 1.189
0.917GlyHis: 0.917 ± 0.674
0.917GlyIle: 0.917 ± 0.674
5.5GlyLys: 5.5 ± 2.189
3.666GlyLeu: 3.666 ± 1.63
0.0GlyMet: 0.0 ± 0.0
0.917GlyAsn: 0.917 ± 1.191
1.833GlyPro: 1.833 ± 0.73
2.75GlyGln: 2.75 ± 1.893
2.75GlyArg: 2.75 ± 1.385
7.333GlySer: 7.333 ± 1.745
2.75GlyThr: 2.75 ± 1.362
4.583GlyVal: 4.583 ± 2.076
0.0GlyTrp: 0.0 ± 0.0
0.917GlyTyr: 0.917 ± 0.674
0.0GlyXaa: 0.0 ± 0.0
His
0.917HisAla: 0.917 ± 0.752
0.917HisCys: 0.917 ± 1.065
0.917HisAsp: 0.917 ± 0.963
0.917HisGlu: 0.917 ± 0.674
2.75HisPhe: 2.75 ± 1.507
1.833HisGly: 1.833 ± 1.377
0.0HisHis: 0.0 ± 0.0
1.833HisIle: 1.833 ± 2.381
0.917HisLys: 0.917 ± 1.037
2.75HisLeu: 2.75 ± 1.385
0.917HisMet: 0.917 ± 1.019
1.833HisAsn: 1.833 ± 1.348
0.917HisPro: 0.917 ± 0.674
1.833HisGln: 1.833 ± 0.73
3.666HisArg: 3.666 ± 3.084
1.833HisSer: 1.833 ± 1.43
5.5HisThr: 5.5 ± 2.545
4.583HisVal: 4.583 ± 2.063
0.0HisTrp: 0.0 ± 0.0
0.917HisTyr: 0.917 ± 0.674
0.0HisXaa: 0.0 ± 0.0
Ile
0.917IleAla: 0.917 ± 0.674
0.0IleCys: 0.0 ± 0.0
4.583IleAsp: 4.583 ± 1.919
0.0IleGlu: 0.0 ± 0.0
5.5IlePhe: 5.5 ± 2.452
2.75IleGly: 2.75 ± 1.515
0.917IleHis: 0.917 ± 1.037
1.833IleIle: 1.833 ± 1.063
8.249IleLys: 8.249 ± 1.384
2.75IleLeu: 2.75 ± 1.94
0.917IleMet: 0.917 ± 1.037
0.917IleAsn: 0.917 ± 1.037
0.917IlePro: 0.917 ± 0.674
4.583IleGln: 4.583 ± 2.681
3.666IleArg: 3.666 ± 1.723
5.5IleSer: 5.5 ± 2.086
1.833IleThr: 1.833 ± 0.73
1.833IleVal: 1.833 ± 1.104
0.917IleTrp: 0.917 ± 1.037
2.75IleTyr: 2.75 ± 0.855
0.0IleXaa: 0.0 ± 0.0
Lys
3.666LysAla: 3.666 ± 1.045
1.833LysCys: 1.833 ± 1.104
2.75LysAsp: 2.75 ± 1.404
6.416LysGlu: 6.416 ± 2.864
3.666LysPhe: 3.666 ± 1.929
3.666LysGly: 3.666 ± 1.429
3.666LysHis: 3.666 ± 1.373
3.666LysIle: 3.666 ± 1.272
0.917LysLys: 0.917 ± 0.674
2.75LysLeu: 2.75 ± 2.235
0.917LysMet: 0.917 ± 1.191
2.75LysAsn: 2.75 ± 2.022
1.833LysPro: 1.833 ± 0.73
0.917LysGln: 0.917 ± 0.752
4.583LysArg: 4.583 ± 2.268
5.5LysSer: 5.5 ± 1.131
3.666LysThr: 3.666 ± 1.62
4.583LysVal: 4.583 ± 1.447
0.0LysTrp: 0.0 ± 0.0
3.666LysTyr: 3.666 ± 1.014
0.0LysXaa: 0.0 ± 0.0
Leu
0.917LeuAla: 0.917 ± 0.963
4.583LeuCys: 4.583 ± 1.066
2.75LeuAsp: 2.75 ± 1.293
7.333LeuGlu: 7.333 ± 3.111
2.75LeuPhe: 2.75 ± 1.258
1.833LeuGly: 1.833 ± 1.035
2.75LeuHis: 2.75 ± 1.507
0.917LeuIle: 0.917 ± 0.674
5.5LeuLys: 5.5 ± 1.584
5.5LeuLeu: 5.5 ± 3.079
1.833LeuMet: 1.833 ± 1.46
3.666LeuAsn: 3.666 ± 1.506
4.583LeuPro: 4.583 ± 1.256
3.666LeuGln: 3.666 ± 2.126
5.5LeuArg: 5.5 ± 3.877
7.333LeuSer: 7.333 ± 1.948
6.416LeuThr: 6.416 ± 1.236
2.75LeuVal: 2.75 ± 1.677
0.917LeuTrp: 0.917 ± 1.065
1.833LeuTyr: 1.833 ± 1.181
0.0LeuXaa: 0.0 ± 0.0
Met
1.833MetAla: 1.833 ± 1.504
0.917MetCys: 0.917 ± 1.191
3.666MetAsp: 3.666 ± 1.42
0.0MetGlu: 0.0 ± 0.0
3.666MetPhe: 3.666 ± 2.33
2.75MetGly: 2.75 ± 1.322
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.917MetLys: 0.917 ± 0.674
2.75MetLeu: 2.75 ± 1.831
1.833MetMet: 1.833 ± 1.181
0.917MetAsn: 0.917 ± 1.037
0.917MetPro: 0.917 ± 0.674
0.0MetGln: 0.0 ± 0.0
2.75MetArg: 2.75 ± 0.855
1.833MetSer: 1.833 ± 1.181
0.917MetThr: 0.917 ± 0.752
0.917MetVal: 0.917 ± 1.191
1.833MetTrp: 1.833 ± 1.063
1.833MetTyr: 1.833 ± 1.504
0.0MetXaa: 0.0 ± 0.0
Asn
4.583AsnAla: 4.583 ± 2.425
0.917AsnCys: 0.917 ± 1.037
2.75AsnAsp: 2.75 ± 1.385
1.833AsnGlu: 1.833 ± 0.73
0.0AsnPhe: 0.0 ± 0.0
1.833AsnGly: 1.833 ± 1.158
4.583AsnHis: 4.583 ± 2.599
4.583AsnIle: 4.583 ± 1.066
3.666AsnLys: 3.666 ± 1.272
2.75AsnLeu: 2.75 ± 1.177
1.833AsnMet: 1.833 ± 1.419
4.583AsnAsn: 4.583 ± 1.895
4.583AsnPro: 4.583 ± 1.691
0.917AsnGln: 0.917 ± 1.191
0.917AsnArg: 0.917 ± 0.752
0.917AsnSer: 0.917 ± 1.037
0.917AsnThr: 0.917 ± 0.752
6.416AsnVal: 6.416 ± 2.761
0.917AsnTrp: 0.917 ± 1.037
1.833AsnTyr: 1.833 ± 0.73
0.0AsnXaa: 0.0 ± 0.0
Pro
1.833ProAla: 1.833 ± 1.504
1.833ProCys: 1.833 ± 1.132
0.0ProAsp: 0.0 ± 0.0
1.833ProGlu: 1.833 ± 1.158
1.833ProPhe: 1.833 ± 1.063
2.75ProGly: 2.75 ± 1.989
2.75ProHis: 2.75 ± 1.497
1.833ProIle: 1.833 ± 1.035
1.833ProLys: 1.833 ± 1.142
4.583ProLeu: 4.583 ± 2.056
4.583ProMet: 4.583 ± 2.748
2.75ProAsn: 2.75 ± 0.855
2.75ProPro: 2.75 ± 1.989
1.833ProGln: 1.833 ± 1.54
5.5ProArg: 5.5 ± 3.079
2.75ProSer: 2.75 ± 0.855
3.666ProThr: 3.666 ± 2.064
0.917ProVal: 0.917 ± 0.752
0.0ProTrp: 0.0 ± 0.0
3.666ProTyr: 3.666 ± 1.63
0.0ProXaa: 0.0 ± 0.0
Gln
2.75GlnAla: 2.75 ± 1.974
0.917GlnCys: 0.917 ± 0.674
2.75GlnAsp: 2.75 ± 1.974
3.666GlnGlu: 3.666 ± 1.806
2.75GlnPhe: 2.75 ± 1.507
1.833GlnGly: 1.833 ± 1.348
1.833GlnHis: 1.833 ± 1.601
5.5GlnIle: 5.5 ± 2.995
0.0GlnLys: 0.0 ± 0.0
4.583GlnLeu: 4.583 ± 2.072
0.917GlnMet: 0.917 ± 1.065
2.75GlnAsn: 2.75 ± 1.94
0.0GlnPro: 0.0 ± 0.0
0.917GlnGln: 0.917 ± 1.065
1.833GlnArg: 1.833 ± 0.73
5.5GlnSer: 5.5 ± 1.825
1.833GlnThr: 1.833 ± 1.43
3.666GlnVal: 3.666 ± 1.973
1.833GlnTrp: 1.833 ± 1.348
1.833GlnTyr: 1.833 ± 0.73
0.0GlnXaa: 0.0 ± 0.0
Arg
3.666ArgAla: 3.666 ± 2.183
3.666ArgCys: 3.666 ± 2.837
6.416ArgAsp: 6.416 ± 1.986
2.75ArgGlu: 2.75 ± 1.322
4.583ArgPhe: 4.583 ± 2.298
3.666ArgGly: 3.666 ± 1.043
3.666ArgHis: 3.666 ± 1.513
2.75ArgIle: 2.75 ± 1.395
1.833ArgLys: 1.833 ± 1.181
2.75ArgLeu: 2.75 ± 1.632
0.917ArgMet: 0.917 ± 0.752
2.75ArgAsn: 2.75 ± 1.974
3.666ArgPro: 3.666 ± 1.459
0.917ArgGln: 0.917 ± 0.963
10.082ArgArg: 10.082 ± 4.534
7.333ArgSer: 7.333 ± 2.542
5.5ArgThr: 5.5 ± 3.399
2.75ArgVal: 2.75 ± 1.6
0.917ArgTrp: 0.917 ± 0.752
0.917ArgTyr: 0.917 ± 0.963
0.0ArgXaa: 0.0 ± 0.0
Ser
4.583SerAla: 4.583 ± 3.37
0.0SerCys: 0.0 ± 0.0
2.75SerAsp: 2.75 ± 1.186
1.833SerGlu: 1.833 ± 1.43
5.5SerPhe: 5.5 ± 1.121
0.917SerGly: 0.917 ± 0.674
3.666SerHis: 3.666 ± 2.183
9.166SerIle: 9.166 ± 2.429
3.666SerLys: 3.666 ± 1.806
4.583SerLeu: 4.583 ± 2.417
1.833SerMet: 1.833 ± 1.391
8.249SerAsn: 8.249 ± 2.536
7.333SerPro: 7.333 ± 3.577
5.5SerGln: 5.5 ± 2.406
3.666SerArg: 3.666 ± 1.237
6.416SerSer: 6.416 ± 1.897
6.416SerThr: 6.416 ± 2.186
3.666SerVal: 3.666 ± 1.951
0.0SerTrp: 0.0 ± 0.0
1.833SerTyr: 1.833 ± 0.73
0.0SerXaa: 0.0 ± 0.0
Thr
2.75ThrAla: 2.75 ± 1.687
1.833ThrCys: 1.833 ± 1.926
0.917ThrAsp: 0.917 ± 1.065
4.583ThrGlu: 4.583 ± 1.39
0.917ThrPhe: 0.917 ± 1.191
4.583ThrGly: 4.583 ± 2.027
5.5ThrHis: 5.5 ± 1.577
3.666ThrIle: 3.666 ± 1.917
3.666ThrLys: 3.666 ± 2.246
1.833ThrLeu: 1.833 ± 0.73
1.833ThrMet: 1.833 ± 1.286
1.833ThrAsn: 1.833 ± 0.73
4.583ThrPro: 4.583 ± 2.016
2.75ThrGln: 2.75 ± 1.177
2.75ThrArg: 2.75 ± 2.065
4.583ThrSer: 4.583 ± 4.584
2.75ThrThr: 2.75 ± 1.43
1.833ThrVal: 1.833 ± 1.142
0.0ThrTrp: 0.0 ± 0.0
3.666ThrTyr: 3.666 ± 1.282
0.0ThrXaa: 0.0 ± 0.0
Val
0.917ValAla: 0.917 ± 1.191
0.0ValCys: 0.0 ± 0.0
0.917ValAsp: 0.917 ± 0.674
2.75ValGlu: 2.75 ± 1.43
2.75ValPhe: 2.75 ± 0.855
2.75ValGly: 2.75 ± 1.6
1.833ValHis: 1.833 ± 1.377
3.666ValIle: 3.666 ± 1.479
6.416ValLys: 6.416 ± 2.089
5.5ValLeu: 5.5 ± 1.448
3.666ValMet: 3.666 ± 1.629
1.833ValAsn: 1.833 ± 1.181
3.666ValPro: 3.666 ± 1.591
5.5ValGln: 5.5 ± 2.775
3.666ValArg: 3.666 ± 2.782
3.666ValSer: 3.666 ± 1.699
1.833ValThr: 1.833 ± 1.132
1.833ValVal: 1.833 ± 1.132
0.917ValTrp: 0.917 ± 0.752
5.5ValTyr: 5.5 ± 1.819
0.0ValXaa: 0.0 ± 0.0
Trp
2.75TrpAla: 2.75 ± 2.022
0.0TrpCys: 0.0 ± 0.0
0.917TrpAsp: 0.917 ± 0.963
0.917TrpGlu: 0.917 ± 1.037
0.0TrpPhe: 0.0 ± 0.0
1.833TrpGly: 1.833 ± 1.035
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.917TrpMet: 0.917 ± 0.752
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.917TrpGln: 0.917 ± 0.674
2.75TrpArg: 2.75 ± 1.395
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.833TrpTyr: 1.833 ± 0.73
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.833TyrAla: 1.833 ± 1.504
0.917TyrCys: 0.917 ± 0.752
3.666TyrAsp: 3.666 ± 2.362
0.917TyrGlu: 0.917 ± 0.752
1.833TyrPhe: 1.833 ± 0.73
3.666TyrGly: 3.666 ± 1.014
0.917TyrHis: 0.917 ± 0.674
1.833TyrIle: 1.833 ± 1.104
2.75TyrLys: 2.75 ± 1.186
2.75TyrLeu: 2.75 ± 1.385
2.75TyrMet: 2.75 ± 0.969
2.75TyrAsn: 2.75 ± 1.171
1.833TyrPro: 1.833 ± 1.063
0.0TyrGln: 0.0 ± 0.0
3.666TyrArg: 3.666 ± 3.008
1.833TyrSer: 1.833 ± 1.063
0.917TyrThr: 0.917 ± 0.674
4.583TyrVal: 4.583 ± 1.287
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1092 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski