Amino acid dipepetide frequency for Yacon necrotic mottle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.611AlaAla: 1.611 ± 0.824
0.805AlaCys: 0.805 ± 0.412
3.222AlaAsp: 3.222 ± 1.1
2.014AlaGlu: 2.014 ± 1.031
1.611AlaPhe: 1.611 ± 0.825
2.416AlaGly: 2.416 ± 0.813
0.403AlaHis: 0.403 ± 0.206
4.43AlaIle: 4.43 ± 1.509
4.833AlaLys: 4.833 ± 1.692
6.444AlaLeu: 6.444 ± 1.372
2.819AlaMet: 2.819 ± 1.443
1.611AlaAsn: 1.611 ± 1.623
1.208AlaPro: 1.208 ± 1.097
3.625AlaGln: 3.625 ± 3.831
4.027AlaArg: 4.027 ± 1.283
2.819AlaSer: 2.819 ± 1.03
4.43AlaThr: 4.43 ± 1.955
3.222AlaVal: 3.222 ± 2.102
0.403AlaTrp: 0.403 ± 0.206
1.611AlaTyr: 1.611 ± 0.825
0.0AlaXaa: 0.0 ± 0.0
Cys
1.611CysAla: 1.611 ± 0.825
0.403CysCys: 0.403 ± 0.206
0.403CysAsp: 0.403 ± 0.206
1.611CysGlu: 1.611 ± 1.711
1.611CysPhe: 1.611 ± 0.825
0.805CysGly: 0.805 ± 0.412
0.805CysHis: 0.805 ± 0.412
2.416CysIle: 2.416 ± 1.378
2.014CysLys: 2.014 ± 1.031
0.805CysLeu: 0.805 ± 1.018
0.403CysMet: 0.403 ± 0.206
0.403CysAsn: 0.403 ± 0.206
0.805CysPro: 0.805 ± 0.412
1.208CysGln: 1.208 ± 0.619
0.403CysArg: 0.403 ± 0.206
0.805CysSer: 0.805 ± 0.412
1.208CysThr: 1.208 ± 0.619
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.403CysTyr: 0.403 ± 0.206
0.0CysXaa: 0.0 ± 0.0
Asp
1.208AspAla: 1.208 ± 0.619
0.805AspCys: 0.805 ± 0.412
3.222AspAsp: 3.222 ± 1.65
4.43AspGlu: 4.43 ± 2.268
2.014AspPhe: 2.014 ± 1.913
2.416AspGly: 2.416 ± 0.813
2.014AspHis: 2.014 ± 1.54
3.222AspIle: 3.222 ± 1.65
3.222AspLys: 3.222 ± 1.65
5.638AspLeu: 5.638 ± 1.79
1.611AspMet: 1.611 ± 0.825
2.416AspAsn: 2.416 ± 1.302
3.222AspPro: 3.222 ± 1.112
2.416AspGln: 2.416 ± 1.237
3.625AspArg: 3.625 ± 2.286
2.416AspSer: 2.416 ± 2.195
3.625AspThr: 3.625 ± 1.856
0.403AspVal: 0.403 ± 0.206
1.611AspTrp: 1.611 ± 0.956
2.014AspTyr: 2.014 ± 0.981
0.403AspXaa: 0.403 ± 0.206
Glu
6.847GluAla: 6.847 ± 1.998
0.805GluCys: 0.805 ± 0.412
6.041GluAsp: 6.041 ± 1.124
16.512GluGlu: 16.512 ± 1.75
2.819GluPhe: 2.819 ± 0.895
4.43GluGly: 4.43 ± 0.913
1.611GluHis: 1.611 ± 0.825
7.249GluIle: 7.249 ± 4.425
8.055GluLys: 8.055 ± 5.067
6.041GluLeu: 6.041 ± 2.711
1.611GluMet: 1.611 ± 0.825
4.027GluAsn: 4.027 ± 1.205
1.611GluPro: 1.611 ± 0.824
3.222GluGln: 3.222 ± 0.966
2.819GluArg: 2.819 ± 0.883
5.638GluSer: 5.638 ± 0.515
3.625GluThr: 3.625 ± 1.224
6.041GluVal: 6.041 ± 1.216
0.805GluTrp: 0.805 ± 1.018
1.611GluTyr: 1.611 ± 1.02
0.0GluXaa: 0.0 ± 0.0
Phe
0.805PheAla: 0.805 ± 1.19
0.805PheCys: 0.805 ± 0.412
1.611PheAsp: 1.611 ± 0.956
1.208PheGlu: 1.208 ± 1.889
1.208PhePhe: 1.208 ± 0.619
0.805PheGly: 0.805 ± 0.412
1.611PheHis: 1.611 ± 0.956
2.014PheIle: 2.014 ± 1.558
3.222PheLys: 3.222 ± 2.37
1.611PheLeu: 1.611 ± 0.825
0.805PheMet: 0.805 ± 0.412
1.611PheAsn: 1.611 ± 0.825
1.611PhePro: 1.611 ± 0.825
1.611PheGln: 1.611 ± 0.824
2.014PheArg: 2.014 ± 1.031
2.014PheSer: 2.014 ± 0.792
2.014PheThr: 2.014 ± 1.031
0.805PheVal: 0.805 ± 0.412
0.403PheTrp: 0.403 ± 0.206
0.805PheTyr: 0.805 ± 0.412
0.0PheXaa: 0.0 ± 0.0
Gly
4.027GlyAla: 4.027 ± 0.914
0.805GlyCys: 0.805 ± 0.412
2.416GlyAsp: 2.416 ± 0.813
4.43GlyGlu: 4.43 ± 1.451
1.611GlyPhe: 1.611 ± 1.02
3.222GlyGly: 3.222 ± 1.046
0.403GlyHis: 0.403 ± 0.206
4.43GlyIle: 4.43 ± 1.509
2.819GlyLys: 2.819 ± 1.03
4.027GlyLeu: 4.027 ± 1.205
1.611GlyMet: 1.611 ± 0.858
2.819GlyAsn: 2.819 ± 0.883
2.416GlyPro: 2.416 ± 0.984
0.805GlyGln: 0.805 ± 2.317
3.625GlyArg: 3.625 ± 1.856
2.819GlySer: 2.819 ± 0.883
3.222GlyThr: 3.222 ± 1.65
1.611GlyVal: 1.611 ± 0.824
0.805GlyTrp: 0.805 ± 0.412
2.416GlyTyr: 2.416 ± 1.237
0.805GlyXaa: 0.805 ± 0.412
His
0.403HisAla: 0.403 ± 0.206
0.403HisCys: 0.403 ± 0.206
0.805HisAsp: 0.805 ± 0.412
1.208HisGlu: 1.208 ± 1.059
0.805HisPhe: 0.805 ± 0.412
0.805HisGly: 0.805 ± 0.412
0.0HisHis: 0.0 ± 0.0
2.014HisIle: 2.014 ± 1.031
1.611HisLys: 1.611 ± 2.379
2.014HisLeu: 2.014 ± 1.54
0.0HisMet: 0.0 ± 0.0
1.208HisAsn: 1.208 ± 0.619
0.0HisPro: 0.0 ± 0.0
2.014HisGln: 2.014 ± 1.031
1.208HisArg: 1.208 ± 0.619
1.611HisSer: 1.611 ± 0.956
0.403HisThr: 0.403 ± 0.206
0.805HisVal: 0.805 ± 0.412
0.805HisTrp: 0.805 ± 1.018
1.611HisTyr: 1.611 ± 0.824
0.0HisXaa: 0.0 ± 0.0
Ile
3.222IleAla: 3.222 ± 1.089
2.819IleCys: 2.819 ± 1.443
5.638IleAsp: 5.638 ± 2.06
5.236IleGlu: 5.236 ± 1.817
2.416IlePhe: 2.416 ± 1.388
4.43IleGly: 4.43 ± 1.356
2.014IleHis: 2.014 ± 0.89
4.833IleIle: 4.833 ± 1.777
7.652IleLys: 7.652 ± 1.845
2.819IleLeu: 2.819 ± 1.443
1.208IleMet: 1.208 ± 0.75
4.43IleAsn: 4.43 ± 1.356
4.43IlePro: 4.43 ± 2.268
3.222IleGln: 3.222 ± 0.966
5.236IleArg: 5.236 ± 3.214
4.833IleSer: 4.833 ± 6.128
5.236IleThr: 5.236 ± 2.681
2.819IleVal: 2.819 ± 1.443
0.805IleTrp: 0.805 ± 0.412
2.014IleTyr: 2.014 ± 2.598
0.0IleXaa: 0.0 ± 0.0
Lys
4.833LysAla: 4.833 ± 0.957
2.819LysCys: 2.819 ± 1.229
4.027LysAsp: 4.027 ± 2.057
8.458LysGlu: 8.458 ± 3.09
3.222LysPhe: 3.222 ± 1.65
2.819LysGly: 2.819 ± 0.895
0.805LysHis: 0.805 ± 0.412
8.055LysIle: 8.055 ± 0.971
6.041LysLys: 6.041 ± 1.294
6.847LysLeu: 6.847 ± 1.458
1.208LysMet: 1.208 ± 0.619
3.625LysAsn: 3.625 ± 2.213
4.43LysPro: 4.43 ± 1.356
3.625LysGln: 3.625 ± 4.965
5.638LysArg: 5.638 ± 4.462
4.833LysSer: 4.833 ± 1.889
3.625LysThr: 3.625 ± 0.97
4.027LysVal: 4.027 ± 1.779
2.014LysTrp: 2.014 ± 1.031
2.014LysTyr: 2.014 ± 1.031
0.403LysXaa: 0.403 ± 0.206
Leu
3.222LeuAla: 3.222 ± 1.046
3.222LeuCys: 3.222 ± 0.992
2.819LeuAsp: 2.819 ± 0.883
9.263LeuGlu: 9.263 ± 7.178
1.208LeuPhe: 1.208 ± 1.059
5.236LeuGly: 5.236 ± 1.041
1.611LeuHis: 1.611 ± 0.956
3.625LeuIle: 3.625 ± 0.97
8.458LeuLys: 8.458 ± 2.156
4.833LeuLeu: 4.833 ± 3.235
0.805LeuMet: 0.805 ± 0.384
3.625LeuAsn: 3.625 ± 0.97
4.027LeuPro: 4.027 ± 1.283
3.625LeuGln: 3.625 ± 1.542
5.638LeuArg: 5.638 ± 1.088
6.444LeuSer: 6.444 ± 2.598
4.027LeuThr: 4.027 ± 4.426
3.625LeuVal: 3.625 ± 1.836
1.611LeuTrp: 1.611 ± 0.825
3.222LeuTyr: 3.222 ± 1.046
0.0LeuXaa: 0.0 ± 0.0
Met
1.611MetAla: 1.611 ± 0.956
0.403MetCys: 0.403 ± 0.206
1.208MetAsp: 1.208 ± 0.619
2.014MetGlu: 2.014 ± 1.031
1.611MetPhe: 1.611 ± 0.825
0.805MetGly: 0.805 ± 0.412
0.403MetHis: 0.403 ± 0.206
2.416MetIle: 2.416 ± 1.237
1.208MetLys: 1.208 ± 0.619
1.611MetLeu: 1.611 ± 0.825
0.403MetMet: 0.403 ± 0.206
1.208MetAsn: 1.208 ± 0.619
1.208MetPro: 1.208 ± 0.619
0.403MetGln: 0.403 ± 0.206
0.805MetArg: 0.805 ± 1.018
1.611MetSer: 1.611 ± 1.02
0.805MetThr: 0.805 ± 0.412
2.014MetVal: 2.014 ± 1.031
0.403MetTrp: 0.403 ± 0.206
0.805MetTyr: 0.805 ± 1.018
0.0MetXaa: 0.0 ± 0.0
Asn
1.611AsnAla: 1.611 ± 0.825
0.0AsnCys: 0.0 ± 0.0
1.208AsnAsp: 1.208 ± 0.619
2.819AsnGlu: 2.819 ± 0.895
1.208AsnPhe: 1.208 ± 1.059
2.819AsnGly: 2.819 ± 1.443
0.403AsnHis: 0.403 ± 0.206
1.611AsnIle: 1.611 ± 1.711
5.638AsnLys: 5.638 ± 2.021
5.236AsnLeu: 5.236 ± 3.152
0.805AsnMet: 0.805 ± 0.412
3.222AsnAsn: 3.222 ± 1.748
3.625AsnPro: 3.625 ± 1.542
4.833AsnGln: 4.833 ± 2.304
1.611AsnArg: 1.611 ± 0.825
4.43AsnSer: 4.43 ± 1.356
2.819AsnThr: 2.819 ± 1.163
2.819AsnVal: 2.819 ± 0.883
0.0AsnTrp: 0.0 ± 0.0
1.611AsnTyr: 1.611 ± 0.824
0.0AsnXaa: 0.0 ± 0.0
Pro
4.833ProAla: 4.833 ± 1.672
0.0ProCys: 0.0 ± 0.0
3.222ProAsp: 3.222 ± 0.992
1.611ProGlu: 1.611 ± 0.825
0.403ProPhe: 0.403 ± 0.206
4.027ProGly: 4.027 ± 1.584
1.611ProHis: 1.611 ± 0.825
2.819ProIle: 2.819 ± 2.261
3.625ProLys: 3.625 ± 1.542
4.833ProLeu: 4.833 ± 1.777
0.805ProMet: 0.805 ± 0.412
2.416ProAsn: 2.416 ± 1.302
4.833ProPro: 4.833 ± 2.475
1.611ProGln: 1.611 ± 0.825
2.819ProArg: 2.819 ± 1.443
2.819ProSer: 2.819 ± 1.03
2.416ProThr: 2.416 ± 1.237
0.805ProVal: 0.805 ± 1.205
0.805ProTrp: 0.805 ± 0.412
1.611ProTyr: 1.611 ± 0.825
0.0ProXaa: 0.0 ± 0.0
Gln
4.43GlnAla: 4.43 ± 3.581
0.0GlnCys: 0.0 ± 0.0
1.611GlnAsp: 1.611 ± 0.825
5.638GlnGlu: 5.638 ± 2.021
1.208GlnPhe: 1.208 ± 0.903
1.611GlnGly: 1.611 ± 1.02
1.208GlnHis: 1.208 ± 0.619
6.444GlnIle: 6.444 ± 1.362
3.222GlnLys: 3.222 ± 2.418
2.416GlnLeu: 2.416 ± 0.813
0.403GlnMet: 0.403 ± 0.206
2.819GlnAsn: 2.819 ± 1.954
3.222GlnPro: 3.222 ± 1.089
5.638GlnGln: 5.638 ± 2.459
2.819GlnArg: 2.819 ± 1.229
4.027GlnSer: 4.027 ± 1.779
2.416GlnThr: 2.416 ± 0.868
3.222GlnVal: 3.222 ± 2.226
1.208GlnTrp: 1.208 ± 0.619
1.208GlnTyr: 1.208 ± 0.903
0.0GlnXaa: 0.0 ± 0.0
Arg
2.416ArgAla: 2.416 ± 0.813
0.0ArgCys: 0.0 ± 0.0
3.222ArgAsp: 3.222 ± 0.992
4.43ArgGlu: 4.43 ± 1.69
1.611ArgPhe: 1.611 ± 0.956
2.819ArgGly: 2.819 ± 0.883
2.014ArgHis: 2.014 ± 1.913
4.833ArgIle: 4.833 ± 1.672
4.43ArgLys: 4.43 ± 2.915
5.638ArgLeu: 5.638 ± 3.018
2.819ArgMet: 2.819 ± 1.127
3.222ArgAsn: 3.222 ± 1.046
1.611ArgPro: 1.611 ± 1.02
3.625ArgGln: 3.625 ± 1.856
5.638ArgArg: 5.638 ± 1.435
4.833ArgSer: 4.833 ± 1.628
3.222ArgThr: 3.222 ± 1.647
2.819ArgVal: 2.819 ± 1.443
1.208ArgTrp: 1.208 ± 0.619
1.611ArgTyr: 1.611 ± 0.825
0.0ArgXaa: 0.0 ± 0.0
Ser
2.416SerAla: 2.416 ± 2.672
0.805SerCys: 0.805 ± 1.19
3.625SerAsp: 3.625 ± 0.961
5.638SerGlu: 5.638 ± 3.018
2.416SerPhe: 2.416 ± 0.813
3.222SerGly: 3.222 ± 1.65
1.208SerHis: 1.208 ± 1.059
6.444SerIle: 6.444 ± 3.928
6.444SerLys: 6.444 ± 1.932
6.444SerLeu: 6.444 ± 5.79
1.208SerMet: 1.208 ± 0.619
2.819SerAsn: 2.819 ± 0.895
2.014SerPro: 2.014 ± 0.792
3.222SerGln: 3.222 ± 0.966
4.43SerArg: 4.43 ± 2.268
4.833SerSer: 4.833 ± 4.698
2.416SerThr: 2.416 ± 1.237
1.611SerVal: 1.611 ± 0.825
1.611SerTrp: 1.611 ± 0.825
2.819SerTyr: 2.819 ± 1.03
0.0SerXaa: 0.0 ± 0.0
Thr
2.819ThrAla: 2.819 ± 1.443
1.208ThrCys: 1.208 ± 0.619
2.416ThrAsp: 2.416 ± 1.237
6.041ThrGlu: 6.041 ± 1.478
0.0ThrPhe: 0.0 ± 0.0
4.027ThrGly: 4.027 ± 1.359
1.208ThrHis: 1.208 ± 0.619
4.027ThrIle: 4.027 ± 2.062
3.222ThrLys: 3.222 ± 2.04
4.027ThrLeu: 4.027 ± 1.962
2.416ThrMet: 2.416 ± 0.813
2.416ThrAsn: 2.416 ± 1.237
2.416ThrPro: 2.416 ± 1.302
3.222ThrGln: 3.222 ± 0.966
1.611ThrArg: 1.611 ± 0.824
3.625ThrSer: 3.625 ± 0.97
3.222ThrThr: 3.222 ± 1.046
3.222ThrVal: 3.222 ± 1.65
0.0ThrTrp: 0.0 ± 0.0
1.208ThrTyr: 1.208 ± 0.619
0.403ThrXaa: 0.403 ± 0.206
Val
1.208ValAla: 1.208 ± 0.903
0.805ValCys: 0.805 ± 1.018
2.014ValAsp: 2.014 ± 1.031
4.833ValGlu: 4.833 ± 1.777
1.611ValPhe: 1.611 ± 1.02
1.611ValGly: 1.611 ± 1.02
0.0ValHis: 0.0 ± 0.0
1.611ValIle: 1.611 ± 0.825
3.222ValLys: 3.222 ± 2.226
5.638ValLeu: 5.638 ± 1.088
0.805ValMet: 0.805 ± 0.412
1.208ValAsn: 1.208 ± 1.059
2.819ValPro: 2.819 ± 0.883
2.819ValGln: 2.819 ± 0.895
3.222ValArg: 3.222 ± 0.992
2.416ValSer: 2.416 ± 1.237
2.014ValThr: 2.014 ± 1.031
0.805ValVal: 0.805 ± 0.412
0.0ValTrp: 0.0 ± 0.0
2.819ValTyr: 2.819 ± 1.443
0.0ValXaa: 0.0 ± 0.0
Trp
1.208TrpAla: 1.208 ± 0.619
0.0TrpCys: 0.0 ± 0.0
1.611TrpAsp: 1.611 ± 0.825
0.403TrpGlu: 0.403 ± 0.206
0.0TrpPhe: 0.0 ± 0.0
1.611TrpGly: 1.611 ± 0.824
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.014TrpLys: 2.014 ± 1.031
1.611TrpLeu: 1.611 ± 0.824
0.0TrpMet: 0.0 ± 0.0
0.805TrpAsn: 0.805 ± 0.412
0.403TrpPro: 0.403 ± 1.159
1.611TrpGln: 1.611 ± 0.825
1.208TrpArg: 1.208 ± 0.619
0.805TrpSer: 0.805 ± 0.412
1.208TrpThr: 1.208 ± 0.619
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.403TrpTyr: 0.403 ± 1.339
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.222TyrAla: 3.222 ± 0.992
1.208TyrCys: 1.208 ± 0.619
1.611TyrAsp: 1.611 ± 0.956
3.625TyrGlu: 3.625 ± 0.961
0.0TyrPhe: 0.0 ± 0.0
1.208TyrGly: 1.208 ± 0.619
0.403TyrHis: 0.403 ± 0.206
2.014TyrIle: 2.014 ± 1.031
2.416TyrLys: 2.416 ± 0.984
1.611TyrLeu: 1.611 ± 1.02
0.403TyrMet: 0.403 ± 0.206
2.416TyrAsn: 2.416 ± 1.237
2.014TyrPro: 2.014 ± 1.031
2.416TyrGln: 2.416 ± 1.302
3.625TyrArg: 3.625 ± 2.708
2.014TyrSer: 2.014 ± 1.031
0.805TyrThr: 0.805 ± 0.412
0.805TyrVal: 0.805 ± 1.205
0.403TyrTrp: 0.403 ± 1.159
1.208TyrTyr: 1.208 ± 0.619
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.805XaaIle: 0.805 ± 0.412
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.403XaaMet: 0.403 ± 0.206
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.403XaaArg: 0.403 ± 0.206
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.403XaaTyr: 0.403 ± 0.206
0.403XaaXaa: 0.403 ± 0.206
Statistics based on 4 proteins (2484 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski