Amino acid dipepetide frequency for Foxtail mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.128AlaAla: 7.128 ± 3.748
1.018AlaCys: 1.018 ± 0.897
4.073AlaAsp: 4.073 ± 0.792
2.037AlaGlu: 2.037 ± 0.578
4.582AlaPhe: 4.582 ± 1.099
6.11AlaGly: 6.11 ± 3.162
3.564AlaHis: 3.564 ± 1.857
4.073AlaIle: 4.073 ± 2.747
2.037AlaLys: 2.037 ± 1.092
11.711AlaLeu: 11.711 ± 2.627
2.037AlaMet: 2.037 ± 0.676
4.073AlaAsn: 4.073 ± 1.592
6.619AlaPro: 6.619 ± 3.415
2.546AlaGln: 2.546 ± 0.74
3.564AlaArg: 3.564 ± 0.632
5.601AlaSer: 5.601 ± 1.7
6.619AlaThr: 6.619 ± 1.503
3.564AlaVal: 3.564 ± 1.233
1.018AlaTrp: 1.018 ± 0.546
4.073AlaTyr: 4.073 ± 1.069
0.0AlaXaa: 0.0 ± 0.0
Cys
2.037CysAla: 2.037 ± 0.672
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.509CysGlu: 0.509 ± 0.273
0.509CysPhe: 0.509 ± 1.054
0.509CysGly: 0.509 ± 0.273
1.018CysHis: 1.018 ± 1.677
0.509CysIle: 0.509 ± 0.273
0.509CysLys: 0.509 ± 0.767
1.018CysLeu: 1.018 ± 1.445
0.0CysMet: 0.0 ± 0.0
2.037CysAsn: 2.037 ± 0.672
1.018CysPro: 1.018 ± 0.546
1.018CysGln: 1.018 ± 0.546
0.509CysArg: 0.509 ± 0.273
0.0CysSer: 0.0 ± 0.0
1.527CysThr: 1.527 ± 0.804
0.509CysVal: 0.509 ± 1.054
0.509CysTrp: 0.509 ± 1.535
0.509CysTyr: 0.509 ± 0.767
0.0CysXaa: 0.0 ± 0.0
Asp
3.564AspAla: 3.564 ± 1.947
1.018AspCys: 1.018 ± 0.546
2.037AspAsp: 2.037 ± 1.092
5.601AspGlu: 5.601 ± 1.157
1.527AspPhe: 1.527 ± 0.819
2.037AspGly: 2.037 ± 0.798
0.0AspHis: 0.0 ± 0.0
2.546AspIle: 2.546 ± 0.831
3.055AspLys: 3.055 ± 0.954
3.564AspLeu: 3.564 ± 1.911
0.509AspMet: 0.509 ± 0.273
3.055AspAsn: 3.055 ± 2.251
3.564AspPro: 3.564 ± 0.632
1.527AspGln: 1.527 ± 0.819
3.055AspArg: 3.055 ± 0.565
3.055AspSer: 3.055 ± 0.565
5.092AspThr: 5.092 ± 1.238
3.055AspVal: 3.055 ± 0.954
1.018AspTrp: 1.018 ± 0.546
1.527AspTyr: 1.527 ± 0.986
0.0AspXaa: 0.0 ± 0.0
Glu
5.092GluAla: 5.092 ± 1.962
0.509GluCys: 0.509 ± 0.273
2.546GluAsp: 2.546 ± 0.623
3.564GluGlu: 3.564 ± 0.632
2.037GluPhe: 2.037 ± 0.672
2.546GluGly: 2.546 ± 0.831
2.037GluHis: 2.037 ± 0.672
7.637GluIle: 7.637 ± 1.934
3.055GluLys: 3.055 ± 1.638
3.564GluLeu: 3.564 ± 0.632
0.509GluMet: 0.509 ± 0.273
2.546GluAsn: 2.546 ± 0.74
7.128GluPro: 7.128 ± 1.749
1.018GluGln: 1.018 ± 0.595
2.546GluArg: 2.546 ± 0.831
2.037GluSer: 2.037 ± 1.414
4.582GluThr: 4.582 ± 1.043
6.11GluVal: 6.11 ± 2.077
0.509GluTrp: 0.509 ± 0.273
1.018GluTyr: 1.018 ± 0.648
0.0GluXaa: 0.0 ± 0.0
Phe
1.527PheAla: 1.527 ± 2.028
0.509PheCys: 0.509 ± 0.273
5.092PheAsp: 5.092 ± 1.245
4.073PheGlu: 4.073 ± 2.184
2.546PhePhe: 2.546 ± 1.681
2.037PheGly: 2.037 ± 1.092
0.0PheHis: 0.0 ± 0.0
1.527PheIle: 1.527 ± 0.52
1.527PheLys: 1.527 ± 0.52
4.073PheLeu: 4.073 ± 1.446
2.546PheMet: 2.546 ± 0.881
0.509PheAsn: 0.509 ± 0.273
0.509PhePro: 0.509 ± 0.767
1.527PheGln: 1.527 ± 0.601
2.546PheArg: 2.546 ± 1.365
2.546PheSer: 2.546 ± 1.365
3.055PheThr: 3.055 ± 1.225
2.546PheVal: 2.546 ± 1.314
0.509PheTrp: 0.509 ± 0.273
2.546PheTyr: 2.546 ± 0.623
0.0PheXaa: 0.0 ± 0.0
Gly
4.582GlyAla: 4.582 ± 2.885
0.509GlyCys: 0.509 ± 0.273
4.073GlyAsp: 4.073 ± 1.446
4.582GlyGlu: 4.582 ± 1.486
2.546GlyPhe: 2.546 ± 2.061
3.055GlyGly: 3.055 ± 0.722
1.527GlyHis: 1.527 ± 1.504
2.037GlyIle: 2.037 ± 0.798
4.073GlyLys: 4.073 ± 1.069
2.546GlyLeu: 2.546 ± 1.482
0.0GlyMet: 0.0 ± 0.0
1.527GlyAsn: 1.527 ± 1.403
1.018GlyPro: 1.018 ± 0.546
3.055GlyGln: 3.055 ± 1.302
2.037GlyArg: 2.037 ± 0.578
2.037GlySer: 2.037 ± 0.798
3.564GlyThr: 3.564 ± 2.022
2.546GlyVal: 2.546 ± 0.623
0.0GlyTrp: 0.0 ± 0.0
1.527GlyTyr: 1.527 ± 0.819
0.0GlyXaa: 0.0 ± 0.0
His
2.546HisAla: 2.546 ± 0.831
0.509HisCys: 0.509 ± 0.273
2.037HisAsp: 2.037 ± 0.672
1.018HisGlu: 1.018 ± 1.445
1.527HisPhe: 1.527 ± 0.601
3.055HisGly: 3.055 ± 2.969
1.018HisHis: 1.018 ± 0.546
1.527HisIle: 1.527 ± 0.601
2.546HisLys: 2.546 ± 0.623
3.564HisLeu: 3.564 ± 1.444
1.018HisMet: 1.018 ± 1.381
1.018HisAsn: 1.018 ± 0.546
1.018HisPro: 1.018 ± 0.546
1.018HisGln: 1.018 ± 0.546
2.546HisArg: 2.546 ± 1.219
1.527HisSer: 1.527 ± 3.567
1.527HisThr: 1.527 ± 1.182
1.527HisVal: 1.527 ± 1.978
0.0HisTrp: 0.0 ± 0.0
1.018HisTyr: 1.018 ± 0.648
0.0HisXaa: 0.0 ± 0.0
Ile
4.582IleAla: 4.582 ± 2.515
0.0IleCys: 0.0 ± 0.0
2.037IleAsp: 2.037 ± 0.798
3.055IleGlu: 3.055 ± 0.954
2.546IlePhe: 2.546 ± 1.084
2.546IleGly: 2.546 ± 0.881
2.037IleHis: 2.037 ± 0.978
2.037IleIle: 2.037 ± 1.574
2.546IleLys: 2.546 ± 1.365
6.619IleLeu: 6.619 ± 1.866
1.527IleMet: 1.527 ± 0.767
3.055IleAsn: 3.055 ± 1.945
2.546IlePro: 2.546 ± 1.365
2.037IleGln: 2.037 ± 0.578
1.527IleArg: 1.527 ± 0.601
2.037IleSer: 2.037 ± 0.978
5.601IleThr: 5.601 ± 1.196
2.037IleVal: 2.037 ± 1.83
0.0IleTrp: 0.0 ± 0.0
0.509IleTyr: 0.509 ± 0.273
0.0IleXaa: 0.0 ± 0.0
Lys
5.092LysAla: 5.092 ± 1.681
0.0LysCys: 0.0 ± 0.0
5.092LysAsp: 5.092 ± 0.942
2.546LysGlu: 2.546 ± 0.831
2.546LysPhe: 2.546 ± 1.365
0.509LysGly: 0.509 ± 0.767
1.527LysHis: 1.527 ± 0.819
2.546LysIle: 2.546 ± 0.831
3.055LysLys: 3.055 ± 1.039
6.619LysLeu: 6.619 ± 2.002
1.018LysMet: 1.018 ± 0.546
1.527LysAsn: 1.527 ± 0.819
4.073LysPro: 4.073 ± 1.156
1.527LysGln: 1.527 ± 0.819
2.546LysArg: 2.546 ± 0.74
3.564LysSer: 3.564 ± 0.799
5.601LysThr: 5.601 ± 1.908
4.073LysVal: 4.073 ± 2.184
1.527LysTrp: 1.527 ± 0.52
2.037LysTyr: 2.037 ± 0.778
0.0LysXaa: 0.0 ± 0.0
Leu
9.165LeuAla: 9.165 ± 3.594
2.037LeuCys: 2.037 ± 0.578
4.073LeuAsp: 4.073 ± 1.069
4.073LeuGlu: 4.073 ± 2.59
3.564LeuPhe: 3.564 ± 1.226
5.092LeuGly: 5.092 ± 1.088
2.546LeuHis: 2.546 ± 1.365
3.055LeuIle: 3.055 ± 1.038
6.11LeuLys: 6.11 ± 3.275
8.656LeuLeu: 8.656 ± 4.242
1.018LeuMet: 1.018 ± 0.546
3.564LeuAsn: 3.564 ± 1.192
10.692LeuPro: 10.692 ± 2.448
4.073LeuGln: 4.073 ± 1.517
5.601LeuArg: 5.601 ± 0.802
5.092LeuSer: 5.092 ± 1.814
7.128LeuThr: 7.128 ± 4.636
5.092LeuVal: 5.092 ± 1.407
1.527LeuTrp: 1.527 ± 1.403
4.073LeuTyr: 4.073 ± 1.343
0.0LeuXaa: 0.0 ± 0.0
Met
1.527MetAla: 1.527 ± 0.52
0.0MetCys: 0.0 ± 0.0
2.037MetAsp: 2.037 ± 0.778
0.509MetGlu: 0.509 ± 0.273
1.018MetPhe: 1.018 ± 0.546
0.509MetGly: 0.509 ± 0.273
0.509MetHis: 0.509 ± 1.535
0.0MetIle: 0.0 ± 0.0
1.527MetLys: 1.527 ± 0.52
1.527MetLeu: 1.527 ± 0.804
0.0MetMet: 0.0 ± 0.0
0.509MetAsn: 0.509 ± 0.273
1.018MetPro: 1.018 ± 0.546
1.018MetGln: 1.018 ± 0.546
2.037MetArg: 2.037 ± 1.092
1.018MetSer: 1.018 ± 0.897
1.018MetThr: 1.018 ± 0.546
2.546MetVal: 2.546 ± 0.831
0.0MetTrp: 0.0 ± 0.0
1.527MetTyr: 1.527 ± 0.819
0.0MetXaa: 0.0 ± 0.0
Asn
5.601AsnAla: 5.601 ± 1.685
2.037AsnCys: 2.037 ± 1.414
0.509AsnAsp: 0.509 ± 0.273
2.546AsnGlu: 2.546 ± 1.084
0.0AsnPhe: 0.0 ± 0.0
2.037AsnGly: 2.037 ± 2.212
1.527AsnHis: 1.527 ± 0.804
2.037AsnIle: 2.037 ± 0.978
2.546AsnLys: 2.546 ± 1.084
4.582AsnLeu: 4.582 ± 2.185
0.0AsnMet: 0.0 ± 0.0
1.527AsnAsn: 1.527 ± 1.269
2.037AsnPro: 2.037 ± 1.092
2.037AsnGln: 2.037 ± 1.191
1.527AsnArg: 1.527 ± 0.986
2.546AsnSer: 2.546 ± 1.365
2.037AsnThr: 2.037 ± 1.069
2.546AsnVal: 2.546 ± 0.623
2.037AsnTrp: 2.037 ± 0.672
2.546AsnTyr: 2.546 ± 0.881
0.0AsnXaa: 0.0 ± 0.0
Pro
7.637ProAla: 7.637 ± 3.526
0.0ProCys: 0.0 ± 0.0
3.564ProAsp: 3.564 ± 1.226
4.073ProGlu: 4.073 ± 1.443
0.509ProPhe: 0.509 ± 0.273
3.055ProGly: 3.055 ± 1.038
3.055ProHis: 3.055 ± 2.519
3.564ProIle: 3.564 ± 1.226
4.582ProLys: 4.582 ± 1.7
4.073ProLeu: 4.073 ± 1.116
2.546ProMet: 2.546 ± 0.74
2.037ProAsn: 2.037 ± 1.687
6.619ProPro: 6.619 ± 4.271
1.527ProGln: 1.527 ± 0.52
5.601ProArg: 5.601 ± 2.226
5.601ProSer: 5.601 ± 1.797
5.092ProThr: 5.092 ± 1.238
3.055ProVal: 3.055 ± 1.442
1.018ProTrp: 1.018 ± 0.648
4.073ProTyr: 4.073 ± 1.178
0.0ProXaa: 0.0 ± 0.0
Gln
5.601GlnAla: 5.601 ± 1.394
0.0GlnCys: 0.0 ± 0.0
1.018GlnAsp: 1.018 ± 0.546
2.546GlnGlu: 2.546 ± 0.831
1.527GlnPhe: 1.527 ± 0.601
0.509GlnGly: 0.509 ± 0.273
1.018GlnHis: 1.018 ± 0.546
1.527GlnIle: 1.527 ± 0.819
3.564GlnLys: 3.564 ± 1.065
2.546GlnLeu: 2.546 ± 0.623
1.527GlnMet: 1.527 ± 0.657
1.018GlnAsn: 1.018 ± 0.595
4.582GlnPro: 4.582 ± 2.232
1.018GlnGln: 1.018 ± 0.546
2.037GlnArg: 2.037 ± 1.414
2.037GlnSer: 2.037 ± 1.092
3.564GlnThr: 3.564 ± 1.271
2.546GlnVal: 2.546 ± 0.623
0.509GlnTrp: 0.509 ± 0.273
1.527GlnTyr: 1.527 ± 0.986
0.0GlnXaa: 0.0 ± 0.0
Arg
4.073ArgAla: 4.073 ± 1.517
3.564ArgCys: 3.564 ± 2.022
4.073ArgAsp: 4.073 ± 2.184
2.546ArgGlu: 2.546 ± 1.365
1.527ArgPhe: 1.527 ± 0.819
4.073ArgGly: 4.073 ± 0.946
1.527ArgHis: 1.527 ± 2.376
2.037ArgIle: 2.037 ± 0.578
1.527ArgLys: 1.527 ± 0.819
4.582ArgLeu: 4.582 ± 1.043
0.509ArgMet: 0.509 ± 0.273
3.564ArgAsn: 3.564 ± 0.799
4.073ArgPro: 4.073 ± 3.694
3.564ArgGln: 3.564 ± 1.192
4.582ArgArg: 4.582 ± 1.168
3.055ArgSer: 3.055 ± 1.887
2.037ArgThr: 2.037 ± 1.295
2.037ArgVal: 2.037 ± 0.778
0.509ArgTrp: 0.509 ± 0.767
2.546ArgTyr: 2.546 ± 0.881
0.0ArgXaa: 0.0 ± 0.0
Ser
1.527SerAla: 1.527 ± 1.421
0.509SerCys: 0.509 ± 1.535
2.037SerAsp: 2.037 ± 0.578
2.546SerGlu: 2.546 ± 0.831
2.037SerPhe: 2.037 ± 1.092
1.527SerGly: 1.527 ± 0.601
2.037SerHis: 2.037 ± 2.081
3.564SerIle: 3.564 ± 1.809
3.055SerLys: 3.055 ± 1.638
7.637SerLeu: 7.637 ± 2.687
0.0SerMet: 0.0 ± 0.0
3.055SerAsn: 3.055 ± 0.811
2.037SerPro: 2.037 ± 1.092
3.055SerGln: 3.055 ± 1.302
3.564SerArg: 3.564 ± 1.729
4.073SerSer: 4.073 ± 0.876
5.092SerThr: 5.092 ± 3.057
2.546SerVal: 2.546 ± 0.74
0.509SerTrp: 0.509 ± 0.273
1.527SerTyr: 1.527 ± 0.804
0.0SerXaa: 0.0 ± 0.0
Thr
3.055ThrAla: 3.055 ± 1.201
1.527ThrCys: 1.527 ± 1.421
2.546ThrAsp: 2.546 ± 1.935
6.11ThrGlu: 6.11 ± 1.742
4.582ThrPhe: 4.582 ± 0.904
5.092ThrGly: 5.092 ± 1.407
3.564ThrHis: 3.564 ± 1.244
3.564ThrIle: 3.564 ± 1.309
5.601ThrLys: 5.601 ± 1.73
6.11ThrLeu: 6.11 ± 1.672
2.546ThrMet: 2.546 ± 0.837
2.546ThrAsn: 2.546 ± 0.909
6.619ThrPro: 6.619 ± 1.409
4.582ThrGln: 4.582 ± 2.782
5.092ThrArg: 5.092 ± 2.853
2.546ThrSer: 2.546 ± 2.061
5.601ThrThr: 5.601 ± 1.908
3.564ThrVal: 3.564 ± 1.244
0.0ThrTrp: 0.0 ± 0.0
3.564ThrTyr: 3.564 ± 1.911
0.0ThrXaa: 0.0 ± 0.0
Val
5.092ValAla: 5.092 ± 1.675
0.0ValCys: 0.0 ± 0.0
1.527ValAsp: 1.527 ± 0.601
6.11ValGlu: 6.11 ± 1.547
4.582ValPhe: 4.582 ± 0.47
1.527ValGly: 1.527 ± 1.182
1.527ValHis: 1.527 ± 1.269
3.055ValIle: 3.055 ± 2.324
2.546ValLys: 2.546 ± 1.219
8.147ValLeu: 8.147 ± 2.7
1.527ValMet: 1.527 ± 0.819
1.527ValAsn: 1.527 ± 0.52
3.564ValPro: 3.564 ± 2.527
3.055ValGln: 3.055 ± 1.438
2.546ValArg: 2.546 ± 1.314
1.018ValSer: 1.018 ± 0.546
3.055ValThr: 3.055 ± 1.438
3.055ValVal: 3.055 ± 1.945
0.509ValTrp: 0.509 ± 0.273
2.037ValTyr: 2.037 ± 0.672
0.0ValXaa: 0.0 ± 0.0
Trp
2.037TrpAla: 2.037 ± 0.578
0.509TrpCys: 0.509 ± 1.535
0.509TrpAsp: 0.509 ± 0.273
0.0TrpGlu: 0.0 ± 0.0
0.509TrpPhe: 0.509 ± 0.273
0.0TrpGly: 0.0 ± 0.0
0.509TrpHis: 0.509 ± 1.535
0.0TrpIle: 0.0 ± 0.0
1.018TrpLys: 1.018 ± 0.546
1.527TrpLeu: 1.527 ± 0.819
0.0TrpMet: 0.0 ± 0.0
1.527TrpAsn: 1.527 ± 0.601
0.509TrpPro: 0.509 ± 0.273
0.0TrpGln: 0.0 ± 0.0
1.018TrpArg: 1.018 ± 0.648
0.0TrpSer: 0.0 ± 0.0
1.018TrpThr: 1.018 ± 0.595
1.018TrpVal: 1.018 ± 0.546
0.509TrpTrp: 0.509 ± 0.273
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.582TyrAla: 4.582 ± 1.7
0.0TyrCys: 0.0 ± 0.0
1.527TyrAsp: 1.527 ± 0.52
2.546TyrGlu: 2.546 ± 1.365
1.527TyrPhe: 1.527 ± 1.346
1.018TyrGly: 1.018 ± 1.584
1.018TyrHis: 1.018 ± 0.546
2.037TyrIle: 2.037 ± 0.978
2.546TyrLys: 2.546 ± 0.623
3.564TyrLeu: 3.564 ± 1.226
0.509TyrMet: 0.509 ± 0.273
2.037TyrAsn: 2.037 ± 0.778
2.037TyrPro: 2.037 ± 1.801
1.527TyrGln: 1.527 ± 0.819
1.527TyrArg: 1.527 ± 0.804
2.546TyrSer: 2.546 ± 1.365
5.601TyrThr: 5.601 ± 1.618
2.037TyrVal: 2.037 ± 0.778
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1965 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski