Amino acid dipepetide frequency for Peanut chlorotic streak virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.528AlaAla: 0.528 ± 0.505
0.0AlaCys: 0.0 ± 0.0
2.111AlaAsp: 2.111 ± 0.741
5.277AlaGlu: 5.277 ± 1.129
2.639AlaPhe: 2.639 ± 1.339
1.055AlaGly: 1.055 ± 0.753
0.0AlaHis: 0.0 ± 0.0
3.694AlaIle: 3.694 ± 0.977
6.86AlaLys: 6.86 ± 0.893
2.111AlaLeu: 2.111 ± 0.314
0.0AlaMet: 0.0 ± 0.0
1.583AlaAsn: 1.583 ± 0.941
2.111AlaPro: 2.111 ± 1.101
2.639AlaGln: 2.639 ± 0.911
3.166AlaArg: 3.166 ± 0.76
1.055AlaSer: 1.055 ± 0.463
2.111AlaThr: 2.111 ± 1.066
2.639AlaVal: 2.639 ± 1.339
0.528AlaTrp: 0.528 ± 0.514
3.166AlaTyr: 3.166 ± 1.219
0.0AlaXaa: 0.0 ± 0.0
Cys
0.528CysAla: 0.528 ± 0.514
0.528CysCys: 0.528 ± 0.514
0.528CysAsp: 0.528 ± 0.514
1.055CysGlu: 1.055 ± 0.741
0.0CysPhe: 0.0 ± 0.0
1.055CysGly: 1.055 ± 0.741
0.0CysHis: 0.0 ± 0.0
0.528CysIle: 0.528 ± 0.558
1.055CysLys: 1.055 ± 0.499
2.111CysLeu: 2.111 ± 0.481
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.111CysPro: 2.111 ± 0.999
2.111CysGln: 2.111 ± 1.168
0.528CysArg: 0.528 ± 0.514
1.583CysSer: 1.583 ± 0.941
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.528CysTrp: 0.528 ± 0.514
1.055CysTyr: 1.055 ± 0.62
0.0CysXaa: 0.0 ± 0.0
Asp
2.639AspAla: 2.639 ± 1.381
0.0AspCys: 0.0 ± 0.0
3.694AspAsp: 3.694 ± 1.676
6.332AspGlu: 6.332 ± 1.967
2.639AspPhe: 2.639 ± 1.26
0.528AspGly: 0.528 ± 0.514
1.055AspHis: 1.055 ± 1.009
5.277AspIle: 5.277 ± 0.68
4.749AspLys: 4.749 ± 1.526
5.805AspLeu: 5.805 ± 1.983
0.0AspMet: 0.0 ± 0.0
3.694AspAsn: 3.694 ± 1.229
2.639AspPro: 2.639 ± 1.416
1.583AspGln: 1.583 ± 1.043
4.222AspArg: 4.222 ± 1.305
2.111AspSer: 2.111 ± 1.168
1.583AspThr: 1.583 ± 1.129
1.055AspVal: 1.055 ± 0.584
0.528AspTrp: 0.528 ± 0.514
2.111AspTyr: 2.111 ± 0.776
0.0AspXaa: 0.0 ± 0.0
Glu
4.222GluAla: 4.222 ± 1.807
0.528GluCys: 0.528 ± 0.514
6.86GluAsp: 6.86 ± 1.741
10.554GluGlu: 10.554 ± 1.578
5.277GluPhe: 5.277 ± 0.627
5.805GluGly: 5.805 ± 1.84
0.528GluHis: 0.528 ± 0.558
4.222GluIle: 4.222 ± 1.161
12.665GluLys: 12.665 ± 2.844
3.694GluLeu: 3.694 ± 0.527
1.055GluMet: 1.055 ± 0.486
4.222GluAsn: 4.222 ± 1.051
1.583GluPro: 1.583 ± 0.38
3.166GluGln: 3.166 ± 2.277
2.111GluArg: 2.111 ± 0.314
4.222GluSer: 4.222 ± 1.842
6.332GluThr: 6.332 ± 2.404
2.639GluVal: 2.639 ± 1.329
1.055GluTrp: 1.055 ± 0.463
1.583GluTyr: 1.583 ± 0.941
0.0GluXaa: 0.0 ± 0.0
Phe
0.528PheAla: 0.528 ± 0.514
0.528PheCys: 0.528 ± 0.514
1.583PheAsp: 1.583 ± 0.38
1.055PheGlu: 1.055 ± 0.499
1.055PhePhe: 1.055 ± 0.499
0.528PheGly: 0.528 ± 0.376
0.0PheHis: 0.0 ± 0.0
5.277PheIle: 5.277 ± 1.446
2.639PheLys: 2.639 ± 0.673
6.86PheLeu: 6.86 ± 2.471
1.055PheMet: 1.055 ± 0.753
3.166PheAsn: 3.166 ± 1.023
2.639PhePro: 2.639 ± 0.187
2.639PheGln: 2.639 ± 0.861
3.694PheArg: 3.694 ± 1.165
3.694PheSer: 3.694 ± 0.527
0.528PheThr: 0.528 ± 0.376
1.583PheVal: 1.583 ± 0.72
0.0PheTrp: 0.0 ± 0.0
0.528PheTyr: 0.528 ± 0.376
0.0PheXaa: 0.0 ± 0.0
Gly
1.583GlyAla: 1.583 ± 0.676
1.055GlyCys: 1.055 ± 0.499
2.639GlyAsp: 2.639 ± 0.822
3.166GlyGlu: 3.166 ± 0.679
3.694GlyPhe: 3.694 ± 1.227
1.055GlyGly: 1.055 ± 0.463
1.055GlyHis: 1.055 ± 0.62
5.805GlyIle: 5.805 ± 1.808
4.222GlyLys: 4.222 ± 0.99
4.222GlyLeu: 4.222 ± 1.327
0.528GlyMet: 0.528 ± 0.505
2.111GlyAsn: 2.111 ± 0.999
1.583GlyPro: 1.583 ± 0.977
0.528GlyGln: 0.528 ± 0.558
1.583GlyArg: 1.583 ± 0.676
3.166GlySer: 3.166 ± 0.679
1.055GlyThr: 1.055 ± 0.753
3.694GlyVal: 3.694 ± 0.806
0.0GlyTrp: 0.0 ± 0.0
2.111GlyTyr: 2.111 ± 1.454
0.0GlyXaa: 0.0 ± 0.0
His
0.528HisAla: 0.528 ± 0.505
0.528HisCys: 0.528 ± 0.514
0.528HisAsp: 0.528 ± 0.376
1.055HisGlu: 1.055 ± 0.55
0.0HisPhe: 0.0 ± 0.0
1.583HisGly: 1.583 ± 0.72
0.0HisHis: 0.0 ± 0.0
2.111HisIle: 2.111 ± 0.991
0.528HisLys: 0.528 ± 0.505
2.639HisLeu: 2.639 ± 1.273
0.0HisMet: 0.0 ± 0.0
1.055HisAsn: 1.055 ± 0.463
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.055HisSer: 1.055 ± 0.55
0.0HisThr: 0.0 ± 0.0
1.055HisVal: 1.055 ± 0.741
0.0HisTrp: 0.0 ± 0.0
0.528HisTyr: 0.528 ± 0.514
0.0HisXaa: 0.0 ± 0.0
Ile
3.166IleAla: 3.166 ± 1.761
2.639IleCys: 2.639 ± 0.187
4.749IleAsp: 4.749 ± 1.758
7.388IleGlu: 7.388 ± 1.253
2.639IlePhe: 2.639 ± 0.697
3.694IleGly: 3.694 ± 1.229
2.111IleHis: 2.111 ± 0.314
4.222IleIle: 4.222 ± 1.114
6.332IleLys: 6.332 ± 1.371
8.443IleLeu: 8.443 ± 0.805
0.528IleMet: 0.528 ± 1.026
3.694IleAsn: 3.694 ± 1.165
3.166IlePro: 3.166 ± 1.369
5.805IleGln: 5.805 ± 1.373
5.277IleArg: 5.277 ± 2.659
4.222IleSer: 4.222 ± 1.473
2.639IleThr: 2.639 ± 1.034
3.166IleVal: 3.166 ± 1.017
1.055IleTrp: 1.055 ± 0.463
5.277IleTyr: 5.277 ± 2.481
0.0IleXaa: 0.0 ± 0.0
Lys
5.805LysAla: 5.805 ± 2.374
0.528LysCys: 0.528 ± 0.514
2.111LysAsp: 2.111 ± 0.582
11.082LysGlu: 11.082 ± 3.117
3.166LysPhe: 3.166 ± 1.138
4.222LysGly: 4.222 ± 1.499
1.055LysHis: 1.055 ± 0.55
9.499LysIle: 9.499 ± 3.285
8.443LysLys: 8.443 ± 2.635
7.388LysLeu: 7.388 ± 1.3
3.166LysMet: 3.166 ± 0.864
4.749LysAsn: 4.749 ± 1.278
4.749LysPro: 4.749 ± 1.124
8.443LysGln: 8.443 ± 1.456
5.805LysArg: 5.805 ± 0.543
5.277LysSer: 5.277 ± 1.237
3.694LysThr: 3.694 ± 1.109
4.749LysVal: 4.749 ± 2.396
1.583LysTrp: 1.583 ± 0.72
4.222LysTyr: 4.222 ± 1.552
0.0LysXaa: 0.0 ± 0.0
Leu
6.332LeuAla: 6.332 ± 2.497
2.111LeuCys: 2.111 ± 1.168
7.388LeuAsp: 7.388 ± 1.756
6.86LeuGlu: 6.86 ± 1.645
1.583LeuPhe: 1.583 ± 0.676
6.332LeuGly: 6.332 ± 0.641
1.055LeuHis: 1.055 ± 0.463
4.749LeuIle: 4.749 ± 1.421
8.971LeuLys: 8.971 ± 2.864
5.805LeuLeu: 5.805 ± 1.318
1.055LeuMet: 1.055 ± 0.55
2.111LeuAsn: 2.111 ± 0.999
3.166LeuPro: 3.166 ± 1.699
7.388LeuGln: 7.388 ± 1.643
5.277LeuArg: 5.277 ± 1.171
5.805LeuSer: 5.805 ± 1.107
6.86LeuThr: 6.86 ± 1.376
4.222LeuVal: 4.222 ± 1.597
0.0LeuTrp: 0.0 ± 0.0
4.222LeuTyr: 4.222 ± 0.941
0.0LeuXaa: 0.0 ± 0.0
Met
1.055MetAla: 1.055 ± 0.499
0.528MetCys: 0.528 ± 0.558
1.055MetAsp: 1.055 ± 0.499
1.055MetGlu: 1.055 ± 0.584
1.055MetPhe: 1.055 ± 0.463
1.055MetGly: 1.055 ± 0.463
1.055MetHis: 1.055 ± 0.753
1.583MetIle: 1.583 ± 0.676
1.583MetLys: 1.583 ± 0.667
2.111MetLeu: 2.111 ± 0.741
0.0MetMet: 0.0 ± 0.0
1.055MetAsn: 1.055 ± 0.499
1.055MetPro: 1.055 ± 0.499
0.0MetGln: 0.0 ± 0.0
0.528MetArg: 0.528 ± 0.505
1.583MetSer: 1.583 ± 1.043
0.0MetThr: 0.0 ± 0.0
1.055MetVal: 1.055 ± 0.584
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.055AsnAla: 1.055 ± 0.753
1.055AsnCys: 1.055 ± 0.62
1.055AsnAsp: 1.055 ± 0.499
3.694AsnGlu: 3.694 ± 0.972
2.111AsnPhe: 2.111 ± 0.314
2.111AsnGly: 2.111 ± 1.061
1.583AsnHis: 1.583 ± 0.72
2.639AsnIle: 2.639 ± 0.605
4.222AsnLys: 4.222 ± 0.941
4.749AsnLeu: 4.749 ± 1.305
2.111AsnMet: 2.111 ± 1.448
3.694AsnAsn: 3.694 ± 1.253
1.583AsnPro: 1.583 ± 0.676
2.111AsnGln: 2.111 ± 0.816
5.277AsnArg: 5.277 ± 1.434
2.639AsnSer: 2.639 ± 0.909
1.055AsnThr: 1.055 ± 0.499
2.639AsnVal: 2.639 ± 0.885
0.528AsnTrp: 0.528 ± 0.376
4.749AsnTyr: 4.749 ± 1.055
0.0AsnXaa: 0.0 ± 0.0
Pro
1.583ProAla: 1.583 ± 0.676
0.528ProCys: 0.528 ± 0.514
1.055ProAsp: 1.055 ± 0.499
2.639ProGlu: 2.639 ± 1.867
2.111ProPhe: 2.111 ± 1.431
0.528ProGly: 0.528 ± 0.505
0.528ProHis: 0.528 ± 0.376
3.166ProIle: 3.166 ± 0.679
2.111ProLys: 2.111 ± 1.448
3.166ProLeu: 3.166 ± 1.52
1.055ProMet: 1.055 ± 0.499
2.111ProAsn: 2.111 ± 0.999
2.111ProPro: 2.111 ± 0.816
3.694ProGln: 3.694 ± 1.115
1.583ProArg: 1.583 ± 0.38
5.805ProSer: 5.805 ± 0.814
2.111ProThr: 2.111 ± 0.902
2.639ProVal: 2.639 ± 1.294
0.528ProTrp: 0.528 ± 0.376
1.583ProTyr: 1.583 ± 0.941
0.0ProXaa: 0.0 ± 0.0
Gln
3.166GlnAla: 3.166 ± 0.679
0.528GlnCys: 0.528 ± 0.505
3.166GlnAsp: 3.166 ± 1.198
4.749GlnGlu: 4.749 ± 2.261
1.055GlnPhe: 1.055 ± 0.753
2.639GlnGly: 2.639 ± 0.187
0.0GlnHis: 0.0 ± 0.0
6.86GlnIle: 6.86 ± 1.358
6.86GlnLys: 6.86 ± 0.816
5.277GlnLeu: 5.277 ± 0.818
0.528GlnMet: 0.528 ± 0.376
2.639GlnAsn: 2.639 ± 1.28
2.111GlnPro: 2.111 ± 0.741
3.694GlnGln: 3.694 ± 1.115
2.111GlnArg: 2.111 ± 0.961
4.749GlnSer: 4.749 ± 1.361
4.222GlnThr: 4.222 ± 0.99
3.694GlnVal: 3.694 ± 1.229
0.0GlnTrp: 0.0 ± 0.0
2.111GlnTyr: 2.111 ± 1.241
0.0GlnXaa: 0.0 ± 0.0
Arg
1.055ArgAla: 1.055 ± 1.009
1.055ArgCys: 1.055 ± 0.499
1.055ArgAsp: 1.055 ± 0.584
4.222ArgGlu: 4.222 ± 2.024
2.111ArgPhe: 2.111 ± 0.582
3.166ArgGly: 3.166 ± 1.498
0.528ArgHis: 0.528 ± 0.505
5.277ArgIle: 5.277 ± 1.147
7.916ArgLys: 7.916 ± 3.081
7.388ArgLeu: 7.388 ± 1.018
2.111ArgMet: 2.111 ± 0.839
2.111ArgAsn: 2.111 ± 1.431
1.583ArgPro: 1.583 ± 0.72
1.583ArgGln: 1.583 ± 0.72
2.639ArgArg: 2.639 ± 1.934
2.111ArgSer: 2.111 ± 1.061
2.639ArgThr: 2.639 ± 1.123
1.055ArgVal: 1.055 ± 0.55
0.528ArgTrp: 0.528 ± 0.376
3.694ArgTyr: 3.694 ± 1.093
0.0ArgXaa: 0.0 ± 0.0
Ser
2.639SerAla: 2.639 ± 1.413
0.528SerCys: 0.528 ± 0.376
3.694SerAsp: 3.694 ± 0.78
6.332SerGlu: 6.332 ± 2.159
3.694SerPhe: 3.694 ± 0.459
3.694SerGly: 3.694 ± 0.417
1.055SerHis: 1.055 ± 0.55
1.055SerIle: 1.055 ± 1.009
5.277SerLys: 5.277 ± 1.129
8.443SerLeu: 8.443 ± 1.786
1.583SerMet: 1.583 ± 0.76
2.639SerAsn: 2.639 ± 0.697
2.111SerPro: 2.111 ± 0.741
6.86SerGln: 6.86 ± 1.358
3.694SerArg: 3.694 ± 0.527
7.388SerSer: 7.388 ± 2.456
3.166SerThr: 3.166 ± 1.746
3.166SerVal: 3.166 ± 0.505
0.528SerTrp: 0.528 ± 0.376
1.583SerTyr: 1.583 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
2.111ThrAla: 2.111 ± 0.314
1.055ThrCys: 1.055 ± 1.027
4.749ThrAsp: 4.749 ± 1.564
2.111ThrGlu: 2.111 ± 1.007
2.639ThrPhe: 2.639 ± 0.861
2.639ThrGly: 2.639 ± 1.26
0.0ThrHis: 0.0 ± 0.0
3.166ThrIle: 3.166 ± 0.49
5.277ThrLys: 5.277 ± 2.153
3.166ThrLeu: 3.166 ± 1.195
0.0ThrMet: 0.0 ± 0.0
2.111ThrAsn: 2.111 ± 1.061
1.055ThrPro: 1.055 ± 0.62
2.639ThrGln: 2.639 ± 1.416
2.111ThrArg: 2.111 ± 0.902
4.749ThrSer: 4.749 ± 0.353
3.166ThrThr: 3.166 ± 0.935
1.583ThrVal: 1.583 ± 0.599
0.528ThrTrp: 0.528 ± 0.376
2.639ThrTyr: 2.639 ± 0.911
0.0ThrXaa: 0.0 ± 0.0
Val
1.055ValAla: 1.055 ± 0.753
1.583ValCys: 1.583 ± 0.599
2.639ValAsp: 2.639 ± 0.697
2.111ValGlu: 2.111 ± 0.816
0.528ValPhe: 0.528 ± 0.376
0.528ValGly: 0.528 ± 0.514
1.055ValHis: 1.055 ± 0.753
5.805ValIle: 5.805 ± 2.179
3.694ValLys: 3.694 ± 1.318
3.694ValLeu: 3.694 ± 1.115
1.055ValMet: 1.055 ± 0.463
3.694ValAsn: 3.694 ± 1.343
3.166ValPro: 3.166 ± 0.935
3.694ValGln: 3.694 ± 1.493
1.583ValArg: 1.583 ± 0.941
2.111ValSer: 2.111 ± 0.961
4.222ValThr: 4.222 ± 1.481
3.166ValVal: 3.166 ± 2.422
0.0ValTrp: 0.0 ± 0.0
1.055ValTyr: 1.055 ± 0.463
0.0ValXaa: 0.0 ± 0.0
Trp
0.528TrpAla: 0.528 ± 0.514
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.528TrpPhe: 0.528 ± 0.376
1.055TrpGly: 1.055 ± 0.463
0.0TrpHis: 0.0 ± 0.0
1.055TrpIle: 1.055 ± 1.027
2.111TrpLys: 2.111 ± 1.035
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.528TrpAsn: 0.528 ± 0.376
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.583TrpSer: 1.583 ± 0.676
0.0TrpThr: 0.0 ± 0.0
0.528TrpVal: 0.528 ± 0.376
0.0TrpTrp: 0.0 ± 0.0
0.528TrpTyr: 0.528 ± 0.376
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.166TyrAla: 3.166 ± 0.383
0.0TyrCys: 0.0 ± 0.0
1.583TyrAsp: 1.583 ± 1.514
1.583TyrGlu: 1.583 ± 0.941
1.055TyrPhe: 1.055 ± 1.009
1.583TyrGly: 1.583 ± 0.993
0.528TyrHis: 0.528 ± 0.376
4.749TyrIle: 4.749 ± 1.545
4.222TyrLys: 4.222 ± 1.631
4.222TyrLeu: 4.222 ± 0.833
1.055TyrMet: 1.055 ± 1.009
3.694TyrAsn: 3.694 ± 1.559
2.111TyrPro: 2.111 ± 1.007
1.583TyrGln: 1.583 ± 0.38
2.639TyrArg: 2.639 ± 1.942
4.222TyrSer: 4.222 ± 1.971
2.111TyrThr: 2.111 ± 0.672
2.111TyrVal: 2.111 ± 0.481
0.528TyrTrp: 0.528 ± 0.376
1.055TyrTyr: 1.055 ± 0.584
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1896 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski