Amino acid dipepetide frequency for Cacao swollen shoot Ghana N virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.699AlaAla: 5.699 ± 3.799
0.438AlaCys: 0.438 ± 0.221
3.069AlaAsp: 3.069 ± 3.284
4.822AlaGlu: 4.822 ± 2.557
3.507AlaPhe: 3.507 ± 1.226
2.192AlaGly: 2.192 ± 1.046
0.877AlaHis: 0.877 ± 1.203
4.822AlaIle: 4.822 ± 2.429
4.384AlaLys: 4.384 ± 4.391
5.261AlaLeu: 5.261 ± 2.499
2.192AlaMet: 2.192 ± 1.104
1.315AlaAsn: 1.315 ± 1.081
3.069AlaPro: 3.069 ± 1.126
3.946AlaGln: 3.946 ± 1.236
3.507AlaArg: 3.507 ± 1.361
4.822AlaSer: 4.822 ± 1.371
4.384AlaThr: 4.384 ± 4.77
3.507AlaVal: 3.507 ± 1.361
0.438AlaTrp: 0.438 ± 0.221
3.946AlaTyr: 3.946 ± 1.236
0.0AlaXaa: 0.0 ± 0.0
Cys
0.877CysAla: 0.877 ± 0.442
0.438CysCys: 0.438 ± 0.221
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.877CysPhe: 0.877 ± 0.442
0.877CysGly: 0.877 ± 1.249
0.0CysHis: 0.0 ± 0.0
0.877CysIle: 0.877 ± 0.442
2.192CysLys: 2.192 ± 1.104
0.0CysLeu: 0.0 ± 0.0
0.877CysMet: 0.877 ± 0.442
1.754CysAsn: 1.754 ± 0.883
1.754CysPro: 1.754 ± 1.073
1.315CysGln: 1.315 ± 0.662
0.438CysArg: 0.438 ± 0.221
0.0CysSer: 0.0 ± 0.0
0.877CysThr: 0.877 ± 0.442
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.315CysTyr: 1.315 ± 0.662
0.0CysXaa: 0.0 ± 0.0
Asp
2.192AspAla: 2.192 ± 1.104
0.877AspCys: 0.877 ± 0.442
3.946AspAsp: 3.946 ± 1.987
3.507AspGlu: 3.507 ± 1.766
0.877AspPhe: 0.877 ± 0.442
3.507AspGly: 3.507 ± 1.226
1.315AspHis: 1.315 ± 1.081
2.63AspIle: 2.63 ± 1.325
1.315AspLys: 1.315 ± 0.662
6.138AspLeu: 6.138 ± 2.06
0.0AspMet: 0.0 ± 0.0
3.507AspAsn: 3.507 ± 1.226
3.069AspPro: 3.069 ± 2.874
3.069AspGln: 3.069 ± 1.252
3.507AspArg: 3.507 ± 4.741
1.754AspSer: 1.754 ± 1.073
3.946AspThr: 3.946 ± 2.107
2.63AspVal: 2.63 ± 1.325
0.877AspTrp: 0.877 ± 0.442
3.507AspTyr: 3.507 ± 1.226
0.0AspXaa: 0.0 ± 0.0
Glu
3.507GluAla: 3.507 ± 2.603
1.315GluCys: 1.315 ± 0.662
5.699GluAsp: 5.699 ± 1.102
11.399GluGlu: 11.399 ± 1.865
1.315GluPhe: 1.315 ± 0.662
5.261GluGly: 5.261 ± 2.52
2.192GluHis: 2.192 ± 1.104
3.507GluIle: 3.507 ± 1.262
6.576GluLys: 6.576 ± 5.67
3.946GluLeu: 3.946 ± 1.349
1.315GluMet: 1.315 ± 0.97
3.069GluAsn: 3.069 ± 1.011
1.754GluPro: 1.754 ± 2.029
3.946GluGln: 3.946 ± 1.239
4.384GluArg: 4.384 ± 1.392
8.33GluSer: 8.33 ± 3.928
5.699GluThr: 5.699 ± 2.87
6.576GluVal: 6.576 ± 1.242
1.754GluTrp: 1.754 ± 0.993
1.315GluTyr: 1.315 ± 1.081
0.0GluXaa: 0.0 ± 0.0
Phe
3.069PheAla: 3.069 ± 1.126
0.877PheCys: 0.877 ± 0.442
1.315PheAsp: 1.315 ± 1.081
1.315PheGlu: 1.315 ± 0.662
0.0PhePhe: 0.0 ± 0.0
1.315PheGly: 1.315 ± 0.662
1.754PheHis: 1.754 ± 0.883
2.63PheIle: 2.63 ± 1.325
4.822PheLys: 4.822 ± 3.157
1.754PheLeu: 1.754 ± 0.883
0.877PheMet: 0.877 ± 0.442
1.315PheAsn: 1.315 ± 0.662
1.315PhePro: 1.315 ± 0.662
0.877PheGln: 0.877 ± 0.442
1.754PheArg: 1.754 ± 0.883
2.192PheSer: 2.192 ± 1.104
0.877PheThr: 0.877 ± 0.442
1.315PheVal: 1.315 ± 0.662
0.438PheTrp: 0.438 ± 0.221
1.754PheTyr: 1.754 ± 0.883
0.0PheXaa: 0.0 ± 0.0
Gly
2.192GlyAla: 2.192 ± 1.046
0.877GlyCys: 0.877 ± 0.442
1.754GlyAsp: 1.754 ± 0.883
3.946GlyGlu: 3.946 ± 1.236
1.754GlyPhe: 1.754 ± 3.252
1.754GlyGly: 1.754 ± 0.883
2.192GlyHis: 2.192 ± 1.104
6.576GlyIle: 6.576 ± 2.863
5.261GlyLys: 5.261 ± 1.711
3.069GlyLeu: 3.069 ± 1.126
1.754GlyMet: 1.754 ± 0.883
2.63GlyAsn: 2.63 ± 0.955
1.754GlyPro: 1.754 ± 0.883
0.438GlyGln: 0.438 ± 0.221
6.138GlyArg: 6.138 ± 2.253
2.63GlySer: 2.63 ± 1.26
4.822GlyThr: 4.822 ± 1.089
1.754GlyVal: 1.754 ± 0.883
0.877GlyTrp: 0.877 ± 0.442
3.069GlyTyr: 3.069 ± 1.545
0.0GlyXaa: 0.0 ± 0.0
His
2.63HisAla: 2.63 ± 1.064
0.438HisCys: 0.438 ± 0.221
0.877HisAsp: 0.877 ± 0.442
1.315HisGlu: 1.315 ± 0.662
1.315HisPhe: 1.315 ± 0.662
2.63HisGly: 2.63 ± 1.325
1.315HisHis: 1.315 ± 0.662
2.63HisIle: 2.63 ± 1.325
0.438HisLys: 0.438 ± 0.221
2.192HisLeu: 2.192 ± 1.66
0.438HisMet: 0.438 ± 0.221
1.754HisAsn: 1.754 ± 1.827
0.438HisPro: 0.438 ± 0.221
1.315HisGln: 1.315 ± 0.662
2.192HisArg: 2.192 ± 1.046
0.438HisSer: 0.438 ± 0.221
1.315HisThr: 1.315 ± 1.081
0.438HisVal: 0.438 ± 0.221
1.754HisTrp: 1.754 ± 0.993
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.822IleAla: 4.822 ± 1.089
1.315IleCys: 1.315 ± 0.662
2.63IleAsp: 2.63 ± 1.325
5.699IleGlu: 5.699 ± 1.102
2.192IlePhe: 2.192 ± 0.949
3.507IleGly: 3.507 ± 1.766
1.754IleHis: 1.754 ± 0.883
4.384IleIle: 4.384 ± 1.446
6.576IleLys: 6.576 ± 2.278
7.014IleLeu: 7.014 ± 2.152
0.438IleMet: 0.438 ± 0.221
1.315IleAsn: 1.315 ± 0.662
5.261IlePro: 5.261 ± 1.848
5.261IleGln: 5.261 ± 3.347
1.754IleArg: 1.754 ± 1.073
3.507IleSer: 3.507 ± 1.393
4.822IleThr: 4.822 ± 1.569
3.069IleVal: 3.069 ± 1.011
0.0IleTrp: 0.0 ± 0.0
1.315IleTyr: 1.315 ± 0.662
0.0IleXaa: 0.0 ± 0.0
Lys
4.384LysAla: 4.384 ± 4.366
0.877LysCys: 0.877 ± 0.442
2.192LysAsp: 2.192 ± 1.306
4.384LysGlu: 4.384 ± 2.873
1.754LysPhe: 1.754 ± 0.883
3.946LysGly: 3.946 ± 1.236
1.754LysHis: 1.754 ± 0.883
3.069LysIle: 3.069 ± 2.794
3.946LysLys: 3.946 ± 1.236
7.891LysLeu: 7.891 ± 4.884
2.63LysMet: 2.63 ± 0.997
2.63LysAsn: 2.63 ± 0.955
2.63LysPro: 2.63 ± 1.712
4.822LysGln: 4.822 ± 3.526
5.699LysArg: 5.699 ± 1.912
6.138LysSer: 6.138 ± 1.857
2.63LysThr: 2.63 ± 1.26
3.069LysVal: 3.069 ± 2.794
0.877LysTrp: 0.877 ± 0.442
1.754LysTyr: 1.754 ± 0.883
0.0LysXaa: 0.0 ± 0.0
Leu
6.138LeuAla: 6.138 ± 6.409
1.315LeuCys: 1.315 ± 1.143
3.069LeuAsp: 3.069 ± 4.953
10.96LeuGlu: 10.96 ± 3.887
1.754LeuPhe: 1.754 ± 0.883
4.822LeuGly: 4.822 ± 1.561
0.877LeuHis: 0.877 ± 1.249
3.507LeuIle: 3.507 ± 3.188
5.261LeuLys: 5.261 ± 1.91
6.576LeuLeu: 6.576 ± 6.202
0.438LeuMet: 0.438 ± 0.221
2.192LeuAsn: 2.192 ± 1.104
2.63LeuPro: 2.63 ± 1.064
5.261LeuGln: 5.261 ± 3.427
6.576LeuArg: 6.576 ± 1.757
7.014LeuSer: 7.014 ± 1.607
3.507LeuThr: 3.507 ± 1.766
4.822LeuVal: 4.822 ± 1.561
0.438LeuTrp: 0.438 ± 0.221
2.192LeuTyr: 2.192 ± 1.306
0.0LeuXaa: 0.0 ± 0.0
Met
2.63MetAla: 2.63 ± 1.064
0.438MetCys: 0.438 ± 0.221
0.877MetAsp: 0.877 ± 0.442
2.63MetGlu: 2.63 ± 1.064
1.754MetPhe: 1.754 ± 0.883
0.0MetGly: 0.0 ± 0.0
0.438MetHis: 0.438 ± 0.221
0.438MetIle: 0.438 ± 0.221
0.877MetLys: 0.877 ± 0.442
2.192MetLeu: 2.192 ± 1.104
0.438MetMet: 0.438 ± 0.221
0.877MetAsn: 0.877 ± 1.626
1.754MetPro: 1.754 ± 0.883
1.754MetGln: 1.754 ± 0.883
0.877MetArg: 0.877 ± 0.442
0.877MetSer: 0.877 ± 1.203
2.192MetThr: 2.192 ± 1.104
1.315MetVal: 1.315 ± 0.662
0.0MetTrp: 0.0 ± 0.0
0.438MetTyr: 0.438 ± 0.221
0.0MetXaa: 0.0 ± 0.0
Asn
1.315AsnAla: 1.315 ± 1.143
0.438AsnCys: 0.438 ± 0.221
1.315AsnAsp: 1.315 ± 0.662
2.192AsnGlu: 2.192 ± 1.104
0.438AsnPhe: 0.438 ± 0.221
1.315AsnGly: 1.315 ± 0.662
0.438AsnHis: 0.438 ± 0.221
2.192AsnIle: 2.192 ± 1.104
3.069AsnLys: 3.069 ± 1.011
3.507AsnLeu: 3.507 ± 3.354
1.315AsnMet: 1.315 ± 0.662
1.754AsnAsn: 1.754 ± 1.386
2.63AsnPro: 2.63 ± 1.325
2.192AsnGln: 2.192 ± 0.949
1.315AsnArg: 1.315 ± 1.081
1.754AsnSer: 1.754 ± 1.386
3.069AsnThr: 3.069 ± 1.545
1.754AsnVal: 1.754 ± 1.386
0.877AsnTrp: 0.877 ± 0.442
3.946AsnTyr: 3.946 ± 1.354
0.0AsnXaa: 0.0 ± 0.0
Pro
6.576ProAla: 6.576 ± 3.042
0.0ProCys: 0.0 ± 0.0
2.63ProAsp: 2.63 ± 1.325
3.507ProGlu: 3.507 ± 1.361
0.877ProPhe: 0.877 ± 0.442
0.877ProGly: 0.877 ± 1.626
1.754ProHis: 1.754 ± 0.883
2.192ProIle: 2.192 ± 1.104
3.507ProLys: 3.507 ± 1.393
2.192ProLeu: 2.192 ± 0.949
1.315ProMet: 1.315 ± 0.881
2.192ProAsn: 2.192 ± 1.104
3.946ProPro: 3.946 ± 1.349
1.754ProGln: 1.754 ± 1.073
2.192ProArg: 2.192 ± 1.104
3.946ProSer: 3.946 ± 1.354
2.192ProThr: 2.192 ± 1.104
2.63ProVal: 2.63 ± 1.325
0.877ProTrp: 0.877 ± 0.442
1.754ProTyr: 1.754 ± 2.029
0.0ProXaa: 0.0 ± 0.0
Gln
2.192GlnAla: 2.192 ± 1.89
0.438GlnCys: 0.438 ± 0.221
1.315GlnAsp: 1.315 ± 0.662
7.453GlnGlu: 7.453 ± 1.578
1.315GlnPhe: 1.315 ± 1.081
4.384GlnGly: 4.384 ± 1.142
1.754GlnHis: 1.754 ± 1.073
5.261GlnIle: 5.261 ± 2.471
3.946GlnLys: 3.946 ± 4.245
5.261GlnLeu: 5.261 ± 2.499
0.438GlnMet: 0.438 ± 0.221
3.946GlnAsn: 3.946 ± 1.349
2.63GlnPro: 2.63 ± 0.955
5.261GlnGln: 5.261 ± 1.711
3.069GlnArg: 3.069 ± 1.126
3.507GlnSer: 3.507 ± 1.282
3.946GlnThr: 3.946 ± 1.354
3.507GlnVal: 3.507 ± 1.766
1.315GlnTrp: 1.315 ± 0.662
0.877GlnTyr: 0.877 ± 0.442
0.0GlnXaa: 0.0 ± 0.0
Arg
3.507ArgAla: 3.507 ± 1.361
0.877ArgCys: 0.877 ± 0.442
5.261ArgAsp: 5.261 ± 3.219
2.63ArgGlu: 2.63 ± 1.26
2.192ArgPhe: 2.192 ± 1.046
3.946ArgGly: 3.946 ± 1.354
0.877ArgHis: 0.877 ± 1.203
4.822ArgIle: 4.822 ± 1.67
3.507ArgLys: 3.507 ± 1.393
7.891ArgLeu: 7.891 ± 1.736
1.754ArgMet: 1.754 ± 0.883
1.754ArgAsn: 1.754 ± 0.883
3.507ArgPro: 3.507 ± 1.226
3.069ArgGln: 3.069 ± 1.252
3.507ArgArg: 3.507 ± 2.146
6.576ArgSer: 6.576 ± 3.137
2.192ArgThr: 2.192 ± 1.104
3.507ArgVal: 3.507 ± 3.354
1.754ArgTrp: 1.754 ± 0.993
1.315ArgTyr: 1.315 ± 1.143
0.0ArgXaa: 0.0 ± 0.0
Ser
1.754SerAla: 1.754 ± 2.029
1.754SerCys: 1.754 ± 1.073
4.384SerAsp: 4.384 ± 1.143
5.261SerGlu: 5.261 ± 2.978
2.192SerPhe: 2.192 ± 1.104
3.507SerGly: 3.507 ± 1.766
2.192SerHis: 2.192 ± 1.66
6.138SerIle: 6.138 ± 1.144
4.822SerLys: 4.822 ± 3.6
5.261SerLeu: 5.261 ± 4.936
1.754SerMet: 1.754 ± 1.073
1.754SerAsn: 1.754 ± 0.883
1.754SerPro: 1.754 ± 1.073
6.138SerGln: 6.138 ± 0.715
5.261SerArg: 5.261 ± 1.071
2.63SerSer: 2.63 ± 2.161
2.192SerThr: 2.192 ± 1.89
3.507SerVal: 3.507 ± 2.615
0.438SerTrp: 0.438 ± 0.221
0.877SerTyr: 0.877 ± 0.442
0.0SerXaa: 0.0 ± 0.0
Thr
5.699ThrAla: 5.699 ± 2.05
0.438ThrCys: 0.438 ± 0.221
4.384ThrAsp: 4.384 ± 2.208
4.822ThrGlu: 4.822 ± 1.561
2.63ThrPhe: 2.63 ± 1.325
5.699ThrGly: 5.699 ± 1.102
2.192ThrHis: 2.192 ± 1.104
4.384ThrIle: 4.384 ± 1.446
1.754ThrLys: 1.754 ± 0.883
2.192ThrLeu: 2.192 ± 1.104
2.63ThrMet: 2.63 ± 1.325
0.438ThrAsn: 0.438 ± 1.351
3.507ThrPro: 3.507 ± 1.766
1.754ThrGln: 1.754 ± 1.073
4.822ThrArg: 4.822 ± 1.67
2.192ThrSer: 2.192 ± 3.637
7.014ThrThr: 7.014 ± 2.402
2.63ThrVal: 2.63 ± 1.325
0.877ThrTrp: 0.877 ± 0.442
2.192ThrTyr: 2.192 ± 1.046
0.0ThrXaa: 0.0 ± 0.0
Val
2.63ValAla: 2.63 ± 1.325
0.438ValCys: 0.438 ± 0.221
4.384ValAsp: 4.384 ± 1.446
2.63ValGlu: 2.63 ± 2.161
3.507ValPhe: 3.507 ± 1.766
3.507ValGly: 3.507 ± 1.393
2.192ValHis: 2.192 ± 1.046
3.507ValIle: 3.507 ± 1.109
2.192ValLys: 2.192 ± 1.104
2.192ValLeu: 2.192 ± 0.949
0.877ValMet: 0.877 ± 0.442
0.0ValAsn: 0.0 ± 0.0
2.192ValPro: 2.192 ± 1.306
5.261ValGln: 5.261 ± 1.062
4.384ValArg: 4.384 ± 1.392
2.192ValSer: 2.192 ± 6.065
3.946ValThr: 3.946 ± 1.987
1.754ValVal: 1.754 ± 0.883
0.438ValTrp: 0.438 ± 0.221
1.754ValTyr: 1.754 ± 0.883
0.0ValXaa: 0.0 ± 0.0
Trp
0.438TrpAla: 0.438 ± 0.221
0.0TrpCys: 0.0 ± 0.0
0.877TrpAsp: 0.877 ± 0.442
3.069TrpGlu: 3.069 ± 1.011
0.0TrpPhe: 0.0 ± 0.0
0.438TrpGly: 0.438 ± 0.221
0.0TrpHis: 0.0 ± 0.0
0.877TrpIle: 0.877 ± 0.442
0.438TrpLys: 0.438 ± 0.221
1.754TrpLeu: 1.754 ± 0.883
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.438TrpPro: 0.438 ± 0.221
1.315TrpGln: 1.315 ± 0.662
1.315TrpArg: 1.315 ± 0.662
0.438TrpSer: 0.438 ± 0.221
0.877TrpThr: 0.877 ± 0.442
1.754TrpVal: 1.754 ± 0.993
0.877TrpTrp: 0.877 ± 0.442
0.438TrpTyr: 0.438 ± 1.351
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.946TyrAla: 3.946 ± 1.354
0.877TyrCys: 0.877 ± 0.442
2.192TyrAsp: 2.192 ± 2.385
1.315TyrGlu: 1.315 ± 1.081
1.315TyrPhe: 1.315 ± 0.662
1.754TyrGly: 1.754 ± 1.386
0.0TyrHis: 0.0 ± 0.0
2.192TyrIle: 2.192 ± 1.104
2.192TyrLys: 2.192 ± 1.306
3.069TyrLeu: 3.069 ± 1.011
0.877TyrMet: 0.877 ± 0.442
2.63TyrAsn: 2.63 ± 1.325
1.315TyrPro: 1.315 ± 1.143
3.069TyrGln: 3.069 ± 1.507
1.754TyrArg: 1.754 ± 0.883
2.63TyrSer: 2.63 ± 1.064
1.754TyrThr: 1.754 ± 0.883
0.438TyrVal: 0.438 ± 0.221
0.438TyrTrp: 0.438 ± 0.221
0.877TyrTyr: 0.877 ± 0.442
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2282 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski