Amino acid dipepetide frequency for Pelargonium chlorotic ring pattern virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.473AlaAla: 4.473 ± 1.698
0.639AlaCys: 0.639 ± 0.406
1.917AlaAsp: 1.917 ± 1.916
4.473AlaGlu: 4.473 ± 0.845
6.39AlaPhe: 6.39 ± 2.248
0.639AlaGly: 0.639 ± 0.406
2.556AlaHis: 2.556 ± 0.973
3.195AlaIle: 3.195 ± 0.728
6.39AlaLys: 6.39 ± 1.902
8.307AlaLeu: 8.307 ± 1.687
1.917AlaMet: 1.917 ± 1.218
3.834AlaAsn: 3.834 ± 2.026
3.195AlaPro: 3.195 ± 0.91
5.112AlaGln: 5.112 ± 1.953
6.39AlaArg: 6.39 ± 1.087
6.39AlaSer: 6.39 ± 2.416
4.473AlaThr: 4.473 ± 1.698
4.473AlaVal: 4.473 ± 1.073
0.0AlaTrp: 0.0 ± 0.0
1.917AlaTyr: 1.917 ± 0.74
0.0AlaXaa: 0.0 ± 0.0
Cys
1.278CysAla: 1.278 ± 0.67
1.278CysCys: 1.278 ± 0.67
0.0CysAsp: 0.0 ± 0.0
2.556CysGlu: 2.556 ± 0.959
1.917CysPhe: 1.917 ± 1.287
1.917CysGly: 1.917 ± 0.74
0.639CysHis: 0.639 ± 0.406
1.917CysIle: 1.917 ± 0.74
0.0CysLys: 0.0 ± 0.0
1.278CysLeu: 1.278 ± 0.812
0.0CysMet: 0.0 ± 0.0
1.278CysAsn: 1.278 ± 0.67
0.639CysPro: 0.639 ± 0.406
0.639CysGln: 0.639 ± 0.406
0.639CysArg: 0.639 ± 0.406
3.834CysSer: 3.834 ± 0.881
0.0CysThr: 0.0 ± 0.0
1.278CysVal: 1.278 ± 0.812
1.917CysTrp: 1.917 ± 0.721
0.639CysTyr: 0.639 ± 0.406
0.0CysXaa: 0.0 ± 0.0
Asp
0.639AspAla: 0.639 ± 0.406
1.917AspCys: 1.917 ± 0.74
1.917AspAsp: 1.917 ± 1.096
1.278AspGlu: 1.278 ± 0.67
0.639AspPhe: 0.639 ± 0.406
3.195AspGly: 3.195 ± 1.238
1.278AspHis: 1.278 ± 0.67
2.556AspIle: 2.556 ± 0.973
5.112AspLys: 5.112 ± 1.256
4.473AspLeu: 4.473 ± 1.703
1.917AspMet: 1.917 ± 0.74
0.0AspAsn: 0.0 ± 0.0
3.195AspPro: 3.195 ± 0.978
1.917AspGln: 1.917 ± 1.218
0.639AspArg: 0.639 ± 0.838
5.112AspSer: 5.112 ± 1.938
1.278AspThr: 1.278 ± 0.548
3.834AspVal: 3.834 ± 1.85
1.917AspTrp: 1.917 ± 0.857
2.556AspTyr: 2.556 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
8.307GluAla: 8.307 ± 1.187
0.0GluCys: 0.0 ± 0.0
3.195GluAsp: 3.195 ± 1.309
4.473GluGlu: 4.473 ± 0.82
4.473GluPhe: 4.473 ± 1.784
3.195GluGly: 3.195 ± 1.331
1.917GluHis: 1.917 ± 1.218
0.639GluIle: 0.639 ± 0.406
1.917GluLys: 1.917 ± 1.218
3.834GluLeu: 3.834 ± 0.882
0.0GluMet: 0.0 ± 0.0
2.556GluAsn: 2.556 ± 1.233
2.556GluPro: 2.556 ± 1.694
0.0GluGln: 0.0 ± 0.0
3.834GluArg: 3.834 ± 1.646
0.639GluSer: 0.639 ± 0.406
1.917GluThr: 1.917 ± 1.453
6.39GluVal: 6.39 ± 2.147
1.278GluTrp: 1.278 ± 0.812
3.195GluTyr: 3.195 ± 1.331
0.0GluXaa: 0.0 ± 0.0
Phe
6.39PheAla: 6.39 ± 2.248
1.278PheCys: 1.278 ± 0.812
3.195PheAsp: 3.195 ± 1.444
3.195PheGlu: 3.195 ± 0.543
0.639PhePhe: 0.639 ± 0.838
3.195PheGly: 3.195 ± 1.331
0.639PheHis: 0.639 ± 0.406
2.556PheIle: 2.556 ± 2.673
0.639PheLys: 0.639 ± 0.406
4.473PheLeu: 4.473 ± 1.494
1.278PheMet: 1.278 ± 1.112
0.639PheAsn: 0.639 ± 0.838
3.195PhePro: 3.195 ± 2.019
1.278PheGln: 1.278 ± 0.67
2.556PheArg: 2.556 ± 0.959
8.946PheSer: 8.946 ± 0.908
3.834PheThr: 3.834 ± 1.105
3.195PheVal: 3.195 ± 1.425
0.0PheTrp: 0.0 ± 0.0
1.917PheTyr: 1.917 ± 1.218
0.0PheXaa: 0.0 ± 0.0
Gly
6.39GlyAla: 6.39 ± 1.652
1.278GlyCys: 1.278 ± 0.812
5.751GlyAsp: 5.751 ± 0.87
1.917GlyGlu: 1.917 ± 0.74
1.278GlyPhe: 1.278 ± 0.812
7.668GlyGly: 7.668 ± 1.323
0.639GlyHis: 0.639 ± 0.406
1.917GlyIle: 1.917 ± 0.564
7.668GlyLys: 7.668 ± 1.122
3.195GlyLeu: 3.195 ± 1.444
1.917GlyMet: 1.917 ± 0.564
3.834GlyAsn: 3.834 ± 0.661
2.556GlyPro: 2.556 ± 1.096
1.917GlyGln: 1.917 ± 0.721
3.195GlyArg: 3.195 ± 1.619
1.278GlySer: 1.278 ± 0.812
4.473GlyThr: 4.473 ± 1.32
7.029GlyVal: 7.029 ± 1.998
1.917GlyTrp: 1.917 ± 0.74
1.917GlyTyr: 1.917 ± 0.74
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.639HisAsp: 0.639 ± 0.406
0.639HisGlu: 0.639 ± 0.406
2.556HisPhe: 2.556 ± 0.377
0.639HisGly: 0.639 ± 0.619
0.639HisHis: 0.639 ± 0.406
0.0HisIle: 0.0 ± 0.0
2.556HisLys: 2.556 ± 0.959
1.917HisLeu: 1.917 ± 1.055
1.278HisMet: 1.278 ± 0.67
1.917HisAsn: 1.917 ± 1.218
0.639HisPro: 0.639 ± 0.406
0.0HisGln: 0.0 ± 0.0
0.639HisArg: 0.639 ± 0.406
4.473HisSer: 4.473 ± 1.058
0.639HisThr: 0.639 ± 0.619
0.639HisVal: 0.639 ± 0.406
0.639HisTrp: 0.639 ± 0.406
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.112IleAla: 5.112 ± 1.587
0.0IleCys: 0.0 ± 0.0
1.278IleAsp: 1.278 ± 0.548
3.834IleGlu: 3.834 ± 1.441
1.917IlePhe: 1.917 ± 0.564
2.556IleGly: 2.556 ± 1.096
0.0IleHis: 0.0 ± 0.0
1.917IleIle: 1.917 ± 0.721
1.278IleLys: 1.278 ± 0.548
1.278IleLeu: 1.278 ± 0.548
2.556IleMet: 2.556 ± 1.103
0.639IleAsn: 0.639 ± 0.406
4.473IlePro: 4.473 ± 0.87
0.639IleGln: 0.639 ± 0.406
0.0IleArg: 0.0 ± 0.0
4.473IleSer: 4.473 ± 1.347
7.668IleThr: 7.668 ± 3.614
2.556IleVal: 2.556 ± 0.977
0.0IleTrp: 0.0 ± 0.0
1.278IleTyr: 1.278 ± 0.812
0.0IleXaa: 0.0 ± 0.0
Lys
5.112LysAla: 5.112 ± 3.478
3.195LysCys: 3.195 ± 0.543
3.195LysAsp: 3.195 ± 0.543
1.278LysGlu: 1.278 ± 0.812
5.112LysPhe: 5.112 ± 1.256
2.556LysGly: 2.556 ± 1.061
0.0LysHis: 0.0 ± 0.0
3.195LysIle: 3.195 ± 1.425
1.278LysLys: 1.278 ± 0.812
7.668LysLeu: 7.668 ± 1.422
1.278LysMet: 1.278 ± 0.925
2.556LysAsn: 2.556 ± 1.438
3.195LysPro: 3.195 ± 1.087
1.917LysGln: 1.917 ± 1.096
4.473LysArg: 4.473 ± 1.058
3.834LysSer: 3.834 ± 1.298
3.195LysThr: 3.195 ± 1.073
6.39LysVal: 6.39 ± 1.516
1.278LysTrp: 1.278 ± 0.812
2.556LysTyr: 2.556 ± 1.096
0.639LysXaa: 0.639 ± 0.406
Leu
9.585LeuAla: 9.585 ± 0.978
1.278LeuCys: 1.278 ± 0.812
3.834LeuAsp: 3.834 ± 0.935
5.112LeuGlu: 5.112 ± 1.653
3.195LeuPhe: 3.195 ± 0.987
4.473LeuGly: 4.473 ± 1.766
1.917LeuHis: 1.917 ± 0.564
3.195LeuIle: 3.195 ± 1.331
2.556LeuLys: 2.556 ± 1.624
5.751LeuLeu: 5.751 ± 2.126
1.278LeuMet: 1.278 ± 0.812
4.473LeuAsn: 4.473 ± 1.655
1.917LeuPro: 1.917 ± 0.564
3.195LeuGln: 3.195 ± 1.534
5.112LeuArg: 5.112 ± 1.094
5.751LeuSer: 5.751 ± 0.977
6.39LeuThr: 6.39 ± 2.388
7.029LeuVal: 7.029 ± 2.852
1.278LeuTrp: 1.278 ± 0.973
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.639MetAla: 0.639 ± 0.619
0.0MetCys: 0.0 ± 0.0
2.556MetAsp: 2.556 ± 1.17
1.278MetGlu: 1.278 ± 0.812
1.278MetPhe: 1.278 ± 0.67
1.278MetGly: 1.278 ± 1.028
1.278MetHis: 1.278 ± 0.548
1.278MetIle: 1.278 ± 0.67
1.917MetLys: 1.917 ± 0.721
0.0MetLeu: 0.0 ± 0.0
0.639MetMet: 0.639 ± 0.565
0.639MetAsn: 0.639 ± 0.406
1.278MetPro: 1.278 ± 0.67
1.278MetGln: 1.278 ± 0.812
0.639MetArg: 0.639 ± 0.406
1.278MetSer: 1.278 ± 0.812
1.917MetThr: 1.917 ± 1.096
1.278MetVal: 1.278 ± 0.548
0.0MetTrp: 0.0 ± 0.0
0.639MetTyr: 0.639 ± 0.619
0.0MetXaa: 0.0 ± 0.0
Asn
1.278AsnAla: 1.278 ± 0.548
3.195AsnCys: 3.195 ± 1.309
1.917AsnAsp: 1.917 ± 0.721
2.556AsnGlu: 2.556 ± 0.959
5.751AsnPhe: 5.751 ± 4.498
3.195AsnGly: 3.195 ± 1.284
1.917AsnHis: 1.917 ± 0.721
0.639AsnIle: 0.639 ± 0.619
3.195AsnLys: 3.195 ± 0.91
3.834AsnLeu: 3.834 ± 0.596
0.0AsnMet: 0.0 ± 0.0
1.917AsnAsn: 1.917 ± 0.74
3.195AsnPro: 3.195 ± 0.91
1.278AsnGln: 1.278 ± 1.268
2.556AsnArg: 2.556 ± 1.17
5.112AsnSer: 5.112 ± 2.201
0.639AsnThr: 0.639 ± 0.406
2.556AsnVal: 2.556 ± 1.694
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.473ProAla: 4.473 ± 1.798
1.917ProCys: 1.917 ± 0.721
2.556ProAsp: 2.556 ± 0.959
1.917ProGlu: 1.917 ± 1.287
1.278ProPhe: 1.278 ± 0.812
1.917ProGly: 1.917 ± 0.564
0.0ProHis: 0.0 ± 0.0
1.917ProIle: 1.917 ± 1.096
2.556ProLys: 2.556 ± 0.377
3.834ProLeu: 3.834 ± 1.868
0.0ProMet: 0.0 ± 0.0
2.556ProAsn: 2.556 ± 2.185
4.473ProPro: 4.473 ± 2.595
0.639ProGln: 0.639 ± 0.619
5.112ProArg: 5.112 ± 1.585
4.473ProSer: 4.473 ± 3.203
3.834ProThr: 3.834 ± 0.661
5.112ProVal: 5.112 ± 1.653
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.917GlnAla: 1.917 ± 1.024
1.278GlnCys: 1.278 ± 0.812
1.278GlnAsp: 1.278 ± 0.67
3.195GlnGlu: 3.195 ± 0.543
2.556GlnPhe: 2.556 ± 1.34
2.556GlnGly: 2.556 ± 0.377
1.278GlnHis: 1.278 ± 0.812
1.278GlnIle: 1.278 ± 1.237
2.556GlnLys: 2.556 ± 1.839
1.278GlnLeu: 1.278 ± 0.548
1.278GlnMet: 1.278 ± 0.548
2.556GlnAsn: 2.556 ± 0.973
1.278GlnPro: 1.278 ± 0.812
2.556GlnGln: 2.556 ± 1.17
0.0GlnArg: 0.0 ± 0.0
1.278GlnSer: 1.278 ± 1.237
3.195GlnThr: 3.195 ± 0.728
1.278GlnVal: 1.278 ± 1.237
0.639GlnTrp: 0.639 ± 0.619
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.834ArgAla: 3.834 ± 1.441
1.278ArgCys: 1.278 ± 0.67
1.917ArgAsp: 1.917 ± 0.74
1.917ArgGlu: 1.917 ± 0.74
1.917ArgPhe: 1.917 ± 1.111
5.112ArgGly: 5.112 ± 1.184
1.278ArgHis: 1.278 ± 0.548
1.278ArgIle: 1.278 ± 0.67
2.556ArgLys: 2.556 ± 1.361
5.751ArgLeu: 5.751 ± 0.83
1.278ArgMet: 1.278 ± 0.548
3.834ArgAsn: 3.834 ± 0.661
2.556ArgPro: 2.556 ± 0.959
1.917ArgGln: 1.917 ± 0.564
10.224ArgArg: 10.224 ± 2.655
5.112ArgSer: 5.112 ± 1.161
1.917ArgThr: 1.917 ± 0.721
7.029ArgVal: 7.029 ± 1.075
1.278ArgTrp: 1.278 ± 0.67
2.556ArgTyr: 2.556 ± 1.329
0.0ArgXaa: 0.0 ± 0.0
Ser
5.112SerAla: 5.112 ± 1.733
1.917SerCys: 1.917 ± 1.287
5.112SerAsp: 5.112 ± 2.573
3.834SerGlu: 3.834 ± 1.504
1.917SerPhe: 1.917 ± 0.74
5.112SerGly: 5.112 ± 1.906
1.278SerHis: 1.278 ± 0.548
3.195SerIle: 3.195 ± 1.238
9.585SerLys: 9.585 ± 2.536
5.112SerLeu: 5.112 ± 1.653
1.278SerMet: 1.278 ± 0.967
3.195SerAsn: 3.195 ± 1.434
7.029SerPro: 7.029 ± 3.478
1.278SerGln: 1.278 ± 1.237
5.751SerArg: 5.751 ± 0.977
3.195SerSer: 3.195 ± 2.347
5.112SerThr: 5.112 ± 2.23
7.029SerVal: 7.029 ± 2.766
0.0SerTrp: 0.0 ± 0.0
4.473SerTyr: 4.473 ± 1.433
0.0SerXaa: 0.0 ± 0.0
Thr
3.834ThrAla: 3.834 ± 0.596
1.278ThrCys: 1.278 ± 0.812
0.639ThrAsp: 0.639 ± 0.619
3.195ThrGlu: 3.195 ± 0.728
2.556ThrPhe: 2.556 ± 1.329
2.556ThrGly: 2.556 ± 0.377
0.0ThrHis: 0.0 ± 0.0
6.39ThrIle: 6.39 ± 1.456
5.112ThrLys: 5.112 ± 1.736
2.556ThrLeu: 2.556 ± 0.377
0.639ThrMet: 0.639 ± 0.406
0.639ThrAsn: 0.639 ± 0.406
1.278ThrPro: 1.278 ± 0.973
2.556ThrGln: 2.556 ± 1.839
7.029ThrArg: 7.029 ± 3.006
8.307ThrSer: 8.307 ± 2.138
3.834ThrThr: 3.834 ± 1.298
6.39ThrVal: 6.39 ± 3.031
0.0ThrTrp: 0.0 ± 0.0
1.278ThrTyr: 1.278 ± 1.237
0.0ThrXaa: 0.0 ± 0.0
Val
7.029ValAla: 7.029 ± 3.034
1.917ValCys: 1.917 ± 0.564
3.195ValAsp: 3.195 ± 1.619
7.029ValGlu: 7.029 ± 1.04
7.029ValPhe: 7.029 ± 1.522
10.863ValGly: 10.863 ± 1.854
3.195ValHis: 3.195 ± 1.054
5.751ValIle: 5.751 ± 0.981
5.112ValLys: 5.112 ± 1.113
7.668ValLeu: 7.668 ± 1.458
1.917ValMet: 1.917 ± 0.721
3.195ValAsn: 3.195 ± 0.978
1.278ValPro: 1.278 ± 0.812
1.917ValGln: 1.917 ± 0.721
2.556ValArg: 2.556 ± 1.233
3.195ValSer: 3.195 ± 1.791
3.195ValThr: 3.195 ± 0.728
4.473ValVal: 4.473 ± 1.647
0.0ValTrp: 0.0 ± 0.0
0.639ValTyr: 0.639 ± 0.406
0.0ValXaa: 0.0 ± 0.0
Trp
0.639TrpAla: 0.639 ± 0.619
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.639TrpGlu: 0.639 ± 0.406
0.0TrpPhe: 0.0 ± 0.0
1.917TrpGly: 1.917 ± 0.857
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.556TrpLeu: 2.556 ± 0.959
0.0TrpMet: 0.0 ± 0.0
1.278TrpAsn: 1.278 ± 0.973
0.0TrpPro: 0.0 ± 0.0
0.639TrpGln: 0.639 ± 0.406
1.917TrpArg: 1.917 ± 0.564
0.639TrpSer: 0.639 ± 0.406
0.639TrpThr: 0.639 ± 0.406
1.278TrpVal: 1.278 ± 0.812
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.639TyrAla: 0.639 ± 0.406
0.0TyrCys: 0.0 ± 0.0
1.278TyrAsp: 1.278 ± 0.67
0.0TyrGlu: 0.0 ± 0.0
0.639TyrPhe: 0.639 ± 0.406
2.556TyrGly: 2.556 ± 1.061
0.0TyrHis: 0.0 ± 0.0
0.639TyrIle: 0.639 ± 0.406
2.556TyrLys: 2.556 ± 1.096
2.556TyrLeu: 2.556 ± 0.377
0.0TyrMet: 0.0 ± 0.0
3.195TyrAsn: 3.195 ± 0.728
0.639TyrPro: 0.639 ± 0.619
2.556TyrGln: 2.556 ± 1.116
1.278TyrArg: 1.278 ± 0.812
3.195TyrSer: 3.195 ± 0.718
1.917TyrThr: 1.917 ± 1.218
1.917TyrVal: 1.917 ± 1.096
0.0TyrTrp: 0.0 ± 0.0
1.917TyrTyr: 1.917 ± 1.532
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.639XaaGly: 0.639 ± 0.406
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1566 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski