Amino acid dipepetide frequency for Prunus geminivirus A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.799AlaAla: 1.799 ± 0.857
1.799AlaCys: 1.799 ± 0.857
0.899AlaAsp: 0.899 ± 0.825
4.496AlaGlu: 4.496 ± 1.751
3.597AlaPhe: 3.597 ± 2.113
0.899AlaGly: 0.899 ± 0.734
0.0AlaHis: 0.0 ± 0.0
2.698AlaIle: 2.698 ± 1.809
0.0AlaLys: 0.0 ± 0.0
5.396AlaLeu: 5.396 ± 2.541
0.0AlaMet: 0.0 ± 0.0
2.698AlaAsn: 2.698 ± 1.045
0.0AlaPro: 0.0 ± 0.0
0.899AlaGln: 0.899 ± 0.734
2.698AlaArg: 2.698 ± 1.207
3.597AlaSer: 3.597 ± 1.018
4.496AlaThr: 4.496 ± 2.338
1.799AlaVal: 1.799 ± 0.857
0.899AlaTrp: 0.899 ± 0.734
0.899AlaTyr: 0.899 ± 0.798
0.0AlaXaa: 0.0 ± 0.0
Cys
0.899CysAla: 0.899 ± 0.703
0.899CysCys: 0.899 ± 0.825
0.0CysAsp: 0.0 ± 0.0
0.899CysGlu: 0.899 ± 0.825
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.899CysIle: 0.899 ± 0.734
0.899CysLys: 0.899 ± 0.912
2.698CysLeu: 2.698 ± 1.438
0.0CysMet: 0.0 ± 0.0
0.899CysAsn: 0.899 ± 0.912
0.899CysPro: 0.899 ± 0.734
1.799CysGln: 1.799 ± 1.196
0.0CysArg: 0.0 ± 0.0
0.899CysSer: 0.899 ± 0.893
1.799CysThr: 1.799 ± 1.65
1.799CysVal: 1.799 ± 1.161
0.0CysTrp: 0.0 ± 0.0
0.899CysTyr: 0.899 ± 0.825
0.0CysXaa: 0.0 ± 0.0
Asp
2.698AspAla: 2.698 ± 1.728
0.0AspCys: 0.0 ± 0.0
6.295AspAsp: 6.295 ± 2.442
4.496AspGlu: 4.496 ± 3.073
2.698AspPhe: 2.698 ± 1.547
5.396AspGly: 5.396 ± 1.514
0.899AspHis: 0.899 ± 0.734
5.396AspIle: 5.396 ± 1.614
0.899AspLys: 0.899 ± 0.912
6.295AspLeu: 6.295 ± 3.407
1.799AspMet: 1.799 ± 1.379
2.698AspAsn: 2.698 ± 1.207
2.698AspPro: 2.698 ± 1.014
1.799AspGln: 1.799 ± 1.073
2.698AspArg: 2.698 ± 1.365
3.597AspSer: 3.597 ± 2.019
3.597AspThr: 3.597 ± 1.308
7.194AspVal: 7.194 ± 1.8
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.799GluAla: 1.799 ± 1.65
0.0GluCys: 0.0 ± 0.0
4.496GluAsp: 4.496 ± 2.134
4.496GluGlu: 4.496 ± 2.453
3.597GluPhe: 3.597 ± 1.453
1.799GluGly: 1.799 ± 1.109
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
6.295GluLys: 6.295 ± 2.213
1.799GluLeu: 1.799 ± 0.926
0.899GluMet: 0.899 ± 0.893
1.799GluAsn: 1.799 ± 1.785
1.799GluPro: 1.799 ± 1.469
0.0GluGln: 0.0 ± 0.0
4.496GluArg: 4.496 ± 1.177
4.496GluSer: 4.496 ± 0.79
7.194GluThr: 7.194 ± 1.536
3.597GluVal: 3.597 ± 2.129
1.799GluTrp: 1.799 ± 1.65
7.194GluTyr: 7.194 ± 3.799
0.0GluXaa: 0.0 ± 0.0
Phe
1.799PheAla: 1.799 ± 1.469
0.899PheCys: 0.899 ± 0.825
1.799PheAsp: 1.799 ± 0.918
0.899PheGlu: 0.899 ± 0.825
4.496PhePhe: 4.496 ± 2.003
2.698PheGly: 2.698 ± 1.492
2.698PheHis: 2.698 ± 1.13
0.0PheIle: 0.0 ± 0.0
2.698PheLys: 2.698 ± 1.207
3.597PheLeu: 3.597 ± 1.406
0.899PheMet: 0.899 ± 0.734
2.698PheAsn: 2.698 ± 1.492
0.899PhePro: 0.899 ± 0.703
1.799PheGln: 1.799 ± 1.196
2.698PheArg: 2.698 ± 1.446
4.496PheSer: 4.496 ± 1.492
4.496PheThr: 4.496 ± 1.352
2.698PheVal: 2.698 ± 1.254
0.899PheTrp: 0.899 ± 0.703
0.899PheTyr: 0.899 ± 0.703
0.0PheXaa: 0.0 ± 0.0
Gly
0.899GlyAla: 0.899 ± 0.703
1.799GlyCys: 1.799 ± 1.109
2.698GlyAsp: 2.698 ± 1.468
3.597GlyGlu: 3.597 ± 2.129
0.0GlyPhe: 0.0 ± 0.0
6.295GlyGly: 6.295 ± 2.578
1.799GlyHis: 1.799 ± 0.857
4.496GlyIle: 4.496 ± 1.651
1.799GlyLys: 1.799 ± 1.113
4.496GlyLeu: 4.496 ± 1.742
1.799GlyMet: 1.799 ± 1.595
2.698GlyAsn: 2.698 ± 1.334
1.799GlyPro: 1.799 ± 1.824
1.799GlyGln: 1.799 ± 1.191
1.799GlyArg: 1.799 ± 1.406
8.094GlySer: 8.094 ± 2.672
1.799GlyThr: 1.799 ± 1.406
2.698GlyVal: 2.698 ± 2.109
0.0GlyTrp: 0.0 ± 0.0
1.799GlyTyr: 1.799 ± 1.595
0.0GlyXaa: 0.0 ± 0.0
His
1.799HisAla: 1.799 ± 1.036
0.899HisCys: 0.899 ± 0.825
0.899HisAsp: 0.899 ± 0.703
2.698HisGlu: 2.698 ± 1.443
0.899HisPhe: 0.899 ± 0.734
1.799HisGly: 1.799 ± 1.016
0.899HisHis: 0.899 ± 0.734
0.899HisIle: 0.899 ± 0.734
0.0HisLys: 0.0 ± 0.0
0.899HisLeu: 0.899 ± 0.734
0.0HisMet: 0.0 ± 0.0
1.799HisAsn: 1.799 ± 1.824
0.899HisPro: 0.899 ± 0.734
1.799HisGln: 1.799 ± 0.919
2.698HisArg: 2.698 ± 2.393
1.799HisSer: 1.799 ± 0.906
3.597HisThr: 3.597 ± 1.582
1.799HisVal: 1.799 ± 0.918
0.0HisTrp: 0.0 ± 0.0
0.899HisTyr: 0.899 ± 0.734
0.0HisXaa: 0.0 ± 0.0
Ile
3.597IleAla: 3.597 ± 1.167
0.0IleCys: 0.0 ± 0.0
3.597IleAsp: 3.597 ± 1.06
4.496IleGlu: 4.496 ± 1.329
4.496IlePhe: 4.496 ± 2.279
0.899IleGly: 0.899 ± 0.703
0.0IleHis: 0.0 ± 0.0
2.698IleIle: 2.698 ± 1.207
2.698IleLys: 2.698 ± 1.045
6.295IleLeu: 6.295 ± 1.294
1.799IleMet: 1.799 ± 0.926
0.899IleAsn: 0.899 ± 0.912
3.597IlePro: 3.597 ± 1.825
1.799IleGln: 1.799 ± 1.785
2.698IleArg: 2.698 ± 1.583
4.496IleSer: 4.496 ± 2.438
4.496IleThr: 4.496 ± 1.683
1.799IleVal: 1.799 ± 1.196
1.799IleTrp: 1.799 ± 1.595
2.698IleTyr: 2.698 ± 1.502
0.0IleXaa: 0.0 ± 0.0
Lys
2.698LysAla: 2.698 ± 1.809
0.899LysCys: 0.899 ± 0.825
4.496LysAsp: 4.496 ± 1.959
2.698LysGlu: 2.698 ± 1.85
2.698LysPhe: 2.698 ± 1.041
2.698LysGly: 2.698 ± 1.256
2.698LysHis: 2.698 ± 1.404
2.698LysIle: 2.698 ± 0.994
4.496LysLys: 4.496 ± 3.176
0.899LysLeu: 0.899 ± 0.703
0.899LysMet: 0.899 ± 0.825
2.698LysAsn: 2.698 ± 0.93
4.496LysPro: 4.496 ± 1.314
2.698LysGln: 2.698 ± 1.045
6.295LysArg: 6.295 ± 2.55
3.597LysSer: 3.597 ± 1.58
3.597LysThr: 3.597 ± 1.348
0.0LysVal: 0.0 ± 0.0
0.899LysTrp: 0.899 ± 0.734
0.899LysTyr: 0.899 ± 0.912
0.0LysXaa: 0.0 ± 0.0
Leu
0.899LeuAla: 0.899 ± 0.825
0.899LeuCys: 0.899 ± 0.912
4.496LeuAsp: 4.496 ± 1.383
6.295LeuGlu: 6.295 ± 2.748
3.597LeuPhe: 3.597 ± 0.811
6.295LeuGly: 6.295 ± 2.228
2.698LeuHis: 2.698 ± 1.492
5.396LeuIle: 5.396 ± 2.165
5.396LeuLys: 5.396 ± 2.147
6.295LeuLeu: 6.295 ± 2.193
0.899LeuMet: 0.899 ± 0.798
3.597LeuAsn: 3.597 ± 1.369
7.194LeuPro: 7.194 ± 1.857
2.698LeuGln: 2.698 ± 1.404
0.899LeuArg: 0.899 ± 0.734
4.496LeuSer: 4.496 ± 2.976
5.396LeuThr: 5.396 ± 2.235
6.295LeuVal: 6.295 ± 1.691
0.899LeuTrp: 0.899 ± 0.825
4.496LeuTyr: 4.496 ± 1.751
0.0LeuXaa: 0.0 ± 0.0
Met
1.799MetAla: 1.799 ± 0.906
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.698MetGlu: 2.698 ± 1.656
0.899MetPhe: 0.899 ± 0.798
0.899MetGly: 0.899 ± 0.798
0.899MetHis: 0.899 ± 0.734
0.899MetIle: 0.899 ± 0.703
0.0MetLys: 0.0 ± 0.0
0.899MetLeu: 0.899 ± 0.798
2.698MetMet: 2.698 ± 2.317
0.0MetAsn: 0.0 ± 0.0
1.799MetPro: 1.799 ± 0.918
0.0MetGln: 0.0 ± 0.0
0.899MetArg: 0.899 ± 0.825
1.799MetSer: 1.799 ± 1.785
0.899MetThr: 0.899 ± 0.798
5.396MetVal: 5.396 ± 2.217
0.899MetTrp: 0.899 ± 0.912
1.799MetTyr: 1.799 ± 1.196
0.0MetXaa: 0.0 ± 0.0
Asn
0.899AsnAla: 0.899 ± 0.703
0.0AsnCys: 0.0 ± 0.0
4.496AsnAsp: 4.496 ± 1.759
0.899AsnGlu: 0.899 ± 0.912
1.799AsnPhe: 1.799 ± 0.906
3.597AsnGly: 3.597 ± 2.113
0.0AsnHis: 0.0 ± 0.0
4.496AsnIle: 4.496 ± 1.329
0.0AsnLys: 0.0 ± 0.0
4.496AsnLeu: 4.496 ± 2.827
0.0AsnMet: 0.0 ± 0.0
0.899AsnAsn: 0.899 ± 0.734
1.799AsnPro: 1.799 ± 1.036
2.698AsnGln: 2.698 ± 0.994
4.496AsnArg: 4.496 ± 2.654
2.698AsnSer: 2.698 ± 1.8
5.396AsnThr: 5.396 ± 3.312
4.496AsnVal: 4.496 ± 2.179
0.899AsnTrp: 0.899 ± 0.912
2.698AsnTyr: 2.698 ± 1.809
0.0AsnXaa: 0.0 ± 0.0
Pro
2.698ProAla: 2.698 ± 1.207
2.698ProCys: 2.698 ± 1.256
4.496ProAsp: 4.496 ± 2.469
2.698ProGlu: 2.698 ± 1.656
0.899ProPhe: 0.899 ± 0.912
0.899ProGly: 0.899 ± 0.825
3.597ProHis: 3.597 ± 1.601
2.698ProIle: 2.698 ± 1.13
1.799ProLys: 1.799 ± 0.918
0.899ProLeu: 0.899 ± 0.825
1.799ProMet: 1.799 ± 0.919
0.899ProAsn: 0.899 ± 0.734
2.698ProPro: 2.698 ± 1.404
2.698ProGln: 2.698 ± 0.827
4.496ProArg: 4.496 ± 1.177
4.496ProSer: 4.496 ± 1.406
4.496ProThr: 4.496 ± 2.465
1.799ProVal: 1.799 ± 0.918
0.899ProTrp: 0.899 ± 0.912
1.799ProTyr: 1.799 ± 0.919
0.0ProXaa: 0.0 ± 0.0
Gln
0.899GlnAla: 0.899 ± 0.734
2.698GlnCys: 2.698 ± 1.404
1.799GlnAsp: 1.799 ± 0.919
3.597GlnGlu: 3.597 ± 1.055
1.799GlnPhe: 1.799 ± 0.906
0.899GlnGly: 0.899 ± 0.825
1.799GlnHis: 1.799 ± 1.073
0.0GlnIle: 0.0 ± 0.0
0.899GlnLys: 0.899 ± 0.893
1.799GlnLeu: 1.799 ± 1.189
0.899GlnMet: 0.899 ± 0.77
3.597GlnAsn: 3.597 ± 2.497
0.899GlnPro: 0.899 ± 0.893
0.899GlnGln: 0.899 ± 0.734
4.496GlnArg: 4.496 ± 1.404
1.799GlnSer: 1.799 ± 1.109
2.698GlnThr: 2.698 ± 1.254
0.899GlnVal: 0.899 ± 0.798
1.799GlnTrp: 1.799 ± 1.036
2.698GlnTyr: 2.698 ± 1.045
0.0GlnXaa: 0.0 ± 0.0
Arg
2.698ArgAla: 2.698 ± 2.109
1.799ArgCys: 1.799 ± 1.595
1.799ArgAsp: 1.799 ± 0.918
2.698ArgGlu: 2.698 ± 2.203
1.799ArgPhe: 1.799 ± 1.406
0.899ArgGly: 0.899 ± 0.798
2.698ArgHis: 2.698 ± 1.85
2.698ArgIle: 2.698 ± 1.334
8.094ArgLys: 8.094 ± 2.684
9.892ArgLeu: 9.892 ± 1.446
0.899ArgMet: 0.899 ± 0.893
4.496ArgAsn: 4.496 ± 2.308
4.496ArgPro: 4.496 ± 2.178
0.899ArgGln: 0.899 ± 0.734
17.986ArgArg: 17.986 ± 7.152
6.295ArgSer: 6.295 ± 2.105
1.799ArgThr: 1.799 ± 1.196
3.597ArgVal: 3.597 ± 2.549
0.899ArgTrp: 0.899 ± 0.893
1.799ArgTyr: 1.799 ± 0.857
0.0ArgXaa: 0.0 ± 0.0
Ser
1.799SerAla: 1.799 ± 1.016
0.899SerCys: 0.899 ± 0.825
7.194SerAsp: 7.194 ± 1.961
3.597SerGlu: 3.597 ± 2.381
4.496SerPhe: 4.496 ± 1.404
7.194SerGly: 7.194 ± 2.438
1.799SerHis: 1.799 ± 0.919
3.597SerIle: 3.597 ± 2.019
6.295SerLys: 6.295 ± 2.583
4.496SerLeu: 4.496 ± 2.772
2.698SerMet: 2.698 ± 2.266
1.799SerAsn: 1.799 ± 0.926
3.597SerPro: 3.597 ± 2.497
6.295SerGln: 6.295 ± 2.539
6.295SerArg: 6.295 ± 1.336
15.288SerSer: 15.288 ± 8.254
6.295SerThr: 6.295 ± 3.263
4.496SerVal: 4.496 ± 1.383
0.899SerTrp: 0.899 ± 0.893
3.597SerTyr: 3.597 ± 1.06
0.0SerXaa: 0.0 ± 0.0
Thr
1.799ThrAla: 1.799 ± 0.918
0.0ThrCys: 0.0 ± 0.0
3.597ThrAsp: 3.597 ± 2.085
2.698ThrGlu: 2.698 ± 1.583
1.799ThrPhe: 1.799 ± 1.785
6.295ThrGly: 6.295 ± 3.451
1.799ThrHis: 1.799 ± 0.919
1.799ThrIle: 1.799 ± 1.109
4.496ThrLys: 4.496 ± 1.096
6.295ThrLeu: 6.295 ± 2.398
0.899ThrMet: 0.899 ± 0.759
3.597ThrAsn: 3.597 ± 1.672
3.597ThrPro: 3.597 ± 2.031
4.496ThrGln: 4.496 ± 2.268
3.597ThrArg: 3.597 ± 1.811
11.691ThrSer: 11.691 ± 7.556
1.799ThrThr: 1.799 ± 1.785
2.698ThrVal: 2.698 ± 1.207
0.899ThrTrp: 0.899 ± 0.734
3.597ThrTyr: 3.597 ± 1.354
0.0ThrXaa: 0.0 ± 0.0
Val
3.597ValAla: 3.597 ± 1.308
0.0ValCys: 0.0 ± 0.0
3.597ValAsp: 3.597 ± 1.44
0.899ValGlu: 0.899 ± 0.703
0.899ValPhe: 0.899 ± 0.734
1.799ValGly: 1.799 ± 1.595
0.899ValHis: 0.899 ± 0.734
7.194ValIle: 7.194 ± 1.408
5.396ValLys: 5.396 ± 1.752
6.295ValLeu: 6.295 ± 1.344
1.799ValMet: 1.799 ± 1.595
4.496ValAsn: 4.496 ± 1.054
2.698ValPro: 2.698 ± 1.529
0.0ValGln: 0.0 ± 0.0
3.597ValArg: 3.597 ± 1.836
8.993ValSer: 8.993 ± 1.925
2.698ValThr: 2.698 ± 1.492
7.194ValVal: 7.194 ± 3.101
0.0ValTrp: 0.0 ± 0.0
0.899ValTyr: 0.899 ± 0.798
0.0ValXaa: 0.0 ± 0.0
Trp
2.698TrpAla: 2.698 ± 2.737
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.899TrpGlu: 0.899 ± 0.825
1.799TrpPhe: 1.799 ± 1.65
0.0TrpGly: 0.0 ± 0.0
0.899TrpHis: 0.899 ± 0.912
1.799TrpIle: 1.799 ± 1.016
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.899TrpAsn: 0.899 ± 0.734
0.899TrpPro: 0.899 ± 0.734
0.899TrpGln: 0.899 ± 0.912
1.799TrpArg: 1.799 ± 1.595
0.899TrpSer: 0.899 ± 0.734
0.0TrpThr: 0.0 ± 0.0
0.899TrpVal: 0.899 ± 0.734
0.0TrpTrp: 0.0 ± 0.0
0.899TrpTyr: 0.899 ± 0.893
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.799TyrAla: 1.799 ± 0.926
0.0TyrCys: 0.0 ± 0.0
4.496TyrAsp: 4.496 ± 3.121
1.799TyrGlu: 1.799 ± 1.036
1.799TyrPhe: 1.799 ± 0.906
0.899TyrGly: 0.899 ± 0.825
0.899TyrHis: 0.899 ± 0.734
4.496TyrIle: 4.496 ± 1.74
0.899TyrLys: 0.899 ± 0.825
5.396TyrLeu: 5.396 ± 1.119
3.597TyrMet: 3.597 ± 0.972
3.597TyrAsn: 3.597 ± 1.055
2.698TyrPro: 2.698 ± 1.443
0.899TyrGln: 0.899 ± 0.734
3.597TyrArg: 3.597 ± 1.213
0.0TyrSer: 0.0 ± 0.0
0.899TyrThr: 0.899 ± 0.893
1.799TyrVal: 1.799 ± 0.919
0.899TyrTrp: 0.899 ± 0.912
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1113 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski