Amino acid dipepetide frequency for Sweet potato leaf curl Spain virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.919AlaAla: 9.919 ± 3.036
0.902AlaCys: 0.902 ± 0.912
0.902AlaAsp: 0.902 ± 1.276
4.509AlaGlu: 4.509 ± 2.053
1.803AlaPhe: 1.803 ± 1.087
1.803AlaGly: 1.803 ± 1.311
0.902AlaHis: 0.902 ± 0.656
1.803AlaIle: 1.803 ± 1.021
5.41AlaLys: 5.41 ± 1.94
6.312AlaLeu: 6.312 ± 2.291
0.0AlaMet: 0.0 ± 0.0
1.803AlaAsn: 1.803 ± 1.039
2.705AlaPro: 2.705 ± 1.791
3.607AlaGln: 3.607 ± 1.015
6.312AlaArg: 6.312 ± 1.71
5.41AlaSer: 5.41 ± 2.408
0.0AlaThr: 0.0 ± 0.0
4.509AlaVal: 4.509 ± 1.989
0.902AlaTrp: 0.902 ± 0.656
1.803AlaTyr: 1.803 ± 1.404
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.803CysCys: 1.803 ± 1.894
0.902CysAsp: 0.902 ± 0.656
0.0CysGlu: 0.0 ± 0.0
0.902CysPhe: 0.902 ± 1.276
2.705CysGly: 2.705 ± 1.644
0.0CysHis: 0.0 ± 0.0
0.902CysIle: 0.902 ± 0.753
3.607CysLys: 3.607 ± 0.935
0.902CysLeu: 0.902 ± 0.92
0.902CysMet: 0.902 ± 0.947
2.705CysAsn: 2.705 ± 1.389
4.509CysPro: 4.509 ± 3.362
0.0CysGln: 0.0 ± 0.0
1.803CysArg: 1.803 ± 0.796
2.705CysSer: 2.705 ± 1.17
2.705CysThr: 2.705 ± 1.644
1.803CysVal: 1.803 ± 1.505
0.0CysTrp: 0.0 ± 0.0
0.902CysTyr: 0.902 ± 0.912
0.0CysXaa: 0.0 ± 0.0
Asp
2.705AspAla: 2.705 ± 1.967
1.803AspCys: 1.803 ± 1.363
3.607AspAsp: 3.607 ± 0.935
0.902AspGlu: 0.902 ± 0.947
1.803AspPhe: 1.803 ± 1.008
3.607AspGly: 3.607 ± 1.829
2.705AspHis: 2.705 ± 1.86
1.803AspIle: 1.803 ± 0.88
1.803AspLys: 1.803 ± 1.311
4.509AspLeu: 4.509 ± 1.392
0.0AspMet: 0.0 ± 0.0
1.803AspAsn: 1.803 ± 1.505
1.803AspPro: 1.803 ± 1.008
0.0AspGln: 0.0 ± 0.0
5.41AspArg: 5.41 ± 2.202
4.509AspSer: 4.509 ± 1.591
0.902AspThr: 0.902 ± 0.656
4.509AspVal: 4.509 ± 1.521
3.607AspTrp: 3.607 ± 1.715
1.803AspTyr: 1.803 ± 1.087
0.0AspXaa: 0.0 ± 0.0
Glu
5.41GluAla: 5.41 ± 1.728
0.902GluCys: 0.902 ± 0.947
0.902GluAsp: 0.902 ± 0.656
7.214GluGlu: 7.214 ± 1.846
3.607GluPhe: 3.607 ± 1.015
4.509GluGly: 4.509 ± 1.805
0.0GluHis: 0.0 ± 0.0
3.607GluIle: 3.607 ± 2.662
2.705GluLys: 2.705 ± 1.456
3.607GluLeu: 3.607 ± 2.157
0.0GluMet: 0.0 ± 0.0
4.509GluAsn: 4.509 ± 1.654
4.509GluPro: 4.509 ± 1.625
3.607GluGln: 3.607 ± 1.572
0.902GluArg: 0.902 ± 0.656
2.705GluSer: 2.705 ± 2.13
3.607GluThr: 3.607 ± 1.649
0.902GluVal: 0.902 ± 0.656
0.902GluTrp: 0.902 ± 0.656
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.902PheAla: 0.902 ± 0.753
1.803PheCys: 1.803 ± 1.253
0.902PheAsp: 0.902 ± 0.656
2.705PheGlu: 2.705 ± 0.914
1.803PhePhe: 1.803 ± 1.021
0.0PheGly: 0.0 ± 0.0
2.705PheHis: 2.705 ± 0.779
2.705PheIle: 2.705 ± 1.224
4.509PheLys: 4.509 ± 1.392
2.705PheLeu: 2.705 ± 1.25
0.902PheMet: 0.902 ± 0.656
2.705PheAsn: 2.705 ± 1.249
0.0PhePro: 0.0 ± 0.0
5.41PheGln: 5.41 ± 1.933
0.902PheArg: 0.902 ± 0.92
3.607PheSer: 3.607 ± 1.015
3.607PheThr: 3.607 ± 1.717
0.902PheVal: 0.902 ± 0.753
0.902PheTrp: 0.902 ± 0.656
0.902PheTyr: 0.902 ± 0.753
0.0PheXaa: 0.0 ± 0.0
Gly
1.803GlyAla: 1.803 ± 0.88
2.705GlyCys: 2.705 ± 1.86
3.607GlyAsp: 3.607 ± 1.313
3.607GlyGlu: 3.607 ± 1.493
2.705GlyPhe: 2.705 ± 1.664
4.509GlyGly: 4.509 ± 1.989
0.902GlyHis: 0.902 ± 0.656
2.705GlyIle: 2.705 ± 0.779
5.41GlyLys: 5.41 ± 1.793
1.803GlyLeu: 1.803 ± 1.008
0.902GlyMet: 0.902 ± 0.884
3.607GlyAsn: 3.607 ± 2.58
3.607GlyPro: 3.607 ± 1.591
4.509GlyGln: 4.509 ± 1.841
2.705GlyArg: 2.705 ± 1.249
2.705GlySer: 2.705 ± 1.224
4.509GlyThr: 4.509 ± 1.935
0.902GlyVal: 0.902 ± 0.912
0.0GlyTrp: 0.0 ± 0.0
1.803GlyTyr: 1.803 ± 1.363
0.0GlyXaa: 0.0 ± 0.0
His
2.705HisAla: 2.705 ± 1.356
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.803HisGlu: 1.803 ± 0.857
1.803HisPhe: 1.803 ± 1.311
0.902HisGly: 0.902 ± 0.912
0.0HisHis: 0.0 ± 0.0
0.902HisIle: 0.902 ± 1.276
2.705HisLys: 2.705 ± 2.258
4.509HisLeu: 4.509 ± 1.482
0.902HisMet: 0.902 ± 1.015
2.705HisAsn: 2.705 ± 1.224
1.803HisPro: 1.803 ± 0.796
0.902HisGln: 0.902 ± 0.912
2.705HisArg: 2.705 ± 1.356
1.803HisSer: 1.803 ± 0.796
1.803HisThr: 1.803 ± 1.021
2.705HisVal: 2.705 ± 0.849
1.803HisTrp: 1.803 ± 1.588
0.902HisTyr: 0.902 ± 0.947
0.0HisXaa: 0.0 ± 0.0
Ile
0.902IleAla: 0.902 ± 0.753
1.803IleCys: 1.803 ± 0.88
3.607IleAsp: 3.607 ± 1.508
0.902IleGlu: 0.902 ± 0.656
2.705IlePhe: 2.705 ± 1.967
1.803IleGly: 1.803 ± 1.84
0.902IleHis: 0.902 ± 0.912
4.509IleIle: 4.509 ± 1.006
3.607IleLys: 3.607 ± 1.015
3.607IleLeu: 3.607 ± 1.486
0.902IleMet: 0.902 ± 0.92
0.902IleAsn: 0.902 ± 0.656
7.214IlePro: 7.214 ± 3.159
1.803IleGln: 1.803 ± 1.311
5.41IleArg: 5.41 ± 1.436
9.017IleSer: 9.017 ± 2.72
3.607IleThr: 3.607 ± 2.046
2.705IleVal: 2.705 ± 0.849
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.902LysCys: 0.902 ± 0.912
5.41LysAsp: 5.41 ± 1.237
5.41LysGlu: 5.41 ± 1.496
5.41LysPhe: 5.41 ± 1.664
4.509LysGly: 4.509 ± 1.496
1.803LysHis: 1.803 ± 1.253
1.803LysIle: 1.803 ± 0.796
4.509LysLys: 4.509 ± 1.699
3.607LysLeu: 3.607 ± 1.411
0.902LysMet: 0.902 ± 0.753
3.607LysAsn: 3.607 ± 1.829
2.705LysPro: 2.705 ± 1.54
0.0LysGln: 0.0 ± 0.0
3.607LysArg: 3.607 ± 1.628
4.509LysSer: 4.509 ± 1.531
3.607LysThr: 3.607 ± 1.386
4.509LysVal: 4.509 ± 1.694
0.902LysTrp: 0.902 ± 0.92
3.607LysTyr: 3.607 ± 1.682
0.0LysXaa: 0.0 ± 0.0
Leu
1.803LeuAla: 1.803 ± 0.88
3.607LeuCys: 3.607 ± 2.077
4.509LeuAsp: 4.509 ± 1.625
2.705LeuGlu: 2.705 ± 1.245
1.803LeuPhe: 1.803 ± 0.88
3.607LeuGly: 3.607 ± 0.915
3.607LeuHis: 3.607 ± 1.372
2.705LeuIle: 2.705 ± 1.57
5.41LeuLys: 5.41 ± 0.96
4.509LeuLeu: 4.509 ± 2.901
1.803LeuMet: 1.803 ± 1.021
4.509LeuAsn: 4.509 ± 1.215
2.705LeuPro: 2.705 ± 1.205
5.41LeuGln: 5.41 ± 2.409
5.41LeuArg: 5.41 ± 2.801
2.705LeuSer: 2.705 ± 1.205
4.509LeuThr: 4.509 ± 2.571
3.607LeuVal: 3.607 ± 1.015
2.705LeuTrp: 2.705 ± 1.205
3.607LeuTyr: 3.607 ± 1.55
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
5.41MetAsp: 5.41 ± 1.734
0.902MetGlu: 0.902 ± 1.276
0.0MetPhe: 0.0 ± 0.0
3.607MetGly: 3.607 ± 1.41
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.803MetLys: 1.803 ± 1.021
1.803MetLeu: 1.803 ± 1.008
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.902MetPro: 0.902 ± 0.656
0.902MetGln: 0.902 ± 0.753
0.902MetArg: 0.902 ± 0.753
1.803MetSer: 1.803 ± 1.363
0.902MetThr: 0.902 ± 0.753
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.803MetTyr: 1.803 ± 1.021
0.0MetXaa: 0.0 ± 0.0
Asn
8.115AsnAla: 8.115 ± 1.265
1.803AsnCys: 1.803 ± 1.039
0.902AsnAsp: 0.902 ± 0.656
0.902AsnGlu: 0.902 ± 0.753
0.902AsnPhe: 0.902 ± 0.753
0.902AsnGly: 0.902 ± 1.276
4.509AsnHis: 4.509 ± 2.904
3.607AsnIle: 3.607 ± 1.829
1.803AsnLys: 1.803 ± 0.796
4.509AsnLeu: 4.509 ± 1.945
0.902AsnMet: 0.902 ± 0.709
2.705AsnAsn: 2.705 ± 1.122
5.41AsnPro: 5.41 ± 1.206
0.902AsnGln: 0.902 ± 0.656
1.803AsnArg: 1.803 ± 1.84
6.312AsnSer: 6.312 ± 3.483
0.902AsnThr: 0.902 ± 0.656
4.509AsnVal: 4.509 ± 1.542
2.705AsnTrp: 2.705 ± 1.122
1.803AsnTyr: 1.803 ± 1.311
0.0AsnXaa: 0.0 ± 0.0
Pro
1.803ProAla: 1.803 ± 1.404
0.902ProCys: 0.902 ± 0.753
3.607ProAsp: 3.607 ± 1.948
7.214ProGlu: 7.214 ± 2.811
2.705ProPhe: 2.705 ± 1.25
2.705ProGly: 2.705 ± 1.488
2.705ProHis: 2.705 ± 1.249
4.509ProIle: 4.509 ± 1.837
2.705ProLys: 2.705 ± 0.914
2.705ProLeu: 2.705 ± 1.806
1.803ProMet: 1.803 ± 1.293
3.607ProAsn: 3.607 ± 0.915
3.607ProPro: 3.607 ± 1.715
2.705ProGln: 2.705 ± 1.25
2.705ProArg: 2.705 ± 1.644
3.607ProSer: 3.607 ± 1.715
5.41ProThr: 5.41 ± 2.154
1.803ProVal: 1.803 ± 0.796
0.0ProTrp: 0.0 ± 0.0
3.607ProTyr: 3.607 ± 2.297
0.0ProXaa: 0.0 ± 0.0
Gln
1.803GlnAla: 1.803 ± 1.505
2.705GlnCys: 2.705 ± 1.25
0.902GlnAsp: 0.902 ± 0.753
4.509GlnGlu: 4.509 ± 2.009
1.803GlnPhe: 1.803 ± 0.88
0.0GlnGly: 0.0 ± 0.0
1.803GlnHis: 1.803 ± 1.471
3.607GlnIle: 3.607 ± 1.791
2.705GlnLys: 2.705 ± 1.389
1.803GlnLeu: 1.803 ± 1.363
0.902GlnMet: 0.902 ± 0.753
0.902GlnAsn: 0.902 ± 0.947
2.705GlnPro: 2.705 ± 1.644
3.607GlnGln: 3.607 ± 2.518
2.705GlnArg: 2.705 ± 1.224
3.607GlnSer: 3.607 ± 1.822
3.607GlnThr: 3.607 ± 1.015
2.705GlnVal: 2.705 ± 0.779
0.0GlnTrp: 0.0 ± 0.0
1.803GlnTyr: 1.803 ± 1.311
0.0GlnXaa: 0.0 ± 0.0
Arg
5.41ArgAla: 5.41 ± 1.67
1.803ArgCys: 1.803 ± 1.008
2.705ArgAsp: 2.705 ± 1.506
2.705ArgGlu: 2.705 ± 1.25
1.803ArgPhe: 1.803 ± 0.796
4.509ArgGly: 4.509 ± 1.369
1.803ArgHis: 1.803 ± 1.331
6.312ArgIle: 6.312 ± 2.249
2.705ArgLys: 2.705 ± 1.86
7.214ArgLeu: 7.214 ± 2.592
4.509ArgMet: 4.509 ± 2.215
0.902ArgAsn: 0.902 ± 0.92
3.607ArgPro: 3.607 ± 1.23
0.0ArgGln: 0.0 ± 0.0
6.312ArgArg: 6.312 ± 4.361
5.41ArgSer: 5.41 ± 1.916
2.705ArgThr: 2.705 ± 1.791
2.705ArgVal: 2.705 ± 1.506
0.0ArgTrp: 0.0 ± 0.0
1.803ArgTyr: 1.803 ± 1.894
0.0ArgXaa: 0.0 ± 0.0
Ser
7.214SerAla: 7.214 ± 1.542
0.902SerCys: 0.902 ± 0.912
4.509SerAsp: 4.509 ± 1.531
0.902SerGlu: 0.902 ± 0.947
1.803SerPhe: 1.803 ± 1.311
4.509SerGly: 4.509 ± 1.256
4.509SerHis: 4.509 ± 1.274
5.41SerIle: 5.41 ± 2.756
4.509SerLys: 4.509 ± 1.629
6.312SerLeu: 6.312 ± 3.416
0.902SerMet: 0.902 ± 0.793
4.509SerAsn: 4.509 ± 1.521
5.41SerPro: 5.41 ± 2.232
4.509SerGln: 4.509 ± 2.426
5.41SerArg: 5.41 ± 2.325
9.919SerSer: 9.919 ± 2.508
4.509SerThr: 4.509 ± 4.025
3.607SerVal: 3.607 ± 1.98
0.0SerTrp: 0.0 ± 0.0
1.803SerTyr: 1.803 ± 0.796
0.0SerXaa: 0.0 ± 0.0
Thr
6.312ThrAla: 6.312 ± 1.041
0.902ThrCys: 0.902 ± 1.276
0.902ThrAsp: 0.902 ± 0.912
0.902ThrGlu: 0.902 ± 0.947
2.705ThrPhe: 2.705 ± 1.857
4.509ThrGly: 4.509 ± 2.503
2.705ThrHis: 2.705 ± 1.513
3.607ThrIle: 3.607 ± 1.846
0.902ThrLys: 0.902 ± 0.912
3.607ThrLeu: 3.607 ± 1.682
0.902ThrMet: 0.902 ± 0.753
7.214ThrAsn: 7.214 ± 1.891
1.803ThrPro: 1.803 ± 1.087
1.803ThrGln: 1.803 ± 1.471
6.312ThrArg: 6.312 ± 1.46
1.803ThrSer: 1.803 ± 2.551
3.607ThrThr: 3.607 ± 1.722
1.803ThrVal: 1.803 ± 0.796
0.902ThrTrp: 0.902 ± 1.276
2.705ThrTyr: 2.705 ± 1.403
0.0ThrXaa: 0.0 ± 0.0
Val
0.902ValAla: 0.902 ± 0.753
3.607ValCys: 3.607 ± 2.173
2.705ValAsp: 2.705 ± 1.967
0.902ValGlu: 0.902 ± 0.92
0.902ValPhe: 0.902 ± 0.656
1.803ValGly: 1.803 ± 1.021
0.902ValHis: 0.902 ± 0.92
3.607ValIle: 3.607 ± 1.76
2.705ValLys: 2.705 ± 1.456
2.705ValLeu: 2.705 ± 0.779
0.902ValMet: 0.902 ± 0.656
2.705ValAsn: 2.705 ± 1.25
4.509ValPro: 4.509 ± 2.836
2.705ValGln: 2.705 ± 0.779
1.803ValArg: 1.803 ± 1.008
4.509ValSer: 4.509 ± 1.989
3.607ValThr: 3.607 ± 2.297
0.0ValVal: 0.0 ± 0.0
3.607ValTrp: 3.607 ± 0.952
3.607ValTyr: 3.607 ± 1.015
0.0ValXaa: 0.0 ± 0.0
Trp
2.705TrpAla: 2.705 ± 1.967
0.902TrpCys: 0.902 ± 0.912
0.902TrpAsp: 0.902 ± 0.947
0.902TrpGlu: 0.902 ± 0.947
0.0TrpPhe: 0.0 ± 0.0
1.803TrpGly: 1.803 ± 1.039
0.0TrpHis: 0.0 ± 0.0
1.803TrpIle: 1.803 ± 0.857
0.902TrpLys: 0.902 ± 0.912
1.803TrpLeu: 1.803 ± 0.796
0.902TrpMet: 0.902 ± 0.753
0.902TrpAsn: 0.902 ± 1.276
0.0TrpPro: 0.0 ± 0.0
0.902TrpGln: 0.902 ± 0.656
0.902TrpArg: 0.902 ± 0.92
0.902TrpSer: 0.902 ± 1.276
0.902TrpThr: 0.902 ± 0.92
0.902TrpVal: 0.902 ± 0.92
0.0TrpTrp: 0.0 ± 0.0
1.803TrpTyr: 1.803 ± 0.857
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.902TyrAla: 0.902 ± 0.947
0.0TyrCys: 0.0 ± 0.0
2.705TyrAsp: 2.705 ± 1.506
4.509TyrGlu: 4.509 ± 2.166
3.607TyrPhe: 3.607 ± 0.935
2.705TyrGly: 2.705 ± 0.914
0.902TyrHis: 0.902 ± 0.656
0.0TyrIle: 0.0 ± 0.0
0.902TyrLys: 0.902 ± 0.912
2.705TyrLeu: 2.705 ± 1.456
0.902TyrMet: 0.902 ± 0.864
3.607TyrAsn: 3.607 ± 2.107
0.902TyrPro: 0.902 ± 0.656
0.902TyrGln: 0.902 ± 0.753
0.902TyrArg: 0.902 ± 0.947
4.509TyrSer: 4.509 ± 1.119
0.902TyrThr: 0.902 ± 0.753
3.607TyrVal: 3.607 ± 1.508
0.902TyrTrp: 0.902 ± 0.753
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1110 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski