Amino acid dipepetide frequency for Sweet potato leaf curl Georgia virus-[16]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.108AlaAla: 8.108 ± 2.269
0.0AlaCys: 0.0 ± 0.0
0.901AlaAsp: 0.901 ± 0.849
6.306AlaGlu: 6.306 ± 3.034
1.802AlaPhe: 1.802 ± 1.065
0.901AlaGly: 0.901 ± 0.659
0.901AlaHis: 0.901 ± 0.659
2.703AlaIle: 2.703 ± 1.146
6.306AlaLys: 6.306 ± 1.947
7.207AlaLeu: 7.207 ± 2.711
0.0AlaMet: 0.0 ± 0.0
1.802AlaAsn: 1.802 ± 1.081
3.604AlaPro: 3.604 ± 2.254
3.604AlaGln: 3.604 ± 1.01
3.604AlaArg: 3.604 ± 1.724
3.604AlaSer: 3.604 ± 1.819
0.901AlaThr: 0.901 ± 0.849
1.802AlaVal: 1.802 ± 0.725
0.901AlaTrp: 0.901 ± 0.659
0.901AlaTyr: 0.901 ± 0.778
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.802CysCys: 1.802 ± 1.961
0.901CysAsp: 0.901 ± 0.659
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.703CysGly: 2.703 ± 1.514
0.0CysHis: 0.0 ± 0.0
0.901CysIle: 0.901 ± 0.778
3.604CysLys: 3.604 ± 0.949
0.0CysLeu: 0.0 ± 0.0
0.901CysMet: 0.901 ± 0.981
2.703CysAsn: 2.703 ± 1.176
5.405CysPro: 5.405 ± 3.057
0.0CysGln: 0.0 ± 0.0
0.901CysArg: 0.901 ± 0.778
3.604CysSer: 3.604 ± 1.649
1.802CysThr: 1.802 ± 1.349
1.802CysVal: 1.802 ± 1.555
0.0CysTrp: 0.0 ± 0.0
0.901CysTyr: 0.901 ± 0.849
0.0CysXaa: 0.0 ± 0.0
Asp
0.901AspAla: 0.901 ± 0.659
2.703AspCys: 2.703 ± 2.03
2.703AspAsp: 2.703 ± 0.811
0.901AspGlu: 0.901 ± 0.981
1.802AspPhe: 1.802 ± 0.725
3.604AspGly: 3.604 ± 1.724
0.901AspHis: 0.901 ± 0.778
3.604AspIle: 3.604 ± 1.37
1.802AspLys: 1.802 ± 1.318
3.604AspLeu: 3.604 ± 1.253
0.0AspMet: 0.0 ± 0.0
2.703AspAsn: 2.703 ± 1.661
1.802AspPro: 1.802 ± 1.122
0.0AspGln: 0.0 ± 0.0
5.405AspArg: 5.405 ± 1.92
4.505AspSer: 4.505 ± 0.972
0.901AspThr: 0.901 ± 0.981
4.505AspVal: 4.505 ± 1.5
4.505AspTrp: 4.505 ± 1.929
2.703AspTyr: 2.703 ± 0.737
0.0AspXaa: 0.0 ± 0.0
Glu
4.505GluAla: 4.505 ± 1.716
0.901GluCys: 0.901 ± 0.981
1.802GluAsp: 1.802 ± 1.349
7.207GluGlu: 7.207 ± 1.847
2.703GluPhe: 2.703 ± 0.811
5.405GluGly: 5.405 ± 1.922
0.901GluHis: 0.901 ± 0.659
2.703GluIle: 2.703 ± 1.893
2.703GluLys: 2.703 ± 1.498
5.405GluLeu: 5.405 ± 3.988
0.0GluMet: 0.0 ± 0.0
2.703GluAsn: 2.703 ± 1.351
3.604GluPro: 3.604 ± 1.857
2.703GluGln: 2.703 ± 0.737
0.0GluArg: 0.0 ± 0.0
3.604GluSer: 3.604 ± 1.846
2.703GluThr: 2.703 ± 0.976
1.802GluVal: 1.802 ± 1.103
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.703PheAla: 2.703 ± 1.278
0.901PheCys: 0.901 ± 0.659
0.901PheAsp: 0.901 ± 0.659
1.802PheGlu: 1.802 ± 1.122
2.703PhePhe: 2.703 ± 1.819
0.0PheGly: 0.0 ± 0.0
2.703PheHis: 2.703 ± 0.811
2.703PheIle: 2.703 ± 1.198
7.207PheLys: 7.207 ± 2.054
4.505PheLeu: 4.505 ± 2.125
0.901PheMet: 0.901 ± 0.659
1.802PheAsn: 1.802 ± 0.894
0.0PhePro: 0.0 ± 0.0
2.703PheGln: 2.703 ± 1.977
3.604PheArg: 3.604 ± 1.909
2.703PheSer: 2.703 ± 0.811
2.703PheThr: 2.703 ± 1.661
0.901PheVal: 0.901 ± 0.778
1.802PheTrp: 1.802 ± 1.318
0.901PheTyr: 0.901 ± 0.778
0.0PheXaa: 0.0 ± 0.0
Gly
1.802GlyAla: 1.802 ± 1.318
2.703GlyCys: 2.703 ± 1.762
1.802GlyAsp: 1.802 ± 0.725
3.604GlyGlu: 3.604 ± 1.381
6.306GlyPhe: 6.306 ± 3.045
3.604GlyGly: 3.604 ± 1.45
1.802GlyHis: 1.802 ± 0.802
4.505GlyIle: 4.505 ± 0.948
5.405GlyLys: 5.405 ± 1.706
2.703GlyLeu: 2.703 ± 1.707
0.0GlyMet: 0.0 ± 0.866
1.802GlyAsn: 1.802 ± 1.065
3.604GlyPro: 3.604 ± 1.45
1.802GlyGln: 1.802 ± 0.894
2.703GlyArg: 2.703 ± 1.351
4.505GlySer: 4.505 ± 2.371
4.505GlyThr: 4.505 ± 1.413
2.703GlyVal: 2.703 ± 1.316
0.0GlyTrp: 0.0 ± 0.0
0.901GlyTyr: 0.901 ± 0.849
0.0GlyXaa: 0.0 ± 0.0
His
2.703HisAla: 2.703 ± 0.811
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.802HisPhe: 1.802 ± 1.318
0.901HisGly: 0.901 ± 0.849
0.901HisHis: 0.901 ± 1.019
0.901HisIle: 0.901 ± 0.89
3.604HisLys: 3.604 ± 2.13
4.505HisLeu: 4.505 ± 1.52
0.0HisMet: 0.0 ± 0.0
2.703HisAsn: 2.703 ± 1.146
2.703HisPro: 2.703 ± 1.146
2.703HisGln: 2.703 ± 1.219
2.703HisArg: 2.703 ± 1.385
1.802HisSer: 1.802 ± 1.318
3.604HisThr: 3.604 ± 1.819
1.802HisVal: 1.802 ± 1.065
0.901HisTrp: 0.901 ± 0.981
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.901IleAla: 0.901 ± 0.778
0.901IleCys: 0.901 ± 0.89
4.505IleAsp: 4.505 ± 1.52
2.703IleGlu: 2.703 ± 1.294
5.405IlePhe: 5.405 ± 2.41
1.802IleGly: 1.802 ± 0.894
0.0IleHis: 0.0 ± 0.0
2.703IleIle: 2.703 ± 1.146
3.604IleLys: 3.604 ± 1.541
3.604IleLeu: 3.604 ± 2.144
0.0IleMet: 0.0 ± 0.0
2.703IleAsn: 2.703 ± 1.659
7.207IlePro: 7.207 ± 3.252
3.604IleGln: 3.604 ± 1.422
4.505IleArg: 4.505 ± 0.972
4.505IleSer: 4.505 ± 2.341
3.604IleThr: 3.604 ± 1.998
3.604IleVal: 3.604 ± 0.949
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.604LysAla: 3.604 ± 1.604
1.802LysCys: 1.802 ± 0.802
2.703LysAsp: 2.703 ± 1.294
5.405LysGlu: 5.405 ± 1.076
2.703LysPhe: 2.703 ± 1.351
4.505LysGly: 4.505 ± 1.191
1.802LysHis: 1.802 ± 0.802
2.703LysIle: 2.703 ± 1.819
3.604LysLys: 3.604 ± 1.464
3.604LysLeu: 3.604 ± 1.381
0.901LysMet: 0.901 ± 0.778
3.604LysAsn: 3.604 ± 2.635
2.703LysPro: 2.703 ± 1.504
0.0LysGln: 0.0 ± 0.0
6.306LysArg: 6.306 ± 3.233
4.505LysSer: 4.505 ± 1.066
3.604LysThr: 3.604 ± 2.217
3.604LysVal: 3.604 ± 1.314
0.901LysTrp: 0.901 ± 0.89
4.505LysTyr: 4.505 ± 1.736
0.0LysXaa: 0.0 ± 0.0
Leu
0.901LeuAla: 0.901 ± 0.89
4.505LeuCys: 4.505 ± 2.304
3.604LeuAsp: 3.604 ± 1.759
2.703LeuGlu: 2.703 ± 1.018
1.802LeuPhe: 1.802 ± 1.103
3.604LeuGly: 3.604 ± 0.9
3.604LeuHis: 3.604 ± 1.337
2.703LeuIle: 2.703 ± 1.122
7.207LeuLys: 7.207 ± 1.478
3.604LeuLeu: 3.604 ± 1.77
1.802LeuMet: 1.802 ± 1.172
2.703LeuAsn: 2.703 ± 1.018
2.703LeuPro: 2.703 ± 1.122
8.108LeuGln: 8.108 ± 3.468
4.505LeuArg: 4.505 ± 1.253
5.405LeuSer: 5.405 ± 1.883
3.604LeuThr: 3.604 ± 1.314
2.703LeuVal: 2.703 ± 1.657
1.802LeuTrp: 1.802 ± 1.961
4.505LeuTyr: 4.505 ± 2.111
0.0LeuXaa: 0.0 ± 0.0
Met
0.901MetAla: 0.901 ± 0.981
0.0MetCys: 0.0 ± 0.0
5.405MetAsp: 5.405 ± 2.737
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.703MetGly: 2.703 ± 1.018
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.901MetLys: 0.901 ± 0.778
2.703MetLeu: 2.703 ± 1.442
0.901MetMet: 0.901 ± 1.019
0.0MetAsn: 0.0 ± 0.0
0.901MetPro: 0.901 ± 0.659
1.802MetGln: 1.802 ± 1.172
0.0MetArg: 0.0 ± 0.0
2.703MetSer: 2.703 ± 1.624
0.901MetThr: 0.901 ± 0.778
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.901MetTyr: 0.901 ± 0.778
0.0MetXaa: 0.0 ± 0.0
Asn
3.604AsnAla: 3.604 ± 1.45
2.703AsnCys: 2.703 ± 1.498
1.802AsnAsp: 1.802 ± 0.802
0.901AsnGlu: 0.901 ± 0.778
1.802AsnPhe: 1.802 ± 0.725
0.0AsnGly: 0.0 ± 0.0
5.405AsnHis: 5.405 ± 3.048
1.802AsnIle: 1.802 ± 1.079
0.0AsnLys: 0.0 ± 0.0
3.604AsnLeu: 3.604 ± 1.414
1.802AsnMet: 1.802 ± 1.484
3.604AsnAsn: 3.604 ± 1.541
6.306AsnPro: 6.306 ± 1.007
0.901AsnGln: 0.901 ± 0.659
0.901AsnArg: 0.901 ± 0.89
5.405AsnSer: 5.405 ± 1.454
1.802AsnThr: 1.802 ± 1.277
4.505AsnVal: 4.505 ± 2.344
1.802AsnTrp: 1.802 ± 0.802
0.901AsnTyr: 0.901 ± 0.659
0.0AsnXaa: 0.0 ± 0.0
Pro
0.901ProAla: 0.901 ± 0.849
0.901ProCys: 0.901 ± 0.778
4.505ProAsp: 4.505 ± 2.304
3.604ProGlu: 3.604 ± 1.763
2.703ProPhe: 2.703 ± 1.294
3.604ProGly: 3.604 ± 2.163
2.703ProHis: 2.703 ± 0.737
3.604ProIle: 3.604 ± 1.415
3.604ProLys: 3.604 ± 1.231
3.604ProLeu: 3.604 ± 2.014
2.703ProMet: 2.703 ± 1.452
5.405ProAsn: 5.405 ± 1.111
2.703ProPro: 2.703 ± 1.198
1.802ProGln: 1.802 ± 0.802
3.604ProArg: 3.604 ± 1.759
5.405ProSer: 5.405 ± 1.918
4.505ProThr: 4.505 ± 2.88
4.505ProVal: 4.505 ± 2.839
0.0ProTrp: 0.0 ± 0.0
3.604ProTyr: 3.604 ± 2.365
0.0ProXaa: 0.0 ± 0.0
Gln
1.802GlnAla: 1.802 ± 1.555
0.0GlnCys: 0.0 ± 0.0
1.802GlnAsp: 1.802 ± 0.725
6.306GlnGlu: 6.306 ± 2.701
3.604GlnPhe: 3.604 ± 1.849
1.802GlnGly: 1.802 ± 1.081
2.703GlnHis: 2.703 ± 1.514
2.703GlnIle: 2.703 ± 1.504
0.901GlnLys: 0.901 ± 0.659
3.604GlnLeu: 3.604 ± 2.245
0.901GlnMet: 0.901 ± 1.019
0.901GlnAsn: 0.901 ± 0.981
2.703GlnPro: 2.703 ± 1.514
4.505GlnGln: 4.505 ± 2.486
1.802GlnArg: 1.802 ± 1.277
0.901GlnSer: 0.901 ± 0.659
3.604GlnThr: 3.604 ± 1.541
3.604GlnVal: 3.604 ± 0.949
0.0GlnTrp: 0.0 ± 0.0
2.703GlnTyr: 2.703 ± 1.498
0.0GlnXaa: 0.0 ± 0.0
Arg
3.604ArgAla: 3.604 ± 1.961
1.802ArgCys: 1.802 ± 1.122
4.505ArgAsp: 4.505 ± 2.037
2.703ArgGlu: 2.703 ± 1.176
1.802ArgPhe: 1.802 ± 1.122
4.505ArgGly: 4.505 ± 1.993
1.802ArgHis: 1.802 ± 0.894
7.207ArgIle: 7.207 ± 1.185
4.505ArgLys: 4.505 ± 2.27
3.604ArgLeu: 3.604 ± 2.158
2.703ArgMet: 2.703 ± 2.333
0.901ArgAsn: 0.901 ± 0.89
5.405ArgPro: 5.405 ± 1.425
2.703ArgGln: 2.703 ± 1.416
5.405ArgArg: 5.405 ± 2.999
3.604ArgSer: 3.604 ± 1.724
1.802ArgThr: 1.802 ± 1.318
4.505ArgVal: 4.505 ± 1.35
0.901ArgTrp: 0.901 ± 0.89
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
7.207SerAla: 7.207 ± 3.258
0.901SerCys: 0.901 ± 0.849
5.405SerAsp: 5.405 ± 1.551
0.901SerGlu: 0.901 ± 0.981
1.802SerPhe: 1.802 ± 1.318
3.604SerGly: 3.604 ± 0.906
3.604SerHis: 3.604 ± 1.314
2.703SerIle: 2.703 ± 1.219
0.901SerLys: 0.901 ± 0.659
7.207SerLeu: 7.207 ± 3.263
2.703SerMet: 2.703 ± 1.011
4.505SerAsn: 4.505 ± 2.344
4.505SerPro: 4.505 ± 2.638
3.604SerGln: 3.604 ± 1.604
9.009SerArg: 9.009 ± 3.814
13.514SerSer: 13.514 ± 4.63
4.505SerThr: 4.505 ± 3.549
4.505SerVal: 4.505 ± 1.501
0.901SerTrp: 0.901 ± 0.89
2.703SerTyr: 2.703 ± 0.976
0.0SerXaa: 0.0 ± 0.0
Thr
7.207ThrAla: 7.207 ± 1.414
0.901ThrCys: 0.901 ± 1.019
0.901ThrAsp: 0.901 ± 0.849
1.802ThrGlu: 1.802 ± 1.433
2.703ThrPhe: 2.703 ± 1.351
8.108ThrGly: 8.108 ± 3.511
1.802ThrHis: 1.802 ± 1.065
0.901ThrIle: 0.901 ± 0.659
1.802ThrLys: 1.802 ± 1.318
2.703ThrLeu: 2.703 ± 1.959
0.901ThrMet: 0.901 ± 0.778
3.604ThrAsn: 3.604 ± 1.596
2.703ThrPro: 2.703 ± 2.236
0.901ThrGln: 0.901 ± 0.89
2.703ThrArg: 2.703 ± 1.385
3.604ThrSer: 3.604 ± 3.003
2.703ThrThr: 2.703 ± 1.819
3.604ThrVal: 3.604 ± 1.056
0.0ThrTrp: 0.0 ± 0.0
2.703ThrTyr: 2.703 ± 1.351
0.0ThrXaa: 0.0 ± 0.0
Val
1.802ValAla: 1.802 ± 1.065
3.604ValCys: 3.604 ± 2.13
1.802ValAsp: 1.802 ± 1.318
0.0ValGlu: 0.0 ± 0.0
0.901ValPhe: 0.901 ± 0.659
1.802ValGly: 1.802 ± 1.079
0.901ValHis: 0.901 ± 0.89
6.306ValIle: 6.306 ± 3.275
3.604ValLys: 3.604 ± 1.68
1.802ValLeu: 1.802 ± 1.172
0.901ValMet: 0.901 ± 0.659
1.802ValAsn: 1.802 ± 0.894
4.505ValPro: 4.505 ± 2.839
1.802ValGln: 1.802 ± 0.725
5.405ValArg: 5.405 ± 2.999
7.207ValSer: 7.207 ± 2.57
2.703ValThr: 2.703 ± 1.657
1.802ValVal: 1.802 ± 1.698
2.703ValTrp: 2.703 ± 0.737
3.604ValTyr: 3.604 ± 1.01
0.0ValXaa: 0.0 ± 0.0
Trp
2.703TrpAla: 2.703 ± 1.977
0.901TrpCys: 0.901 ± 0.849
0.901TrpAsp: 0.901 ± 0.981
0.901TrpGlu: 0.901 ± 0.981
0.0TrpPhe: 0.0 ± 0.0
1.802TrpGly: 1.802 ± 1.081
0.901TrpHis: 0.901 ± 0.89
2.703TrpIle: 2.703 ± 1.11
0.901TrpLys: 0.901 ± 0.849
1.802TrpLeu: 1.802 ± 0.725
0.901TrpMet: 0.901 ± 0.778
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.901TrpGln: 0.901 ± 0.659
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.901TrpThr: 0.901 ± 0.89
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.802TrpTyr: 1.802 ± 0.802
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
2.703TyrAsp: 2.703 ± 1.663
3.604TyrGlu: 3.604 ± 1.718
1.802TyrPhe: 1.802 ± 1.065
2.703TyrGly: 2.703 ± 0.976
0.901TyrHis: 0.901 ± 0.659
1.802TyrIle: 1.802 ± 1.318
0.901TyrLys: 0.901 ± 0.849
2.703TyrLeu: 2.703 ± 1.498
0.901TyrMet: 0.901 ± 0.875
3.604TyrAsn: 3.604 ± 2.083
0.901TyrPro: 0.901 ± 0.659
2.703TyrGln: 2.703 ± 1.762
0.901TyrArg: 0.901 ± 0.981
3.604TyrSer: 3.604 ± 0.906
0.901TyrThr: 0.901 ± 0.778
2.703TyrVal: 2.703 ± 1.819
0.901TyrTrp: 0.901 ± 0.778
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1111 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski