Amino acid dipepetide frequency for Calla lily chlorotic spot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.584AlaAla: 1.584 ± 1.074
2.178AlaCys: 2.178 ± 0.585
1.782AlaAsp: 1.782 ± 0.427
2.376AlaGlu: 2.376 ± 0.233
1.98AlaPhe: 1.98 ± 0.926
2.574AlaGly: 2.574 ± 0.722
0.594AlaHis: 0.594 ± 0.419
3.168AlaIle: 3.168 ± 0.818
2.376AlaLys: 2.376 ± 1.06
3.366AlaLeu: 3.366 ± 1.482
1.584AlaMet: 1.584 ± 0.463
1.782AlaAsn: 1.782 ± 0.6
1.386AlaPro: 1.386 ± 0.527
1.386AlaGln: 1.386 ± 0.563
1.386AlaArg: 1.386 ± 1.071
3.564AlaSer: 3.564 ± 0.829
2.376AlaThr: 2.376 ± 1.197
2.574AlaVal: 2.574 ± 0.752
0.396AlaTrp: 0.396 ± 0.234
1.782AlaTyr: 1.782 ± 0.656
0.0AlaXaa: 0.0 ± 0.0
Cys
1.386CysAla: 1.386 ± 0.536
0.396CysCys: 0.396 ± 0.234
0.99CysAsp: 0.99 ± 0.739
0.594CysGlu: 0.594 ± 0.19
1.782CysPhe: 1.782 ± 0.671
0.99CysGly: 0.99 ± 1.012
0.792CysHis: 0.792 ± 0.255
2.178CysIle: 2.178 ± 0.517
2.178CysLys: 2.178 ± 0.842
2.178CysLeu: 2.178 ± 0.8
0.99CysMet: 0.99 ± 0.904
0.792CysAsn: 0.792 ± 0.371
0.594CysPro: 0.594 ± 0.351
0.198CysGln: 0.198 ± 0.245
1.782CysArg: 1.782 ± 0.508
2.772CysSer: 2.772 ± 0.899
0.99CysThr: 0.99 ± 0.599
1.386CysVal: 1.386 ± 1.086
0.396CysTrp: 0.396 ± 0.297
0.792CysTyr: 0.792 ± 0.406
0.0CysXaa: 0.0 ± 0.0
Asp
0.792AspAla: 0.792 ± 0.468
1.782AspCys: 1.782 ± 0.611
3.366AspAsp: 3.366 ± 0.743
4.554AspGlu: 4.554 ± 0.741
4.356AspPhe: 4.356 ± 0.183
1.584AspGly: 1.584 ± 0.742
0.792AspHis: 0.792 ± 0.202
6.137AspIle: 6.137 ± 0.999
4.752AspLys: 4.752 ± 1.339
4.752AspLeu: 4.752 ± 0.585
2.178AspMet: 2.178 ± 0.8
2.772AspAsn: 2.772 ± 1.122
1.386AspPro: 1.386 ± 0.348
1.98AspGln: 1.98 ± 0.525
2.772AspArg: 2.772 ± 0.917
6.335AspSer: 6.335 ± 1.794
1.98AspThr: 1.98 ± 0.652
3.366AspVal: 3.366 ± 0.853
0.594AspTrp: 0.594 ± 0.36
1.782AspTyr: 1.782 ± 0.361
0.0AspXaa: 0.0 ± 0.0
Glu
2.772GluAla: 2.772 ± 0.725
1.386GluCys: 1.386 ± 0.358
3.168GluAsp: 3.168 ± 0.484
4.158GluGlu: 4.158 ± 0.812
4.752GluPhe: 4.752 ± 1.229
1.782GluGly: 1.782 ± 0.471
0.99GluHis: 0.99 ± 0.286
5.939GluIle: 5.939 ± 0.766
5.741GluLys: 5.741 ± 0.745
6.137GluLeu: 6.137 ± 1.339
4.356GluMet: 4.356 ± 1.163
4.95GluAsn: 4.95 ± 0.694
1.386GluPro: 1.386 ± 0.229
1.386GluGln: 1.386 ± 1.126
1.386GluArg: 1.386 ± 0.513
4.95GluSer: 4.95 ± 0.639
2.97GluThr: 2.97 ± 0.387
2.772GluVal: 2.772 ± 0.648
0.396GluTrp: 0.396 ± 0.186
2.97GluTyr: 2.97 ± 1.046
0.0GluXaa: 0.0 ± 0.0
Phe
1.98PheAla: 1.98 ± 0.3
1.584PheCys: 1.584 ± 0.484
2.772PheAsp: 2.772 ± 0.648
2.574PheGlu: 2.574 ± 0.624
2.574PhePhe: 2.574 ± 1.336
1.782PheGly: 1.782 ± 0.567
0.594PheHis: 0.594 ± 0.325
2.574PheIle: 2.574 ± 0.683
4.752PheLys: 4.752 ± 1.207
5.345PheLeu: 5.345 ± 1.39
1.188PheMet: 1.188 ± 0.294
3.762PheAsn: 3.762 ± 0.956
2.376PhePro: 2.376 ± 0.482
1.98PheGln: 1.98 ± 0.407
1.386PheArg: 1.386 ± 0.818
5.147PheSer: 5.147 ± 1.639
2.178PheThr: 2.178 ± 0.665
2.178PheVal: 2.178 ± 0.902
0.0PheTrp: 0.0 ± 0.0
2.178PheTyr: 2.178 ± 1.212
0.0PheXaa: 0.0 ± 0.0
Gly
1.386GlyAla: 1.386 ± 0.668
1.584GlyCys: 1.584 ± 0.999
2.574GlyAsp: 2.574 ± 0.97
2.772GlyGlu: 2.772 ± 0.948
1.584GlyPhe: 1.584 ± 0.674
1.188GlyGly: 1.188 ± 0.757
0.99GlyHis: 0.99 ± 0.357
3.168GlyIle: 3.168 ± 0.806
3.366GlyLys: 3.366 ± 1.247
4.554GlyLeu: 4.554 ± 1.047
0.792GlyMet: 0.792 ± 0.32
3.762GlyAsn: 3.762 ± 0.676
1.188GlyPro: 1.188 ± 0.822
1.386GlyGln: 1.386 ± 0.378
0.99GlyArg: 0.99 ± 0.606
3.366GlySer: 3.366 ± 0.355
2.574GlyThr: 2.574 ± 0.97
1.584GlyVal: 1.584 ± 0.497
0.396GlyTrp: 0.396 ± 0.297
1.584GlyTyr: 1.584 ± 0.541
0.0GlyXaa: 0.0 ± 0.0
His
0.594HisAla: 0.594 ± 0.351
0.198HisCys: 0.198 ± 0.245
2.772HisAsp: 2.772 ± 0.491
0.594HisGlu: 0.594 ± 0.351
1.98HisPhe: 1.98 ± 0.431
0.396HisGly: 0.396 ± 0.234
0.198HisHis: 0.198 ± 0.245
1.386HisIle: 1.386 ± 0.644
0.99HisLys: 0.99 ± 0.412
0.792HisLeu: 0.792 ± 0.636
0.198HisMet: 0.198 ± 0.117
1.386HisAsn: 1.386 ± 0.358
0.99HisPro: 0.99 ± 0.349
0.396HisGln: 0.396 ± 0.365
0.594HisArg: 0.594 ± 0.19
2.376HisSer: 2.376 ± 0.521
1.782HisThr: 1.782 ± 0.631
0.792HisVal: 0.792 ± 0.255
0.198HisTrp: 0.198 ± 0.117
0.594HisTyr: 0.594 ± 0.419
0.0HisXaa: 0.0 ± 0.0
Ile
2.574IleAla: 2.574 ± 0.522
2.97IleCys: 2.97 ± 0.823
5.147IleAsp: 5.147 ± 0.601
4.356IleGlu: 4.356 ± 1.215
2.376IlePhe: 2.376 ± 0.311
3.96IleGly: 3.96 ± 0.827
1.584IleHis: 1.584 ± 0.674
4.554IleIle: 4.554 ± 1.177
10.295IleLys: 10.295 ± 0.54
5.147IleLeu: 5.147 ± 0.65
1.782IleMet: 1.782 ± 0.235
5.741IleAsn: 5.741 ± 0.275
4.752IlePro: 4.752 ± 1.09
3.168IleGln: 3.168 ± 0.527
2.574IleArg: 2.574 ± 0.733
8.909IleSer: 8.909 ± 2.138
3.762IleThr: 3.762 ± 1.186
3.366IleVal: 3.366 ± 0.98
0.594IleTrp: 0.594 ± 0.19
2.772IleTyr: 2.772 ± 1.033
0.0IleXaa: 0.0 ± 0.0
Lys
3.762LysAla: 3.762 ± 0.536
1.584LysCys: 1.584 ± 0.523
4.356LysAsp: 4.356 ± 0.323
4.95LysGlu: 4.95 ± 0.67
3.762LysPhe: 3.762 ± 0.56
3.168LysGly: 3.168 ± 0.728
2.178LysHis: 2.178 ± 0.317
8.513LysIle: 8.513 ± 0.92
8.711LysLys: 8.711 ± 1.244
7.127LysLeu: 7.127 ± 1.468
3.366LysMet: 3.366 ± 1.028
5.939LysAsn: 5.939 ± 1.108
2.376LysPro: 2.376 ± 0.395
2.772LysGln: 2.772 ± 2.202
2.376LysArg: 2.376 ± 0.493
8.513LysSer: 8.513 ± 0.578
8.513LysThr: 8.513 ± 1.565
3.168LysVal: 3.168 ± 1.467
0.792LysTrp: 0.792 ± 0.255
2.97LysTyr: 2.97 ± 0.565
0.0LysXaa: 0.0 ± 0.0
Leu
4.158LeuAla: 4.158 ± 1.791
0.792LeuCys: 0.792 ± 0.514
3.96LeuAsp: 3.96 ± 0.462
5.543LeuGlu: 5.543 ± 0.975
3.564LeuPhe: 3.564 ± 0.823
3.168LeuGly: 3.168 ± 0.502
1.782LeuHis: 1.782 ± 0.461
5.939LeuIle: 5.939 ± 0.995
7.721LeuLys: 7.721 ± 1.7
6.731LeuLeu: 6.731 ± 1.247
5.543LeuMet: 5.543 ± 1.251
4.95LeuAsn: 4.95 ± 1.372
2.97LeuPro: 2.97 ± 1.161
2.376LeuGln: 2.376 ± 0.827
2.772LeuArg: 2.772 ± 0.387
10.889LeuSer: 10.889 ± 1.122
5.147LeuThr: 5.147 ± 0.436
4.158LeuVal: 4.158 ± 1.791
0.594LeuTrp: 0.594 ± 0.419
2.97LeuTyr: 2.97 ± 1.046
0.0LeuXaa: 0.0 ± 0.0
Met
1.386MetAla: 1.386 ± 0.435
0.198MetCys: 0.198 ± 0.117
1.98MetAsp: 1.98 ± 0.363
2.178MetGlu: 2.178 ± 0.658
0.792MetPhe: 0.792 ± 0.255
1.782MetGly: 1.782 ± 0.569
0.99MetHis: 0.99 ± 0.657
3.564MetIle: 3.564 ± 0.451
3.366MetLys: 3.366 ± 1.462
1.386MetLeu: 1.386 ± 0.378
2.376MetMet: 2.376 ± 0.906
2.772MetAsn: 2.772 ± 1.364
0.792MetPro: 0.792 ± 0.541
0.594MetGln: 0.594 ± 0.319
0.99MetArg: 0.99 ± 0.585
5.147MetSer: 5.147 ± 0.669
1.98MetThr: 1.98 ± 0.509
2.97MetVal: 2.97 ± 0.969
0.198MetTrp: 0.198 ± 0.117
1.188MetTyr: 1.188 ± 0.577
0.0MetXaa: 0.0 ± 0.0
Asn
2.376AsnAla: 2.376 ± 0.673
1.584AsnCys: 1.584 ± 0.68
4.752AsnAsp: 4.752 ± 1.02
4.95AsnGlu: 4.95 ± 0.697
4.158AsnPhe: 4.158 ± 0.868
1.782AsnGly: 1.782 ± 0.617
0.792AsnHis: 0.792 ± 0.255
5.741AsnIle: 5.741 ± 0.922
3.762AsnLys: 3.762 ± 0.345
6.731AsnLeu: 6.731 ± 1.917
0.594AsnMet: 0.594 ± 0.351
2.574AsnAsn: 2.574 ± 0.168
2.178AsnPro: 2.178 ± 0.35
3.168AsnGln: 3.168 ± 0.197
2.178AsnArg: 2.178 ± 0.511
5.147AsnSer: 5.147 ± 2.05
3.366AsnThr: 3.366 ± 0.853
3.96AsnVal: 3.96 ± 1.308
0.99AsnTrp: 0.99 ± 0.811
3.564AsnTyr: 3.564 ± 1.137
0.0AsnXaa: 0.0 ± 0.0
Pro
1.386ProAla: 1.386 ± 0.802
0.0ProCys: 0.0 ± 0.0
1.98ProAsp: 1.98 ± 0.616
1.782ProGlu: 1.782 ± 0.675
1.386ProPhe: 1.386 ± 0.358
1.98ProGly: 1.98 ± 1.306
0.198ProHis: 0.198 ± 0.245
2.376ProIle: 2.376 ± 1.352
3.168ProLys: 3.168 ± 0.946
2.97ProLeu: 2.97 ± 1.209
1.188ProMet: 1.188 ± 0.25
1.98ProAsn: 1.98 ± 0.999
0.594ProPro: 0.594 ± 0.735
1.386ProGln: 1.386 ± 0.435
0.396ProArg: 0.396 ± 0.49
2.178ProSer: 2.178 ± 0.687
1.386ProThr: 1.386 ± 1.06
3.168ProVal: 3.168 ± 0.993
0.0ProTrp: 0.0 ± 0.0
1.386ProTyr: 1.386 ± 0.435
0.0ProXaa: 0.0 ± 0.0
Gln
0.594GlnAla: 0.594 ± 0.37
0.792GlnCys: 0.792 ± 0.842
1.584GlnAsp: 1.584 ± 0.461
1.782GlnGlu: 1.782 ± 0.788
0.594GlnPhe: 0.594 ± 0.735
0.594GlnGly: 0.594 ± 0.367
0.396GlnHis: 0.396 ± 0.297
3.762GlnIle: 3.762 ± 0.94
2.772GlnLys: 2.772 ± 0.589
1.98GlnLeu: 1.98 ± 0.796
1.98GlnMet: 1.98 ± 0.932
2.178GlnAsn: 2.178 ± 0.77
0.396GlnPro: 0.396 ± 0.234
0.396GlnGln: 0.396 ± 0.49
1.188GlnArg: 1.188 ± 0.554
3.762GlnSer: 3.762 ± 1.183
1.386GlnThr: 1.386 ± 0.378
0.99GlnVal: 0.99 ± 0.683
0.0GlnTrp: 0.0 ± 0.0
1.188GlnTyr: 1.188 ± 0.728
0.0GlnXaa: 0.0 ± 0.0
Arg
1.782ArgAla: 1.782 ± 0.632
0.198ArgCys: 0.198 ± 0.245
1.386ArgAsp: 1.386 ± 0.435
1.386ArgGlu: 1.386 ± 0.69
0.594ArgPhe: 0.594 ± 0.351
1.386ArgGly: 1.386 ± 0.609
0.99ArgHis: 0.99 ± 0.585
2.97ArgIle: 2.97 ± 0.579
2.574ArgLys: 2.574 ± 0.374
4.752ArgLeu: 4.752 ± 0.695
0.198ArgMet: 0.198 ± 0.117
1.98ArgAsn: 1.98 ± 0.363
0.396ArgPro: 0.396 ± 0.234
1.782ArgGln: 1.782 ± 0.235
0.396ArgArg: 0.396 ± 0.318
2.97ArgSer: 2.97 ± 0.607
2.574ArgThr: 2.574 ± 0.726
1.782ArgVal: 1.782 ± 0.569
0.396ArgTrp: 0.396 ± 0.234
1.584ArgTyr: 1.584 ± 0.541
0.0ArgXaa: 0.0 ± 0.0
Ser
5.939SerAla: 5.939 ± 1.066
1.386SerCys: 1.386 ± 0.782
6.533SerAsp: 6.533 ± 1.48
9.107SerGlu: 9.107 ± 1.021
4.356SerPhe: 4.356 ± 1.46
5.147SerGly: 5.147 ± 1.191
1.584SerHis: 1.584 ± 0.461
8.513SerIle: 8.513 ± 1.045
8.711SerLys: 8.711 ± 1.248
8.909SerLeu: 8.909 ± 0.897
3.168SerMet: 3.168 ± 0.25
6.137SerAsn: 6.137 ± 1.478
2.178SerPro: 2.178 ± 0.835
0.594SerGln: 0.594 ± 0.319
3.96SerArg: 3.96 ± 1.391
7.325SerSer: 7.325 ± 1.988
5.345SerThr: 5.345 ± 1.16
5.543SerVal: 5.543 ± 0.871
0.594SerTrp: 0.594 ± 0.628
3.564SerTyr: 3.564 ± 0.73
0.0SerXaa: 0.0 ± 0.0
Thr
2.376ThrAla: 2.376 ± 0.599
1.98ThrCys: 1.98 ± 0.525
2.376ThrAsp: 2.376 ± 0.904
4.158ThrGlu: 4.158 ± 0.725
3.564ThrPhe: 3.564 ± 1.061
2.97ThrGly: 2.97 ± 0.603
1.782ThrHis: 1.782 ± 0.604
3.366ThrIle: 3.366 ± 1.011
4.158ThrLys: 4.158 ± 0.637
3.366ThrLeu: 3.366 ± 0.766
1.386ThrMet: 1.386 ± 0.818
4.752ThrAsn: 4.752 ± 1.409
1.386ThrPro: 1.386 ± 0.682
0.594ThrGln: 0.594 ± 0.367
1.188ThrArg: 1.188 ± 0.528
5.543ThrSer: 5.543 ± 1.41
3.168ThrThr: 3.168 ± 0.663
4.554ThrVal: 4.554 ± 0.443
0.594ThrTrp: 0.594 ± 0.37
2.376ThrTyr: 2.376 ± 0.453
0.0ThrXaa: 0.0 ± 0.0
Val
1.782ValAla: 1.782 ± 1.307
1.782ValCys: 1.782 ± 0.367
4.356ValAsp: 4.356 ± 0.747
3.96ValGlu: 3.96 ± 0.899
2.772ValPhe: 2.772 ± 0.376
2.376ValGly: 2.376 ± 0.584
1.386ValHis: 1.386 ± 0.656
3.168ValIle: 3.168 ± 0.911
3.762ValLys: 3.762 ± 1.319
4.95ValLeu: 4.95 ± 0.697
1.782ValMet: 1.782 ± 0.446
2.772ValAsn: 2.772 ± 0.658
2.772ValPro: 2.772 ± 1.061
1.188ValGln: 1.188 ± 0.453
2.376ValArg: 2.376 ± 0.285
4.95ValSer: 4.95 ± 0.56
2.574ValThr: 2.574 ± 0.434
3.564ValVal: 3.564 ± 1.291
0.396ValTrp: 0.396 ± 0.446
2.574ValTyr: 2.574 ± 0.752
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.198TrpCys: 0.198 ± 0.319
0.594TrpAsp: 0.594 ± 0.37
0.396TrpGlu: 0.396 ± 0.234
0.198TrpPhe: 0.198 ± 0.117
0.198TrpGly: 0.198 ± 0.245
0.0TrpHis: 0.0 ± 0.0
0.396TrpIle: 0.396 ± 0.446
0.99TrpLys: 0.99 ± 0.348
0.99TrpLeu: 0.99 ± 0.357
0.396TrpMet: 0.396 ± 0.234
0.396TrpAsn: 0.396 ± 0.186
0.198TrpPro: 0.198 ± 0.245
0.0TrpGln: 0.0 ± 0.0
0.198TrpArg: 0.198 ± 0.319
1.188TrpSer: 1.188 ± 0.339
0.396TrpThr: 0.396 ± 0.365
0.792TrpVal: 0.792 ± 0.541
0.198TrpTrp: 0.198 ± 0.245
0.198TrpTyr: 0.198 ± 0.245
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.584TyrAla: 1.584 ± 0.592
1.584TyrCys: 1.584 ± 0.264
1.584TyrAsp: 1.584 ± 0.541
2.376TyrGlu: 2.376 ± 1.488
1.782TyrPhe: 1.782 ± 0.687
1.98TyrGly: 1.98 ± 0.522
0.594TyrHis: 0.594 ± 0.351
2.97TyrIle: 2.97 ± 0.725
4.752TyrLys: 4.752 ± 1.644
3.564TyrLeu: 3.564 ± 0.946
1.386TyrMet: 1.386 ± 0.536
2.97TyrAsn: 2.97 ± 0.659
0.594TyrPro: 0.594 ± 0.419
1.584TyrGln: 1.584 ± 1.326
1.188TyrArg: 1.188 ± 0.268
3.564TyrSer: 3.564 ± 0.54
1.188TyrThr: 1.188 ± 0.38
2.574TyrVal: 2.574 ± 0.347
0.198TyrTrp: 0.198 ± 0.319
1.782TyrTyr: 1.782 ± 0.6
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (5052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski