Amino acid dipepetide frequency for Citrus chlorotic spot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.43AlaAla: 4.43 ± 1.967
1.661AlaCys: 1.661 ± 0.705
3.876AlaAsp: 3.876 ± 1.244
3.599AlaGlu: 3.599 ± 0.92
1.661AlaPhe: 1.661 ± 0.642
1.384AlaGly: 1.384 ± 1.176
1.107AlaHis: 1.107 ± 0.518
2.769AlaIle: 2.769 ± 0.649
1.938AlaLys: 1.938 ± 0.485
4.983AlaLeu: 4.983 ± 0.669
0.831AlaMet: 0.831 ± 0.471
3.045AlaAsn: 3.045 ± 0.675
2.492AlaPro: 2.492 ± 1.106
1.384AlaGln: 1.384 ± 0.596
3.876AlaArg: 3.876 ± 1.051
7.752AlaSer: 7.752 ± 1.289
2.492AlaThr: 2.492 ± 1.108
5.26AlaVal: 5.26 ± 1.669
0.831AlaTrp: 0.831 ± 0.716
2.215AlaTyr: 2.215 ± 0.398
0.0AlaXaa: 0.0 ± 0.0
Cys
1.107CysAla: 1.107 ± 0.34
0.554CysCys: 0.554 ± 0.598
1.938CysAsp: 1.938 ± 0.5
1.938CysGlu: 1.938 ± 0.476
0.277CysPhe: 0.277 ± 0.304
0.554CysGly: 0.554 ± 0.609
0.554CysHis: 0.554 ± 0.609
1.384CysIle: 1.384 ± 0.558
1.107CysLys: 1.107 ± 0.626
2.492CysLeu: 2.492 ± 0.794
0.554CysMet: 0.554 ± 0.314
1.661CysAsn: 1.661 ± 0.705
0.554CysPro: 0.554 ± 0.274
1.107CysGln: 1.107 ± 0.345
1.938CysArg: 1.938 ± 0.519
1.938CysSer: 1.938 ± 0.485
0.554CysThr: 0.554 ± 0.308
0.554CysVal: 0.554 ± 0.298
0.0CysTrp: 0.0 ± 0.0
1.107CysTyr: 1.107 ± 0.598
0.0CysXaa: 0.0 ± 0.0
Asp
4.707AspAla: 4.707 ± 0.415
1.107AspCys: 1.107 ± 0.345
3.599AspAsp: 3.599 ± 1.078
4.153AspGlu: 4.153 ± 1.446
1.661AspPhe: 1.661 ± 0.477
2.492AspGly: 2.492 ± 0.992
4.43AspHis: 4.43 ± 0.44
5.26AspIle: 5.26 ± 1.501
3.322AspLys: 3.322 ± 0.859
5.26AspLeu: 5.26 ± 1.24
2.492AspMet: 2.492 ± 1.135
2.769AspAsn: 2.769 ± 0.501
3.045AspPro: 3.045 ± 0.85
1.384AspGln: 1.384 ± 0.698
3.045AspArg: 3.045 ± 0.753
3.599AspSer: 3.599 ± 0.69
2.492AspThr: 2.492 ± 0.94
2.215AspVal: 2.215 ± 0.592
1.938AspTrp: 1.938 ± 0.795
1.384AspTyr: 1.384 ± 0.59
0.0AspXaa: 0.0 ± 0.0
Glu
3.322GluAla: 3.322 ± 0.85
0.554GluCys: 0.554 ± 0.314
2.769GluAsp: 2.769 ± 0.482
3.876GluGlu: 3.876 ± 0.505
1.384GluPhe: 1.384 ± 0.888
4.983GluGly: 4.983 ± 1.014
2.215GluHis: 2.215 ± 0.745
2.769GluIle: 2.769 ± 0.887
4.43GluLys: 4.43 ± 1.129
3.322GluLeu: 3.322 ± 0.926
2.492GluMet: 2.492 ± 1.103
1.384GluAsn: 1.384 ± 0.581
0.277GluPro: 0.277 ± 0.157
0.277GluGln: 0.277 ± 0.433
1.938GluArg: 1.938 ± 0.485
3.876GluSer: 3.876 ± 1.447
3.045GluThr: 3.045 ± 0.814
2.769GluVal: 2.769 ± 0.873
1.107GluTrp: 1.107 ± 0.393
2.492GluTyr: 2.492 ± 1.14
0.0GluXaa: 0.0 ± 0.0
Phe
1.107PheAla: 1.107 ± 0.433
0.831PheCys: 0.831 ± 0.616
1.107PheAsp: 1.107 ± 0.407
1.107PheGlu: 1.107 ± 0.451
1.107PhePhe: 1.107 ± 0.261
1.661PheGly: 1.661 ± 0.851
0.831PheHis: 0.831 ± 0.326
1.938PheIle: 1.938 ± 0.464
3.599PheLys: 3.599 ± 1.236
4.153PheLeu: 4.153 ± 0.51
1.384PheMet: 1.384 ± 0.616
2.215PheAsn: 2.215 ± 0.49
1.938PhePro: 1.938 ± 0.573
1.661PheGln: 1.661 ± 0.696
1.107PheArg: 1.107 ± 0.433
0.831PheSer: 0.831 ± 0.438
0.831PheThr: 0.831 ± 0.386
0.0PheVal: 0.0 ± 0.0
0.277PheTrp: 0.277 ± 0.337
0.831PheTyr: 0.831 ± 0.326
0.0PheXaa: 0.0 ± 0.0
Gly
3.322GlyAla: 3.322 ± 0.95
1.384GlyCys: 1.384 ± 0.616
4.983GlyAsp: 4.983 ± 0.911
3.045GlyGlu: 3.045 ± 1.071
1.107GlyPhe: 1.107 ± 0.595
5.26GlyGly: 5.26 ± 1.082
1.384GlyHis: 1.384 ± 0.373
3.045GlyIle: 3.045 ± 0.71
2.492GlyLys: 2.492 ± 0.571
5.537GlyLeu: 5.537 ± 1.248
2.215GlyMet: 2.215 ± 0.515
1.938GlyAsn: 1.938 ± 0.436
4.153GlyPro: 4.153 ± 1.565
1.107GlyGln: 1.107 ± 0.433
3.876GlyArg: 3.876 ± 0.785
3.322GlySer: 3.322 ± 0.789
3.322GlyThr: 3.322 ± 0.906
4.707GlyVal: 4.707 ± 0.925
0.554GlyTrp: 0.554 ± 0.314
0.554GlyTyr: 0.554 ± 0.274
0.0GlyXaa: 0.0 ± 0.0
His
1.107HisAla: 1.107 ± 0.762
0.277HisCys: 0.277 ± 0.337
3.322HisAsp: 3.322 ± 1.081
0.831HisGlu: 0.831 ± 0.471
1.107HisPhe: 1.107 ± 0.557
2.215HisGly: 2.215 ± 0.431
1.661HisHis: 1.661 ± 1.048
1.938HisIle: 1.938 ± 0.485
1.107HisLys: 1.107 ± 0.433
4.707HisLeu: 4.707 ± 1.418
0.831HisMet: 0.831 ± 0.293
2.215HisAsn: 2.215 ± 0.375
1.661HisPro: 1.661 ± 0.457
0.0HisGln: 0.0 ± 0.0
1.384HisArg: 1.384 ± 0.566
0.554HisSer: 0.554 ± 0.298
1.938HisThr: 1.938 ± 0.853
2.769HisVal: 2.769 ± 1.032
0.554HisTrp: 0.554 ± 0.314
1.938HisTyr: 1.938 ± 0.808
0.0HisXaa: 0.0 ± 0.0
Ile
3.045IleAla: 3.045 ± 0.746
1.938IleCys: 1.938 ± 0.558
3.322IleAsp: 3.322 ± 0.805
2.492IleGlu: 2.492 ± 0.411
2.215IlePhe: 2.215 ± 0.79
2.769IleGly: 2.769 ± 0.642
2.492IleHis: 2.492 ± 0.721
6.921IleIle: 6.921 ± 1.561
4.43IleLys: 4.43 ± 1.826
6.091IleLeu: 6.091 ± 2.041
3.045IleMet: 3.045 ± 0.47
4.43IleAsn: 4.43 ± 0.61
2.492IlePro: 2.492 ± 0.814
2.215IleGln: 2.215 ± 1.486
2.492IleArg: 2.492 ± 0.721
6.645IleSer: 6.645 ± 0.261
4.43IleThr: 4.43 ± 0.71
3.876IleVal: 3.876 ± 1.066
1.107IleTrp: 1.107 ± 0.345
3.322IleTyr: 3.322 ± 0.471
0.0IleXaa: 0.0 ± 0.0
Lys
4.707LysAla: 4.707 ± 1.244
1.107LysCys: 1.107 ± 0.547
4.983LysAsp: 4.983 ± 1.033
2.492LysGlu: 2.492 ± 0.84
1.661LysPhe: 1.661 ± 0.943
3.045LysGly: 3.045 ± 0.821
1.938LysHis: 1.938 ± 0.592
3.599LysIle: 3.599 ± 0.225
4.43LysLys: 4.43 ± 1.458
4.153LysLeu: 4.153 ± 0.757
2.215LysMet: 2.215 ± 0.629
1.661LysAsn: 1.661 ± 0.546
3.322LysPro: 3.322 ± 1.678
2.492LysGln: 2.492 ± 0.689
4.153LysArg: 4.153 ± 0.799
3.322LysSer: 3.322 ± 0.891
2.769LysThr: 2.769 ± 0.871
3.599LysVal: 3.599 ± 0.63
1.107LysTrp: 1.107 ± 0.445
1.661LysTyr: 1.661 ± 0.477
0.0LysXaa: 0.0 ± 0.0
Leu
5.26LeuAla: 5.26 ± 1.28
3.322LeuCys: 3.322 ± 0.545
4.43LeuAsp: 4.43 ± 1.445
5.537LeuGlu: 5.537 ± 1.723
3.045LeuPhe: 3.045 ± 0.707
5.537LeuGly: 5.537 ± 1.127
2.215LeuHis: 2.215 ± 0.568
5.26LeuIle: 5.26 ± 1.405
3.599LeuLys: 3.599 ± 0.582
6.091LeuLeu: 6.091 ± 1.642
3.322LeuMet: 3.322 ± 0.989
4.153LeuAsn: 4.153 ± 0.651
2.769LeuPro: 2.769 ± 0.292
2.215LeuGln: 2.215 ± 0.398
4.43LeuArg: 4.43 ± 0.662
8.859LeuSer: 8.859 ± 0.91
4.983LeuThr: 4.983 ± 1.009
6.921LeuVal: 6.921 ± 1.586
0.554LeuTrp: 0.554 ± 0.314
2.492LeuTyr: 2.492 ± 0.419
0.0LeuXaa: 0.0 ± 0.0
Met
3.599MetAla: 3.599 ± 0.709
0.831MetCys: 0.831 ± 0.384
2.769MetAsp: 2.769 ± 0.739
1.938MetGlu: 1.938 ± 0.573
1.384MetPhe: 1.384 ± 0.527
2.215MetGly: 2.215 ± 0.568
0.277MetHis: 0.277 ± 0.304
3.599MetIle: 3.599 ± 0.802
3.045MetLys: 3.045 ± 1.082
3.322MetLeu: 3.322 ± 0.767
1.938MetMet: 1.938 ± 0.578
2.492MetAsn: 2.492 ± 0.552
0.831MetPro: 0.831 ± 0.389
0.554MetGln: 0.554 ± 0.314
1.938MetArg: 1.938 ± 0.266
5.537MetSer: 5.537 ± 1.159
1.384MetThr: 1.384 ± 0.438
3.322MetVal: 3.322 ± 0.663
1.107MetTrp: 1.107 ± 0.433
1.107MetTyr: 1.107 ± 0.433
0.0MetXaa: 0.0 ± 0.0
Asn
1.661AsnAla: 1.661 ± 0.622
0.0AsnCys: 0.0 ± 0.0
4.153AsnAsp: 4.153 ± 1.164
1.661AsnGlu: 1.661 ± 0.457
0.831AsnPhe: 0.831 ± 0.471
1.107AsnGly: 1.107 ± 0.433
2.492AsnHis: 2.492 ± 0.524
5.26AsnIle: 5.26 ± 0.489
4.153AsnLys: 4.153 ± 0.724
2.215AsnLeu: 2.215 ± 0.64
2.492AsnMet: 2.492 ± 0.95
2.215AsnAsn: 2.215 ± 0.527
2.769AsnPro: 2.769 ± 0.835
1.938AsnGln: 1.938 ± 0.498
2.215AsnArg: 2.215 ± 0.793
3.045AsnSer: 3.045 ± 0.604
1.938AsnThr: 1.938 ± 0.592
3.599AsnVal: 3.599 ± 0.887
0.277AsnTrp: 0.277 ± 0.157
0.831AsnTyr: 0.831 ± 0.337
0.0AsnXaa: 0.0 ± 0.0
Pro
1.661ProAla: 1.661 ± 0.426
0.277ProCys: 0.277 ± 0.157
3.322ProAsp: 3.322 ± 0.981
1.384ProGlu: 1.384 ± 0.615
1.384ProPhe: 1.384 ± 0.438
2.215ProGly: 2.215 ± 1.333
2.215ProHis: 2.215 ± 0.703
3.045ProIle: 3.045 ± 1.171
2.492ProLys: 2.492 ± 0.346
5.26ProLeu: 5.26 ± 0.861
1.938ProMet: 1.938 ± 0.557
0.831ProAsn: 0.831 ± 0.471
4.153ProPro: 4.153 ± 2.182
1.661ProGln: 1.661 ± 1.0
2.215ProArg: 2.215 ± 0.981
4.983ProSer: 4.983 ± 0.837
3.045ProThr: 3.045 ± 0.961
3.876ProVal: 3.876 ± 0.674
0.554ProTrp: 0.554 ± 0.298
1.107ProTyr: 1.107 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
2.215GlnAla: 2.215 ± 0.788
1.384GlnCys: 1.384 ± 0.867
2.215GlnAsp: 2.215 ± 0.752
1.938GlnGlu: 1.938 ± 0.582
0.554GlnPhe: 0.554 ± 0.314
1.384GlnGly: 1.384 ± 0.36
0.554GlnHis: 0.554 ± 0.298
0.554GlnIle: 0.554 ± 0.314
1.384GlnLys: 1.384 ± 0.373
1.938GlnLeu: 1.938 ± 0.871
1.661GlnMet: 1.661 ± 0.895
0.831GlnAsn: 0.831 ± 0.32
1.107GlnPro: 1.107 ± 0.947
1.661GlnGln: 1.661 ± 0.514
1.107GlnArg: 1.107 ± 0.816
3.045GlnSer: 3.045 ± 0.531
1.107GlnThr: 1.107 ± 0.34
3.322GlnVal: 3.322 ± 0.859
0.277GlnTrp: 0.277 ± 0.304
0.554GlnTyr: 0.554 ± 0.589
0.0GlnXaa: 0.0 ± 0.0
Arg
3.599ArgAla: 3.599 ± 0.639
1.107ArgCys: 1.107 ± 0.762
3.045ArgAsp: 3.045 ± 1.059
2.769ArgGlu: 2.769 ± 0.528
1.661ArgPhe: 1.661 ± 1.209
4.43ArgGly: 4.43 ± 1.223
1.661ArgHis: 1.661 ± 0.561
4.707ArgIle: 4.707 ± 1.255
3.876ArgLys: 3.876 ± 1.089
1.938ArgLeu: 1.938 ± 0.84
2.769ArgMet: 2.769 ± 0.642
1.384ArgAsn: 1.384 ± 0.601
2.215ArgPro: 2.215 ± 0.975
0.831ArgGln: 0.831 ± 0.328
3.322ArgArg: 3.322 ± 0.735
3.876ArgSer: 3.876 ± 1.198
3.045ArgThr: 3.045 ± 0.944
4.707ArgVal: 4.707 ± 0.773
0.277ArgTrp: 0.277 ± 0.157
2.492ArgTyr: 2.492 ± 0.694
0.0ArgXaa: 0.0 ± 0.0
Ser
4.153SerAla: 4.153 ± 0.745
1.938SerCys: 1.938 ± 0.905
2.769SerAsp: 2.769 ± 0.631
3.599SerGlu: 3.599 ± 1.396
2.769SerPhe: 2.769 ± 0.804
7.475SerGly: 7.475 ± 2.325
1.938SerHis: 1.938 ± 0.693
4.707SerIle: 4.707 ± 0.964
4.983SerLys: 4.983 ± 1.474
10.797SerLeu: 10.797 ± 1.734
4.707SerMet: 4.707 ± 0.766
3.876SerAsn: 3.876 ± 0.747
3.876SerPro: 3.876 ± 0.612
3.045SerGln: 3.045 ± 0.987
5.537SerArg: 5.537 ± 1.281
10.52SerSer: 10.52 ± 2.093
4.43SerThr: 4.43 ± 0.749
6.645SerVal: 6.645 ± 1.224
1.107SerTrp: 1.107 ± 0.451
3.045SerTyr: 3.045 ± 0.909
0.0SerXaa: 0.0 ± 0.0
Thr
3.045ThrAla: 3.045 ± 0.764
0.277ThrCys: 0.277 ± 0.157
2.492ThrAsp: 2.492 ± 0.714
3.876ThrGlu: 3.876 ± 1.049
2.215ThrPhe: 2.215 ± 0.689
1.107ThrGly: 1.107 ± 0.407
1.384ThrHis: 1.384 ± 0.36
4.43ThrIle: 4.43 ± 0.783
2.215ThrLys: 2.215 ± 0.516
5.537ThrLeu: 5.537 ± 1.451
2.492ThrMet: 2.492 ± 0.573
1.938ThrAsn: 1.938 ± 0.807
3.876ThrPro: 3.876 ± 1.886
0.554ThrGln: 0.554 ± 0.298
1.938ThrArg: 1.938 ± 0.84
6.091ThrSer: 6.091 ± 1.009
1.107ThrThr: 1.107 ± 0.629
4.43ThrVal: 4.43 ± 0.683
1.661ThrTrp: 1.661 ± 0.346
1.661ThrTyr: 1.661 ± 0.224
0.0ThrXaa: 0.0 ± 0.0
Val
3.876ValAla: 3.876 ± 1.691
1.938ValCys: 1.938 ± 0.571
3.876ValAsp: 3.876 ± 1.188
2.215ValGlu: 2.215 ± 0.689
0.831ValPhe: 0.831 ± 0.471
4.153ValGly: 4.153 ± 0.726
1.384ValHis: 1.384 ± 0.36
3.876ValIle: 3.876 ± 0.744
3.322ValLys: 3.322 ± 0.944
4.983ValLeu: 4.983 ± 0.817
3.876ValMet: 3.876 ± 1.241
3.876ValAsn: 3.876 ± 0.933
3.599ValPro: 3.599 ± 0.661
3.045ValGln: 3.045 ± 1.154
4.153ValArg: 4.153 ± 0.651
8.306ValSer: 8.306 ± 2.309
5.26ValThr: 5.26 ± 1.247
3.876ValVal: 3.876 ± 1.084
1.938ValTrp: 1.938 ± 0.937
1.661ValTyr: 1.661 ± 0.224
0.0ValXaa: 0.0 ± 0.0
Trp
0.554TrpAla: 0.554 ± 0.274
0.554TrpCys: 0.554 ± 0.314
0.277TrpAsp: 0.277 ± 0.157
0.277TrpGlu: 0.277 ± 0.157
0.277TrpPhe: 0.277 ± 0.157
1.107TrpGly: 1.107 ± 0.433
0.277TrpHis: 0.277 ± 0.157
2.769TrpIle: 2.769 ± 1.032
0.277TrpLys: 0.277 ± 0.353
1.107TrpLeu: 1.107 ± 0.494
0.277TrpMet: 0.277 ± 0.157
0.831TrpAsn: 0.831 ± 0.438
0.831TrpPro: 0.831 ± 0.438
0.831TrpGln: 0.831 ± 0.5
1.384TrpArg: 1.384 ± 0.315
2.215TrpSer: 2.215 ± 0.607
1.384TrpThr: 1.384 ± 0.408
0.554TrpVal: 0.554 ± 0.274
0.277TrpTrp: 0.277 ± 0.304
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.107TyrAla: 1.107 ± 0.518
0.831TyrCys: 0.831 ± 0.389
1.384TyrAsp: 1.384 ± 0.615
0.554TyrGlu: 0.554 ± 0.298
1.384TyrPhe: 1.384 ± 0.786
2.215TyrGly: 2.215 ± 0.375
1.107TyrHis: 1.107 ± 0.433
1.938TyrIle: 1.938 ± 0.978
2.215TyrLys: 2.215 ± 0.634
1.107TyrLeu: 1.107 ± 0.629
1.384TyrMet: 1.384 ± 0.373
1.107TyrAsn: 1.107 ± 0.433
1.661TyrPro: 1.661 ± 0.679
0.831TyrGln: 0.831 ± 0.326
1.661TyrArg: 1.661 ± 0.705
3.599TyrSer: 3.599 ± 1.084
2.769TyrThr: 2.769 ± 0.692
3.045TyrVal: 3.045 ± 0.804
0.554TyrTrp: 0.554 ± 0.274
0.831TyrTyr: 0.831 ± 0.471
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3613 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski