Amino acid dipepetide frequency for Cherry mottle leaf virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.656AlaAla: 2.656 ± 1.016
1.138AlaCys: 1.138 ± 0.567
1.517AlaAsp: 1.517 ± 0.405
3.414AlaGlu: 3.414 ± 3.046
2.656AlaPhe: 2.656 ± 0.532
3.035AlaGly: 3.035 ± 1.126
1.897AlaHis: 1.897 ± 0.514
4.932AlaIle: 4.932 ± 0.35
3.794AlaLys: 3.794 ± 1.496
6.07AlaLeu: 6.07 ± 1.054
1.897AlaMet: 1.897 ± 0.826
3.035AlaAsn: 3.035 ± 2.522
1.897AlaPro: 1.897 ± 1.088
0.759AlaGln: 0.759 ± 0.378
3.794AlaArg: 3.794 ± 2.462
2.656AlaSer: 2.656 ± 0.857
1.897AlaThr: 1.897 ± 0.766
1.897AlaVal: 1.897 ± 0.748
0.0AlaTrp: 0.0 ± 0.0
0.379AlaTyr: 0.379 ± 0.189
0.0AlaXaa: 0.0 ± 0.0
Cys
0.759CysAla: 0.759 ± 0.832
0.379CysCys: 0.379 ± 0.189
0.759CysAsp: 0.759 ± 0.378
0.379CysGlu: 0.379 ± 0.985
1.517CysPhe: 1.517 ± 0.745
1.897CysGly: 1.897 ± 0.762
0.379CysHis: 0.379 ± 0.189
0.759CysIle: 0.759 ± 0.378
1.138CysLys: 1.138 ± 0.367
3.414CysLeu: 3.414 ± 1.448
0.0CysMet: 0.0 ± 0.0
1.138CysAsn: 1.138 ± 0.367
0.379CysPro: 0.379 ± 0.189
0.759CysGln: 0.759 ± 0.378
1.897CysArg: 1.897 ± 0.514
3.035CysSer: 3.035 ± 0.742
1.897CysThr: 1.897 ± 0.826
2.656CysVal: 2.656 ± 1.38
0.379CysTrp: 0.379 ± 0.539
0.759CysTyr: 0.759 ± 0.832
0.0CysXaa: 0.0 ± 0.0
Asp
2.276AspAla: 2.276 ± 0.734
2.656AspCys: 2.656 ± 0.75
4.552AspAsp: 4.552 ± 1.035
6.07AspGlu: 6.07 ± 0.441
2.656AspPhe: 2.656 ± 0.885
4.552AspGly: 4.552 ± 1.322
0.759AspHis: 0.759 ± 0.378
4.173AspIle: 4.173 ± 1.169
2.276AspLys: 2.276 ± 0.734
5.311AspLeu: 5.311 ± 1.158
1.138AspMet: 1.138 ± 0.789
1.517AspAsn: 1.517 ± 0.756
1.897AspPro: 1.897 ± 0.766
0.759AspGln: 0.759 ± 0.378
1.897AspArg: 1.897 ± 0.826
5.69AspSer: 5.69 ± 2.281
0.759AspThr: 0.759 ± 0.421
3.794AspVal: 3.794 ± 1.109
1.138AspTrp: 1.138 ± 0.567
2.656AspTyr: 2.656 ± 1.322
0.0AspXaa: 0.0 ± 0.0
Glu
3.794GluAla: 3.794 ± 2.496
0.379GluCys: 0.379 ± 0.189
4.932GluAsp: 4.932 ± 0.685
4.932GluGlu: 4.932 ± 1.482
4.932GluPhe: 4.932 ± 0.686
2.276GluGly: 2.276 ± 0.734
1.517GluHis: 1.517 ± 0.847
6.07GluIle: 6.07 ± 0.847
7.587GluLys: 7.587 ± 2.227
5.311GluLeu: 5.311 ± 1.287
1.138GluMet: 1.138 ± 0.785
2.276GluAsn: 2.276 ± 0.597
2.276GluPro: 2.276 ± 0.797
2.276GluGln: 2.276 ± 0.597
4.552GluArg: 4.552 ± 1.322
5.311GluSer: 5.311 ± 1.891
1.517GluThr: 1.517 ± 0.783
4.932GluVal: 4.932 ± 1.472
0.759GluTrp: 0.759 ± 0.832
2.276GluTyr: 2.276 ± 1.577
0.0GluXaa: 0.0 ± 0.0
Phe
4.552PheAla: 4.552 ± 2.24
1.138PheCys: 1.138 ± 0.785
4.173PheAsp: 4.173 ± 0.522
5.311PheGlu: 5.311 ± 1.602
1.517PhePhe: 1.517 ± 0.405
2.656PheGly: 2.656 ± 1.181
1.517PheHis: 1.517 ± 0.756
3.794PheIle: 3.794 ± 1.286
4.173PheLys: 4.173 ± 1.148
4.173PheLeu: 4.173 ± 0.839
0.379PheMet: 0.379 ± 0.356
3.794PheAsn: 3.794 ± 2.255
1.517PhePro: 1.517 ± 0.756
1.897PheGln: 1.897 ± 0.944
5.311PheArg: 5.311 ± 3.751
5.311PheSer: 5.311 ± 1.556
2.656PheThr: 2.656 ± 1.322
1.517PheVal: 1.517 ± 0.756
0.0PheTrp: 0.0 ± 0.0
1.138PheTyr: 1.138 ± 0.567
0.0PheXaa: 0.0 ± 0.0
Gly
3.794GlyAla: 3.794 ± 1.01
1.517GlyCys: 1.517 ± 0.405
5.311GlyAsp: 5.311 ± 1.499
2.656GlyGlu: 2.656 ± 0.857
2.656GlyPhe: 2.656 ± 0.75
3.035GlyGly: 3.035 ± 1.439
0.379GlyHis: 0.379 ± 0.189
3.794GlyIle: 3.794 ± 1.651
7.208GlyLys: 7.208 ± 1.31
5.69GlyLeu: 5.69 ± 1.978
1.517GlyMet: 1.517 ± 0.431
3.794GlyAsn: 3.794 ± 1.029
1.138GlyPro: 1.138 ± 0.567
1.138GlyGln: 1.138 ± 0.567
3.035GlyArg: 3.035 ± 0.742
5.311GlySer: 5.311 ± 2.361
1.517GlyThr: 1.517 ± 0.745
5.69GlyVal: 5.69 ± 1.805
1.517GlyTrp: 1.517 ± 0.756
2.656GlyTyr: 2.656 ± 0.7
0.0GlyXaa: 0.0 ± 0.0
His
0.379HisAla: 0.379 ± 0.189
0.759HisCys: 0.759 ± 1.078
1.138HisAsp: 1.138 ± 0.567
0.759HisGlu: 0.759 ± 0.378
1.517HisPhe: 1.517 ± 0.847
0.379HisGly: 0.379 ± 0.189
0.379HisHis: 0.379 ± 0.189
2.276HisIle: 2.276 ± 0.906
0.759HisLys: 0.759 ± 0.421
1.138HisLeu: 1.138 ± 0.567
0.379HisMet: 0.379 ± 0.189
0.759HisAsn: 0.759 ± 0.378
0.759HisPro: 0.759 ± 0.378
1.517HisGln: 1.517 ± 0.745
1.517HisArg: 1.517 ± 0.405
1.897HisSer: 1.897 ± 0.762
1.138HisThr: 1.138 ± 0.789
1.138HisVal: 1.138 ± 0.567
0.759HisTrp: 0.759 ± 1.078
0.379HisTyr: 0.379 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
3.414IleAla: 3.414 ± 3.503
3.414IleCys: 3.414 ± 1.621
3.794IleAsp: 3.794 ± 1.01
5.311IleGlu: 5.311 ± 1.133
3.794IlePhe: 3.794 ± 0.614
3.794IleGly: 3.794 ± 1.356
2.276IleHis: 2.276 ± 0.597
1.138IleIle: 1.138 ± 0.567
5.311IleLys: 5.311 ± 1.952
6.829IleLeu: 6.829 ± 2.046
1.897IleMet: 1.897 ± 0.514
2.276IleAsn: 2.276 ± 2.495
1.517IlePro: 1.517 ± 1.342
1.897IleGln: 1.897 ± 0.748
1.897IleArg: 1.897 ± 0.514
4.932IleSer: 4.932 ± 1.867
2.656IleThr: 2.656 ± 0.532
1.517IleVal: 1.517 ± 0.745
0.379IleTrp: 0.379 ± 0.189
2.276IleTyr: 2.276 ± 1.133
0.0IleXaa: 0.0 ± 0.0
Lys
3.414LysAla: 3.414 ± 1.655
1.517LysCys: 1.517 ± 0.756
6.829LysAsp: 6.829 ± 1.754
2.656LysGlu: 2.656 ± 0.825
2.656LysPhe: 2.656 ± 1.322
7.208LysGly: 7.208 ± 0.954
0.379LysHis: 0.379 ± 0.189
4.552LysIle: 4.552 ± 0.796
5.311LysLys: 5.311 ± 0.822
6.07LysLeu: 6.07 ± 2.169
1.517LysMet: 1.517 ± 0.756
5.69LysAsn: 5.69 ± 0.942
2.656LysPro: 2.656 ± 1.322
2.276LysGln: 2.276 ± 0.734
6.449LysArg: 6.449 ± 1.709
8.725LysSer: 8.725 ± 2.655
3.035LysThr: 3.035 ± 0.997
4.932LysVal: 4.932 ± 1.373
0.379LysTrp: 0.379 ± 0.189
1.517LysTyr: 1.517 ± 1.197
0.0LysXaa: 0.0 ± 0.0
Leu
5.69LeuAla: 5.69 ± 1.43
4.552LeuCys: 4.552 ± 2.012
4.932LeuAsp: 4.932 ± 0.686
4.932LeuGlu: 4.932 ± 1.908
4.932LeuPhe: 4.932 ± 0.852
5.311LeuGly: 5.311 ± 1.33
0.379LeuHis: 0.379 ± 0.985
6.07LeuIle: 6.07 ± 2.981
6.07LeuLys: 6.07 ± 1.778
7.208LeuLeu: 7.208 ± 1.81
2.656LeuMet: 2.656 ± 0.75
6.449LeuAsn: 6.449 ± 1.837
3.035LeuPro: 3.035 ± 1.511
2.656LeuGln: 2.656 ± 0.857
6.07LeuArg: 6.07 ± 1.638
10.243LeuSer: 10.243 ± 1.489
3.794LeuThr: 3.794 ± 0.701
3.414LeuVal: 3.414 ± 1.136
0.379LeuTrp: 0.379 ± 0.189
1.897LeuTyr: 1.897 ± 0.748
0.0LeuXaa: 0.0 ± 0.0
Met
1.138MetAla: 1.138 ± 0.567
0.759MetCys: 0.759 ± 0.421
1.138MetAsp: 1.138 ± 0.567
1.897MetGlu: 1.897 ± 0.944
0.759MetPhe: 0.759 ± 0.378
0.759MetGly: 0.759 ± 0.378
0.379MetHis: 0.379 ± 0.189
2.656MetIle: 2.656 ± 1.38
1.897MetLys: 1.897 ± 0.826
0.0MetLeu: 0.0 ± 0.0
1.138MetMet: 1.138 ± 1.499
1.138MetAsn: 1.138 ± 0.785
1.517MetPro: 1.517 ± 0.841
0.379MetGln: 0.379 ± 0.539
3.414MetArg: 3.414 ± 0.7
3.035MetSer: 3.035 ± 1.816
0.379MetThr: 0.379 ± 0.539
1.897MetVal: 1.897 ± 0.944
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.517AsnAla: 1.517 ± 0.783
1.138AsnCys: 1.138 ± 0.978
1.138AsnAsp: 1.138 ± 0.948
3.414AsnGlu: 3.414 ± 1.982
4.173AsnPhe: 4.173 ± 0.89
4.173AsnGly: 4.173 ± 3.045
1.897AsnHis: 1.897 ± 0.762
1.517AsnIle: 1.517 ± 1.744
5.69AsnLys: 5.69 ± 1.164
7.208AsnLeu: 7.208 ± 1.708
1.897AsnMet: 1.897 ± 1.285
1.517AsnAsn: 1.517 ± 1.197
1.517AsnPro: 1.517 ± 1.939
1.517AsnGln: 1.517 ± 0.745
2.276AsnArg: 2.276 ± 1.004
3.414AsnSer: 3.414 ± 1.291
2.276AsnThr: 2.276 ± 0.734
1.897AsnVal: 1.897 ± 0.514
0.379AsnTrp: 0.379 ± 0.189
1.897AsnTyr: 1.897 ± 0.514
0.0AsnXaa: 0.0 ± 0.0
Pro
0.379ProAla: 0.379 ± 0.189
0.379ProCys: 0.379 ± 0.189
3.035ProAsp: 3.035 ± 0.997
3.035ProGlu: 3.035 ± 1.49
2.276ProPhe: 2.276 ± 0.734
1.138ProGly: 1.138 ± 0.367
0.379ProHis: 0.379 ± 0.189
3.414ProIle: 3.414 ± 0.827
1.517ProLys: 1.517 ± 0.856
3.035ProLeu: 3.035 ± 1.001
1.138ProMet: 1.138 ± 0.567
2.276ProAsn: 2.276 ± 1.062
0.759ProPro: 0.759 ± 0.378
0.759ProGln: 0.759 ± 0.378
3.414ProArg: 3.414 ± 0.6
2.276ProSer: 2.276 ± 1.12
1.517ProThr: 1.517 ± 0.847
1.517ProVal: 1.517 ± 0.841
0.759ProTrp: 0.759 ± 0.378
1.138ProTyr: 1.138 ± 0.567
0.0ProXaa: 0.0 ± 0.0
Gln
1.138GlnAla: 1.138 ± 0.789
0.379GlnCys: 0.379 ± 0.189
1.897GlnAsp: 1.897 ± 0.766
2.276GlnGlu: 2.276 ± 1.133
0.759GlnPhe: 0.759 ± 0.378
2.276GlnGly: 2.276 ± 0.797
0.759GlnHis: 0.759 ± 0.421
1.138GlnIle: 1.138 ± 0.789
1.897GlnLys: 1.897 ± 0.944
1.897GlnLeu: 1.897 ± 0.944
0.0GlnMet: 0.0 ± 0.0
0.379GlnAsn: 0.379 ± 0.189
1.138GlnPro: 1.138 ± 0.567
1.138GlnGln: 1.138 ± 0.567
2.276GlnArg: 2.276 ± 2.616
4.932GlnSer: 4.932 ± 1.306
2.276GlnThr: 2.276 ± 0.597
2.276GlnVal: 2.276 ± 0.797
0.0GlnTrp: 0.0 ± 0.0
1.517GlnTyr: 1.517 ± 0.405
0.0GlnXaa: 0.0 ± 0.0
Arg
2.656ArgAla: 2.656 ± 1.502
1.517ArgCys: 1.517 ± 1.847
1.897ArgAsp: 1.897 ± 0.514
4.932ArgGlu: 4.932 ± 0.686
5.311ArgPhe: 5.311 ± 1.133
4.932ArgGly: 4.932 ± 1.472
1.138ArgHis: 1.138 ± 0.785
1.897ArgIle: 1.897 ± 0.826
4.173ArgLys: 4.173 ± 1.242
9.105ArgLeu: 9.105 ± 0.633
1.138ArgMet: 1.138 ± 0.567
2.276ArgAsn: 2.276 ± 2.495
1.138ArgPro: 1.138 ± 1.851
2.276ArgGln: 2.276 ± 0.597
4.552ArgArg: 4.552 ± 1.667
4.932ArgSer: 4.932 ± 2.126
4.173ArgThr: 4.173 ± 1.092
4.932ArgVal: 4.932 ± 1.426
0.379ArgTrp: 0.379 ± 0.539
1.138ArgTyr: 1.138 ± 0.567
0.0ArgXaa: 0.0 ± 0.0
Ser
5.311SerAla: 5.311 ± 1.065
0.759SerCys: 0.759 ± 0.378
4.552SerAsp: 4.552 ± 1.372
6.829SerGlu: 6.829 ± 2.007
4.552SerPhe: 4.552 ± 1.413
6.07SerGly: 6.07 ± 1.734
2.276SerHis: 2.276 ± 1.133
6.829SerIle: 6.829 ± 1.411
8.346SerLys: 8.346 ± 2.4
5.69SerLeu: 5.69 ± 3.455
1.138SerMet: 1.138 ± 1.509
3.035SerAsn: 3.035 ± 1.682
1.897SerPro: 1.897 ± 0.514
4.932SerGln: 4.932 ± 1.159
4.552SerArg: 4.552 ± 1.201
7.587SerSer: 7.587 ± 2.51
5.69SerThr: 5.69 ± 3.945
4.173SerVal: 4.173 ± 2.017
0.759SerTrp: 0.759 ± 0.378
4.552SerTyr: 4.552 ± 0.579
0.0SerXaa: 0.0 ± 0.0
Thr
1.897ThrAla: 1.897 ± 0.944
0.379ThrCys: 0.379 ± 0.189
1.517ThrAsp: 1.517 ± 0.756
2.656ThrGlu: 2.656 ± 2.645
4.552ThrPhe: 4.552 ± 0.579
3.035ThrGly: 3.035 ± 0.531
1.138ThrHis: 1.138 ± 0.367
1.897ThrIle: 1.897 ± 0.748
3.035ThrLys: 3.035 ± 1.511
5.69ThrLeu: 5.69 ± 2.833
1.517ThrMet: 1.517 ± 0.841
3.414ThrAsn: 3.414 ± 4.782
3.035ThrPro: 3.035 ± 0.859
1.517ThrGln: 1.517 ± 0.756
2.276ThrArg: 2.276 ± 0.661
1.897ThrSer: 1.897 ± 1.285
1.897ThrThr: 1.897 ± 0.748
1.517ThrVal: 1.517 ± 0.847
0.0ThrTrp: 0.0 ± 0.0
0.759ThrTyr: 0.759 ± 1.078
0.0ThrXaa: 0.0 ± 0.0
Val
3.035ValAla: 3.035 ± 1.511
0.759ValCys: 0.759 ± 0.378
1.897ValAsp: 1.897 ± 1.364
5.311ValGlu: 5.311 ± 1.649
3.414ValPhe: 3.414 ± 0.906
2.656ValGly: 2.656 ± 1.461
1.517ValHis: 1.517 ± 0.841
1.517ValIle: 1.517 ± 1.197
5.311ValLys: 5.311 ± 1.399
2.276ValLeu: 2.276 ± 0.706
2.276ValMet: 2.276 ± 1.747
4.173ValAsn: 4.173 ± 0.831
3.035ValPro: 3.035 ± 0.81
1.517ValGln: 1.517 ± 0.745
2.656ValArg: 2.656 ± 0.947
4.932ValSer: 4.932 ± 1.375
3.035ValThr: 3.035 ± 0.81
3.794ValVal: 3.794 ± 2.103
0.379ValTrp: 0.379 ± 0.189
3.035ValTyr: 3.035 ± 1.126
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.379TrpCys: 0.379 ± 0.189
0.379TrpAsp: 0.379 ± 0.189
0.379TrpGlu: 0.379 ± 0.539
0.759TrpPhe: 0.759 ± 0.378
0.759TrpGly: 0.759 ± 0.421
0.0TrpHis: 0.0 ± 0.0
0.379TrpIle: 0.379 ± 0.189
0.379TrpLys: 0.379 ± 0.189
0.759TrpLeu: 0.759 ± 0.378
0.0TrpMet: 0.0 ± 0.0
0.379TrpAsn: 0.379 ± 0.539
0.759TrpPro: 0.759 ± 0.378
0.0TrpGln: 0.0 ± 0.0
0.759TrpArg: 0.759 ± 0.378
1.517TrpSer: 1.517 ± 0.756
0.0TrpThr: 0.0 ± 0.0
1.138TrpVal: 1.138 ± 0.978
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.517TyrAla: 1.517 ± 0.745
0.379TyrCys: 0.379 ± 0.189
1.138TyrAsp: 1.138 ± 0.567
1.897TyrGlu: 1.897 ± 0.944
1.517TyrPhe: 1.517 ± 0.405
3.414TyrGly: 3.414 ± 0.807
0.379TyrHis: 0.379 ± 0.189
1.517TyrIle: 1.517 ± 0.405
1.897TyrLys: 1.897 ± 0.514
3.794TyrLeu: 3.794 ± 1.356
1.138TyrMet: 1.138 ± 0.567
1.138TyrAsn: 1.138 ± 0.789
2.276TyrPro: 2.276 ± 0.597
0.379TyrGln: 0.379 ± 0.189
1.897TyrArg: 1.897 ± 2.216
1.897TyrSer: 1.897 ± 0.514
1.517TyrThr: 1.517 ± 0.745
1.897TyrVal: 1.897 ± 0.514
0.379TyrTrp: 0.379 ± 0.189
0.379TyrTyr: 0.379 ± 0.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2637 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski