Amino acid dipepetide frequency for Cherry necrotic rusty mottle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.362AlaAla: 5.362 ± 2.349
1.005AlaCys: 1.005 ± 0.591
2.681AlaAsp: 2.681 ± 1.032
4.357AlaGlu: 4.357 ± 1.48
3.686AlaPhe: 3.686 ± 2.363
3.686AlaGly: 3.686 ± 1.06
1.005AlaHis: 1.005 ± 1.113
3.686AlaIle: 3.686 ± 1.237
4.692AlaLys: 4.692 ± 1.393
5.362AlaLeu: 5.362 ± 1.008
0.0AlaMet: 0.0 ± 0.0
4.357AlaAsn: 4.357 ± 0.799
2.681AlaPro: 2.681 ± 1.063
2.681AlaGln: 2.681 ± 0.703
2.011AlaArg: 2.011 ± 1.093
4.357AlaSer: 4.357 ± 1.674
3.351AlaThr: 3.351 ± 1.28
4.357AlaVal: 4.357 ± 1.725
0.335AlaTrp: 0.335 ± 0.182
1.34AlaTyr: 1.34 ± 0.692
0.0AlaXaa: 0.0 ± 0.0
Cys
1.005CysAla: 1.005 ± 1.326
0.0CysCys: 0.0 ± 0.0
1.005CysAsp: 1.005 ± 0.546
0.67CysGlu: 0.67 ± 1.127
2.681CysPhe: 2.681 ± 0.741
1.005CysGly: 1.005 ± 0.493
0.0CysHis: 0.0 ± 0.0
1.676CysIle: 1.676 ± 0.618
0.67CysLys: 0.67 ± 0.52
2.011CysLeu: 2.011 ± 0.961
0.0CysMet: 0.0 ± 0.0
0.67CysAsn: 0.67 ± 0.896
0.67CysPro: 0.67 ± 0.364
0.335CysGln: 0.335 ± 0.604
0.335CysArg: 0.335 ± 0.182
2.681CysSer: 2.681 ± 1.009
1.676CysThr: 1.676 ± 0.733
2.346CysVal: 2.346 ± 1.827
0.0CysTrp: 0.0 ± 0.0
0.67CysTyr: 0.67 ± 0.482
0.0CysXaa: 0.0 ± 0.0
Asp
2.346AspAla: 2.346 ± 0.902
1.34AspCys: 1.34 ± 0.531
4.357AspAsp: 4.357 ± 1.651
4.357AspGlu: 4.357 ± 1.409
3.686AspPhe: 3.686 ± 1.365
2.011AspGly: 2.011 ± 0.681
1.005AspHis: 1.005 ± 0.493
1.34AspIle: 1.34 ± 0.964
2.681AspLys: 2.681 ± 1.032
5.697AspLeu: 5.697 ± 1.448
0.335AspMet: 0.335 ± 0.182
2.681AspAsn: 2.681 ± 1.062
3.351AspPro: 3.351 ± 0.849
0.67AspGln: 0.67 ± 0.689
2.011AspArg: 2.011 ± 1.093
4.692AspSer: 4.692 ± 2.034
2.346AspThr: 2.346 ± 0.646
1.005AspVal: 1.005 ± 0.465
1.34AspTrp: 1.34 ± 0.516
2.346AspTyr: 2.346 ± 0.916
0.0AspXaa: 0.0 ± 0.0
Glu
4.357GluAla: 4.357 ± 1.465
0.67GluCys: 0.67 ± 0.364
2.346GluAsp: 2.346 ± 0.892
6.032GluGlu: 6.032 ± 1.342
2.346GluPhe: 2.346 ± 1.035
3.686GluGly: 3.686 ± 0.633
1.005GluHis: 1.005 ± 0.465
5.362GluIle: 5.362 ± 3.065
5.697GluLys: 5.697 ± 2.15
5.362GluLeu: 5.362 ± 1.394
2.346GluMet: 2.346 ± 0.933
2.346GluAsn: 2.346 ± 0.944
2.346GluPro: 2.346 ± 1.229
2.681GluGln: 2.681 ± 1.457
3.351GluArg: 3.351 ± 1.088
5.362GluSer: 5.362 ± 1.514
1.676GluThr: 1.676 ± 0.611
5.362GluVal: 5.362 ± 1.976
0.0GluTrp: 0.0 ± 0.0
1.34GluTyr: 1.34 ± 0.671
0.0GluXaa: 0.0 ± 0.0
Phe
2.011PheAla: 2.011 ± 0.93
1.34PheCys: 1.34 ± 0.516
6.032PheAsp: 6.032 ± 1.679
3.686PheGlu: 3.686 ± 1.088
2.346PhePhe: 2.346 ± 0.902
3.016PheGly: 3.016 ± 1.479
1.34PheHis: 1.34 ± 0.531
5.027PheIle: 5.027 ± 1.585
4.692PheLys: 4.692 ± 1.625
8.378PheLeu: 8.378 ± 2.563
1.34PheMet: 1.34 ± 0.589
3.351PheAsn: 3.351 ± 0.728
2.346PhePro: 2.346 ± 1.472
1.005PheGln: 1.005 ± 0.546
2.011PheArg: 2.011 ± 1.112
9.383PheSer: 9.383 ± 1.823
3.016PheThr: 3.016 ± 1.219
2.681PheVal: 2.681 ± 1.253
0.335PheTrp: 0.335 ± 0.182
0.335PheTyr: 0.335 ± 0.182
0.0PheXaa: 0.0 ± 0.0
Gly
2.011GlyAla: 2.011 ± 1.824
1.34GlyCys: 1.34 ± 0.896
4.357GlyAsp: 4.357 ± 1.461
3.351GlyGlu: 3.351 ± 1.276
3.351GlyPhe: 3.351 ± 1.57
3.351GlyGly: 3.351 ± 1.554
0.67GlyHis: 0.67 ± 0.364
1.34GlyIle: 1.34 ± 0.729
4.357GlyLys: 4.357 ± 1.636
5.027GlyLeu: 5.027 ± 1.834
1.005GlyMet: 1.005 ± 0.627
2.011GlyAsn: 2.011 ± 1.093
2.681GlyPro: 2.681 ± 1.216
3.016GlyGln: 3.016 ± 0.754
4.021GlyArg: 4.021 ± 1.153
4.021GlySer: 4.021 ± 1.287
3.686GlyThr: 3.686 ± 0.884
3.686GlyVal: 3.686 ± 2.25
0.67GlyTrp: 0.67 ± 0.364
2.011GlyTyr: 2.011 ± 1.093
0.0GlyXaa: 0.0 ± 0.0
His
1.005HisAla: 1.005 ± 0.546
0.67HisCys: 0.67 ± 0.364
0.67HisAsp: 0.67 ± 0.364
1.34HisGlu: 1.34 ± 0.729
1.676HisPhe: 1.676 ± 0.722
1.676HisGly: 1.676 ± 0.721
1.34HisHis: 1.34 ± 0.79
0.0HisIle: 0.0 ± 0.0
1.676HisLys: 1.676 ± 0.911
2.346HisLeu: 2.346 ± 1.025
0.0HisMet: 0.0 ± 0.0
1.676HisAsn: 1.676 ± 0.618
0.67HisPro: 0.67 ± 0.52
0.67HisGln: 0.67 ± 0.52
1.34HisArg: 1.34 ± 1.254
3.351HisSer: 3.351 ± 1.985
0.335HisThr: 0.335 ± 0.182
0.67HisVal: 0.67 ± 0.52
0.0HisTrp: 0.0 ± 0.0
1.676HisTyr: 1.676 ± 0.679
0.0HisXaa: 0.0 ± 0.0
Ile
2.681IleAla: 2.681 ± 1.455
2.011IleCys: 2.011 ± 1.915
1.676IleAsp: 1.676 ± 0.911
4.357IleGlu: 4.357 ± 1.651
4.357IlePhe: 4.357 ± 1.465
3.351IleGly: 3.351 ± 2.444
0.67IleHis: 0.67 ± 0.364
1.34IleIle: 1.34 ± 0.896
4.357IleLys: 4.357 ± 1.465
6.032IleLeu: 6.032 ± 1.097
1.005IleMet: 1.005 ± 0.582
4.357IleAsn: 4.357 ± 0.966
2.681IlePro: 2.681 ± 0.901
1.676IleGln: 1.676 ± 0.653
3.686IleArg: 3.686 ± 1.323
6.702IleSer: 6.702 ± 1.778
3.016IleThr: 3.016 ± 2.209
3.351IleVal: 3.351 ± 0.873
0.335IleTrp: 0.335 ± 0.561
1.005IleTyr: 1.005 ± 0.891
0.0IleXaa: 0.0 ± 0.0
Lys
6.032LysAla: 6.032 ± 1.594
1.005LysCys: 1.005 ± 0.868
3.016LysAsp: 3.016 ± 1.23
5.697LysGlu: 5.697 ± 1.831
3.686LysPhe: 3.686 ± 0.816
3.686LysGly: 3.686 ± 2.004
1.34LysHis: 1.34 ± 0.531
3.686LysIle: 3.686 ± 1.152
7.373LysLys: 7.373 ± 2.535
7.038LysLeu: 7.038 ± 2.228
0.67LysMet: 0.67 ± 0.364
3.016LysAsn: 3.016 ± 1.23
1.34LysPro: 1.34 ± 0.896
2.011LysGln: 2.011 ± 1.254
4.021LysArg: 4.021 ± 1.736
4.357LysSer: 4.357 ± 1.477
4.021LysThr: 4.021 ± 1.504
5.027LysVal: 5.027 ± 1.531
0.0LysTrp: 0.0 ± 0.0
2.011LysTyr: 2.011 ± 0.752
0.0LysXaa: 0.0 ± 0.0
Leu
8.043LeuAla: 8.043 ± 1.391
1.34LeuCys: 1.34 ± 0.61
4.021LeuAsp: 4.021 ± 0.828
5.697LeuGlu: 5.697 ± 2.066
4.357LeuPhe: 4.357 ± 1.478
7.038LeuGly: 7.038 ± 1.629
3.351LeuHis: 3.351 ± 1.628
8.378LeuIle: 8.378 ± 4.25
8.378LeuLys: 8.378 ± 1.827
10.724LeuLeu: 10.724 ± 5.798
3.351LeuMet: 3.351 ± 1.306
5.362LeuAsn: 5.362 ± 1.401
5.362LeuPro: 5.362 ± 1.879
4.021LeuGln: 4.021 ± 3.437
5.027LeuArg: 5.027 ± 1.022
11.729LeuSer: 11.729 ± 3.632
4.357LeuThr: 4.357 ± 4.455
5.697LeuVal: 5.697 ± 2.313
0.0LeuTrp: 0.0 ± 0.0
2.681LeuTyr: 2.681 ± 2.614
0.0LeuXaa: 0.0 ± 0.0
Met
2.011MetAla: 2.011 ± 0.752
0.335MetCys: 0.335 ± 0.182
0.335MetAsp: 0.335 ± 0.182
2.011MetGlu: 2.011 ± 0.668
0.335MetPhe: 0.335 ± 0.182
1.005MetGly: 1.005 ± 0.546
0.0MetHis: 0.0 ± 0.0
1.005MetIle: 1.005 ± 0.465
1.34MetLys: 1.34 ± 0.615
2.011MetLeu: 2.011 ± 1.68
0.67MetMet: 0.67 ± 1.577
2.011MetAsn: 2.011 ± 1.093
0.335MetPro: 0.335 ± 0.182
1.005MetGln: 1.005 ± 0.546
1.34MetArg: 1.34 ± 0.825
1.676MetSer: 1.676 ± 0.805
1.676MetThr: 1.676 ± 0.653
0.67MetVal: 0.67 ± 0.364
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.346AsnAla: 2.346 ± 0.892
1.34AsnCys: 1.34 ± 0.763
2.346AsnAsp: 2.346 ± 0.965
2.011AsnGlu: 2.011 ± 0.752
5.362AsnPhe: 5.362 ± 1.223
0.335AsnGly: 0.335 ± 0.709
1.34AsnHis: 1.34 ± 0.703
2.011AsnIle: 2.011 ± 0.93
2.681AsnLys: 2.681 ± 0.735
8.043AsnLeu: 8.043 ± 1.392
1.005AsnMet: 1.005 ± 0.61
1.34AsnAsn: 1.34 ± 0.516
2.681AsnPro: 2.681 ± 0.921
1.34AsnGln: 1.34 ± 0.703
1.34AsnArg: 1.34 ± 0.729
4.021AsnSer: 4.021 ± 2.048
1.34AsnThr: 1.34 ± 0.815
3.016AsnVal: 3.016 ± 0.565
1.005AsnTrp: 1.005 ± 0.868
2.681AsnTyr: 2.681 ± 1.062
0.0AsnXaa: 0.0 ± 0.0
Pro
2.681ProAla: 2.681 ± 1.063
1.34ProCys: 1.34 ± 0.531
3.016ProAsp: 3.016 ± 0.828
0.67ProGlu: 0.67 ± 1.061
1.34ProPhe: 1.34 ± 0.729
2.681ProGly: 2.681 ± 1.028
1.005ProHis: 1.005 ± 1.104
2.011ProIle: 2.011 ± 1.295
2.011ProLys: 2.011 ± 0.93
3.016ProLeu: 3.016 ± 2.628
1.005ProMet: 1.005 ± 0.465
0.67ProAsn: 0.67 ± 0.689
1.676ProPro: 1.676 ± 1.587
1.005ProGln: 1.005 ± 1.211
2.346ProArg: 2.346 ± 0.902
3.351ProSer: 3.351 ± 1.554
5.027ProThr: 5.027 ± 2.333
2.681ProVal: 2.681 ± 0.952
0.67ProTrp: 0.67 ± 0.364
1.005ProTyr: 1.005 ± 0.493
0.0ProXaa: 0.0 ± 0.0
Gln
1.34GlnAla: 1.34 ± 0.615
0.335GlnCys: 0.335 ± 0.182
1.676GlnAsp: 1.676 ± 0.622
2.346GlnGlu: 2.346 ± 1.503
2.346GlnPhe: 2.346 ± 0.714
1.005GlnGly: 1.005 ± 0.546
1.005GlnHis: 1.005 ± 0.591
1.005GlnIle: 1.005 ± 0.703
1.676GlnLys: 1.676 ± 0.911
4.021GlnLeu: 4.021 ± 2.034
1.34GlnMet: 1.34 ± 0.589
2.346GlnAsn: 2.346 ± 0.661
2.346GlnPro: 2.346 ± 2.419
0.335GlnGln: 0.335 ± 0.182
2.346GlnArg: 2.346 ± 0.604
2.681GlnSer: 2.681 ± 1.049
1.005GlnThr: 1.005 ± 0.627
1.34GlnVal: 1.34 ± 1.587
0.0GlnTrp: 0.0 ± 0.0
1.005GlnTyr: 1.005 ± 0.546
0.0GlnXaa: 0.0 ± 0.0
Arg
3.351ArgAla: 3.351 ± 1.794
1.005ArgCys: 1.005 ± 0.546
3.016ArgAsp: 3.016 ± 0.884
2.346ArgGlu: 2.346 ± 0.902
4.021ArgPhe: 4.021 ± 1.224
3.351ArgGly: 3.351 ± 1.476
0.67ArgHis: 0.67 ± 0.364
3.351ArgIle: 3.351 ± 0.989
3.686ArgLys: 3.686 ± 1.23
5.697ArgLeu: 5.697 ± 1.999
0.67ArgMet: 0.67 ± 0.364
3.351ArgAsn: 3.351 ± 0.885
1.676ArgPro: 1.676 ± 2.204
1.005ArgGln: 1.005 ± 0.627
4.692ArgArg: 4.692 ± 3.709
4.021ArgSer: 4.021 ± 2.237
1.676ArgThr: 1.676 ± 0.975
2.011ArgVal: 2.011 ± 0.961
0.335ArgTrp: 0.335 ± 0.182
2.346ArgTyr: 2.346 ± 1.275
0.0ArgXaa: 0.0 ± 0.0
Ser
4.021SerAla: 4.021 ± 0.974
1.34SerCys: 1.34 ± 0.896
4.692SerAsp: 4.692 ± 1.201
5.362SerGlu: 5.362 ± 2.458
5.697SerPhe: 5.697 ± 1.392
6.032SerGly: 6.032 ± 1.245
3.016SerHis: 3.016 ± 1.198
5.697SerIle: 5.697 ± 1.649
5.027SerLys: 5.027 ± 1.171
12.399SerLeu: 12.399 ± 6.667
2.681SerMet: 2.681 ± 1.113
5.027SerAsn: 5.027 ± 1.524
2.681SerPro: 2.681 ± 1.052
2.011SerGln: 2.011 ± 0.786
5.362SerArg: 5.362 ± 1.434
8.713SerSer: 8.713 ± 2.321
3.016SerThr: 3.016 ± 2.091
4.357SerVal: 4.357 ± 1.75
1.34SerTrp: 1.34 ± 0.615
4.357SerTyr: 4.357 ± 1.364
0.0SerXaa: 0.0 ± 0.0
Thr
4.021ThrAla: 4.021 ± 2.133
1.34ThrCys: 1.34 ± 0.729
1.005ThrAsp: 1.005 ± 0.861
3.016ThrGlu: 3.016 ± 0.853
8.043ThrPhe: 8.043 ± 0.849
3.351ThrGly: 3.351 ± 1.913
1.005ThrHis: 1.005 ± 0.591
3.016ThrIle: 3.016 ± 1.164
3.016ThrLys: 3.016 ± 0.822
5.362ThrLeu: 5.362 ± 1.187
0.335ThrMet: 0.335 ± 0.182
0.67ThrAsn: 0.67 ± 0.52
1.34ThrPro: 1.34 ± 0.825
1.34ThrGln: 1.34 ± 0.615
2.346ThrArg: 2.346 ± 1.784
3.686ThrSer: 3.686 ± 2.268
2.011ThrThr: 2.011 ± 2.569
2.346ThrVal: 2.346 ± 2.39
0.0ThrTrp: 0.0 ± 0.0
1.005ThrTyr: 1.005 ± 0.493
0.0ThrXaa: 0.0 ± 0.0
Val
2.346ValAla: 2.346 ± 2.244
1.005ValCys: 1.005 ± 0.891
1.676ValAsp: 1.676 ± 0.733
2.346ValGlu: 2.346 ± 1.433
3.686ValPhe: 3.686 ± 0.96
4.357ValGly: 4.357 ± 1.17
2.011ValHis: 2.011 ± 0.694
7.038ValIle: 7.038 ± 1.859
3.351ValLys: 3.351 ± 1.4
5.362ValLeu: 5.362 ± 3.034
1.34ValMet: 1.34 ± 0.615
1.005ValAsn: 1.005 ± 1.154
1.34ValPro: 1.34 ± 0.671
3.351ValGln: 3.351 ± 1.01
3.016ValArg: 3.016 ± 0.828
5.362ValSer: 5.362 ± 1.233
3.686ValThr: 3.686 ± 1.906
4.021ValVal: 4.021 ± 1.525
0.67ValTrp: 0.67 ± 0.482
1.005ValTyr: 1.005 ± 0.546
0.0ValXaa: 0.0 ± 0.0
Trp
1.005TrpAla: 1.005 ± 0.627
0.0TrpCys: 0.0 ± 0.0
0.335TrpAsp: 0.335 ± 0.561
0.335TrpGlu: 0.335 ± 0.182
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.335TrpHis: 0.335 ± 0.182
0.0TrpIle: 0.0 ± 0.0
0.335TrpLys: 0.335 ± 0.182
1.005TrpLeu: 1.005 ± 0.546
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.335TrpPro: 0.335 ± 0.182
0.67TrpGln: 0.67 ± 1.009
0.335TrpArg: 0.335 ± 0.182
0.67TrpSer: 0.67 ± 0.364
0.0TrpThr: 0.0 ± 0.0
1.676TrpVal: 1.676 ± 0.618
0.0TrpTrp: 0.0 ± 0.0
0.335TrpTyr: 0.335 ± 0.561
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.016TyrAla: 3.016 ± 0.884
1.34TyrCys: 1.34 ± 1.312
1.34TyrAsp: 1.34 ± 0.729
3.686TyrGlu: 3.686 ± 1.142
0.67TyrPhe: 0.67 ± 0.364
1.34TyrGly: 1.34 ± 0.531
0.67TyrHis: 0.67 ± 0.482
1.676TyrIle: 1.676 ± 0.911
1.34TyrLys: 1.34 ± 0.729
4.021TyrLeu: 4.021 ± 0.975
0.335TyrMet: 0.335 ± 0.182
1.34TyrAsn: 1.34 ± 1.912
0.67TyrPro: 0.67 ± 0.364
0.67TyrGln: 0.67 ± 0.52
1.34TyrArg: 1.34 ± 0.79
2.346TyrSer: 2.346 ± 0.714
1.34TyrThr: 1.34 ± 0.692
1.676TyrVal: 1.676 ± 0.722
0.335TyrTrp: 0.335 ± 0.182
0.67TyrTyr: 0.67 ± 0.364
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2985 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski