Amino acid dipepetide frequency for Melon yellowing-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.012AlaAla: 3.012 ± 1.347
1.004AlaCys: 1.004 ± 0.422
2.008AlaAsp: 2.008 ± 0.942
2.008AlaGlu: 2.008 ± 0.54
2.343AlaPhe: 2.343 ± 0.835
2.343AlaGly: 2.343 ± 0.526
1.004AlaHis: 1.004 ± 0.996
5.02AlaIle: 5.02 ± 1.412
6.359AlaLys: 6.359 ± 2.09
4.685AlaLeu: 4.685 ± 1.407
1.004AlaMet: 1.004 ± 0.427
4.016AlaAsn: 4.016 ± 1.452
1.339AlaPro: 1.339 ± 0.913
3.012AlaGln: 3.012 ± 1.008
3.012AlaArg: 3.012 ± 1.532
2.677AlaSer: 2.677 ± 0.719
2.677AlaThr: 2.677 ± 0.921
1.004AlaVal: 1.004 ± 0.541
0.335AlaTrp: 0.335 ± 0.18
1.339AlaTyr: 1.339 ± 1.354
0.0AlaXaa: 0.0 ± 0.0
Cys
1.004CysAla: 1.004 ± 0.806
0.335CysCys: 0.335 ± 0.18
1.339CysAsp: 1.339 ± 1.147
2.008CysGlu: 2.008 ± 0.959
1.004CysPhe: 1.004 ± 0.541
2.677CysGly: 2.677 ± 1.083
0.0CysHis: 0.0 ± 0.0
2.343CysIle: 2.343 ± 1.262
1.339CysLys: 1.339 ± 0.721
2.008CysLeu: 2.008 ± 0.825
0.0CysMet: 0.0 ± 0.0
1.339CysAsn: 1.339 ± 0.599
0.669CysPro: 0.669 ± 0.824
1.339CysGln: 1.339 ± 0.599
0.669CysArg: 0.669 ± 1.002
1.339CysSer: 1.339 ± 0.461
1.339CysThr: 1.339 ± 1.096
3.012CysVal: 3.012 ± 2.499
0.0CysTrp: 0.0 ± 0.0
1.339CysTyr: 1.339 ± 0.721
0.0CysXaa: 0.0 ± 0.0
Asp
2.343AspAla: 2.343 ± 1.116
2.677AspCys: 2.677 ± 1.442
2.343AspAsp: 2.343 ± 1.262
5.355AspGlu: 5.355 ± 1.75
5.02AspPhe: 5.02 ± 1.162
2.343AspGly: 2.343 ± 0.917
1.339AspHis: 1.339 ± 1.266
5.689AspIle: 5.689 ± 2.047
3.347AspLys: 3.347 ± 2.016
4.351AspLeu: 4.351 ± 1.554
0.669AspMet: 0.669 ± 0.36
2.343AspAsn: 2.343 ± 1.262
1.339AspPro: 1.339 ± 0.719
0.0AspGln: 0.0 ± 0.0
3.012AspArg: 3.012 ± 0.974
3.681AspSer: 3.681 ± 1.073
2.677AspThr: 2.677 ± 0.921
3.347AspVal: 3.347 ± 1.519
1.004AspTrp: 1.004 ± 0.796
3.012AspTyr: 3.012 ± 1.136
0.0AspXaa: 0.0 ± 0.0
Glu
3.347GluAla: 3.347 ± 1.335
1.339GluCys: 1.339 ± 0.599
4.685GluAsp: 4.685 ± 1.052
6.693GluGlu: 6.693 ± 1.571
3.012GluPhe: 3.012 ± 1.166
5.355GluGly: 5.355 ± 1.798
1.673GluHis: 1.673 ± 0.637
6.024GluIle: 6.024 ± 2.741
4.016GluLys: 4.016 ± 1.091
5.355GluLeu: 5.355 ± 1.089
2.008GluMet: 2.008 ± 0.915
3.347GluAsn: 3.347 ± 0.975
2.343GluPro: 2.343 ± 0.526
1.339GluGln: 1.339 ± 0.913
3.681GluArg: 3.681 ± 0.776
5.355GluSer: 5.355 ± 1.546
1.339GluThr: 1.339 ± 0.922
4.685GluVal: 4.685 ± 1.773
0.335GluTrp: 0.335 ± 0.551
2.677GluTyr: 2.677 ± 1.198
0.0GluXaa: 0.0 ± 0.0
Phe
3.681PheAla: 3.681 ± 1.242
1.004PheCys: 1.004 ± 0.613
4.685PheAsp: 4.685 ± 1.102
5.02PheGlu: 5.02 ± 1.207
2.343PhePhe: 2.343 ± 1.262
4.685PheGly: 4.685 ± 1.669
0.335PheHis: 0.335 ± 0.18
2.677PheIle: 2.677 ± 1.198
5.02PheLys: 5.02 ± 1.083
5.689PheLeu: 5.689 ± 1.707
1.673PheMet: 1.673 ± 0.875
1.004PheAsn: 1.004 ± 0.541
2.008PhePro: 2.008 ± 0.54
3.012PheGln: 3.012 ± 0.948
3.681PheArg: 3.681 ± 1.507
5.355PheSer: 5.355 ± 1.205
1.339PheThr: 1.339 ± 0.721
5.355PheVal: 5.355 ± 1.645
0.335PheTrp: 0.335 ± 0.778
1.004PheTyr: 1.004 ± 0.541
0.0PheXaa: 0.0 ± 0.0
Gly
1.673GlyAla: 1.673 ± 0.901
1.339GlyCys: 1.339 ± 0.922
4.685GlyAsp: 4.685 ± 1.102
4.685GlyGlu: 4.685 ± 1.052
2.008GlyPhe: 2.008 ± 0.69
3.347GlyGly: 3.347 ± 1.059
0.335GlyHis: 0.335 ± 0.18
3.012GlyIle: 3.012 ± 1.651
3.012GlyLys: 3.012 ± 1.322
6.693GlyLeu: 6.693 ± 1.941
1.339GlyMet: 1.339 ± 0.721
3.681GlyAsn: 3.681 ± 1.105
1.004GlyPro: 1.004 ± 0.541
2.008GlyGln: 2.008 ± 0.844
2.677GlyArg: 2.677 ± 0.571
7.028GlySer: 7.028 ± 3.112
3.347GlyThr: 3.347 ± 1.686
4.351GlyVal: 4.351 ± 1.843
1.673GlyTrp: 1.673 ± 0.843
2.677GlyTyr: 2.677 ± 0.969
0.0GlyXaa: 0.0 ± 0.0
His
1.339HisAla: 1.339 ± 0.599
0.335HisCys: 0.335 ± 0.922
1.339HisAsp: 1.339 ± 0.461
0.335HisGlu: 0.335 ± 0.18
1.339HisPhe: 1.339 ± 0.461
0.669HisGly: 0.669 ± 0.824
1.004HisHis: 1.004 ± 0.541
1.339HisIle: 1.339 ± 0.599
1.004HisLys: 1.004 ± 0.422
2.008HisLeu: 2.008 ± 0.72
0.0HisMet: 0.0 ± 0.0
1.004HisAsn: 1.004 ± 0.853
0.669HisPro: 0.669 ± 0.36
0.335HisGln: 0.335 ± 0.18
0.669HisArg: 0.669 ± 0.677
2.343HisSer: 2.343 ± 1.116
1.339HisThr: 1.339 ± 0.599
2.343HisVal: 2.343 ± 1.801
0.0HisTrp: 0.0 ± 0.0
0.335HisTyr: 0.335 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
4.016IleAla: 4.016 ± 1.804
3.012IleCys: 3.012 ± 1.224
4.016IleAsp: 4.016 ± 1.083
4.016IleGlu: 4.016 ± 0.706
3.347IlePhe: 3.347 ± 1.045
3.347IleGly: 3.347 ± 0.945
1.673IleHis: 1.673 ± 0.901
2.343IleIle: 2.343 ± 0.865
5.689IleLys: 5.689 ± 0.843
5.355IleLeu: 5.355 ± 1.393
1.004IleMet: 1.004 ± 0.541
3.347IleAsn: 3.347 ± 1.511
1.673IlePro: 1.673 ± 0.859
2.343IleGln: 2.343 ± 1.262
4.351IleArg: 4.351 ± 1.379
5.02IleSer: 5.02 ± 1.262
3.012IleThr: 3.012 ± 0.687
3.681IleVal: 3.681 ± 1.961
0.335IleTrp: 0.335 ± 0.18
2.343IleTyr: 2.343 ± 1.819
0.0IleXaa: 0.0 ± 0.0
Lys
4.685LysAla: 4.685 ± 1.563
0.335LysCys: 0.335 ± 0.922
3.681LysAsp: 3.681 ± 1.242
4.016LysGlu: 4.016 ± 1.091
4.351LysPhe: 4.351 ± 1.716
5.355LysGly: 5.355 ± 2.251
2.008LysHis: 2.008 ± 0.72
2.677LysIle: 2.677 ± 0.752
4.351LysLys: 4.351 ± 1.081
6.693LysLeu: 6.693 ± 2.441
2.008LysMet: 2.008 ± 0.851
3.681LysAsn: 3.681 ± 1.983
3.347LysPro: 3.347 ± 0.636
1.339LysGln: 1.339 ± 0.812
3.681LysArg: 3.681 ± 1.618
6.359LysSer: 6.359 ± 2.304
5.02LysThr: 5.02 ± 1.475
6.359LysVal: 6.359 ± 0.6
0.669LysTrp: 0.669 ± 0.457
4.351LysTyr: 4.351 ± 1.516
0.0LysXaa: 0.0 ± 0.0
Leu
5.02LeuAla: 5.02 ± 1.073
0.669LeuCys: 0.669 ± 0.824
7.028LeuAsp: 7.028 ± 1.556
4.351LeuGlu: 4.351 ± 1.464
3.681LeuPhe: 3.681 ± 0.93
4.351LeuGly: 4.351 ± 1.884
0.669LeuHis: 0.669 ± 0.36
9.036LeuIle: 9.036 ± 4.117
7.028LeuLys: 7.028 ± 1.689
5.355LeuLeu: 5.355 ± 0.919
2.343LeuMet: 2.343 ± 0.73
6.359LeuAsn: 6.359 ± 1.445
3.012LeuPro: 3.012 ± 1.57
3.012LeuGln: 3.012 ± 1.202
4.351LeuArg: 4.351 ± 1.244
8.701LeuSer: 8.701 ± 1.635
7.028LeuThr: 7.028 ± 2.825
6.693LeuVal: 6.693 ± 0.931
0.0LeuTrp: 0.0 ± 0.0
2.677LeuTyr: 2.677 ± 1.068
0.0LeuXaa: 0.0 ± 0.0
Met
0.669MetAla: 0.669 ± 0.457
0.335MetCys: 0.335 ± 0.18
1.004MetAsp: 1.004 ± 1.203
2.008MetGlu: 2.008 ± 0.69
1.004MetPhe: 1.004 ± 0.422
2.008MetGly: 2.008 ± 1.596
0.0MetHis: 0.0 ± 0.0
2.008MetIle: 2.008 ± 0.83
2.343MetLys: 2.343 ± 0.84
1.339MetLeu: 1.339 ± 0.721
0.0MetMet: 0.0 ± 0.0
0.335MetAsn: 0.335 ± 0.18
0.669MetPro: 0.669 ± 0.36
1.339MetGln: 1.339 ± 0.461
0.669MetArg: 0.669 ± 0.457
0.669MetSer: 0.669 ± 0.824
1.339MetThr: 1.339 ± 0.721
2.008MetVal: 2.008 ± 1.081
0.335MetTrp: 0.335 ± 0.767
0.335MetTyr: 0.335 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
1.339AsnAla: 1.339 ± 0.461
2.008AsnCys: 2.008 ± 1.613
2.008AsnAsp: 2.008 ± 0.69
3.681AsnGlu: 3.681 ± 1.354
4.685AsnPhe: 4.685 ± 1.939
4.016AsnGly: 4.016 ± 2.222
0.669AsnHis: 0.669 ± 0.457
1.004AsnIle: 1.004 ± 0.422
2.008AsnLys: 2.008 ± 0.54
7.697AsnLeu: 7.697 ± 1.451
1.004AsnMet: 1.004 ± 0.422
2.343AsnAsn: 2.343 ± 1.099
1.673AsnPro: 1.673 ± 0.86
1.004AsnGln: 1.004 ± 0.541
5.02AsnArg: 5.02 ± 2.209
3.012AsnSer: 3.012 ± 2.153
4.016AsnThr: 4.016 ± 1.094
2.677AsnVal: 2.677 ± 1.186
0.669AsnTrp: 0.669 ± 0.457
1.673AsnTyr: 1.673 ± 0.843
0.0AsnXaa: 0.0 ± 0.0
Pro
2.677ProAla: 2.677 ± 1.173
1.004ProCys: 1.004 ± 0.541
2.677ProAsp: 2.677 ± 0.719
3.012ProGlu: 3.012 ± 0.751
2.343ProPhe: 2.343 ± 0.526
1.004ProGly: 1.004 ± 0.613
1.339ProHis: 1.339 ± 2.003
2.343ProIle: 2.343 ± 0.84
1.339ProLys: 1.339 ± 0.812
3.012ProLeu: 3.012 ± 1.044
0.669ProMet: 0.669 ± 0.36
2.008ProAsn: 2.008 ± 0.95
1.339ProPro: 1.339 ± 0.916
1.673ProGln: 1.673 ± 0.88
2.343ProArg: 2.343 ± 1.026
3.012ProSer: 3.012 ± 1.166
1.004ProThr: 1.004 ± 0.422
0.669ProVal: 0.669 ± 1.002
0.335ProTrp: 0.335 ± 0.18
0.669ProTyr: 0.669 ± 0.36
0.0ProXaa: 0.0 ± 0.0
Gln
1.004GlnAla: 1.004 ± 0.996
0.335GlnCys: 0.335 ± 0.18
1.004GlnAsp: 1.004 ± 0.806
1.673GlnGlu: 1.673 ± 0.901
0.669GlnPhe: 0.669 ± 0.677
2.677GlnGly: 2.677 ± 1.011
0.0GlnHis: 0.0 ± 0.0
1.004GlnIle: 1.004 ± 1.411
2.677GlnLys: 2.677 ± 1.0
2.677GlnLeu: 2.677 ± 1.0
0.669GlnMet: 0.669 ± 0.457
2.343GlnAsn: 2.343 ± 1.218
1.004GlnPro: 1.004 ± 0.541
0.669GlnGln: 0.669 ± 1.002
1.339GlnArg: 1.339 ± 0.808
1.673GlnSer: 1.673 ± 0.86
2.008GlnThr: 2.008 ± 0.69
3.681GlnVal: 3.681 ± 0.647
0.335GlnTrp: 0.335 ± 0.18
0.335GlnTyr: 0.335 ± 0.888
0.0GlnXaa: 0.0 ± 0.0
Arg
3.347ArgAla: 3.347 ± 0.797
1.004ArgCys: 1.004 ± 0.796
3.012ArgAsp: 3.012 ± 1.265
5.02ArgGlu: 5.02 ± 1.076
4.685ArgPhe: 4.685 ± 1.43
2.343ArgGly: 2.343 ± 0.806
0.669ArgHis: 0.669 ± 0.677
3.681ArgIle: 3.681 ± 1.983
5.355ArgLys: 5.355 ± 0.856
5.02ArgLeu: 5.02 ± 1.475
0.669ArgMet: 0.669 ± 0.847
2.677ArgAsn: 2.677 ± 1.0
2.677ArgPro: 2.677 ± 2.67
0.335ArgGln: 0.335 ± 0.551
2.008ArgArg: 2.008 ± 1.596
4.351ArgSer: 4.351 ± 1.549
1.004ArgThr: 1.004 ± 0.806
1.673ArgVal: 1.673 ± 0.86
1.004ArgTrp: 1.004 ± 0.541
2.343ArgTyr: 2.343 ± 0.922
0.0ArgXaa: 0.0 ± 0.0
Ser
2.008SerAla: 2.008 ± 0.998
2.008SerCys: 2.008 ± 0.717
3.681SerAsp: 3.681 ± 0.844
4.685SerGlu: 4.685 ± 1.518
5.689SerPhe: 5.689 ± 2.054
6.359SerGly: 6.359 ± 2.731
2.343SerHis: 2.343 ± 1.354
5.689SerIle: 5.689 ± 1.368
7.697SerLys: 7.697 ± 0.701
8.032SerLeu: 8.032 ± 3.156
2.343SerMet: 2.343 ± 1.03
3.681SerAsn: 3.681 ± 1.751
2.008SerPro: 2.008 ± 0.83
3.012SerGln: 3.012 ± 1.142
3.347SerArg: 3.347 ± 1.912
5.689SerSer: 5.689 ± 3.235
6.024SerThr: 6.024 ± 1.215
4.016SerVal: 4.016 ± 0.788
0.335SerTrp: 0.335 ± 0.18
2.343SerTyr: 2.343 ± 1.335
0.335SerXaa: 0.335 ± 0.18
Thr
2.677ThrAla: 2.677 ± 1.135
1.673ThrCys: 1.673 ± 0.825
1.339ThrAsp: 1.339 ± 0.913
2.677ThrGlu: 2.677 ± 1.83
9.036ThrPhe: 9.036 ± 1.163
3.012ThrGly: 3.012 ± 1.076
1.339ThrHis: 1.339 ± 0.805
2.677ThrIle: 2.677 ± 1.617
4.016ThrLys: 4.016 ± 1.6
3.347ThrLeu: 3.347 ± 1.711
1.339ThrMet: 1.339 ± 0.721
1.673ThrAsn: 1.673 ± 1.72
2.343ThrPro: 2.343 ± 1.026
0.0ThrGln: 0.0 ± 0.0
3.681ThrArg: 3.681 ± 1.385
4.351ThrSer: 4.351 ± 1.938
2.008ThrThr: 2.008 ± 0.72
4.016ThrVal: 4.016 ± 1.34
0.335ThrTrp: 0.335 ± 0.18
1.339ThrTyr: 1.339 ± 1.169
0.0ThrXaa: 0.0 ± 0.0
Val
4.685ValAla: 4.685 ± 0.958
3.012ValCys: 3.012 ± 0.912
3.012ValAsp: 3.012 ± 1.117
4.685ValGlu: 4.685 ± 1.07
1.673ValPhe: 1.673 ± 0.558
3.012ValGly: 3.012 ± 1.031
3.012ValHis: 3.012 ± 0.664
2.343ValIle: 2.343 ± 2.039
4.351ValLys: 4.351 ± 2.562
7.028ValLeu: 7.028 ± 1.579
0.669ValMet: 0.669 ± 0.457
4.351ValAsn: 4.351 ± 1.924
4.016ValPro: 4.016 ± 1.424
1.004ValGln: 1.004 ± 0.541
3.012ValArg: 3.012 ± 1.271
6.693ValSer: 6.693 ± 2.508
5.02ValThr: 5.02 ± 1.073
2.677ValVal: 2.677 ± 1.298
0.335ValTrp: 0.335 ± 0.551
1.004ValTyr: 1.004 ± 0.806
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.669TrpCys: 0.669 ± 0.684
0.0TrpAsp: 0.0 ± 0.0
0.335TrpGlu: 0.335 ± 0.18
0.335TrpPhe: 0.335 ± 0.18
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.335TrpIle: 0.335 ± 0.18
1.004TrpLys: 1.004 ± 0.613
1.339TrpLeu: 1.339 ± 1.169
0.0TrpMet: 0.0 ± 0.0
0.669TrpAsn: 0.669 ± 0.457
0.0TrpPro: 0.0 ± 0.0
0.335TrpGln: 0.335 ± 0.551
0.335TrpArg: 0.335 ± 0.18
1.339TrpSer: 1.339 ± 0.461
0.335TrpThr: 0.335 ± 0.18
1.004TrpVal: 1.004 ± 0.541
0.0TrpTrp: 0.0 ± 0.0
0.669TrpTyr: 0.669 ± 0.457
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.673TyrAla: 1.673 ± 0.558
1.004TyrCys: 1.004 ± 0.796
2.343TyrAsp: 2.343 ± 1.6
2.343TyrGlu: 2.343 ± 1.015
1.339TyrPhe: 1.339 ± 0.65
1.673TyrGly: 1.673 ± 0.859
0.335TyrHis: 0.335 ± 0.18
2.677TyrIle: 2.677 ± 1.888
3.012TyrLys: 3.012 ± 1.117
4.016TyrLeu: 4.016 ± 1.25
0.669TyrMet: 0.669 ± 0.457
2.008TyrAsn: 2.008 ± 1.596
1.339TyrPro: 1.339 ± 0.721
0.669TyrGln: 0.669 ± 0.677
1.339TyrArg: 1.339 ± 0.461
2.677TyrSer: 2.677 ± 2.485
1.004TyrThr: 1.004 ± 1.968
2.008TyrVal: 2.008 ± 1.612
0.335TyrTrp: 0.335 ± 0.18
0.669TyrTyr: 0.669 ± 0.36
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.335XaaVal: 0.335 ± 0.18
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2989 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski