Amino acid dipepetide frequency for Watermelon virus A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.466AlaAla: 3.466 ± 2.032
1.155AlaCys: 1.155 ± 0.6
1.54AlaAsp: 1.54 ± 0.5
3.851AlaGlu: 3.851 ± 1.103
3.08AlaPhe: 3.08 ± 1.264
2.31AlaGly: 2.31 ± 0.804
1.54AlaHis: 1.54 ± 0.443
3.08AlaIle: 3.08 ± 0.576
3.851AlaLys: 3.851 ± 0.935
6.161AlaLeu: 6.161 ± 1.31
0.385AlaMet: 0.385 ± 0.595
1.925AlaAsn: 1.925 ± 0.552
2.31AlaPro: 2.31 ± 1.923
1.925AlaGln: 1.925 ± 0.592
3.08AlaArg: 3.08 ± 0.749
3.08AlaSer: 3.08 ± 2.837
2.31AlaThr: 2.31 ± 1.034
2.31AlaVal: 2.31 ± 0.729
0.77AlaTrp: 0.77 ± 0.47
2.695AlaTyr: 2.695 ± 0.89
0.0AlaXaa: 0.0 ± 0.0
Cys
1.54CysAla: 1.54 ± 1.341
0.0CysCys: 0.0 ± 0.0
1.155CysAsp: 1.155 ± 0.91
1.54CysGlu: 1.54 ± 0.674
0.77CysPhe: 0.77 ± 0.763
0.77CysGly: 0.77 ± 0.4
0.77CysHis: 0.77 ± 0.4
1.54CysIle: 1.54 ± 1.074
0.385CysLys: 0.385 ± 0.2
2.31CysLeu: 2.31 ± 1.2
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.77CysPro: 0.77 ± 0.4
0.385CysGln: 0.385 ± 0.2
1.54CysArg: 1.54 ± 1.074
0.385CysSer: 0.385 ± 0.2
1.925CysThr: 1.925 ± 0.552
1.155CysVal: 1.155 ± 0.6
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.236AspAla: 4.236 ± 1.423
0.77AspCys: 0.77 ± 0.4
2.695AspAsp: 2.695 ± 1.4
6.546AspGlu: 6.546 ± 1.547
2.695AspPhe: 2.695 ± 0.42
3.466AspGly: 3.466 ± 1.22
1.155AspHis: 1.155 ± 0.411
5.776AspIle: 5.776 ± 1.756
5.006AspLys: 5.006 ± 3.479
5.006AspLeu: 5.006 ± 0.607
2.31AspMet: 2.31 ± 1.474
1.155AspAsn: 1.155 ± 0.6
0.385AspPro: 0.385 ± 0.2
1.925AspGln: 1.925 ± 0.554
2.695AspArg: 2.695 ± 0.704
2.31AspSer: 2.31 ± 0.804
0.77AspThr: 0.77 ± 1.0
3.08AspVal: 3.08 ± 0.461
1.54AspTrp: 1.54 ± 0.8
1.54AspTyr: 1.54 ± 0.443
0.0AspXaa: 0.0 ± 0.0
Glu
3.851GluAla: 3.851 ± 1.433
2.695GluCys: 2.695 ± 1.272
5.391GluAsp: 5.391 ± 1.726
11.167GluGlu: 11.167 ± 1.907
3.466GluPhe: 3.466 ± 1.238
6.161GluGly: 6.161 ± 0.971
1.925GluHis: 1.925 ± 1.0
4.621GluIle: 4.621 ± 0.97
13.477GluLys: 13.477 ± 0.919
7.701GluLeu: 7.701 ± 1.974
2.695GluMet: 2.695 ± 1.35
6.931GluAsn: 6.931 ± 1.337
2.31GluPro: 2.31 ± 1.2
3.08GluGln: 3.08 ± 1.116
1.925GluArg: 1.925 ± 0.714
5.776GluSer: 5.776 ± 1.651
3.466GluThr: 3.466 ± 1.318
7.701GluVal: 7.701 ± 0.731
1.155GluTrp: 1.155 ± 0.6
0.385GluTyr: 0.385 ± 0.2
0.0GluXaa: 0.0 ± 0.0
Phe
2.695PheAla: 2.695 ± 1.594
1.925PheCys: 1.925 ± 0.86
5.006PheAsp: 5.006 ± 1.248
8.471PheGlu: 8.471 ± 1.893
1.925PhePhe: 1.925 ± 1.0
2.31PheGly: 2.31 ± 0.468
1.54PheHis: 1.54 ± 0.443
4.621PheIle: 4.621 ± 1.286
4.621PheLys: 4.621 ± 1.177
5.776PheLeu: 5.776 ± 3.591
2.695PheMet: 2.695 ± 1.554
4.236PheAsn: 4.236 ± 1.072
1.54PhePro: 1.54 ± 0.5
1.54PheGln: 1.54 ± 0.8
2.695PheArg: 2.695 ± 1.4
3.466PheSer: 3.466 ± 0.574
3.851PheThr: 3.851 ± 1.17
0.77PheVal: 0.77 ± 0.4
0.385PheTrp: 0.385 ± 0.2
0.77PheTyr: 0.77 ± 0.763
0.0PheXaa: 0.0 ± 0.0
Gly
1.925GlyAla: 1.925 ± 0.554
1.54GlyCys: 1.54 ± 0.5
3.08GlyAsp: 3.08 ± 1.026
4.236GlyGlu: 4.236 ± 1.623
3.466GlyPhe: 3.466 ± 1.079
2.31GlyGly: 2.31 ± 0.729
1.155GlyHis: 1.155 ± 0.479
3.466GlyIle: 3.466 ± 1.246
6.161GlyLys: 6.161 ± 0.908
3.466GlyLeu: 3.466 ± 1.079
1.54GlyMet: 1.54 ± 0.5
3.466GlyAsn: 3.466 ± 2.271
0.77GlyPro: 0.77 ± 1.308
1.54GlyGln: 1.54 ± 2.502
2.695GlyArg: 2.695 ± 1.529
4.621GlySer: 4.621 ± 1.038
2.31GlyThr: 2.31 ± 1.827
2.31GlyVal: 2.31 ± 0.701
0.77GlyTrp: 0.77 ± 0.47
2.31GlyTyr: 2.31 ± 1.245
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.77HisAsp: 0.77 ± 0.4
1.54HisGlu: 1.54 ± 0.674
2.31HisPhe: 2.31 ± 0.804
0.77HisGly: 0.77 ± 0.47
0.77HisHis: 0.77 ± 0.4
1.54HisIle: 1.54 ± 0.677
1.155HisLys: 1.155 ± 0.691
2.31HisLeu: 2.31 ± 1.382
0.385HisMet: 0.385 ± 0.2
0.77HisAsn: 0.77 ± 0.4
1.155HisPro: 1.155 ± 0.411
0.77HisGln: 0.77 ± 0.4
1.155HisArg: 1.155 ± 1.053
4.236HisSer: 4.236 ± 1.038
0.0HisThr: 0.0 ± 0.0
1.54HisVal: 1.54 ± 0.5
0.77HisTrp: 0.77 ± 0.4
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.695IleAla: 2.695 ± 1.987
1.155IleCys: 1.155 ± 0.479
3.851IleAsp: 3.851 ± 1.974
6.161IleGlu: 6.161 ± 0.573
3.466IlePhe: 3.466 ± 0.961
2.31IleGly: 2.31 ± 1.897
2.31IleHis: 2.31 ± 1.245
2.695IleIle: 2.695 ± 0.42
8.086IleLys: 8.086 ± 0.682
5.391IleLeu: 5.391 ± 1.6
3.08IleMet: 3.08 ± 0.745
4.621IleAsn: 4.621 ± 1.772
2.31IlePro: 2.31 ± 1.508
2.31IleGln: 2.31 ± 1.611
2.695IleArg: 2.695 ± 1.987
3.466IleSer: 3.466 ± 0.814
1.54IleThr: 1.54 ± 0.5
4.236IleVal: 4.236 ± 0.904
0.0IleTrp: 0.0 ± 0.0
1.155IleTyr: 1.155 ± 0.6
0.0IleXaa: 0.0 ± 0.0
Lys
4.621LysAla: 4.621 ± 1.041
1.54LysCys: 1.54 ± 0.8
3.851LysAsp: 3.851 ± 0.758
12.322LysGlu: 12.322 ± 1.969
5.006LysPhe: 5.006 ± 3.721
8.086LysGly: 8.086 ± 4.11
0.77LysHis: 0.77 ± 0.763
5.776LysIle: 5.776 ± 3.074
11.167LysLys: 11.167 ± 5.172
6.931LysLeu: 6.931 ± 1.8
3.851LysMet: 3.851 ± 1.184
4.621LysAsn: 4.621 ± 0.898
2.695LysPro: 2.695 ± 0.679
1.54LysGln: 1.54 ± 0.8
5.006LysArg: 5.006 ± 1.034
4.621LysSer: 4.621 ± 1.32
5.776LysThr: 5.776 ± 1.325
5.006LysVal: 5.006 ± 2.009
0.77LysTrp: 0.77 ± 0.47
2.31LysTyr: 2.31 ± 2.605
0.0LysXaa: 0.0 ± 0.0
Leu
7.316LeuAla: 7.316 ± 2.061
1.155LeuCys: 1.155 ± 0.6
4.621LeuAsp: 4.621 ± 1.815
6.931LeuGlu: 6.931 ± 3.541
5.006LeuPhe: 5.006 ± 0.806
3.466LeuGly: 3.466 ± 1.238
1.54LeuHis: 1.54 ± 0.739
6.161LeuIle: 6.161 ± 1.11
7.316LeuLys: 7.316 ± 1.242
7.316LeuLeu: 7.316 ± 2.678
2.695LeuMet: 2.695 ± 0.89
6.931LeuAsn: 6.931 ± 1.882
1.54LeuPro: 1.54 ± 0.739
2.31LeuGln: 2.31 ± 0.485
4.621LeuArg: 4.621 ± 1.077
7.316LeuSer: 7.316 ± 1.921
6.546LeuThr: 6.546 ± 1.069
5.006LeuVal: 5.006 ± 2.054
0.0LeuTrp: 0.0 ± 0.0
2.695LeuTyr: 2.695 ± 0.42
0.0LeuXaa: 0.0 ± 0.0
Met
3.851MetAla: 3.851 ± 0.758
0.385MetCys: 0.385 ± 0.2
0.77MetAsp: 0.77 ± 0.47
2.695MetGlu: 2.695 ± 0.704
1.54MetPhe: 1.54 ± 0.8
2.31MetGly: 2.31 ± 0.468
0.0MetHis: 0.0 ± 0.0
2.31MetIle: 2.31 ± 1.557
4.236MetLys: 4.236 ± 0.612
2.31MetLeu: 2.31 ± 0.729
0.385MetMet: 0.385 ± 0.2
1.925MetAsn: 1.925 ± 0.592
1.925MetPro: 1.925 ± 0.552
0.77MetGln: 0.77 ± 0.537
1.925MetArg: 1.925 ± 0.714
1.155MetSer: 1.155 ± 0.6
2.31MetThr: 2.31 ± 1.611
1.155MetVal: 1.155 ± 0.6
0.0MetTrp: 0.0 ± 0.0
0.77MetTyr: 0.77 ± 0.4
0.0MetXaa: 0.0 ± 0.0
Asn
2.695AsnAla: 2.695 ± 0.704
0.77AsnCys: 0.77 ± 0.4
3.08AsnAsp: 3.08 ± 1.599
3.466AsnGlu: 3.466 ± 0.707
3.466AsnPhe: 3.466 ± 1.799
3.08AsnGly: 3.08 ± 1.064
1.155AsnHis: 1.155 ± 0.6
0.77AsnIle: 0.77 ± 0.47
5.006AsnLys: 5.006 ± 1.547
6.161AsnLeu: 6.161 ± 0.74
1.925AsnMet: 1.925 ± 0.552
3.08AsnAsn: 3.08 ± 1.052
0.77AsnPro: 0.77 ± 0.47
1.54AsnGln: 1.54 ± 0.443
1.925AsnArg: 1.925 ± 1.0
6.161AsnSer: 6.161 ± 2.206
2.31AsnThr: 2.31 ± 1.611
5.391AsnVal: 5.391 ± 0.532
0.385AsnTrp: 0.385 ± 0.654
2.695AsnTyr: 2.695 ± 1.4
0.0AsnXaa: 0.0 ± 0.0
Pro
0.385ProAla: 0.385 ± 0.2
0.385ProCys: 0.385 ± 0.2
3.466ProAsp: 3.466 ± 1.36
1.54ProGlu: 1.54 ± 0.8
1.925ProPhe: 1.925 ± 0.805
0.385ProGly: 0.385 ± 0.2
0.385ProHis: 0.385 ± 0.595
1.155ProIle: 1.155 ± 1.18
1.925ProLys: 1.925 ± 1.519
0.77ProLeu: 0.77 ± 0.4
1.54ProMet: 1.54 ± 0.443
1.155ProAsn: 1.155 ± 0.411
0.77ProPro: 0.77 ± 0.4
0.385ProGln: 0.385 ± 0.595
1.925ProArg: 1.925 ± 0.714
2.695ProSer: 2.695 ± 2.067
0.385ProThr: 0.385 ± 0.2
1.155ProVal: 1.155 ± 0.479
0.385ProTrp: 0.385 ± 0.2
0.77ProTyr: 0.77 ± 0.537
0.0ProXaa: 0.0 ± 0.0
Gln
1.925GlnAla: 1.925 ± 0.552
1.155GlnCys: 1.155 ± 0.83
1.155GlnAsp: 1.155 ± 0.6
1.925GlnGlu: 1.925 ± 0.592
1.925GlnPhe: 1.925 ± 0.554
1.54GlnGly: 1.54 ± 1.074
0.385GlnHis: 0.385 ± 0.2
1.54GlnIle: 1.54 ± 0.739
2.31GlnLys: 2.31 ± 1.2
1.925GlnLeu: 1.925 ± 0.554
0.0GlnMet: 0.0 ± 0.0
1.54GlnAsn: 1.54 ± 0.941
0.0GlnPro: 0.0 ± 0.0
0.77GlnGln: 0.77 ± 0.537
3.851GlnArg: 3.851 ± 1.107
2.31GlnSer: 2.31 ± 1.82
0.77GlnThr: 0.77 ± 0.4
0.77GlnVal: 0.77 ± 1.75
0.385GlnTrp: 0.385 ± 0.2
0.385GlnTyr: 0.385 ± 0.2
0.0GlnXaa: 0.0 ± 0.0
Arg
0.77ArgAla: 0.77 ± 0.4
1.54ArgCys: 1.54 ± 0.739
0.77ArgAsp: 0.77 ± 0.763
4.236ArgGlu: 4.236 ± 1.015
4.621ArgPhe: 4.621 ± 1.123
3.466ArgGly: 3.466 ± 1.239
1.925ArgHis: 1.925 ± 1.0
4.621ArgIle: 4.621 ± 1.725
2.31ArgLys: 2.31 ± 1.402
5.391ArgLeu: 5.391 ± 2.203
4.236ArgMet: 4.236 ± 0.904
1.54ArgAsn: 1.54 ± 0.443
0.385ArgPro: 0.385 ± 0.2
2.31ArgGln: 2.31 ± 1.034
3.08ArgArg: 3.08 ± 0.576
3.466ArgSer: 3.466 ± 1.921
1.925ArgThr: 1.925 ± 0.592
1.925ArgVal: 1.925 ± 0.714
0.77ArgTrp: 0.77 ± 0.4
1.925ArgTyr: 1.925 ± 1.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.236SerAla: 4.236 ± 1.361
0.0SerCys: 0.0 ± 0.0
6.161SerAsp: 6.161 ± 1.468
6.161SerGlu: 6.161 ± 1.595
4.621SerPhe: 4.621 ± 1.177
4.236SerGly: 4.236 ± 1.559
1.155SerHis: 1.155 ± 0.6
4.236SerIle: 4.236 ± 2.18
6.161SerLys: 6.161 ± 1.45
7.701SerLeu: 7.701 ± 2.454
1.925SerMet: 1.925 ± 1.545
2.695SerAsn: 2.695 ± 1.01
0.77SerPro: 0.77 ± 0.763
1.925SerGln: 1.925 ± 0.805
4.236SerArg: 4.236 ± 1.452
6.931SerSer: 6.931 ± 1.405
2.31SerThr: 2.31 ± 1.611
3.466SerVal: 3.466 ± 1.52
0.385SerTrp: 0.385 ± 0.2
1.925SerTyr: 1.925 ± 0.714
0.0SerXaa: 0.0 ± 0.0
Thr
1.155ThrAla: 1.155 ± 0.479
0.385ThrCys: 0.385 ± 0.654
1.925ThrAsp: 1.925 ± 0.554
3.08ThrGlu: 3.08 ± 1.642
6.161ThrPhe: 6.161 ± 1.692
2.695ThrGly: 2.695 ± 0.958
1.155ThrHis: 1.155 ± 0.91
4.236ThrIle: 4.236 ± 0.807
3.466ThrLys: 3.466 ± 1.011
3.466ThrLeu: 3.466 ± 0.574
1.155ThrMet: 1.155 ± 0.6
2.695ThrAsn: 2.695 ± 1.638
0.0ThrPro: 0.0 ± 0.0
1.155ThrGln: 1.155 ± 0.479
2.31ThrArg: 2.31 ± 0.804
3.851ThrSer: 3.851 ± 2.151
0.0ThrThr: 0.0 ± 0.0
1.925ThrVal: 1.925 ± 2.047
1.155ThrTrp: 1.155 ± 0.83
1.155ThrTyr: 1.155 ± 0.6
0.0ThrXaa: 0.0 ± 0.0
Val
1.54ValAla: 1.54 ± 0.677
0.0ValCys: 0.0 ± 0.0
3.08ValAsp: 3.08 ± 1.484
6.546ValGlu: 6.546 ± 1.38
5.006ValPhe: 5.006 ± 0.806
1.54ValGly: 1.54 ± 1.389
1.54ValHis: 1.54 ± 0.674
2.695ValIle: 2.695 ± 0.493
7.316ValLys: 7.316 ± 1.351
5.776ValLeu: 5.776 ± 1.952
0.385ValMet: 0.385 ± 0.875
5.006ValAsn: 5.006 ± 0.906
1.155ValPro: 1.155 ± 0.411
0.0ValGln: 0.0 ± 0.0
2.31ValArg: 2.31 ± 1.2
3.466ValSer: 3.466 ± 0.648
1.54ValThr: 1.54 ± 0.677
4.236ValVal: 4.236 ± 1.038
0.0ValTrp: 0.0 ± 0.0
1.925ValTyr: 1.925 ± 0.714
0.0ValXaa: 0.0 ± 0.0
Trp
0.77TrpAla: 0.77 ± 1.0
0.0TrpCys: 0.0 ± 0.0
0.77TrpAsp: 0.77 ± 1.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.385TrpGly: 0.385 ± 0.2
0.385TrpHis: 0.385 ± 0.2
0.77TrpIle: 0.77 ± 0.4
0.77TrpLys: 0.77 ± 0.47
1.155TrpLeu: 1.155 ± 0.6
0.0TrpMet: 0.0 ± 0.0
0.77TrpAsn: 0.77 ± 0.4
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.155TrpArg: 1.155 ± 0.6
0.77TrpSer: 0.77 ± 0.4
0.77TrpThr: 0.77 ± 0.47
0.77TrpVal: 0.77 ± 0.4
0.0TrpTrp: 0.0 ± 0.0
0.385TrpTyr: 0.385 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.155TyrAla: 1.155 ± 0.6
0.385TyrCys: 0.385 ± 0.2
1.925TyrAsp: 1.925 ± 1.0
3.466TyrGlu: 3.466 ± 2.074
0.77TyrPhe: 0.77 ± 0.763
1.54TyrGly: 1.54 ± 0.739
0.385TyrHis: 0.385 ± 0.2
1.925TyrIle: 1.925 ± 1.0
1.54TyrLys: 1.54 ± 0.8
3.466TyrLeu: 3.466 ± 1.246
1.155TyrMet: 1.155 ± 0.411
0.77TyrAsn: 0.77 ± 0.4
1.925TyrPro: 1.925 ± 0.552
0.385TyrGln: 0.385 ± 0.2
0.385TyrArg: 0.385 ± 0.2
1.155TyrSer: 1.155 ± 1.058
2.31TyrThr: 2.31 ± 0.87
1.155TyrVal: 1.155 ± 0.691
0.0TyrTrp: 0.0 ± 0.0
0.385TyrTyr: 0.385 ± 0.2
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2598 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski