Amino acid dipepetide frequency for Lycianthes yellow mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.895AlaAla: 3.895 ± 1.845
1.113AlaCys: 1.113 ± 0.638
0.556AlaAsp: 0.556 ± 0.562
3.339AlaGlu: 3.339 ± 1.586
1.113AlaPhe: 1.113 ± 0.695
0.556AlaGly: 0.556 ± 0.476
1.113AlaHis: 1.113 ± 0.702
2.782AlaIle: 2.782 ± 0.956
3.895AlaLys: 3.895 ± 1.167
5.565AlaLeu: 5.565 ± 1.946
0.556AlaMet: 0.556 ± 0.615
1.669AlaAsn: 1.669 ± 0.977
2.226AlaPro: 2.226 ± 1.098
1.669AlaGln: 1.669 ± 0.786
3.339AlaArg: 3.339 ± 1.515
3.895AlaSer: 3.895 ± 2.101
4.452AlaThr: 4.452 ± 0.89
1.113AlaVal: 1.113 ± 0.798
0.0AlaTrp: 0.0 ± 0.0
2.226AlaTyr: 2.226 ± 1.173
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
2.226CysCys: 2.226 ± 1.793
0.556CysAsp: 0.556 ± 0.461
1.113CysGlu: 1.113 ± 0.826
0.556CysPhe: 0.556 ± 0.581
1.669CysGly: 1.669 ± 1.159
0.556CysHis: 0.556 ± 0.55
1.113CysIle: 1.113 ± 0.761
1.113CysLys: 1.113 ± 0.718
1.113CysLeu: 1.113 ± 0.754
1.113CysMet: 1.113 ± 0.754
3.339CysAsn: 3.339 ± 0.968
1.113CysPro: 1.113 ± 0.691
1.113CysGln: 1.113 ± 0.58
1.113CysArg: 1.113 ± 1.156
2.782CysSer: 2.782 ± 1.104
0.556CysThr: 0.556 ± 0.476
2.226CysVal: 2.226 ± 0.779
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.669AspAla: 1.669 ± 0.977
1.669AspCys: 1.669 ± 0.98
2.226AspAsp: 2.226 ± 1.099
0.556AspGlu: 0.556 ± 0.497
2.226AspPhe: 2.226 ± 0.692
3.339AspGly: 3.339 ± 1.918
1.669AspHis: 1.669 ± 0.908
2.226AspIle: 2.226 ± 1.0
2.782AspLys: 2.782 ± 0.799
5.008AspLeu: 5.008 ± 1.164
0.0AspMet: 0.0 ± 0.0
2.782AspAsn: 2.782 ± 1.088
2.782AspPro: 2.782 ± 1.05
1.669AspGln: 1.669 ± 1.106
2.226AspArg: 2.226 ± 1.043
3.339AspSer: 3.339 ± 1.289
3.339AspThr: 3.339 ± 1.158
4.452AspVal: 4.452 ± 1.247
1.113AspTrp: 1.113 ± 0.995
1.669AspTyr: 1.669 ± 0.749
0.0AspXaa: 0.0 ± 0.0
Glu
3.895GluAla: 3.895 ± 1.71
0.0GluCys: 0.0 ± 0.0
3.339GluAsp: 3.339 ± 1.223
2.226GluGlu: 2.226 ± 1.14
2.782GluPhe: 2.782 ± 1.05
2.226GluGly: 2.226 ± 1.099
2.226GluHis: 2.226 ± 1.08
4.452GluIle: 4.452 ± 1.155
3.339GluLys: 3.339 ± 1.789
1.113GluLeu: 1.113 ± 0.648
1.113GluMet: 1.113 ± 0.846
2.226GluAsn: 2.226 ± 1.066
1.669GluPro: 1.669 ± 0.626
1.113GluGln: 1.113 ± 0.951
1.113GluArg: 1.113 ± 0.58
2.782GluSer: 2.782 ± 1.536
1.113GluThr: 1.113 ± 0.897
2.226GluVal: 2.226 ± 1.218
1.113GluTrp: 1.113 ± 0.702
1.669GluTyr: 1.669 ± 0.755
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
2.226PheCys: 2.226 ± 0.577
1.669PheAsp: 1.669 ± 0.922
0.556PheGlu: 0.556 ± 0.476
0.556PhePhe: 0.556 ± 0.476
4.452PheGly: 4.452 ± 2.09
0.556PheHis: 0.556 ± 0.497
2.226PheIle: 2.226 ± 1.501
4.452PheLys: 4.452 ± 1.537
5.565PheLeu: 5.565 ± 2.382
0.556PheMet: 0.556 ± 0.497
4.452PheAsn: 4.452 ± 1.693
1.669PhePro: 1.669 ± 0.794
3.895PheGln: 3.895 ± 1.571
3.895PheArg: 3.895 ± 1.411
5.008PheSer: 5.008 ± 2.021
2.226PheThr: 2.226 ± 0.796
0.0PheVal: 0.0 ± 0.0
1.113PheTrp: 1.113 ± 0.701
1.113PheTyr: 1.113 ± 0.951
0.0PheXaa: 0.0 ± 0.0
Gly
3.339GlyAla: 3.339 ± 1.305
2.782GlyCys: 2.782 ± 1.398
2.782GlyAsp: 2.782 ± 1.123
1.669GlyGlu: 1.669 ± 0.933
1.669GlyPhe: 1.669 ± 1.049
3.895GlyGly: 3.895 ± 1.768
2.226GlyHis: 2.226 ± 0.951
3.339GlyIle: 3.339 ± 1.003
5.565GlyLys: 5.565 ± 2.199
2.226GlyLeu: 2.226 ± 1.844
2.226GlyMet: 2.226 ± 1.079
2.226GlyAsn: 2.226 ± 1.278
1.669GlyPro: 1.669 ± 0.661
2.226GlyGln: 2.226 ± 1.043
2.226GlyArg: 2.226 ± 0.935
5.565GlySer: 5.565 ± 1.2
1.113GlyThr: 1.113 ± 0.578
3.339GlyVal: 3.339 ± 1.077
0.556GlyTrp: 0.556 ± 0.497
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.782HisAla: 2.782 ± 1.503
1.669HisCys: 1.669 ± 0.957
2.226HisAsp: 2.226 ± 0.692
0.556HisGlu: 0.556 ± 0.578
1.669HisPhe: 1.669 ± 1.0
2.226HisGly: 2.226 ± 1.074
1.113HisHis: 1.113 ± 0.702
1.113HisIle: 1.113 ± 0.819
1.113HisLys: 1.113 ± 0.747
1.113HisLeu: 1.113 ± 0.702
1.113HisMet: 1.113 ± 1.365
2.782HisAsn: 2.782 ± 1.008
2.782HisPro: 2.782 ± 1.573
1.669HisGln: 1.669 ± 0.734
2.782HisArg: 2.782 ± 1.082
1.669HisSer: 1.669 ± 0.774
2.782HisThr: 2.782 ± 1.484
4.452HisVal: 4.452 ± 1.625
0.0HisTrp: 0.0 ± 0.0
2.226HisTyr: 2.226 ± 0.581
0.0HisXaa: 0.0 ± 0.0
Ile
1.669IleAla: 1.669 ± 0.797
0.556IleCys: 0.556 ± 0.562
3.895IleAsp: 3.895 ± 1.648
2.782IleGlu: 2.782 ± 1.148
5.008IlePhe: 5.008 ± 2.248
1.669IleGly: 1.669 ± 0.837
5.008IleHis: 5.008 ± 1.339
2.782IleIle: 2.782 ± 1.147
5.565IleLys: 5.565 ± 1.237
6.678IleLeu: 6.678 ± 3.231
0.556IleMet: 0.556 ± 0.508
1.669IleAsn: 1.669 ± 0.889
2.226IlePro: 2.226 ± 0.836
3.895IleGln: 3.895 ± 1.211
4.452IleArg: 4.452 ± 1.254
4.452IleSer: 4.452 ± 3.145
3.895IleThr: 3.895 ± 1.352
4.452IleVal: 4.452 ± 2.007
1.113IleTrp: 1.113 ± 1.162
1.113IleTyr: 1.113 ± 0.674
0.0IleXaa: 0.0 ± 0.0
Lys
1.669LysAla: 1.669 ± 1.0
1.669LysCys: 1.669 ± 0.851
3.339LysAsp: 3.339 ± 1.048
3.895LysGlu: 3.895 ± 2.419
1.669LysPhe: 1.669 ± 0.755
1.669LysGly: 1.669 ± 0.977
2.782LysHis: 2.782 ± 1.489
3.339LysIle: 3.339 ± 1.278
2.782LysLys: 2.782 ± 0.964
3.895LysLeu: 3.895 ± 1.392
0.556LysMet: 0.556 ± 1.121
3.895LysAsn: 3.895 ± 1.445
4.452LysPro: 4.452 ± 1.41
1.669LysGln: 1.669 ± 1.122
3.339LysArg: 3.339 ± 1.942
3.895LysSer: 3.895 ± 1.423
2.782LysThr: 2.782 ± 1.627
4.452LysVal: 4.452 ± 1.362
0.0LysTrp: 0.0 ± 0.0
3.339LysTyr: 3.339 ± 1.215
0.0LysXaa: 0.0 ± 0.0
Leu
1.669LeuAla: 1.669 ± 1.049
3.339LeuCys: 3.339 ± 0.92
5.008LeuAsp: 5.008 ± 2.326
3.895LeuGlu: 3.895 ± 1.088
4.452LeuPhe: 4.452 ± 1.387
3.895LeuGly: 3.895 ± 1.472
3.895LeuHis: 3.895 ± 1.296
3.339LeuIle: 3.339 ± 1.201
4.452LeuLys: 4.452 ± 1.124
3.895LeuLeu: 3.895 ± 1.428
0.556LeuMet: 0.556 ± 0.682
4.452LeuAsn: 4.452 ± 1.143
5.008LeuPro: 5.008 ± 0.977
3.339LeuGln: 3.339 ± 1.729
7.234LeuArg: 7.234 ± 2.186
5.565LeuSer: 5.565 ± 2.099
2.782LeuThr: 2.782 ± 1.425
2.226LeuVal: 2.226 ± 1.029
1.669LeuTrp: 1.669 ± 0.749
2.782LeuTyr: 2.782 ± 1.28
0.0LeuXaa: 0.0 ± 0.0
Met
1.113MetAla: 1.113 ± 0.674
0.0MetCys: 0.0 ± 0.0
0.556MetAsp: 0.556 ± 0.476
1.669MetGlu: 1.669 ± 1.278
0.556MetPhe: 0.556 ± 0.476
1.669MetGly: 1.669 ± 0.85
0.556MetHis: 0.556 ± 0.578
0.556MetIle: 0.556 ± 0.703
0.556MetLys: 0.556 ± 0.682
2.782MetLeu: 2.782 ± 1.59
0.556MetMet: 0.556 ± 0.682
0.556MetAsn: 0.556 ± 0.703
2.226MetPro: 2.226 ± 0.816
1.113MetGln: 1.113 ± 0.846
2.226MetArg: 2.226 ± 1.074
2.782MetSer: 2.782 ± 1.011
0.556MetThr: 0.556 ± 0.578
0.0MetVal: 0.0 ± 0.0
0.556MetTrp: 0.556 ± 0.562
1.669MetTyr: 1.669 ± 1.049
0.0MetXaa: 0.0 ± 0.0
Asn
4.452AsnAla: 4.452 ± 1.069
1.113AsnCys: 1.113 ± 0.755
2.226AsnAsp: 2.226 ± 0.701
2.782AsnGlu: 2.782 ± 1.22
0.556AsnPhe: 0.556 ± 0.497
2.782AsnGly: 2.782 ± 1.204
3.339AsnHis: 3.339 ± 1.657
5.565AsnIle: 5.565 ± 1.968
0.0AsnLys: 0.0 ± 0.0
5.008AsnLeu: 5.008 ± 1.634
2.782AsnMet: 2.782 ± 0.983
5.565AsnAsn: 5.565 ± 1.604
3.339AsnPro: 3.339 ± 0.877
1.113AsnGln: 1.113 ± 1.156
2.782AsnArg: 2.782 ± 1.54
4.452AsnSer: 4.452 ± 1.741
3.895AsnThr: 3.895 ± 1.652
10.017AsnVal: 10.017 ± 1.819
0.0AsnTrp: 0.0 ± 0.0
2.782AsnTyr: 2.782 ± 1.043
0.0AsnXaa: 0.0 ± 0.0
Pro
2.226ProAla: 2.226 ± 0.796
1.669ProCys: 1.669 ± 0.842
0.556ProAsp: 0.556 ± 0.562
2.226ProGlu: 2.226 ± 1.205
3.339ProPhe: 3.339 ± 1.097
2.226ProGly: 2.226 ± 1.099
2.226ProHis: 2.226 ± 1.059
2.782ProIle: 2.782 ± 1.115
3.895ProLys: 3.895 ± 1.333
4.452ProLeu: 4.452 ± 1.373
2.226ProMet: 2.226 ± 0.949
4.452ProAsn: 4.452 ± 1.536
0.0ProPro: 0.0 ± 0.0
3.339ProGln: 3.339 ± 1.265
4.452ProArg: 4.452 ± 1.152
3.895ProSer: 3.895 ± 1.49
1.113ProThr: 1.113 ± 0.995
2.226ProVal: 2.226 ± 1.246
1.113ProTrp: 1.113 ± 0.648
1.669ProTyr: 1.669 ± 0.714
0.0ProXaa: 0.0 ± 0.0
Gln
2.782GlnAla: 2.782 ± 1.289
0.556GlnCys: 0.556 ± 0.497
2.782GlnAsp: 2.782 ± 0.806
3.339GlnGlu: 3.339 ± 0.787
2.226GlnPhe: 2.226 ± 0.731
2.226GlnGly: 2.226 ± 0.782
1.669GlnHis: 1.669 ± 0.936
2.226GlnIle: 2.226 ± 1.453
2.226GlnLys: 2.226 ± 0.812
1.113GlnLeu: 1.113 ± 0.62
0.556GlnMet: 0.556 ± 0.682
1.669GlnAsn: 1.669 ± 1.012
1.113GlnPro: 1.113 ± 1.1
2.782GlnGln: 2.782 ± 1.759
2.782GlnArg: 2.782 ± 0.967
3.895GlnSer: 3.895 ± 1.797
2.782GlnThr: 2.782 ± 1.242
3.895GlnVal: 3.895 ± 1.605
0.556GlnTrp: 0.556 ± 0.497
2.226GlnTyr: 2.226 ± 1.156
0.0GlnXaa: 0.0 ± 0.0
Arg
0.556ArgAla: 0.556 ± 0.581
1.113ArgCys: 1.113 ± 1.125
3.339ArgAsp: 3.339 ± 1.493
1.113ArgGlu: 1.113 ± 0.691
6.121ArgPhe: 6.121 ± 2.783
2.226ArgGly: 2.226 ± 1.489
2.782ArgHis: 2.782 ± 1.133
4.452ArgIle: 4.452 ± 1.095
1.113ArgLys: 1.113 ± 0.701
3.895ArgLeu: 3.895 ± 1.606
1.669ArgMet: 1.669 ± 0.97
3.339ArgAsn: 3.339 ± 1.352
5.008ArgPro: 5.008 ± 1.404
2.782ArgGln: 2.782 ± 1.928
10.017ArgArg: 10.017 ± 3.532
8.904ArgSer: 8.904 ± 2.023
3.339ArgThr: 3.339 ± 0.941
5.565ArgVal: 5.565 ± 1.415
0.0ArgTrp: 0.0 ± 0.0
2.226ArgTyr: 2.226 ± 0.879
0.0ArgXaa: 0.0 ± 0.0
Ser
3.895SerAla: 3.895 ± 1.859
1.113SerCys: 1.113 ± 0.648
5.008SerAsp: 5.008 ± 0.758
5.565SerGlu: 5.565 ± 1.181
3.339SerPhe: 3.339 ± 1.059
4.452SerGly: 4.452 ± 1.967
1.669SerHis: 1.669 ± 1.085
5.008SerIle: 5.008 ± 1.9
6.678SerLys: 6.678 ± 1.468
5.565SerLeu: 5.565 ± 1.845
2.226SerMet: 2.226 ± 1.582
5.008SerAsn: 5.008 ± 1.925
5.565SerPro: 5.565 ± 1.783
3.339SerGln: 3.339 ± 1.4
5.008SerArg: 5.008 ± 1.403
12.799SerSer: 12.799 ± 2.975
3.895SerThr: 3.895 ± 1.565
6.121SerVal: 6.121 ± 2.053
0.556SerTrp: 0.556 ± 0.476
4.452SerTyr: 4.452 ± 1.555
0.0SerXaa: 0.0 ± 0.0
Thr
2.782ThrAla: 2.782 ± 1.484
0.556ThrCys: 0.556 ± 0.703
0.556ThrAsp: 0.556 ± 0.476
2.782ThrGlu: 2.782 ± 1.195
2.226ThrPhe: 2.226 ± 1.035
3.895ThrGly: 3.895 ± 0.889
1.669ThrHis: 1.669 ± 1.081
5.008ThrIle: 5.008 ± 1.749
1.669ThrLys: 1.669 ± 0.797
3.895ThrLeu: 3.895 ± 1.031
0.556ThrMet: 0.556 ± 0.703
5.008ThrAsn: 5.008 ± 1.528
2.226ThrPro: 2.226 ± 0.855
1.113ThrGln: 1.113 ± 0.659
3.339ThrArg: 3.339 ± 1.279
5.008ThrSer: 5.008 ± 1.163
4.452ThrThr: 4.452 ± 1.899
3.339ThrVal: 3.339 ± 1.696
1.113ThrTrp: 1.113 ± 0.798
1.113ThrTyr: 1.113 ± 0.659
0.0ThrXaa: 0.0 ± 0.0
Val
2.226ValAla: 2.226 ± 0.712
0.0ValCys: 0.0 ± 0.0
3.895ValAsp: 3.895 ± 1.336
1.113ValGlu: 1.113 ± 0.762
3.895ValPhe: 3.895 ± 1.573
2.226ValGly: 2.226 ± 1.131
2.226ValHis: 2.226 ± 1.1
8.347ValIle: 8.347 ± 1.504
3.339ValLys: 3.339 ± 0.921
4.452ValLeu: 4.452 ± 1.958
1.113ValMet: 1.113 ± 0.701
5.008ValAsn: 5.008 ± 1.555
3.339ValPro: 3.339 ± 1.098
3.895ValGln: 3.895 ± 1.669
1.669ValArg: 1.669 ± 1.007
7.234ValSer: 7.234 ± 1.874
3.895ValThr: 3.895 ± 1.461
6.678ValVal: 6.678 ± 2.552
1.113ValTrp: 1.113 ± 0.638
5.565ValTyr: 5.565 ± 1.9
0.0ValXaa: 0.0 ± 0.0
Trp
1.669TrpAla: 1.669 ± 1.0
0.0TrpCys: 0.0 ± 0.0
0.556TrpAsp: 0.556 ± 0.562
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.556TrpGly: 0.556 ± 0.497
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.113TrpLeu: 1.113 ± 0.638
0.556TrpMet: 0.556 ± 0.476
0.556TrpAsn: 0.556 ± 0.581
0.0TrpPro: 0.0 ± 0.0
0.556TrpGln: 0.556 ± 0.497
2.226TrpArg: 2.226 ± 0.817
0.0TrpSer: 0.0 ± 0.0
1.669TrpThr: 1.669 ± 1.27
1.669TrpVal: 1.669 ± 0.626
0.0TrpTrp: 0.0 ± 0.0
0.556TrpTyr: 0.556 ± 0.497
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.782TyrAla: 2.782 ± 1.35
0.0TyrCys: 0.0 ± 0.0
1.669TyrAsp: 1.669 ± 1.027
0.556TyrGlu: 0.556 ± 0.476
2.226TyrPhe: 2.226 ± 0.86
2.782TyrGly: 2.782 ± 1.364
0.556TyrHis: 0.556 ± 0.497
2.782TyrIle: 2.782 ± 1.362
1.113TyrLys: 1.113 ± 0.578
5.008TyrLeu: 5.008 ± 1.573
1.113TyrMet: 1.113 ± 0.653
3.895TyrAsn: 3.895 ± 0.838
1.669TyrPro: 1.669 ± 0.922
1.113TyrGln: 1.113 ± 0.58
3.339TyrArg: 3.339 ± 1.782
3.339TyrSer: 3.339 ± 1.11
1.669TyrThr: 1.669 ± 1.155
2.782TyrVal: 2.782 ± 1.113
0.0TyrTrp: 0.0 ± 0.0
1.669TyrTyr: 1.669 ± 0.74
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1798 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski