Amino acid dipepetide frequency for Gan Gan virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.597AlaAla: 2.597 ± 1.498
0.779AlaCys: 0.779 ± 0.381
2.597AlaAsp: 2.597 ± 0.434
2.856AlaGlu: 2.856 ± 2.502
1.818AlaPhe: 1.818 ± 0.607
1.298AlaGly: 1.298 ± 0.765
0.26AlaHis: 0.26 ± 0.167
3.895AlaIle: 3.895 ± 0.67
5.453AlaLys: 5.453 ± 0.827
5.193AlaLeu: 5.193 ± 1.357
1.298AlaMet: 1.298 ± 0.323
3.635AlaAsn: 3.635 ± 0.927
1.558AlaPro: 1.558 ± 0.67
1.558AlaGln: 1.558 ± 1.792
2.856AlaArg: 2.856 ± 1.479
1.558AlaSer: 1.558 ± 0.409
2.856AlaThr: 2.856 ± 0.762
2.077AlaVal: 2.077 ± 0.774
0.519AlaTrp: 0.519 ± 0.334
1.298AlaTyr: 1.298 ± 0.323
0.0AlaXaa: 0.0 ± 0.0
Cys
1.039CysAla: 1.039 ± 0.311
0.519CysCys: 0.519 ± 0.499
0.0CysAsp: 0.0 ± 0.0
1.298CysGlu: 1.298 ± 1.248
1.298CysPhe: 1.298 ± 1.248
2.337CysGly: 2.337 ± 1.868
0.779CysHis: 0.779 ± 0.381
1.558CysIle: 1.558 ± 0.762
3.635CysLys: 3.635 ± 2.37
3.116CysLeu: 3.116 ± 1.2
0.26CysMet: 0.26 ± 0.167
1.818CysAsn: 1.818 ± 1.005
0.779CysPro: 0.779 ± 0.381
1.039CysGln: 1.039 ± 0.339
0.26CysArg: 0.26 ± 0.167
1.558CysSer: 1.558 ± 1.121
2.077CysThr: 2.077 ± 1.998
0.519CysVal: 0.519 ± 0.499
0.0CysTrp: 0.0 ± 0.0
1.039CysTyr: 1.039 ± 0.311
0.0CysXaa: 0.0 ± 0.0
Asp
1.298AspAla: 1.298 ± 0.835
1.558AspCys: 1.558 ± 1.121
4.414AspAsp: 4.414 ± 1.736
2.856AspGlu: 2.856 ± 0.872
4.155AspPhe: 4.155 ± 1.466
2.077AspGly: 2.077 ± 0.983
0.26AspHis: 0.26 ± 0.25
6.751AspIle: 6.751 ± 0.831
4.155AspLys: 4.155 ± 1.035
6.492AspLeu: 6.492 ± 0.736
1.039AspMet: 1.039 ± 0.893
4.414AspAsn: 4.414 ± 1.148
2.856AspPro: 2.856 ± 1.652
1.298AspGln: 1.298 ± 0.323
2.077AspArg: 2.077 ± 0.679
2.337AspSer: 2.337 ± 0.915
2.856AspThr: 2.856 ± 0.776
2.856AspVal: 2.856 ± 0.776
0.0AspTrp: 0.0 ± 0.0
3.116AspTyr: 3.116 ± 1.31
0.0AspXaa: 0.0 ± 0.0
Glu
2.337GluAla: 2.337 ± 1.63
1.558GluCys: 1.558 ± 1.498
3.635GluAsp: 3.635 ± 1.325
4.155GluGlu: 4.155 ± 1.172
3.895GluPhe: 3.895 ± 1.483
2.077GluGly: 2.077 ± 0.622
2.077GluHis: 2.077 ± 0.586
8.05GluIle: 8.05 ± 1.177
4.414GluLys: 4.414 ± 1.122
6.232GluLeu: 6.232 ± 1.866
2.856GluMet: 2.856 ± 0.457
3.376GluAsn: 3.376 ± 0.838
2.597GluPro: 2.597 ± 0.735
2.597GluGln: 2.597 ± 1.022
2.597GluArg: 2.597 ± 0.735
4.155GluSer: 4.155 ± 1.074
3.116GluThr: 3.116 ± 0.818
3.116GluVal: 3.116 ± 1.478
0.26GluTrp: 0.26 ± 0.167
1.818GluTyr: 1.818 ± 0.607
0.0GluXaa: 0.0 ± 0.0
Phe
1.558PheAla: 1.558 ± 0.67
1.298PheCys: 1.298 ± 0.323
3.895PheAsp: 3.895 ± 1.586
3.376PheGlu: 3.376 ± 0.838
2.597PhePhe: 2.597 ± 1.022
2.077PheGly: 2.077 ± 0.561
0.26PheHis: 0.26 ± 0.167
4.155PheIle: 4.155 ± 0.281
4.934PheLys: 4.934 ± 0.918
4.674PheLeu: 4.674 ± 0.607
1.558PheMet: 1.558 ± 1.776
3.116PheAsn: 3.116 ± 1.31
1.039PhePro: 1.039 ± 0.893
1.298PheGln: 1.298 ± 0.873
2.337PheArg: 2.337 ± 1.27
3.895PheSer: 3.895 ± 0.529
4.674PheThr: 4.674 ± 1.407
1.298PheVal: 1.298 ± 0.494
0.26PheTrp: 0.26 ± 0.167
1.298PheTyr: 1.298 ± 0.765
0.0PheXaa: 0.0 ± 0.0
Gly
1.298GlyAla: 1.298 ± 0.765
2.077GlyCys: 2.077 ± 0.905
2.077GlyAsp: 2.077 ± 0.774
2.856GlyGlu: 2.856 ± 0.872
1.298GlyPhe: 1.298 ± 0.323
1.039GlyGly: 1.039 ± 0.625
0.779GlyHis: 0.779 ± 0.871
3.116GlyIle: 3.116 ± 1.886
2.337GlyLys: 2.337 ± 0.611
4.674GlyLeu: 4.674 ± 1.159
1.298GlyMet: 1.298 ± 0.873
3.635GlyAsn: 3.635 ± 0.449
1.558GlyPro: 1.558 ± 0.467
1.818GlyGln: 1.818 ± 0.679
1.039GlyArg: 1.039 ± 0.339
3.116GlySer: 3.116 ± 2.242
1.818GlyThr: 1.818 ± 1.005
1.558GlyVal: 1.558 ± 1.742
0.519GlyTrp: 0.519 ± 0.156
1.298GlyTyr: 1.298 ± 2.846
0.0GlyXaa: 0.0 ± 0.0
His
1.558HisAla: 1.558 ± 0.409
0.26HisCys: 0.26 ± 0.25
0.26HisAsp: 0.26 ± 0.167
0.779HisGlu: 0.779 ± 0.381
1.039HisPhe: 1.039 ± 0.339
2.077HisGly: 2.077 ± 1.59
0.779HisHis: 0.779 ± 0.204
0.519HisIle: 0.519 ± 0.156
1.818HisLys: 1.818 ± 0.675
2.856HisLeu: 2.856 ± 0.457
0.26HisMet: 0.26 ± 0.25
1.818HisAsn: 1.818 ± 0.535
0.26HisPro: 0.26 ± 0.25
0.26HisGln: 0.26 ± 0.167
0.779HisArg: 0.779 ± 1.013
1.818HisSer: 1.818 ± 0.463
1.298HisThr: 1.298 ± 0.873
1.039HisVal: 1.039 ± 0.311
0.0HisTrp: 0.0 ± 0.0
1.039HisTyr: 1.039 ± 0.668
0.0HisXaa: 0.0 ± 0.0
Ile
6.492IleAla: 6.492 ± 1.234
2.337IleCys: 2.337 ± 1.497
6.751IleAsp: 6.751 ± 1.675
8.31IleGlu: 8.31 ± 0.171
4.155IlePhe: 4.155 ± 1.077
4.155IleGly: 4.155 ± 1.697
1.818IleHis: 1.818 ± 0.535
7.011IleIle: 7.011 ± 2.217
8.31IleLys: 8.31 ± 1.009
9.089IleLeu: 9.089 ± 2.346
1.039IleMet: 1.039 ± 0.718
6.751IleAsn: 6.751 ± 1.555
3.376IlePro: 3.376 ± 0.838
3.635IleGln: 3.635 ± 0.927
2.077IleArg: 2.077 ± 0.622
7.79IleSer: 7.79 ± 2.0
5.453IleThr: 5.453 ± 2.038
2.597IleVal: 2.597 ± 0.478
1.298IleTrp: 1.298 ± 0.922
2.856IleTyr: 2.856 ± 0.357
0.0IleXaa: 0.0 ± 0.0
Lys
3.895LysAla: 3.895 ± 1.705
1.818LysCys: 1.818 ± 1.005
4.934LysAsp: 4.934 ± 0.892
7.531LysGlu: 7.531 ± 2.078
3.635LysPhe: 3.635 ± 0.397
4.934LysGly: 4.934 ± 0.387
2.077LysHis: 2.077 ± 0.679
7.011LysIle: 7.011 ± 1.419
6.751LysLys: 6.751 ± 2.564
6.492LysLeu: 6.492 ± 0.638
3.635LysMet: 3.635 ± 1.07
4.674LysAsn: 4.674 ± 1.243
1.558LysPro: 1.558 ± 0.467
3.376LysGln: 3.376 ± 2.393
3.376LysArg: 3.376 ± 1.17
7.531LysSer: 7.531 ± 0.92
4.674LysThr: 4.674 ± 1.223
4.155LysVal: 4.155 ± 0.511
0.519LysTrp: 0.519 ± 0.156
3.116LysTyr: 3.116 ± 0.783
0.0LysXaa: 0.0 ± 0.0
Leu
4.414LeuAla: 4.414 ± 1.157
1.818LeuCys: 1.818 ± 1.37
7.79LeuAsp: 7.79 ± 3.276
5.453LeuGlu: 5.453 ± 1.431
6.232LeuPhe: 6.232 ± 1.636
3.895LeuGly: 3.895 ± 1.288
2.337LeuHis: 2.337 ± 0.611
9.868LeuIle: 9.868 ± 2.329
9.348LeuLys: 9.348 ± 0.441
12.464LeuLeu: 12.464 ± 3.527
2.597LeuMet: 2.597 ± 0.735
4.414LeuAsn: 4.414 ± 1.105
5.713LeuPro: 5.713 ± 0.714
2.337LeuGln: 2.337 ± 0.611
3.635LeuArg: 3.635 ± 0.449
6.232LeuSer: 6.232 ± 2.753
5.713LeuThr: 5.713 ± 1.641
3.635LeuVal: 3.635 ± 1.07
0.26LeuTrp: 0.26 ± 0.25
2.856LeuTyr: 2.856 ± 1.149
0.0LeuXaa: 0.0 ± 0.0
Met
0.779MetAla: 0.779 ± 1.912
0.519MetCys: 0.519 ± 0.156
2.077MetAsp: 2.077 ± 0.586
1.039MetGlu: 1.039 ± 0.339
1.039MetPhe: 1.039 ± 0.795
0.26MetGly: 0.26 ± 0.25
1.298MetHis: 1.298 ± 0.526
2.856MetIle: 2.856 ± 0.599
3.376MetLys: 3.376 ± 1.473
2.856MetLeu: 2.856 ± 1.149
1.558MetMet: 1.558 ± 0.655
1.558MetAsn: 1.558 ± 0.409
0.779MetPro: 0.779 ± 0.204
1.039MetGln: 1.039 ± 1.858
1.818MetArg: 1.818 ± 1.058
3.116MetSer: 3.116 ± 2.411
0.779MetThr: 0.779 ± 0.204
2.077MetVal: 2.077 ± 0.622
0.26MetTrp: 0.26 ± 0.167
2.077MetTyr: 2.077 ± 0.622
0.0MetXaa: 0.0 ± 0.0
Asn
3.116AsnAla: 3.116 ± 1.355
0.779AsnCys: 0.779 ± 0.749
3.895AsnAsp: 3.895 ± 0.153
2.077AsnGlu: 2.077 ± 0.517
4.155AsnPhe: 4.155 ± 1.077
2.597AsnGly: 2.597 ± 1.492
1.298AsnHis: 1.298 ± 0.323
5.972AsnIle: 5.972 ± 1.481
3.376AsnLys: 3.376 ± 0.303
6.492AsnLeu: 6.492 ± 1.039
1.558AsnMet: 1.558 ± 0.655
4.155AsnAsn: 4.155 ± 2.142
2.856AsnPro: 2.856 ± 0.457
2.337AsnGln: 2.337 ± 1.807
2.597AsnArg: 2.597 ± 0.778
3.635AsnSer: 3.635 ± 0.397
2.856AsnThr: 2.856 ± 0.95
2.077AsnVal: 2.077 ± 0.622
1.039AsnTrp: 1.039 ± 0.311
3.895AsnTyr: 3.895 ± 1.483
0.0AsnXaa: 0.0 ± 0.0
Pro
1.298ProAla: 1.298 ± 0.494
0.26ProCys: 0.26 ± 0.25
1.818ProAsp: 1.818 ± 0.463
3.635ProGlu: 3.635 ± 1.168
2.337ProPhe: 2.337 ± 0.613
2.337ProGly: 2.337 ± 1.536
0.519ProHis: 0.519 ± 0.156
3.895ProIle: 3.895 ± 1.853
1.298ProLys: 1.298 ± 0.494
2.597ProLeu: 2.597 ± 0.763
1.039ProMet: 1.039 ± 0.339
2.077ProAsn: 2.077 ± 0.827
0.519ProPro: 0.519 ± 0.334
0.519ProGln: 0.519 ± 0.156
0.26ProArg: 0.26 ± 0.167
2.077ProSer: 2.077 ± 1.157
2.856ProThr: 2.856 ± 1.993
2.077ProVal: 2.077 ± 0.517
0.26ProTrp: 0.26 ± 0.99
1.558ProTyr: 1.558 ± 0.655
0.0ProXaa: 0.0 ± 0.0
Gln
2.077GlnAla: 2.077 ± 2.651
0.26GlnCys: 0.26 ± 0.25
1.818GlnAsp: 1.818 ± 1.058
1.818GlnGlu: 1.818 ± 0.463
2.077GlnPhe: 2.077 ± 1.644
1.298GlnGly: 1.298 ± 0.323
0.779GlnHis: 0.779 ± 0.381
4.674GlnIle: 4.674 ± 1.966
3.635GlnLys: 3.635 ± 2.374
2.856GlnLeu: 2.856 ± 1.326
1.298GlnMet: 1.298 ± 1.783
1.818GlnAsn: 1.818 ± 0.766
0.779GlnPro: 0.779 ± 0.381
1.558GlnGln: 1.558 ± 1.722
2.597GlnArg: 2.597 ± 0.678
1.818GlnSer: 1.818 ± 0.535
1.818GlnThr: 1.818 ± 0.819
0.779GlnVal: 0.779 ± 0.204
0.26GlnTrp: 0.26 ± 0.167
1.298GlnTyr: 1.298 ± 0.749
0.0GlnXaa: 0.0 ± 0.0
Arg
1.818ArgAla: 1.818 ± 1.058
1.298ArgCys: 1.298 ± 0.526
1.818ArgAsp: 1.818 ± 0.819
3.635ArgGlu: 3.635 ± 0.927
1.298ArgPhe: 1.298 ± 0.526
0.26ArgGly: 0.26 ± 0.167
1.298ArgHis: 1.298 ± 0.323
3.895ArgIle: 3.895 ± 1.483
2.597ArgLys: 2.597 ± 0.989
5.193ArgLeu: 5.193 ± 0.868
0.519ArgMet: 0.519 ± 0.164
1.818ArgAsn: 1.818 ± 0.819
0.519ArgPro: 0.519 ± 0.156
1.558ArgGln: 1.558 ± 1.792
1.818ArgArg: 1.818 ± 0.819
2.077ArgSer: 2.077 ± 0.679
2.077ArgThr: 2.077 ± 0.679
1.039ArgVal: 1.039 ± 0.668
0.519ArgTrp: 0.519 ± 0.156
2.856ArgTyr: 2.856 ± 1.399
0.0ArgXaa: 0.0 ± 0.0
Ser
1.818SerAla: 1.818 ± 0.463
3.116SerCys: 3.116 ± 2.242
3.635SerAsp: 3.635 ± 0.927
4.414SerGlu: 4.414 ± 1.186
2.337SerPhe: 2.337 ± 1.643
2.597SerGly: 2.597 ± 2.6
0.779SerHis: 0.779 ± 0.896
7.271SerIle: 7.271 ± 1.839
6.492SerLys: 6.492 ± 0.831
6.751SerLeu: 6.751 ± 2.34
2.597SerMet: 2.597 ± 0.434
2.597SerAsn: 2.597 ± 1.498
1.558SerPro: 1.558 ± 0.467
2.597SerGln: 2.597 ± 0.989
3.635SerArg: 3.635 ± 1.325
3.635SerSer: 3.635 ± 0.777
5.453SerThr: 5.453 ± 0.329
4.155SerVal: 4.155 ± 0.511
0.0SerTrp: 0.0 ± 0.0
2.337SerTyr: 2.337 ± 0.611
0.0SerXaa: 0.0 ± 0.0
Thr
3.116ThrAla: 3.116 ± 0.818
2.856ThrCys: 2.856 ± 1.63
2.077ThrAsp: 2.077 ± 0.827
3.116ThrGlu: 3.116 ± 1.018
2.077ThrPhe: 2.077 ± 1.157
2.337ThrGly: 2.337 ± 1.497
0.519ThrHis: 0.519 ± 0.156
5.453ThrIle: 5.453 ± 1.39
5.193ThrLys: 5.193 ± 1.357
2.856ThrLeu: 2.856 ± 0.762
2.597ThrMet: 2.597 ± 0.478
4.155ThrAsn: 4.155 ± 0.511
2.077ThrPro: 2.077 ± 0.679
1.558ThrGln: 1.558 ± 0.409
1.818ThrArg: 1.818 ± 0.463
4.155ThrSer: 4.155 ± 1.244
3.376ThrThr: 3.376 ± 0.996
3.116ThrVal: 3.116 ± 1.63
1.039ThrTrp: 1.039 ± 0.882
4.934ThrTyr: 4.934 ± 1.377
0.0ThrXaa: 0.0 ± 0.0
Val
3.116ValAla: 3.116 ± 0.783
1.298ValCys: 1.298 ± 0.323
2.077ValAsp: 2.077 ± 1.25
2.856ValGlu: 2.856 ± 1.561
1.818ValPhe: 1.818 ± 0.463
0.779ValGly: 0.779 ± 0.381
1.818ValHis: 1.818 ± 0.931
2.856ValIle: 2.856 ± 1.399
2.856ValLys: 2.856 ± 0.357
4.414ValLeu: 4.414 ± 1.726
1.558ValMet: 1.558 ± 0.655
1.818ValAsn: 1.818 ± 1.659
1.298ValPro: 1.298 ± 0.526
2.337ValGln: 2.337 ± 0.611
0.779ValArg: 0.779 ± 0.501
4.414ValSer: 4.414 ± 2.016
1.818ValThr: 1.818 ± 0.607
1.298ValVal: 1.298 ± 0.494
0.26ValTrp: 0.26 ± 0.99
2.337ValTyr: 2.337 ± 0.611
0.0ValXaa: 0.0 ± 0.0
Trp
0.519TrpAla: 0.519 ± 0.334
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.519TrpGlu: 0.519 ± 0.929
0.519TrpPhe: 0.519 ± 0.156
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.519TrpIle: 0.519 ± 0.156
0.519TrpLys: 0.519 ± 0.97
1.558TrpLeu: 1.558 ± 0.467
0.519TrpMet: 0.519 ± 0.97
0.519TrpAsn: 0.519 ± 0.334
0.0TrpPro: 0.0 ± 0.0
0.26TrpGln: 0.26 ± 0.167
0.26TrpArg: 0.26 ± 0.167
1.039TrpSer: 1.039 ± 0.668
0.779TrpThr: 0.779 ± 0.871
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.818TyrAla: 1.818 ± 0.675
0.519TyrCys: 0.519 ± 0.156
1.298TyrAsp: 1.298 ± 0.494
2.337TyrGlu: 2.337 ± 0.831
1.818TyrPhe: 1.818 ± 0.766
0.779TyrGly: 0.779 ± 0.381
0.779TyrHis: 0.779 ± 0.381
6.232TyrIle: 6.232 ± 1.636
5.193TyrLys: 5.193 ± 0.724
4.155TyrLeu: 4.155 ± 1.654
1.818TyrMet: 1.818 ± 0.607
2.597TyrAsn: 2.597 ± 0.478
1.558TyrPro: 1.558 ± 0.67
2.337TyrGln: 2.337 ± 0.473
1.298TyrArg: 1.298 ± 0.494
1.818TyrSer: 1.818 ± 0.679
2.077TyrThr: 2.077 ± 0.983
2.337TyrVal: 2.337 ± 0.621
0.26TyrTrp: 0.26 ± 0.167
1.039TyrTyr: 1.039 ± 0.625
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3852 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski