Amino acid dipepetide frequency for Gordil virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.618AlaAla: 4.618 ± 1.745
1.63AlaCys: 1.63 ± 0.327
1.902AlaAsp: 1.902 ± 0.658
3.532AlaGlu: 3.532 ± 1.091
2.988AlaPhe: 2.988 ± 0.779
2.445AlaGly: 2.445 ± 0.743
1.358AlaHis: 1.358 ± 0.31
4.89AlaIle: 4.89 ± 0.731
4.075AlaLys: 4.075 ± 1.775
4.347AlaLeu: 4.347 ± 0.381
3.532AlaMet: 3.532 ± 1.331
2.988AlaAsn: 2.988 ± 0.811
2.717AlaPro: 2.717 ± 0.761
0.272AlaGln: 0.272 ± 0.154
4.075AlaArg: 4.075 ± 1.241
4.618AlaSer: 4.618 ± 1.681
3.532AlaThr: 3.532 ± 0.949
2.988AlaVal: 2.988 ± 0.425
0.815AlaTrp: 0.815 ± 0.427
0.815AlaTyr: 0.815 ± 0.392
0.0AlaXaa: 0.0 ± 0.0
Cys
0.543CysAla: 0.543 ± 0.551
0.272CysCys: 0.272 ± 0.154
1.087CysAsp: 1.087 ± 0.7
2.445CysGlu: 2.445 ± 0.584
1.63CysPhe: 1.63 ± 0.855
1.63CysGly: 1.63 ± 0.327
1.087CysHis: 1.087 ± 0.7
1.902CysIle: 1.902 ± 0.837
2.445CysLys: 2.445 ± 1.282
2.173CysLeu: 2.173 ± 1.399
0.272CysMet: 0.272 ± 0.154
1.087CysAsn: 1.087 ± 0.7
0.815CysPro: 0.815 ± 0.164
1.087CysGln: 1.087 ± 0.272
0.815CysArg: 0.815 ± 0.427
2.988CysSer: 2.988 ± 1.077
1.63CysThr: 1.63 ± 0.497
0.815CysVal: 0.815 ± 0.649
0.0CysTrp: 0.0 ± 0.0
0.543CysTyr: 0.543 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
2.988AspAla: 2.988 ± 0.672
1.902AspCys: 1.902 ± 0.847
4.075AspAsp: 4.075 ± 1.633
4.618AspGlu: 4.618 ± 1.22
3.26AspPhe: 3.26 ± 1.41
2.445AspGly: 2.445 ± 0.4
1.087AspHis: 1.087 ± 0.507
5.162AspIle: 5.162 ± 1.247
2.988AspLys: 2.988 ± 1.203
5.162AspLeu: 5.162 ± 0.944
1.902AspMet: 1.902 ± 1.081
1.087AspAsn: 1.087 ± 0.272
2.717AspPro: 2.717 ± 0.576
1.087AspGln: 1.087 ± 0.294
1.63AspArg: 1.63 ± 0.279
4.075AspSer: 4.075 ± 0.963
1.358AspThr: 1.358 ± 0.44
1.902AspVal: 1.902 ± 0.306
0.815AspTrp: 0.815 ± 0.424
1.358AspTyr: 1.358 ± 0.38
0.0AspXaa: 0.0 ± 0.0
Glu
5.705GluAla: 5.705 ± 0.466
0.815GluCys: 0.815 ± 0.826
3.803GluAsp: 3.803 ± 1.761
5.705GluGlu: 5.705 ± 1.142
3.803GluPhe: 3.803 ± 0.954
3.532GluGly: 3.532 ± 0.64
0.815GluHis: 0.815 ± 0.463
5.977GluIle: 5.977 ± 1.374
5.433GluLys: 5.433 ± 0.908
5.705GluLeu: 5.705 ± 1.564
2.717GluMet: 2.717 ± 1.088
2.445GluAsn: 2.445 ± 0.746
1.63GluPro: 1.63 ± 0.557
0.815GluGln: 0.815 ± 0.359
3.532GluArg: 3.532 ± 1.265
4.89GluSer: 4.89 ± 1.551
4.075GluThr: 4.075 ± 0.496
4.075GluVal: 4.075 ± 0.69
0.815GluTrp: 0.815 ± 0.164
2.717GluTyr: 2.717 ± 0.475
0.0GluXaa: 0.0 ± 0.0
Phe
2.717PheAla: 2.717 ± 1.302
1.902PheCys: 1.902 ± 1.249
3.803PheAsp: 3.803 ± 0.739
3.532PheGlu: 3.532 ± 0.249
2.717PhePhe: 2.717 ± 0.576
1.902PheGly: 1.902 ± 0.708
0.543PheHis: 0.543 ± 0.166
3.26PheIle: 3.26 ± 0.976
5.162PheLys: 5.162 ± 1.498
4.075PheLeu: 4.075 ± 1.883
1.087PheMet: 1.087 ± 0.571
3.26PheAsn: 3.26 ± 0.674
1.902PhePro: 1.902 ± 1.064
0.543PheGln: 0.543 ± 0.166
2.173PheArg: 2.173 ± 0.627
3.803PheSer: 3.803 ± 0.806
2.445PheThr: 2.445 ± 1.127
3.26PheVal: 3.26 ± 0.72
0.543PheTrp: 0.543 ± 0.166
0.815PheTyr: 0.815 ± 0.826
0.0PheXaa: 0.0 ± 0.0
Gly
2.717GlyAla: 2.717 ± 0.619
1.63GlyCys: 1.63 ± 0.497
2.717GlyAsp: 2.717 ± 0.761
2.445GlyGlu: 2.445 ± 0.973
3.26GlyPhe: 3.26 ± 0.324
3.803GlyGly: 3.803 ± 1.498
1.63GlyHis: 1.63 ± 0.926
4.347GlyIle: 4.347 ± 0.381
2.717GlyLys: 2.717 ± 1.165
4.89GlyLeu: 4.89 ± 0.615
1.358GlyMet: 1.358 ± 0.888
2.173GlyAsn: 2.173 ± 1.026
2.445GlyPro: 2.445 ± 0.584
2.445GlyGln: 2.445 ± 0.677
1.902GlyArg: 1.902 ± 0.697
5.162GlySer: 5.162 ± 1.554
1.902GlyThr: 1.902 ± 0.464
3.532GlyVal: 3.532 ± 0.73
0.543GlyTrp: 0.543 ± 0.551
1.902GlyTyr: 1.902 ± 0.242
0.0GlyXaa: 0.0 ± 0.0
His
0.543HisAla: 0.543 ± 0.166
0.815HisCys: 0.815 ± 0.164
1.358HisAsp: 1.358 ± 0.41
0.543HisGlu: 0.543 ± 0.309
2.173HisPhe: 2.173 ± 0.721
2.717HisGly: 2.717 ± 0.932
0.272HisHis: 0.272 ± 0.154
1.63HisIle: 1.63 ± 0.557
1.087HisLys: 1.087 ± 0.272
1.902HisLeu: 1.902 ± 0.749
0.272HisMet: 0.272 ± 0.275
0.815HisAsn: 0.815 ± 0.164
0.815HisPro: 0.815 ± 0.392
0.815HisGln: 0.815 ± 0.424
0.543HisArg: 0.543 ± 0.166
2.445HisSer: 2.445 ± 0.679
0.543HisThr: 0.543 ± 0.166
1.902HisVal: 1.902 ± 0.921
0.0HisTrp: 0.0 ± 0.0
1.087HisTyr: 1.087 ± 0.618
0.0HisXaa: 0.0 ± 0.0
Ile
5.162IleAla: 5.162 ± 1.196
1.358IleCys: 1.358 ± 0.303
4.347IleAsp: 4.347 ± 0.316
5.433IleGlu: 5.433 ± 0.982
2.445IlePhe: 2.445 ± 0.679
4.075IleGly: 4.075 ± 1.545
1.358IleHis: 1.358 ± 0.38
5.705IleIle: 5.705 ± 1.142
6.52IleLys: 6.52 ± 2.289
6.52IleLeu: 6.52 ± 1.092
2.445IleMet: 2.445 ± 0.608
4.347IleAsn: 4.347 ± 0.786
2.445IlePro: 2.445 ± 0.971
2.717IleGln: 2.717 ± 0.475
5.162IleArg: 5.162 ± 0.936
6.52IleSer: 6.52 ± 1.298
2.445IleThr: 2.445 ± 0.677
4.347IleVal: 4.347 ± 1.018
0.272IleTrp: 0.272 ± 0.154
2.988IleTyr: 2.988 ± 0.669
0.0IleXaa: 0.0 ± 0.0
Lys
4.075LysAla: 4.075 ± 1.08
1.63LysCys: 1.63 ± 0.855
4.347LysAsp: 4.347 ± 0.302
3.803LysGlu: 3.803 ± 1.105
3.26LysPhe: 3.26 ± 1.108
3.803LysGly: 3.803 ± 0.6
2.445LysHis: 2.445 ± 0.604
5.433LysIle: 5.433 ± 0.777
5.433LysLys: 5.433 ± 1.173
4.89LysLeu: 4.89 ± 0.721
2.717LysMet: 2.717 ± 0.98
2.445LysAsn: 2.445 ± 0.491
3.803LysPro: 3.803 ± 0.797
2.988LysGln: 2.988 ± 0.779
3.803LysArg: 3.803 ± 1.897
4.075LysSer: 4.075 ± 0.172
5.977LysThr: 5.977 ± 0.558
4.618LysVal: 4.618 ± 0.628
2.445LysTrp: 2.445 ± 0.622
1.902LysTyr: 1.902 ± 0.881
0.0LysXaa: 0.0 ± 0.0
Leu
3.803LeuAla: 3.803 ± 1.054
2.173LeuCys: 2.173 ± 0.442
2.445LeuAsp: 2.445 ± 1.053
7.335LeuGlu: 7.335 ± 0.946
5.433LeuPhe: 5.433 ± 0.926
3.532LeuGly: 3.532 ± 0.895
1.902LeuHis: 1.902 ± 0.708
4.347LeuIle: 4.347 ± 0.302
6.52LeuLys: 6.52 ± 1.032
5.433LeuLeu: 5.433 ± 1.41
2.988LeuMet: 2.988 ± 0.126
3.803LeuAsn: 3.803 ± 0.084
2.445LeuPro: 2.445 ± 0.706
3.532LeuGln: 3.532 ± 0.618
5.433LeuArg: 5.433 ± 1.487
8.422LeuSer: 8.422 ± 1.189
4.618LeuThr: 4.618 ± 0.48
4.618LeuVal: 4.618 ± 0.228
0.815LeuTrp: 0.815 ± 0.826
2.445LeuTyr: 2.445 ± 0.677
0.0LeuXaa: 0.0 ± 0.0
Met
3.532MetAla: 3.532 ± 1.151
0.543MetCys: 0.543 ± 0.166
1.902MetAsp: 1.902 ± 0.859
2.445MetGlu: 2.445 ± 0.679
1.63MetPhe: 1.63 ± 0.847
2.445MetGly: 2.445 ± 0.491
0.815MetHis: 0.815 ± 0.392
3.26MetIle: 3.26 ± 1.252
0.272MetLys: 0.272 ± 0.154
2.717MetLeu: 2.717 ± 1.165
4.075MetMet: 4.075 ± 1.439
0.815MetAsn: 0.815 ± 0.164
0.543MetPro: 0.543 ± 0.166
1.63MetGln: 1.63 ± 0.705
1.087MetArg: 1.087 ± 0.571
1.902MetSer: 1.902 ± 0.478
2.717MetThr: 2.717 ± 0.373
1.087MetVal: 1.087 ± 0.318
0.0MetTrp: 0.0 ± 0.0
0.543MetTyr: 0.543 ± 0.309
0.0MetXaa: 0.0 ± 0.0
Asn
0.815AsnAla: 0.815 ± 0.164
0.815AsnCys: 0.815 ± 0.826
2.988AsnAsp: 2.988 ± 0.779
2.988AsnGlu: 2.988 ± 0.766
2.173AsnPhe: 2.173 ± 0.447
1.902AsnGly: 1.902 ± 0.242
1.358AsnHis: 1.358 ± 0.736
1.63AsnIle: 1.63 ± 0.857
5.162AsnLys: 5.162 ± 0.815
5.433AsnLeu: 5.433 ± 0.576
1.087AsnMet: 1.087 ± 0.991
1.087AsnAsn: 1.087 ± 0.331
3.26AsnPro: 3.26 ± 0.593
1.087AsnGln: 1.087 ± 0.618
2.173AsnArg: 2.173 ± 0.543
3.803AsnSer: 3.803 ± 0.66
1.087AsnThr: 1.087 ± 0.618
0.815AsnVal: 0.815 ± 0.164
0.543AsnTrp: 0.543 ± 0.366
2.445AsnTyr: 2.445 ± 0.233
0.0AsnXaa: 0.0 ± 0.0
Pro
1.902ProAla: 1.902 ± 0.881
0.272ProCys: 0.272 ± 0.275
0.543ProAsp: 0.543 ± 0.387
4.075ProGlu: 4.075 ± 1.08
1.358ProPhe: 1.358 ± 0.291
2.445ProGly: 2.445 ± 0.54
0.543ProHis: 0.543 ± 0.468
2.173ProIle: 2.173 ± 0.295
2.717ProLys: 2.717 ± 0.96
2.173ProLeu: 2.173 ± 0.721
1.087ProMet: 1.087 ± 0.23
2.717ProAsn: 2.717 ± 0.373
0.543ProPro: 0.543 ± 0.309
0.272ProGln: 0.272 ± 0.275
1.63ProArg: 1.63 ± 0.372
3.803ProSer: 3.803 ± 0.503
2.988ProThr: 2.988 ± 0.951
3.532ProVal: 3.532 ± 1.9
1.63ProTrp: 1.63 ± 0.372
1.358ProTyr: 1.358 ± 0.618
0.0ProXaa: 0.0 ± 0.0
Gln
1.902GlnAla: 1.902 ± 0.637
1.087GlnCys: 1.087 ± 0.331
1.358GlnAsp: 1.358 ± 0.448
2.445GlnGlu: 2.445 ± 0.743
1.358GlnPhe: 1.358 ± 0.291
2.173GlnGly: 2.173 ± 0.334
1.63GlnHis: 1.63 ± 0.926
3.803GlnIle: 3.803 ± 1.105
1.902GlnLys: 1.902 ± 1.064
1.358GlnLeu: 1.358 ± 0.44
0.815GlnMet: 0.815 ± 0.424
0.272GlnAsn: 0.272 ± 0.275
1.358GlnPro: 1.358 ± 0.587
1.63GlnGln: 1.63 ± 0.988
2.173GlnArg: 2.173 ± 0.636
2.445GlnSer: 2.445 ± 0.604
1.358GlnThr: 1.358 ± 0.291
0.815GlnVal: 0.815 ± 0.164
0.272GlnTrp: 0.272 ± 0.275
1.087GlnTyr: 1.087 ± 0.272
0.0GlnXaa: 0.0 ± 0.0
Arg
3.532ArgAla: 3.532 ± 0.488
1.63ArgCys: 1.63 ± 0.279
3.532ArgAsp: 3.532 ± 1.331
3.532ArgGlu: 3.532 ± 0.649
1.087ArgPhe: 1.087 ± 0.752
2.717ArgGly: 2.717 ± 1.206
0.543ArgHis: 0.543 ± 0.551
4.618ArgIle: 4.618 ± 0.662
2.445ArgLys: 2.445 ± 0.65
4.89ArgLeu: 4.89 ± 1.93
0.815ArgMet: 0.815 ± 0.463
2.445ArgAsn: 2.445 ± 0.935
1.087ArgPro: 1.087 ± 0.774
1.902ArgGln: 1.902 ± 0.658
2.445ArgArg: 2.445 ± 0.584
3.532ArgSer: 3.532 ± 0.249
2.717ArgThr: 2.717 ± 0.628
2.988ArgVal: 2.988 ± 1.042
1.358ArgTrp: 1.358 ± 0.303
2.173ArgTyr: 2.173 ± 0.353
0.0ArgXaa: 0.0 ± 0.0
Ser
5.977SerAla: 5.977 ± 1.325
2.988SerCys: 2.988 ± 1.826
4.347SerAsp: 4.347 ± 0.97
4.075SerGlu: 4.075 ± 0.333
3.532SerPhe: 3.532 ± 0.719
4.89SerGly: 4.89 ± 1.443
1.087SerHis: 1.087 ± 0.272
6.52SerIle: 6.52 ± 0.579
6.792SerLys: 6.792 ± 1.177
6.248SerLeu: 6.248 ± 0.738
2.717SerMet: 2.717 ± 0.582
3.803SerAsn: 3.803 ± 0.759
2.717SerPro: 2.717 ± 0.578
3.26SerGln: 3.26 ± 0.732
4.075SerArg: 4.075 ± 0.333
6.792SerSer: 6.792 ± 1.386
4.347SerThr: 4.347 ± 1.661
7.063SerVal: 7.063 ± 0.765
1.358SerTrp: 1.358 ± 0.303
2.445SerTyr: 2.445 ± 0.584
0.0SerXaa: 0.0 ± 0.0
Thr
2.173ThrAla: 2.173 ± 0.353
1.63ThrCys: 1.63 ± 1.248
3.26ThrAsp: 3.26 ± 1.115
3.803ThrGlu: 3.803 ± 0.76
1.63ThrPhe: 1.63 ± 0.988
3.803ThrGly: 3.803 ± 0.503
1.087ThrHis: 1.087 ± 0.331
5.433ThrIle: 5.433 ± 0.821
3.26ThrLys: 3.26 ± 0.521
4.347ThrLeu: 4.347 ± 0.699
0.815ThrMet: 0.815 ± 0.463
2.717ThrAsn: 2.717 ± 0.576
2.988ThrPro: 2.988 ± 0.126
1.902ThrGln: 1.902 ± 0.421
2.173ThrArg: 2.173 ± 1.022
5.977ThrSer: 5.977 ± 0.895
4.89ThrThr: 4.89 ± 1.208
2.173ThrVal: 2.173 ± 1.013
0.0ThrTrp: 0.0 ± 0.0
1.358ThrTyr: 1.358 ± 1.163
0.0ThrXaa: 0.0 ± 0.0
Val
3.803ValAla: 3.803 ± 0.614
1.902ValCys: 1.902 ± 0.749
2.173ValAsp: 2.173 ± 0.859
4.347ValGlu: 4.347 ± 1.162
2.988ValPhe: 2.988 ± 1.073
0.543ValGly: 0.543 ± 0.309
1.358ValHis: 1.358 ± 0.41
4.075ValIle: 4.075 ± 1.737
4.618ValLys: 4.618 ± 1.195
4.075ValLeu: 4.075 ± 1.813
1.902ValMet: 1.902 ± 0.708
2.445ValAsn: 2.445 ± 0.491
1.358ValPro: 1.358 ± 0.31
1.63ValGln: 1.63 ± 0.279
3.26ValArg: 3.26 ± 2.12
6.248ValSer: 6.248 ± 0.738
3.803ValThr: 3.803 ± 0.396
3.532ValVal: 3.532 ± 0.405
0.815ValTrp: 0.815 ± 0.392
2.173ValTyr: 2.173 ± 0.686
0.0ValXaa: 0.0 ± 0.0
Trp
0.272TrpAla: 0.272 ± 0.403
0.272TrpCys: 0.272 ± 0.154
0.0TrpAsp: 0.0 ± 0.0
0.815TrpGlu: 0.815 ± 0.427
0.543TrpPhe: 0.543 ± 0.309
0.815TrpGly: 0.815 ± 0.427
0.0TrpHis: 0.0 ± 0.0
0.543TrpIle: 0.543 ± 0.166
1.902TrpLys: 1.902 ± 0.55
1.902TrpLeu: 1.902 ± 0.242
0.815TrpMet: 0.815 ± 0.164
0.815TrpAsn: 0.815 ± 0.359
0.815TrpPro: 0.815 ± 0.781
0.272TrpGln: 0.272 ± 0.275
0.543TrpArg: 0.543 ± 0.309
0.543TrpSer: 0.543 ± 0.387
0.815TrpThr: 0.815 ± 0.392
0.815TrpVal: 0.815 ± 0.427
0.543TrpTrp: 0.543 ± 0.166
0.815TrpTyr: 0.815 ± 0.463
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.358TyrAla: 1.358 ± 0.41
0.0TyrCys: 0.0 ± 0.0
1.63TyrAsp: 1.63 ± 0.988
1.087TyrGlu: 1.087 ± 0.318
2.173TyrPhe: 2.173 ± 0.353
2.173TyrGly: 2.173 ± 0.295
1.358TyrHis: 1.358 ± 0.303
1.902TyrIle: 1.902 ± 0.675
2.445TyrLys: 2.445 ± 0.404
3.803TyrLeu: 3.803 ± 0.872
0.272TyrMet: 0.272 ± 0.154
1.087TyrAsn: 1.087 ± 0.618
1.358TyrPro: 1.358 ± 0.448
1.358TyrGln: 1.358 ± 0.448
1.358TyrArg: 1.358 ± 0.828
2.988TyrSer: 2.988 ± 0.723
2.173TyrThr: 2.173 ± 0.295
2.173TyrVal: 2.173 ± 0.859
0.272TyrTrp: 0.272 ± 0.154
0.272TyrTyr: 0.272 ± 0.154
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3682 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski