Amino acid dipepetide frequency for Changjiang hepe-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.386AlaAla: 8.386 ± 0.705
2.096AlaCys: 2.096 ± 0.618
4.792AlaAsp: 4.792 ± 0.948
4.492AlaGlu: 4.492 ± 1.343
3.594AlaPhe: 3.594 ± 1.109
5.391AlaGly: 5.391 ± 0.798
2.396AlaHis: 2.396 ± 0.936
8.086AlaIle: 8.086 ± 1.179
2.695AlaLys: 2.695 ± 0.982
8.086AlaLeu: 8.086 ± 1.881
1.198AlaMet: 1.198 ± 0.62
6.888AlaAsn: 6.888 ± 1.368
3.893AlaPro: 3.893 ± 1.369
3.594AlaGln: 3.594 ± 0.993
5.391AlaArg: 5.391 ± 2.055
7.188AlaSer: 7.188 ± 1.823
6.589AlaThr: 6.589 ± 1.131
5.391AlaVal: 5.391 ± 1.106
1.198AlaTrp: 1.198 ± 0.566
3.594AlaTyr: 3.594 ± 1.263
0.0AlaXaa: 0.0 ± 0.0
Cys
2.096CysAla: 2.096 ± 1.34
0.898CysCys: 0.898 ± 1.449
1.797CysAsp: 1.797 ± 0.747
0.898CysGlu: 0.898 ± 0.465
1.797CysPhe: 1.797 ± 0.641
0.898CysGly: 0.898 ± 0.356
0.299CysHis: 0.299 ± 0.155
0.599CysIle: 0.599 ± 0.364
0.0CysLys: 0.0 ± 0.0
0.599CysLeu: 0.599 ± 0.31
0.299CysMet: 0.299 ± 0.758
0.599CysAsn: 0.599 ± 0.31
0.898CysPro: 0.898 ± 0.356
0.299CysGln: 0.299 ± 0.155
0.599CysArg: 0.599 ± 1.515
1.198CysSer: 1.198 ± 0.997
0.299CysThr: 0.299 ± 0.155
0.599CysVal: 0.599 ± 0.364
0.299CysTrp: 0.299 ± 0.758
0.299CysTyr: 0.299 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
5.091AspAla: 5.091 ± 0.905
0.599AspCys: 0.599 ± 0.31
5.091AspAsp: 5.091 ± 1.555
2.096AspGlu: 2.096 ± 1.057
3.594AspPhe: 3.594 ± 1.282
4.193AspGly: 4.193 ± 1.422
1.497AspHis: 1.497 ± 0.703
4.193AspIle: 4.193 ± 2.168
1.497AspLys: 1.497 ± 0.774
5.391AspLeu: 5.391 ± 1.226
1.797AspMet: 1.797 ± 0.611
3.594AspAsn: 3.594 ± 0.993
3.294AspPro: 3.294 ± 0.628
2.096AspGln: 2.096 ± 0.614
3.294AspArg: 3.294 ± 1.306
2.995AspSer: 2.995 ± 1.053
2.096AspThr: 2.096 ± 1.062
4.792AspVal: 4.792 ± 1.215
1.797AspTrp: 1.797 ± 0.446
1.797AspTyr: 1.797 ± 0.747
0.0AspXaa: 0.0 ± 0.0
Glu
2.995GluAla: 2.995 ± 0.799
0.299GluCys: 0.299 ± 0.155
3.294GluAsp: 3.294 ± 1.346
2.396GluGlu: 2.396 ± 0.905
1.198GluPhe: 1.198 ± 0.62
1.497GluGly: 1.497 ± 0.51
2.995GluHis: 2.995 ± 1.649
2.396GluIle: 2.396 ± 2.347
2.995GluLys: 2.995 ± 0.828
4.792GluLeu: 4.792 ± 1.123
1.198GluMet: 1.198 ± 0.62
1.497GluAsn: 1.497 ± 1.093
2.096GluPro: 2.096 ± 1.084
1.497GluGln: 1.497 ± 0.665
1.797GluArg: 1.797 ± 0.749
1.198GluSer: 1.198 ± 0.62
3.594GluThr: 3.594 ± 0.993
4.193GluVal: 4.193 ± 0.867
0.599GluTrp: 0.599 ± 0.587
1.198GluTyr: 1.198 ± 0.62
0.0GluXaa: 0.0 ± 0.0
Phe
2.995PheAla: 2.995 ± 2.229
0.299PheCys: 0.299 ± 0.758
2.995PheAsp: 2.995 ± 0.766
1.797PheGlu: 1.797 ± 0.929
1.497PhePhe: 1.497 ± 0.51
3.594PheGly: 3.594 ± 1.441
1.497PheHis: 1.497 ± 0.792
3.893PheIle: 3.893 ± 1.043
2.096PheLys: 2.096 ± 0.765
2.096PheLeu: 2.096 ± 0.618
0.599PheMet: 0.599 ± 0.455
2.096PheAsn: 2.096 ± 0.668
0.898PhePro: 0.898 ± 0.745
2.096PheGln: 2.096 ± 0.759
2.396PheArg: 2.396 ± 1.318
1.797PheSer: 1.797 ± 0.631
1.497PheThr: 1.497 ± 1.008
2.995PheVal: 2.995 ± 0.544
0.0PheTrp: 0.0 ± 0.0
1.797PheTyr: 1.797 ± 0.631
0.0PheXaa: 0.0 ± 0.0
Gly
5.99GlyAla: 5.99 ± 1.015
0.299GlyCys: 0.299 ± 0.155
3.594GlyAsp: 3.594 ± 1.026
1.497GlyGlu: 1.497 ± 0.51
2.096GlyPhe: 2.096 ± 0.74
2.995GlyGly: 2.995 ± 2.091
1.797GlyHis: 1.797 ± 0.749
1.497GlyIle: 1.497 ± 1.336
4.792GlyLys: 4.792 ± 1.193
3.594GlyLeu: 3.594 ± 1.396
1.198GlyMet: 1.198 ± 0.549
1.797GlyAsn: 1.797 ± 0.713
1.497GlyPro: 1.497 ± 0.61
1.497GlyGln: 1.497 ± 0.792
4.193GlyArg: 4.193 ± 1.766
3.294GlySer: 3.294 ± 1.475
5.091GlyThr: 5.091 ± 1.037
3.893GlyVal: 3.893 ± 0.877
0.299GlyTrp: 0.299 ± 0.155
3.594GlyTyr: 3.594 ± 1.263
0.0GlyXaa: 0.0 ± 0.0
His
2.995HisAla: 2.995 ± 1.197
1.198HisCys: 1.198 ± 0.678
1.198HisAsp: 1.198 ± 0.412
1.797HisGlu: 1.797 ± 0.594
0.898HisPhe: 0.898 ± 0.465
1.797HisGly: 1.797 ± 0.446
2.396HisHis: 2.396 ± 1.326
2.096HisIle: 2.096 ± 0.74
1.497HisLys: 1.497 ± 0.72
1.797HisLeu: 1.797 ± 1.092
0.898HisMet: 0.898 ± 0.465
0.599HisAsn: 0.599 ± 0.31
3.294HisPro: 3.294 ± 0.685
1.198HisGln: 1.198 ± 1.533
1.497HisArg: 1.497 ± 0.61
1.797HisSer: 1.797 ± 1.342
2.396HisThr: 2.396 ± 1.426
2.396HisVal: 2.396 ± 0.99
0.599HisTrp: 0.599 ± 0.31
1.198HisTyr: 1.198 ± 0.531
0.0HisXaa: 0.0 ± 0.0
Ile
8.985IleAla: 8.985 ± 1.176
0.898IleCys: 0.898 ± 0.671
2.096IleAsp: 2.096 ± 0.74
1.497IleGlu: 1.497 ± 0.666
2.396IlePhe: 2.396 ± 0.775
3.294IleGly: 3.294 ± 0.535
3.294IleHis: 3.294 ± 1.408
3.294IleIle: 3.294 ± 1.959
3.294IleLys: 3.294 ± 0.685
3.893IleLeu: 3.893 ± 0.89
0.898IleMet: 0.898 ± 0.356
5.091IleAsn: 5.091 ± 1.561
2.396IlePro: 2.396 ± 0.89
2.396IleGln: 2.396 ± 1.773
2.396IleArg: 2.396 ± 1.061
3.594IleSer: 3.594 ± 2.613
5.69IleThr: 5.69 ± 0.896
4.193IleVal: 4.193 ± 1.012
0.898IleTrp: 0.898 ± 0.671
1.198IleTyr: 1.198 ± 0.62
0.0IleXaa: 0.0 ± 0.0
Lys
4.193LysAla: 4.193 ± 1.277
0.299LysCys: 0.299 ± 0.431
1.198LysAsp: 1.198 ± 0.412
1.497LysGlu: 1.497 ± 0.774
1.797LysPhe: 1.797 ± 0.715
0.898LysGly: 0.898 ± 0.465
1.497LysHis: 1.497 ± 0.72
5.091LysIle: 5.091 ± 1.494
2.695LysLys: 2.695 ± 1.05
3.893LysLeu: 3.893 ± 0.551
0.898LysMet: 0.898 ± 0.538
1.797LysAsn: 1.797 ± 0.914
3.594LysPro: 3.594 ± 0.616
3.893LysGln: 3.893 ± 1.541
2.995LysArg: 2.995 ± 1.549
2.695LysSer: 2.695 ± 1.508
4.193LysThr: 4.193 ± 0.574
3.594LysVal: 3.594 ± 1.235
0.0LysTrp: 0.0 ± 0.0
1.198LysTyr: 1.198 ± 0.728
0.0LysXaa: 0.0 ± 0.0
Leu
8.086LeuAla: 8.086 ± 1.177
1.797LeuCys: 1.797 ± 1.047
5.69LeuAsp: 5.69 ± 1.954
5.69LeuGlu: 5.69 ± 1.876
2.396LeuPhe: 2.396 ± 0.715
5.091LeuGly: 5.091 ± 0.835
2.995LeuHis: 2.995 ± 1.236
4.492LeuIle: 4.492 ± 0.707
3.594LeuLys: 3.594 ± 1.026
8.386LeuLeu: 8.386 ± 0.871
1.198LeuMet: 1.198 ± 0.593
3.294LeuAsn: 3.294 ± 0.883
2.695LeuPro: 2.695 ± 1.069
4.492LeuGln: 4.492 ± 1.9
5.391LeuArg: 5.391 ± 0.999
4.792LeuSer: 4.792 ± 1.312
6.589LeuThr: 6.589 ± 1.901
5.391LeuVal: 5.391 ± 1.535
1.198LeuTrp: 1.198 ± 0.62
1.497LeuTyr: 1.497 ± 0.485
0.0LeuXaa: 0.0 ± 0.0
Met
0.599MetAla: 0.599 ± 0.31
0.299MetCys: 0.299 ± 0.155
0.299MetAsp: 0.299 ± 0.155
0.599MetGlu: 0.599 ± 0.31
0.599MetPhe: 0.599 ± 0.699
0.599MetGly: 0.599 ± 0.862
0.0MetHis: 0.0 ± 0.0
0.299MetIle: 0.299 ± 0.155
1.497MetLys: 1.497 ± 0.703
2.695MetLeu: 2.695 ± 0.914
0.299MetMet: 0.299 ± 0.758
1.198MetAsn: 1.198 ± 0.62
1.497MetPro: 1.497 ± 0.72
1.198MetGln: 1.198 ± 0.62
0.898MetArg: 0.898 ± 0.538
0.299MetSer: 0.299 ± 0.669
1.797MetThr: 1.797 ± 0.715
1.497MetVal: 1.497 ± 0.51
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
6.289AsnAla: 6.289 ± 1.351
0.599AsnCys: 0.599 ± 0.699
3.893AsnAsp: 3.893 ± 0.994
2.995AsnGlu: 2.995 ± 0.558
1.497AsnPhe: 1.497 ± 0.665
3.594AsnGly: 3.594 ± 0.921
0.898AsnHis: 0.898 ± 0.671
3.893AsnIle: 3.893 ± 1.314
2.695AsnLys: 2.695 ± 0.593
4.193AsnLeu: 4.193 ± 1.288
0.599AsnMet: 0.599 ± 0.345
2.995AsnAsn: 2.995 ± 1.623
0.299AsnPro: 0.299 ± 0.758
2.096AsnGln: 2.096 ± 3.314
2.396AsnArg: 2.396 ± 1.054
3.893AsnSer: 3.893 ± 3.201
3.893AsnThr: 3.893 ± 1.294
3.893AsnVal: 3.893 ± 1.323
0.0AsnTrp: 0.0 ± 0.0
2.695AsnTyr: 2.695 ± 0.921
0.0AsnXaa: 0.0 ± 0.0
Pro
4.193ProAla: 4.193 ± 0.919
0.599ProCys: 0.599 ± 0.587
3.594ProAsp: 3.594 ± 1.263
2.695ProGlu: 2.695 ± 1.394
0.299ProPhe: 0.299 ± 0.669
3.594ProGly: 3.594 ± 1.496
2.096ProHis: 2.096 ± 0.882
2.096ProIle: 2.096 ± 0.902
2.695ProLys: 2.695 ± 0.914
4.193ProLeu: 4.193 ± 0.961
0.599ProMet: 0.599 ± 0.31
2.995ProAsn: 2.995 ± 0.966
1.797ProPro: 1.797 ± 0.594
1.497ProGln: 1.497 ± 0.61
1.497ProArg: 1.497 ± 0.665
1.797ProSer: 1.797 ± 2.123
3.893ProThr: 3.893 ± 0.781
3.294ProVal: 3.294 ± 1.625
0.299ProTrp: 0.299 ± 0.155
1.198ProTyr: 1.198 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
2.995GlnAla: 2.995 ± 0.962
0.0GlnCys: 0.0 ± 0.0
1.497GlnAsp: 1.497 ± 0.774
1.497GlnGlu: 1.497 ± 0.51
4.193GlnPhe: 4.193 ± 2.121
1.497GlnGly: 1.497 ± 0.51
2.096GlnHis: 2.096 ± 0.765
2.695GlnIle: 2.695 ± 0.494
2.695GlnLys: 2.695 ± 0.927
5.99GlnLeu: 5.99 ± 2.894
0.0GlnMet: 0.0 ± 0.0
2.695GlnAsn: 2.695 ± 1.47
2.096GlnPro: 2.096 ± 0.793
5.391GlnGln: 5.391 ± 4.184
2.096GlnArg: 2.096 ± 0.758
2.096GlnSer: 2.096 ± 1.405
5.091GlnThr: 5.091 ± 3.958
2.695GlnVal: 2.695 ± 1.05
0.0GlnTrp: 0.0 ± 0.0
1.797GlnTyr: 1.797 ± 0.715
0.0GlnXaa: 0.0 ± 0.0
Arg
5.391ArgAla: 5.391 ± 1.465
0.599ArgCys: 0.599 ± 0.31
3.594ArgAsp: 3.594 ± 1.116
2.995ArgGlu: 2.995 ± 1.524
2.995ArgPhe: 2.995 ± 0.962
2.096ArgGly: 2.096 ± 0.765
1.198ArgHis: 1.198 ± 0.412
2.995ArgIle: 2.995 ± 1.385
1.198ArgLys: 1.198 ± 0.678
4.193ArgLeu: 4.193 ± 2.133
0.599ArgMet: 0.599 ± 0.31
3.594ArgAsn: 3.594 ± 1.073
2.096ArgPro: 2.096 ± 0.754
3.594ArgGln: 3.594 ± 1.58
5.091ArgArg: 5.091 ± 1.715
5.69ArgSer: 5.69 ± 2.347
3.594ArgThr: 3.594 ± 1.498
4.792ArgVal: 4.792 ± 1.282
0.898ArgTrp: 0.898 ± 0.465
1.797ArgTyr: 1.797 ± 1.239
0.0ArgXaa: 0.0 ± 0.0
Ser
7.188SerAla: 7.188 ± 2.492
1.497SerCys: 1.497 ± 0.703
2.096SerAsp: 2.096 ± 0.765
2.396SerGlu: 2.396 ± 1.723
3.294SerPhe: 3.294 ± 1.248
3.594SerGly: 3.594 ± 1.231
2.396SerHis: 2.396 ± 0.543
3.294SerIle: 3.294 ± 1.968
2.396SerLys: 2.396 ± 0.715
5.99SerLeu: 5.99 ± 3.045
0.898SerMet: 0.898 ± 0.783
3.594SerAsn: 3.594 ± 1.583
1.797SerPro: 1.797 ± 1.516
4.193SerGln: 4.193 ± 1.54
4.193SerArg: 4.193 ± 2.771
5.091SerSer: 5.091 ± 3.617
2.695SerThr: 2.695 ± 1.05
2.396SerVal: 2.396 ± 0.824
0.299SerTrp: 0.299 ± 0.155
1.198SerTyr: 1.198 ± 0.678
0.0SerXaa: 0.0 ± 0.0
Thr
3.893ThrAla: 3.893 ± 1.478
1.198ThrCys: 1.198 ± 0.678
5.091ThrAsp: 5.091 ± 1.211
1.797ThrGlu: 1.797 ± 0.715
1.497ThrPhe: 1.497 ± 0.485
4.492ThrGly: 4.492 ± 0.997
1.497ThrHis: 1.497 ± 0.774
5.99ThrIle: 5.99 ± 1.471
3.294ThrLys: 3.294 ± 1.254
6.289ThrLeu: 6.289 ± 0.711
1.198ThrMet: 1.198 ± 0.412
5.391ThrAsn: 5.391 ± 4.407
5.69ThrPro: 5.69 ± 1.236
2.995ThrGln: 2.995 ± 2.091
4.492ThrArg: 4.492 ± 1.301
3.893ThrSer: 3.893 ± 1.411
5.091ThrThr: 5.091 ± 0.862
5.391ThrVal: 5.391 ± 1.244
0.599ThrTrp: 0.599 ± 0.699
0.898ThrTyr: 0.898 ± 0.356
0.0ThrXaa: 0.0 ± 0.0
Val
7.188ValAla: 7.188 ± 2.418
1.198ValCys: 1.198 ± 0.62
4.792ValAsp: 4.792 ± 0.93
2.396ValGlu: 2.396 ± 0.905
2.096ValPhe: 2.096 ± 0.908
3.594ValGly: 3.594 ± 1.073
1.497ValHis: 1.497 ± 0.774
3.594ValIle: 3.594 ± 1.235
5.391ValLys: 5.391 ± 1.632
4.492ValLeu: 4.492 ± 1.875
0.299ValMet: 0.299 ± 0.155
2.695ValAsn: 2.695 ± 0.921
4.792ValPro: 4.792 ± 1.037
3.893ValGln: 3.893 ± 1.795
5.99ValArg: 5.99 ± 1.418
4.193ValSer: 4.193 ± 1.386
4.193ValThr: 4.193 ± 1.144
3.893ValVal: 3.893 ± 0.447
0.299ValTrp: 0.299 ± 0.155
2.096ValTyr: 2.096 ± 0.458
0.0ValXaa: 0.0 ± 0.0
Trp
1.497TrpAla: 1.497 ± 0.568
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.599TrpPhe: 0.599 ± 0.587
0.299TrpGly: 0.299 ± 0.431
0.0TrpHis: 0.0 ± 0.0
0.898TrpIle: 0.898 ± 0.465
0.0TrpLys: 0.0 ± 0.0
1.497TrpLeu: 1.497 ± 0.72
0.299TrpMet: 0.299 ± 0.155
0.299TrpAsn: 0.299 ± 0.155
0.0TrpPro: 0.0 ± 0.0
0.299TrpGln: 0.299 ± 0.155
0.599TrpArg: 0.599 ± 0.31
0.898TrpSer: 0.898 ± 0.465
0.599TrpThr: 0.599 ± 1.247
1.497TrpVal: 1.497 ± 1.115
0.0TrpTrp: 0.0 ± 0.0
0.599TrpTyr: 0.599 ± 0.364
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.893TyrAla: 3.893 ± 0.91
0.599TyrCys: 0.599 ± 0.699
4.193TyrAsp: 4.193 ± 1.197
2.695TyrGlu: 2.695 ± 1.05
0.898TyrPhe: 0.898 ± 0.671
1.497TyrGly: 1.497 ± 0.485
1.198TyrHis: 1.198 ± 0.531
0.299TyrIle: 0.299 ± 0.155
0.898TyrLys: 0.898 ± 0.465
2.695TyrLeu: 2.695 ± 1.05
0.898TyrMet: 0.898 ± 0.356
0.599TyrAsn: 0.599 ± 0.725
0.299TyrPro: 0.299 ± 0.155
0.898TyrGln: 0.898 ± 0.538
1.797TyrArg: 1.797 ± 0.929
2.096TyrSer: 2.096 ± 0.754
1.797TyrThr: 1.797 ± 0.81
1.797TyrVal: 1.797 ± 0.631
0.599TyrTrp: 0.599 ± 0.587
2.096TyrTyr: 2.096 ± 0.765
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3340 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski