Amino acid dipepetide frequency for Ground squirrel hepatitis virus (strain 27) (GSHV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.797AlaAla: 6.797 ± 1.98
1.699AlaCys: 1.699 ± 0.746
2.974AlaAsp: 2.974 ± 1.558
1.699AlaGlu: 1.699 ± 0.752
1.699AlaPhe: 1.699 ± 0.874
4.673AlaGly: 4.673 ± 1.35
0.425AlaHis: 0.425 ± 0.285
2.124AlaIle: 2.124 ± 0.817
0.425AlaLys: 0.425 ± 0.659
9.346AlaLeu: 9.346 ± 1.762
0.0AlaMet: 0.0 ± 0.0
1.699AlaAsn: 1.699 ± 1.142
2.549AlaPro: 2.549 ± 1.268
2.974AlaGln: 2.974 ± 1.169
5.098AlaArg: 5.098 ± 0.849
3.823AlaSer: 3.823 ± 1.335
2.549AlaThr: 2.549 ± 0.587
1.274AlaVal: 1.274 ± 0.856
0.85AlaTrp: 0.85 ± 0.385
3.398AlaTyr: 3.398 ± 0.935
0.0AlaXaa: 0.0 ± 0.0
Cys
0.425CysAla: 0.425 ± 0.659
3.398CysCys: 3.398 ± 1.898
0.0CysAsp: 0.0 ± 0.0
1.274CysGlu: 1.274 ± 0.856
0.425CysPhe: 0.425 ± 0.659
0.85CysGly: 0.85 ± 0.571
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.274CysLys: 1.274 ± 0.681
5.098CysLeu: 5.098 ± 1.204
1.699CysMet: 1.699 ± 0.633
1.699CysAsn: 1.699 ± 0.511
2.974CysPro: 2.974 ± 1.418
0.85CysGln: 0.85 ± 0.618
2.124CysArg: 2.124 ± 0.751
2.124CysSer: 2.124 ± 0.855
4.673CysThr: 4.673 ± 1.671
0.85CysVal: 0.85 ± 0.491
2.549CysTrp: 2.549 ± 0.661
0.85CysTyr: 0.85 ± 0.571
0.0CysXaa: 0.0 ± 0.0
Asp
2.124AspAla: 2.124 ± 1.016
0.0AspCys: 0.0 ± 0.0
0.85AspAsp: 0.85 ± 0.571
0.0AspGlu: 0.0 ± 0.0
2.124AspPhe: 2.124 ± 0.825
0.425AspGly: 0.425 ± 0.285
1.699AspHis: 1.699 ± 0.794
1.699AspIle: 1.699 ± 0.979
2.124AspLys: 2.124 ± 1.427
2.974AspLeu: 2.974 ± 1.033
0.425AspMet: 0.425 ± 0.518
2.549AspAsn: 2.549 ± 1.286
3.398AspPro: 3.398 ± 1.579
0.85AspGln: 0.85 ± 0.748
0.425AspArg: 0.425 ± 0.285
1.274AspSer: 1.274 ± 1.245
2.549AspThr: 2.549 ± 1.063
1.274AspVal: 1.274 ± 0.701
3.398AspTrp: 3.398 ± 1.4
0.425AspTyr: 0.425 ± 0.285
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
1.274GluCys: 1.274 ± 0.681
0.425GluAsp: 0.425 ± 0.285
4.673GluGlu: 4.673 ± 2.653
1.699GluPhe: 1.699 ± 1.389
1.699GluGly: 1.699 ± 0.702
2.974GluHis: 2.974 ± 1.301
0.85GluIle: 0.85 ± 0.847
1.699GluLys: 1.699 ± 1.142
3.398GluLeu: 3.398 ± 1.144
0.85GluMet: 0.85 ± 0.558
1.274GluAsn: 1.274 ± 0.634
0.425GluPro: 0.425 ± 0.285
0.85GluGln: 0.85 ± 0.571
1.274GluArg: 1.274 ± 0.856
2.124GluSer: 2.124 ± 1.427
1.699GluThr: 1.699 ± 0.952
1.274GluVal: 1.274 ± 0.634
1.699GluTrp: 1.699 ± 0.511
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.673PheAla: 4.673 ± 0.751
0.425PheCys: 0.425 ± 0.285
1.274PheAsp: 1.274 ± 0.529
0.85PheGlu: 0.85 ± 0.385
2.974PhePhe: 2.974 ± 0.486
3.398PheGly: 3.398 ± 1.923
1.274PheHis: 1.274 ± 1.135
2.974PheIle: 2.974 ± 1.169
1.274PheLys: 1.274 ± 0.529
8.496PheLeu: 8.496 ± 1.912
0.425PheMet: 0.425 ± 0.285
0.425PheAsn: 0.425 ± 0.285
5.098PhePro: 5.098 ± 0.846
1.699PheGln: 1.699 ± 0.511
3.398PheArg: 3.398 ± 1.039
3.398PheSer: 3.398 ± 0.727
3.398PheThr: 3.398 ± 0.935
1.274PheVal: 1.274 ± 1.245
1.699PheTrp: 1.699 ± 0.949
1.699PheTyr: 1.699 ± 0.874
0.0PheXaa: 0.0 ± 0.0
Gly
1.274GlyAla: 1.274 ± 0.856
0.425GlyCys: 0.425 ± 0.659
2.124GlyAsp: 2.124 ± 0.963
1.274GlyGlu: 1.274 ± 0.856
3.823GlyPhe: 3.823 ± 1.159
4.248GlyGly: 4.248 ± 1.317
2.124GlyHis: 2.124 ± 0.634
7.222GlyIle: 7.222 ± 1.426
0.85GlyLys: 0.85 ± 0.571
8.921GlyLeu: 8.921 ± 2.075
0.425GlyMet: 0.425 ± 0.659
2.124GlyAsn: 2.124 ± 0.769
3.398GlyPro: 3.398 ± 1.131
3.823GlyGln: 3.823 ± 0.696
2.124GlyArg: 2.124 ± 0.825
4.248GlySer: 4.248 ± 1.202
2.974GlyThr: 2.974 ± 1.166
2.124GlyVal: 2.124 ± 0.855
0.85GlyTrp: 0.85 ± 0.491
1.274GlyTyr: 1.274 ± 0.856
0.0GlyXaa: 0.0 ± 0.0
His
0.425HisAla: 0.425 ± 0.659
1.274HisCys: 1.274 ± 0.634
0.425HisAsp: 0.425 ± 0.285
0.0HisGlu: 0.0 ± 0.0
1.274HisPhe: 1.274 ± 0.856
0.0HisGly: 0.0 ± 0.0
2.549HisHis: 2.549 ± 0.481
1.699HisIle: 1.699 ± 1.142
1.699HisLys: 1.699 ± 0.874
6.372HisLeu: 6.372 ± 0.725
0.0HisMet: 0.0 ± 0.0
1.274HisAsn: 1.274 ± 0.439
1.274HisPro: 1.274 ± 0.529
0.425HisGln: 0.425 ± 0.659
1.274HisArg: 1.274 ± 0.856
0.85HisSer: 0.85 ± 0.571
4.248HisThr: 4.248 ± 1.925
1.274HisVal: 1.274 ± 0.634
0.425HisTrp: 0.425 ± 0.285
0.425HisTyr: 0.425 ± 0.285
0.0HisXaa: 0.0 ± 0.0
Ile
2.549IleAla: 2.549 ± 0.587
2.124IleCys: 2.124 ± 1.427
2.124IleAsp: 2.124 ± 0.795
0.425IleGlu: 0.425 ± 0.285
2.124IlePhe: 2.124 ± 0.769
0.85IleGly: 0.85 ± 0.558
1.274IleHis: 1.274 ± 0.856
2.974IleIle: 2.974 ± 1.243
1.274IleLys: 1.274 ± 0.758
5.098IleLeu: 5.098 ± 0.92
1.274IleMet: 1.274 ± 0.479
0.85IleAsn: 0.85 ± 0.571
6.372IlePro: 6.372 ± 1.56
2.124IleGln: 2.124 ± 1.427
4.248IleArg: 4.248 ± 1.875
3.823IleSer: 3.823 ± 0.939
1.699IleThr: 1.699 ± 0.696
2.124IleVal: 2.124 ± 0.817
3.823IleTrp: 3.823 ± 2.042
1.699IleTyr: 1.699 ± 0.952
0.0IleXaa: 0.0 ± 0.0
Lys
0.425LysAla: 0.425 ± 0.285
0.425LysCys: 0.425 ± 0.285
0.85LysAsp: 0.85 ± 0.618
1.699LysGlu: 1.699 ± 0.596
0.85LysPhe: 0.85 ± 0.571
2.549LysGly: 2.549 ± 0.587
1.699LysHis: 1.699 ± 0.511
2.549LysIle: 2.549 ± 0.587
1.274LysLys: 1.274 ± 0.856
3.823LysLeu: 3.823 ± 1.263
0.0LysMet: 0.0 ± 0.0
2.974LysAsn: 2.974 ± 1.048
2.974LysPro: 2.974 ± 1.008
0.425LysGln: 0.425 ± 0.285
1.274LysArg: 1.274 ± 0.701
2.549LysSer: 2.549 ± 1.286
1.699LysThr: 1.699 ± 1.142
1.699LysVal: 1.699 ± 0.596
0.425LysTrp: 0.425 ± 0.285
1.274LysTyr: 1.274 ± 0.701
0.0LysXaa: 0.0 ± 0.0
Leu
6.372LeuAla: 6.372 ± 1.526
1.699LeuCys: 1.699 ± 1.126
5.947LeuAsp: 5.947 ± 0.672
1.699LeuGlu: 1.699 ± 1.142
4.673LeuPhe: 4.673 ± 0.874
8.071LeuGly: 8.071 ± 1.73
1.274LeuHis: 1.274 ± 0.856
4.248LeuIle: 4.248 ± 0.956
4.248LeuLys: 4.248 ± 0.446
24.214LeuLeu: 24.214 ± 5.18
1.699LeuMet: 1.699 ± 0.952
3.398LeuAsn: 3.398 ± 1.393
10.195LeuPro: 10.195 ± 0.472
6.372LeuGln: 6.372 ± 1.815
6.372LeuArg: 6.372 ± 1.667
9.346LeuSer: 9.346 ± 1.563
11.895LeuThr: 11.895 ± 2.129
8.921LeuVal: 8.921 ± 1.933
6.372LeuTrp: 6.372 ± 0.632
3.823LeuTyr: 3.823 ± 1.828
0.0LeuXaa: 0.0 ± 0.0
Met
0.425MetAla: 0.425 ± 0.659
0.0MetCys: 0.0 ± 0.0
1.699MetAsp: 1.699 ± 0.856
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.699MetGly: 1.699 ± 0.758
0.85MetHis: 0.85 ± 0.571
1.274MetIle: 1.274 ± 0.681
0.85MetLys: 0.85 ± 0.558
0.0MetLeu: 0.0 ± 0.0
2.124MetMet: 2.124 ± 1.382
0.0MetAsn: 0.0 ± 0.0
1.274MetPro: 1.274 ± 0.681
0.425MetGln: 0.425 ± 0.659
0.0MetArg: 0.0 ± 0.0
3.823MetSer: 3.823 ± 1.183
0.425MetThr: 0.425 ± 0.424
0.85MetVal: 0.85 ± 0.571
0.0MetTrp: 0.0 ± 0.0
1.699MetTyr: 1.699 ± 0.822
0.0MetXaa: 0.0 ± 0.0
Asn
2.124AsnAla: 2.124 ± 1.299
3.823AsnCys: 3.823 ± 1.183
0.0AsnAsp: 0.0 ± 0.0
0.85AsnGlu: 0.85 ± 0.571
1.699AsnPhe: 1.699 ± 0.856
2.124AsnGly: 2.124 ± 0.47
0.85AsnHis: 0.85 ± 0.571
1.699AsnIle: 1.699 ± 0.758
0.85AsnLys: 0.85 ± 0.385
4.248AsnLeu: 4.248 ± 1.57
0.0AsnMet: 0.0 ± 0.0
2.974AsnAsn: 2.974 ± 1.042
2.124AsnPro: 2.124 ± 1.125
2.974AsnGln: 2.974 ± 1.048
1.274AsnArg: 1.274 ± 0.529
3.398AsnSer: 3.398 ± 1.428
2.124AsnThr: 2.124 ± 1.299
1.274AsnVal: 1.274 ± 0.856
1.699AsnTrp: 1.699 ± 0.758
1.699AsnTyr: 1.699 ± 1.142
0.0AsnXaa: 0.0 ± 0.0
Pro
5.947ProAla: 5.947 ± 0.728
2.124ProCys: 2.124 ± 0.849
0.85ProAsp: 0.85 ± 0.694
2.549ProGlu: 2.549 ± 0.661
3.823ProPhe: 3.823 ± 0.844
2.549ProGly: 2.549 ± 1.132
2.124ProHis: 2.124 ± 0.571
4.673ProIle: 4.673 ± 1.132
1.274ProLys: 1.274 ± 0.529
7.222ProLeu: 7.222 ± 0.679
2.124ProMet: 2.124 ± 0.459
2.549ProAsn: 2.549 ± 0.678
6.797ProPro: 6.797 ± 2.012
0.85ProGln: 0.85 ± 0.385
4.248ProArg: 4.248 ± 1.875
7.222ProSer: 7.222 ± 1.775
7.222ProThr: 7.222 ± 2.887
8.496ProVal: 8.496 ± 1.846
1.274ProTrp: 1.274 ± 0.701
3.823ProTyr: 3.823 ± 1.102
0.0ProXaa: 0.0 ± 0.0
Gln
2.549GlnAla: 2.549 ± 1.134
1.274GlnCys: 1.274 ± 0.681
1.699GlnAsp: 1.699 ± 0.511
1.699GlnGlu: 1.699 ± 0.596
2.124GlnPhe: 2.124 ± 0.47
1.274GlnGly: 1.274 ± 0.893
1.699GlnHis: 1.699 ± 0.696
0.85GlnIle: 0.85 ± 0.571
0.85GlnLys: 0.85 ± 0.618
4.673GlnLeu: 4.673 ± 2.007
0.0GlnMet: 0.0 ± 0.0
1.274GlnAsn: 1.274 ± 0.681
1.274GlnPro: 1.274 ± 0.701
2.124GlnGln: 2.124 ± 1.016
2.124GlnArg: 2.124 ± 1.125
4.248GlnSer: 4.248 ± 1.008
6.372GlnThr: 6.372 ± 0.926
1.274GlnVal: 1.274 ± 0.681
1.699GlnTrp: 1.699 ± 0.511
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.124ArgAla: 2.124 ± 1.001
0.85ArgCys: 0.85 ± 0.571
2.974ArgAsp: 2.974 ± 1.462
0.85ArgGlu: 0.85 ± 0.694
3.823ArgPhe: 3.823 ± 0.939
4.248ArgGly: 4.248 ± 1.2
1.274ArgHis: 1.274 ± 0.701
2.124ArgIle: 2.124 ± 0.855
1.699ArgLys: 1.699 ± 0.758
4.673ArgLeu: 4.673 ± 1.742
0.0ArgMet: 0.0 ± 0.0
2.974ArgAsn: 2.974 ± 1.998
2.974ArgPro: 2.974 ± 0.683
4.248ArgGln: 4.248 ± 1.202
12.744ArgArg: 12.744 ± 7.002
5.523ArgSer: 5.523 ± 2.223
2.549ArgThr: 2.549 ± 1.431
1.699ArgVal: 1.699 ± 1.142
1.274ArgTrp: 1.274 ± 0.681
0.425ArgTyr: 0.425 ± 0.424
0.0ArgXaa: 0.0 ± 0.0
Ser
5.098SerAla: 5.098 ± 2.445
2.974SerCys: 2.974 ± 0.486
2.124SerAsp: 2.124 ± 0.665
1.699SerGlu: 1.699 ± 0.696
4.248SerPhe: 4.248 ± 0.781
5.098SerGly: 5.098 ± 1.642
1.274SerHis: 1.274 ± 0.856
2.549SerIle: 2.549 ± 0.587
2.549SerLys: 2.549 ± 1.327
8.496SerLeu: 8.496 ± 1.974
0.425SerMet: 0.425 ± 0.285
3.823SerAsn: 3.823 ± 1.038
10.62SerPro: 10.62 ± 2.397
2.974SerGln: 2.974 ± 1.301
4.248SerArg: 4.248 ± 1.2
7.647SerSer: 7.647 ± 2.373
5.098SerThr: 5.098 ± 0.644
1.699SerVal: 1.699 ± 0.874
2.974SerTrp: 2.974 ± 1.169
2.549SerTyr: 2.549 ± 0.661
0.0SerXaa: 0.0 ± 0.0
Thr
5.098ThrAla: 5.098 ± 0.969
5.098ThrCys: 5.098 ± 1.722
0.425ThrAsp: 0.425 ± 0.285
3.823ThrGlu: 3.823 ± 0.746
5.523ThrPhe: 5.523 ± 1.209
5.098ThrGly: 5.098 ± 0.572
1.274ThrHis: 1.274 ± 0.529
4.673ThrIle: 4.673 ± 1.319
3.398ThrLys: 3.398 ± 1.039
5.947ThrLeu: 5.947 ± 0.87
1.274ThrMet: 1.274 ± 0.627
1.699ThrAsn: 1.699 ± 0.758
5.947ThrPro: 5.947 ± 1.638
0.425ThrGln: 0.425 ± 0.285
2.549ThrArg: 2.549 ± 0.678
6.797ThrSer: 6.797 ± 1.243
9.771ThrThr: 9.771 ± 2.516
6.797ThrVal: 6.797 ± 2.784
2.124ThrTrp: 2.124 ± 1.299
2.124ThrTyr: 2.124 ± 1.016
0.0ThrXaa: 0.0 ± 0.0
Val
2.549ValAla: 2.549 ± 1.712
2.549ValCys: 2.549 ± 0.661
2.549ValAsp: 2.549 ± 1.268
0.425ValGlu: 0.425 ± 0.285
2.974ValPhe: 2.974 ± 1.597
1.699ValGly: 1.699 ± 0.625
0.85ValHis: 0.85 ± 0.571
1.274ValIle: 1.274 ± 0.634
0.0ValLys: 0.0 ± 0.0
6.797ValLeu: 6.797 ± 2.174
0.0ValMet: 0.0 ± 0.0
2.974ValAsn: 2.974 ± 1.243
5.098ValPro: 5.098 ± 1.216
2.124ValGln: 2.124 ± 1.037
2.974ValArg: 2.974 ± 1.301
4.248ValSer: 4.248 ± 0.528
2.124ValThr: 2.124 ± 0.881
3.398ValVal: 3.398 ± 0.792
2.124ValTrp: 2.124 ± 0.895
1.699ValTyr: 1.699 ± 0.511
0.0ValXaa: 0.0 ± 0.0
Trp
4.673TrpAla: 4.673 ± 1.671
0.0TrpCys: 0.0 ± 0.0
0.85TrpAsp: 0.85 ± 0.634
3.398TrpGlu: 3.398 ± 0.659
2.124TrpPhe: 2.124 ± 0.895
3.398TrpGly: 3.398 ± 0.445
0.85TrpHis: 0.85 ± 0.618
2.124TrpIle: 2.124 ± 0.855
1.699TrpLys: 1.699 ± 0.511
5.523TrpLeu: 5.523 ± 1.456
3.398TrpMet: 3.398 ± 1.431
0.0TrpAsn: 0.0 ± 0.0
2.549TrpPro: 2.549 ± 0.662
0.425TrpGln: 0.425 ± 0.424
0.425TrpArg: 0.425 ± 0.285
0.0TrpSer: 0.0 ± 0.0
3.823TrpThr: 3.823 ± 1.102
0.0TrpVal: 0.0 ± 0.0
3.398TrpTrp: 3.398 ± 1.4
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.425TyrAla: 0.425 ± 0.285
1.699TyrCys: 1.699 ± 0.511
0.425TyrAsp: 0.425 ± 0.285
1.274TyrGlu: 1.274 ± 0.634
2.124TyrPhe: 2.124 ± 0.47
1.699TyrGly: 1.699 ± 0.486
0.85TyrHis: 0.85 ± 0.571
1.699TyrIle: 1.699 ± 0.949
2.124TyrLys: 2.124 ± 0.571
5.523TyrLeu: 5.523 ± 1.456
0.85TyrMet: 0.85 ± 0.618
1.274TyrAsn: 1.274 ± 0.856
0.425TyrPro: 0.425 ± 0.285
2.124TyrGln: 2.124 ± 0.571
0.85TyrArg: 0.85 ± 0.694
1.699TyrSer: 1.699 ± 0.874
2.974TyrThr: 2.974 ± 1.561
0.85TyrVal: 0.85 ± 0.571
0.0TyrTrp: 0.0 ± 0.0
0.85TyrTyr: 0.85 ± 0.385
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2355 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski