Amino acid dipepetide frequency for Hubei toti-like virus 16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.447AlaAla: 10.447 ± 3.882
0.836AlaCys: 0.836 ± 0.695
3.343AlaAsp: 3.343 ± 0.241
3.343AlaGlu: 3.343 ± 0.726
3.343AlaPhe: 3.343 ± 1.026
7.522AlaGly: 7.522 ± 1.459
1.254AlaHis: 1.254 ± 0.778
2.089AlaIle: 2.089 ± 0.438
4.179AlaLys: 4.179 ± 1.355
6.686AlaLeu: 6.686 ± 3.609
2.925AlaMet: 2.925 ± 1.283
3.761AlaAsn: 3.761 ± 1.051
4.597AlaPro: 4.597 ± 1.344
2.507AlaGln: 2.507 ± 1.159
8.358AlaArg: 8.358 ± 1.899
5.85AlaSer: 5.85 ± 1.842
4.597AlaThr: 4.597 ± 0.887
6.268AlaVal: 6.268 ± 2.181
0.418AlaTrp: 0.418 ± 0.259
1.254AlaTyr: 1.254 ± 0.778
0.0AlaXaa: 0.0 ± 0.0
Cys
0.836CysAla: 0.836 ± 0.196
0.418CysCys: 0.418 ± 0.347
0.836CysAsp: 0.836 ± 0.196
0.418CysGlu: 0.418 ± 0.259
0.836CysPhe: 0.836 ± 0.695
0.836CysGly: 0.836 ± 0.196
0.418CysHis: 0.418 ± 0.347
0.836CysIle: 0.836 ± 0.196
0.836CysLys: 0.836 ± 0.196
2.089CysLeu: 2.089 ± 0.677
0.0CysMet: 0.0 ± 0.0
0.418CysAsn: 0.418 ± 0.347
0.418CysPro: 0.418 ± 0.347
0.418CysGln: 0.418 ± 0.259
0.836CysArg: 0.836 ± 0.519
0.836CysSer: 0.836 ± 0.695
1.254CysThr: 1.254 ± 0.302
0.836CysVal: 0.836 ± 0.196
0.418CysTrp: 0.418 ± 0.259
0.418CysTyr: 0.418 ± 0.347
0.0CysXaa: 0.0 ± 0.0
Asp
6.686AspAla: 6.686 ± 2.053
0.418AspCys: 0.418 ± 0.347
2.925AspAsp: 2.925 ± 0.482
1.254AspGlu: 1.254 ± 1.609
2.089AspPhe: 2.089 ± 0.677
3.343AspGly: 3.343 ± 2.907
0.836AspHis: 0.836 ± 0.519
3.343AspIle: 3.343 ± 1.328
1.672AspLys: 1.672 ± 0.84
4.597AspLeu: 4.597 ± 0.978
1.254AspMet: 1.254 ± 0.302
1.254AspAsn: 1.254 ± 0.721
4.597AspPro: 4.597 ± 0.887
3.343AspGln: 3.343 ± 1.328
4.179AspArg: 4.179 ± 2.779
3.761AspSer: 3.761 ± 2.591
1.254AspThr: 1.254 ± 0.501
3.761AspVal: 3.761 ± 1.504
1.672AspTrp: 1.672 ± 0.393
1.672AspTyr: 1.672 ± 0.393
0.0AspXaa: 0.0 ± 0.0
Glu
2.925GluAla: 2.925 ± 0.819
0.418GluCys: 0.418 ± 0.347
1.672GluAsp: 1.672 ± 0.393
2.089GluGlu: 2.089 ± 0.438
2.507GluPhe: 2.507 ± 0.365
2.507GluGly: 2.507 ± 0.365
0.418GluHis: 0.418 ± 0.347
2.089GluIle: 2.089 ± 1.184
1.672GluLys: 1.672 ± 0.393
4.179GluLeu: 4.179 ± 0.245
0.418GluMet: 0.418 ± 0.347
0.836GluAsn: 0.836 ± 0.695
2.089GluPro: 2.089 ± 1.825
1.672GluGln: 1.672 ± 1.038
1.672GluArg: 1.672 ± 0.535
2.925GluSer: 2.925 ± 2.035
3.343GluThr: 3.343 ± 2.076
3.343GluVal: 3.343 ± 1.54
2.089GluTrp: 2.089 ± 0.677
2.089GluTyr: 2.089 ± 1.297
0.0GluXaa: 0.0 ± 0.0
Phe
2.925PheAla: 2.925 ± 0.482
1.254PheCys: 1.254 ± 0.501
2.925PheAsp: 2.925 ± 2.108
1.254PheGlu: 1.254 ± 0.656
1.672PhePhe: 1.672 ± 0.527
4.179PheGly: 4.179 ± 1.818
0.0PheHis: 0.0 ± 0.0
2.925PheIle: 2.925 ± 1.283
1.672PheLys: 1.672 ± 0.84
2.925PheLeu: 2.925 ± 0.819
0.418PheMet: 0.418 ± 0.347
2.089PheAsn: 2.089 ± 0.438
1.672PhePro: 1.672 ± 0.527
3.343PheGln: 3.343 ± 0.726
2.089PheArg: 2.089 ± 0.993
4.179PheSer: 4.179 ± 1.498
1.672PheThr: 1.672 ± 0.527
2.089PheVal: 2.089 ± 0.438
0.418PheTrp: 0.418 ± 0.259
1.254PheTyr: 1.254 ± 1.008
0.0PheXaa: 0.0 ± 0.0
Gly
5.015GlyAla: 5.015 ± 1.415
0.418GlyCys: 0.418 ± 0.259
5.433GlyAsp: 5.433 ± 0.782
4.179GlyGlu: 4.179 ± 0.876
5.015GlyPhe: 5.015 ± 1.415
4.597GlyGly: 4.597 ± 0.585
1.254GlyHis: 1.254 ± 0.721
4.597GlyIle: 4.597 ± 1.695
2.089GlyLys: 2.089 ± 0.438
8.358GlyLeu: 8.358 ± 1.107
2.507GlyMet: 2.507 ± 0.581
2.089GlyAsn: 2.089 ± 1.297
2.925GlyPro: 2.925 ± 0.608
4.597GlyGln: 4.597 ± 1.134
1.672GlyArg: 1.672 ± 0.766
10.447GlySer: 10.447 ± 8.421
4.597GlyThr: 4.597 ± 2.315
7.522GlyVal: 7.522 ± 1.323
0.836GlyTrp: 0.836 ± 0.196
2.507GlyTyr: 2.507 ± 0.603
0.0GlyXaa: 0.0 ± 0.0
His
0.836HisAla: 0.836 ± 0.519
0.418HisCys: 0.418 ± 0.259
0.836HisAsp: 0.836 ± 0.695
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.836HisGly: 0.836 ± 0.519
1.254HisHis: 1.254 ± 0.656
0.418HisIle: 0.418 ± 0.347
0.0HisLys: 0.0 ± 0.0
1.672HisLeu: 1.672 ± 1.39
0.418HisMet: 0.418 ± 0.259
1.254HisAsn: 1.254 ± 0.778
0.418HisPro: 0.418 ± 0.259
1.254HisGln: 1.254 ± 0.721
0.418HisArg: 0.418 ± 0.803
1.672HisSer: 1.672 ± 0.393
0.418HisThr: 0.418 ± 0.259
0.836HisVal: 0.836 ± 0.695
0.418HisTrp: 0.418 ± 0.347
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.089IleAla: 2.089 ± 0.525
0.0IleCys: 0.0 ± 0.0
3.343IleAsp: 3.343 ± 0.757
2.089IleGlu: 2.089 ± 0.993
2.925IlePhe: 2.925 ± 0.862
6.686IleGly: 6.686 ± 0.511
1.254IleHis: 1.254 ± 0.302
5.015IleIle: 5.015 ± 0.373
2.925IleLys: 2.925 ± 1.016
5.433IleLeu: 5.433 ± 1.852
1.254IleMet: 1.254 ± 0.436
2.507IleAsn: 2.507 ± 0.589
3.761IlePro: 3.761 ± 1.299
1.254IleGln: 1.254 ± 0.302
1.672IleArg: 1.672 ± 0.766
6.686IleSer: 6.686 ± 1.715
1.254IleThr: 1.254 ± 0.302
2.507IleVal: 2.507 ± 0.589
1.254IleTrp: 1.254 ± 1.042
1.672IleTyr: 1.672 ± 0.527
0.0IleXaa: 0.0 ± 0.0
Lys
2.089LysAla: 2.089 ± 0.438
0.0LysCys: 0.0 ± 0.0
1.672LysAsp: 1.672 ± 1.251
2.089LysGlu: 2.089 ± 0.438
0.836LysPhe: 0.836 ± 0.519
2.507LysGly: 2.507 ± 0.603
0.418LysHis: 0.418 ± 0.347
2.925LysIle: 2.925 ± 0.862
3.343LysLys: 3.343 ± 0.726
2.925LysLeu: 2.925 ± 1.339
0.418LysMet: 0.418 ± 0.347
3.343LysAsn: 3.343 ± 1.68
0.836LysPro: 0.836 ± 0.843
2.089LysGln: 2.089 ± 0.677
2.089LysArg: 2.089 ± 0.993
2.507LysSer: 2.507 ± 0.736
2.507LysThr: 2.507 ± 0.589
3.761LysVal: 3.761 ± 1.051
1.672LysTrp: 1.672 ± 0.393
1.672LysTyr: 1.672 ± 0.535
0.0LysXaa: 0.0 ± 0.0
Leu
7.522LeuAla: 7.522 ± 1.572
0.418LeuCys: 0.418 ± 0.347
4.597LeuAsp: 4.597 ± 2.178
2.925LeuGlu: 2.925 ± 0.329
4.179LeuPhe: 4.179 ± 0.336
6.686LeuGly: 6.686 ± 2.109
0.836LeuHis: 0.836 ± 0.695
2.507LeuIle: 2.507 ± 1.529
2.925LeuLys: 2.925 ± 1.339
6.268LeuLeu: 6.268 ± 0.873
2.507LeuMet: 2.507 ± 1.557
4.179LeuAsn: 4.179 ± 0.876
6.268LeuPro: 6.268 ± 0.578
5.433LeuGln: 5.433 ± 2.832
5.015LeuArg: 5.015 ± 0.932
9.611LeuSer: 9.611 ± 2.093
6.686LeuThr: 6.686 ± 1.323
7.104LeuVal: 7.104 ± 1.823
0.418LeuTrp: 0.418 ± 0.259
2.507LeuTyr: 2.507 ± 0.365
0.0LeuXaa: 0.0 ± 0.0
Met
3.761MetAla: 3.761 ± 1.798
0.836MetCys: 0.836 ± 0.196
1.254MetAsp: 1.254 ± 0.302
0.836MetGlu: 0.836 ± 0.519
0.836MetPhe: 0.836 ± 0.519
1.672MetGly: 1.672 ± 0.527
0.418MetHis: 0.418 ± 0.259
2.507MetIle: 2.507 ± 1.028
0.836MetLys: 0.836 ± 0.196
0.418MetLeu: 0.418 ± 0.347
1.254MetMet: 1.254 ± 0.302
0.836MetAsn: 0.836 ± 0.196
0.418MetPro: 0.418 ± 0.259
1.254MetGln: 1.254 ± 0.778
1.254MetArg: 1.254 ± 0.302
2.925MetSer: 2.925 ± 0.819
1.672MetThr: 1.672 ± 0.535
1.254MetVal: 1.254 ± 1.042
0.836MetTrp: 0.836 ± 0.519
0.836MetTyr: 0.836 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
2.507AsnAla: 2.507 ± 0.365
0.418AsnCys: 0.418 ± 0.259
2.507AsnAsp: 2.507 ± 1.529
1.672AsnGlu: 1.672 ± 0.812
2.507AsnPhe: 2.507 ± 0.736
3.343AsnGly: 3.343 ± 1.055
0.418AsnHis: 0.418 ± 0.259
2.925AsnIle: 2.925 ± 0.862
3.343AsnLys: 3.343 ± 0.241
2.089AsnLeu: 2.089 ± 0.438
0.836AsnMet: 0.836 ± 0.519
3.761AsnAsn: 3.761 ± 1.299
1.672AsnPro: 1.672 ± 1.038
2.089AsnGln: 2.089 ± 1.297
1.672AsnArg: 1.672 ± 0.393
2.925AsnSer: 2.925 ± 1.016
0.418AsnThr: 0.418 ± 0.259
5.85AsnVal: 5.85 ± 2.176
0.418AsnTrp: 0.418 ± 0.259
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.85ProAla: 5.85 ± 1.356
0.836ProCys: 0.836 ± 0.196
3.343ProAsp: 3.343 ± 1.246
2.507ProGlu: 2.507 ± 0.603
0.836ProPhe: 0.836 ± 0.196
5.015ProGly: 5.015 ± 0.841
0.0ProHis: 0.0 ± 0.0
2.089ProIle: 2.089 ± 1.184
0.418ProLys: 0.418 ± 0.259
6.686ProLeu: 6.686 ± 1.65
2.507ProMet: 2.507 ± 1.028
1.254ProAsn: 1.254 ± 0.778
5.015ProPro: 5.015 ± 1.582
2.925ProGln: 2.925 ± 1.816
3.761ProArg: 3.761 ± 1.969
2.507ProSer: 2.507 ± 0.365
4.179ProThr: 4.179 ± 1.443
3.343ProVal: 3.343 ± 1.055
0.418ProTrp: 0.418 ± 0.259
2.089ProTyr: 2.089 ± 0.438
0.0ProXaa: 0.0 ± 0.0
Gln
3.761GlnAla: 3.761 ± 1.798
0.836GlnCys: 0.836 ± 0.519
0.418GlnAsp: 0.418 ± 0.259
1.254GlnGlu: 1.254 ± 0.721
0.418GlnPhe: 0.418 ± 0.347
5.015GlnGly: 5.015 ± 1.994
1.254GlnHis: 1.254 ± 0.501
1.254GlnIle: 1.254 ± 0.778
1.254GlnLys: 1.254 ± 0.302
7.104GlnLeu: 7.104 ± 1.066
1.254GlnMet: 1.254 ± 0.778
0.836GlnAsn: 0.836 ± 0.519
2.507GlnPro: 2.507 ± 1.028
3.761GlnGln: 3.761 ± 0.136
2.925GlnArg: 2.925 ± 0.812
4.179GlnSer: 4.179 ± 1.549
2.925GlnThr: 2.925 ± 0.812
2.925GlnVal: 2.925 ± 0.819
1.254GlnTrp: 1.254 ± 0.501
1.254GlnTyr: 1.254 ± 0.778
0.0GlnXaa: 0.0 ± 0.0
Arg
3.343ArgAla: 3.343 ± 0.785
0.836ArgCys: 0.836 ± 0.695
5.433ArgAsp: 5.433 ± 0.646
2.925ArgGlu: 2.925 ± 0.819
2.089ArgPhe: 2.089 ± 1.488
3.343ArgGly: 3.343 ± 0.757
0.0ArgHis: 0.0 ± 0.0
4.179ArgIle: 4.179 ± 1.407
1.254ArgLys: 1.254 ± 0.501
6.268ArgLeu: 6.268 ± 1.973
2.089ArgMet: 2.089 ± 1.184
1.672ArgAsn: 1.672 ± 1.436
3.343ArgPro: 3.343 ± 2.83
0.836ArgGln: 0.836 ± 0.196
5.015ArgArg: 5.015 ± 4.032
5.433ArgSer: 5.433 ± 4.368
1.254ArgThr: 1.254 ± 1.501
4.179ArgVal: 4.179 ± 1.407
1.254ArgTrp: 1.254 ± 0.501
1.254ArgTyr: 1.254 ± 0.656
0.0ArgXaa: 0.0 ± 0.0
Ser
8.776SerAla: 8.776 ± 3.463
1.254SerCys: 1.254 ± 0.302
4.597SerAsp: 4.597 ± 0.887
4.179SerGlu: 4.179 ± 1.355
2.089SerPhe: 2.089 ± 0.525
9.611SerGly: 9.611 ± 6.18
1.254SerHis: 1.254 ± 0.302
5.85SerIle: 5.85 ± 1.216
2.507SerLys: 2.507 ± 1.275
8.358SerLeu: 8.358 ± 0.49
2.089SerMet: 2.089 ± 0.589
2.925SerAsn: 2.925 ± 1.384
5.433SerPro: 5.433 ± 1.433
5.015SerGln: 5.015 ± 2.055
5.015SerArg: 5.015 ± 2.867
9.193SerSer: 9.193 ± 1.199
5.015SerThr: 5.015 ± 2.636
5.433SerVal: 5.433 ± 2.429
1.672SerTrp: 1.672 ± 1.038
1.254SerTyr: 1.254 ± 0.721
0.0SerXaa: 0.0 ± 0.0
Thr
3.761ThrAla: 3.761 ± 1.299
1.254ThrCys: 1.254 ± 0.302
3.761ThrAsp: 3.761 ± 1.933
2.507ThrGlu: 2.507 ± 0.633
1.672ThrPhe: 1.672 ± 0.812
4.597ThrGly: 4.597 ± 0.835
0.418ThrHis: 0.418 ± 0.347
3.343ThrIle: 3.343 ± 1.246
2.089ThrLys: 2.089 ± 0.438
4.179ThrLeu: 4.179 ± 1.549
0.0ThrMet: 0.0 ± 0.0
2.925ThrAsn: 2.925 ± 0.819
4.179ThrPro: 4.179 ± 2.079
1.672ThrGln: 1.672 ± 1.038
2.507ThrArg: 2.507 ± 0.365
5.433ThrSer: 5.433 ± 0.782
5.433ThrThr: 5.433 ± 0.947
2.925ThrVal: 2.925 ± 0.329
1.254ThrTrp: 1.254 ± 0.302
1.672ThrTyr: 1.672 ± 0.535
0.0ThrXaa: 0.0 ± 0.0
Val
7.522ValAla: 7.522 ± 0.875
2.089ValCys: 2.089 ± 0.677
2.925ValAsp: 2.925 ± 1.583
2.089ValGlu: 2.089 ± 0.677
3.761ValPhe: 3.761 ± 1.075
3.761ValGly: 3.761 ± 0.68
0.836ValHis: 0.836 ± 0.718
3.761ValIle: 3.761 ± 0.905
5.015ValLys: 5.015 ± 1.507
5.015ValLeu: 5.015 ± 2.005
2.507ValMet: 2.507 ± 1.557
4.179ValAsn: 4.179 ± 1.355
2.089ValPro: 2.089 ± 0.774
2.089ValGln: 2.089 ± 0.438
3.761ValArg: 3.761 ± 1.504
7.94ValSer: 7.94 ± 1.748
4.597ValThr: 4.597 ± 0.073
5.85ValVal: 5.85 ± 2.773
0.418ValTrp: 0.418 ± 0.347
1.672ValTyr: 1.672 ± 0.84
0.0ValXaa: 0.0 ± 0.0
Trp
1.672TrpAla: 1.672 ± 1.038
0.836TrpCys: 0.836 ± 0.695
0.418TrpAsp: 0.418 ± 0.347
2.089TrpGlu: 2.089 ± 0.774
1.254TrpPhe: 1.254 ± 0.501
1.672TrpGly: 1.672 ± 0.527
0.0TrpHis: 0.0 ± 0.0
1.672TrpIle: 1.672 ± 0.393
0.418TrpLys: 0.418 ± 0.347
2.089TrpLeu: 2.089 ± 0.438
0.418TrpMet: 0.418 ± 0.259
0.418TrpAsn: 0.418 ± 0.259
0.836TrpPro: 0.836 ± 0.519
0.418TrpGln: 0.418 ± 0.259
0.836TrpArg: 0.836 ± 0.196
0.836TrpSer: 0.836 ± 0.196
0.836TrpThr: 0.836 ± 0.196
0.836TrpVal: 0.836 ± 0.695
0.0TrpTrp: 0.0 ± 0.0
0.418TrpTyr: 0.418 ± 0.347
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.254TyrAla: 1.254 ± 0.778
0.418TyrCys: 0.418 ± 0.259
1.672TyrAsp: 1.672 ± 1.251
1.254TyrGlu: 1.254 ± 0.656
2.089TyrPhe: 2.089 ± 0.525
2.925TyrGly: 2.925 ± 0.329
0.836TyrHis: 0.836 ± 0.695
1.672TyrIle: 1.672 ± 0.527
1.254TyrLys: 1.254 ± 0.721
1.254TyrLeu: 1.254 ± 0.302
0.418TyrMet: 0.418 ± 0.259
0.836TyrAsn: 0.836 ± 0.196
2.507TyrPro: 2.507 ± 1.028
0.418TyrGln: 0.418 ± 0.259
1.254TyrArg: 1.254 ± 1.042
1.672TyrSer: 1.672 ± 0.393
1.672TyrThr: 1.672 ± 0.535
1.254TyrVal: 1.254 ± 0.302
0.836TyrTrp: 0.836 ± 0.519
0.836TyrTyr: 0.836 ± 0.196
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2394 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski