Amino acid dipepetide frequency for Hubei rhabdo-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.749AlaAla: 8.749 ± 5.202
0.761AlaCys: 0.761 ± 0.382
5.706AlaAsp: 5.706 ± 1.281
3.043AlaGlu: 3.043 ± 2.15
3.423AlaPhe: 3.423 ± 4.16
6.466AlaGly: 6.466 ± 2.194
1.902AlaHis: 1.902 ± 1.13
5.325AlaIle: 5.325 ± 1.92
3.043AlaLys: 3.043 ± 1.651
8.749AlaLeu: 8.749 ± 0.388
2.282AlaMet: 2.282 ± 2.539
2.663AlaAsn: 2.663 ± 1.465
3.043AlaPro: 3.043 ± 0.738
1.141AlaGln: 1.141 ± 0.515
5.325AlaArg: 5.325 ± 1.266
6.847AlaSer: 6.847 ± 2.483
2.663AlaThr: 2.663 ± 1.016
4.945AlaVal: 4.945 ± 1.267
1.521AlaTrp: 1.521 ± 0.462
2.282AlaTyr: 2.282 ± 0.572
0.0AlaXaa: 0.0 ± 0.0
Cys
1.521CysAla: 1.521 ± 0.462
0.0CysCys: 0.0 ± 0.0
0.38CysAsp: 0.38 ± 0.854
0.38CysGlu: 0.38 ± 0.191
0.761CysPhe: 0.761 ± 0.382
0.761CysGly: 0.761 ± 0.382
0.38CysHis: 0.38 ± 0.191
0.761CysIle: 0.761 ± 0.382
0.38CysLys: 0.38 ± 0.191
2.282CysLeu: 2.282 ± 1.145
0.0CysMet: 0.0 ± 0.0
0.38CysAsn: 0.38 ± 0.191
0.0CysPro: 0.0 ± 0.0
0.761CysGln: 0.761 ± 0.382
1.141CysArg: 1.141 ± 0.573
1.141CysSer: 1.141 ± 0.573
1.141CysThr: 1.141 ± 0.573
0.761CysVal: 0.761 ± 0.382
0.38CysTrp: 0.38 ± 0.191
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.043AspAla: 3.043 ± 0.924
1.141AspCys: 1.141 ± 0.573
2.282AspAsp: 2.282 ± 1.145
3.423AspGlu: 3.423 ± 3.33
2.663AspPhe: 2.663 ± 1.016
1.902AspGly: 1.902 ± 0.8
1.521AspHis: 1.521 ± 0.725
3.804AspIle: 3.804 ± 0.382
2.663AspLys: 2.663 ± 0.924
7.227AspLeu: 7.227 ± 2.208
2.282AspMet: 2.282 ± 1.295
2.282AspAsn: 2.282 ± 1.145
4.184AspPro: 4.184 ± 1.641
1.902AspGln: 1.902 ± 0.552
1.521AspArg: 1.521 ± 0.725
3.423AspSer: 3.423 ± 0.392
0.761AspThr: 0.761 ± 0.625
1.902AspVal: 1.902 ± 0.954
1.141AspTrp: 1.141 ± 0.573
0.761AspTyr: 0.761 ± 0.382
0.0AspXaa: 0.0 ± 0.0
Glu
6.847GluAla: 6.847 ± 2.031
1.521GluCys: 1.521 ± 0.744
3.804GluAsp: 3.804 ± 3.163
4.945GluGlu: 4.945 ± 5.405
2.663GluPhe: 2.663 ± 0.96
2.282GluGly: 2.282 ± 0.895
1.902GluHis: 1.902 ± 0.484
3.423GluIle: 3.423 ± 0.556
3.423GluLys: 3.423 ± 1.062
6.086GluLeu: 6.086 ± 0.649
3.043GluMet: 3.043 ± 0.287
0.761GluAsn: 0.761 ± 0.773
1.902GluPro: 1.902 ± 0.552
1.902GluGln: 1.902 ± 1.3
3.043GluArg: 3.043 ± 0.738
4.945GluSer: 4.945 ± 1.086
1.521GluThr: 1.521 ± 1.25
7.607GluVal: 7.607 ± 2.21
1.521GluTrp: 1.521 ± 0.764
1.902GluTyr: 1.902 ± 0.484
0.0GluXaa: 0.0 ± 0.0
Phe
2.282PheAla: 2.282 ± 1.875
0.38PheCys: 0.38 ± 0.191
2.282PheAsp: 2.282 ± 0.572
1.521PheGlu: 1.521 ± 0.744
1.902PhePhe: 1.902 ± 0.484
1.902PheGly: 1.902 ± 0.954
1.521PheHis: 1.521 ± 0.764
1.902PheIle: 1.902 ± 0.484
1.141PheLys: 1.141 ± 0.573
1.902PheLeu: 1.902 ± 1.13
0.761PheMet: 0.761 ± 0.625
1.902PheAsn: 1.902 ± 1.3
5.706PhePro: 5.706 ± 2.382
1.521PheGln: 1.521 ± 0.744
1.521PheArg: 1.521 ± 0.764
3.423PheSer: 3.423 ± 0.556
1.521PheThr: 1.521 ± 0.764
1.521PheVal: 1.521 ± 0.764
0.761PheTrp: 0.761 ± 0.382
0.761PheTyr: 0.761 ± 1.535
0.0PheXaa: 0.0 ± 0.0
Gly
4.564GlyAla: 4.564 ± 0.195
0.38GlyCys: 0.38 ± 0.191
2.282GlyAsp: 2.282 ± 0.572
4.564GlyGlu: 4.564 ± 1.555
2.282GlyPhe: 2.282 ± 0.397
7.227GlyGly: 7.227 ± 0.931
1.521GlyHis: 1.521 ± 0.725
3.043GlyIle: 3.043 ± 1.139
3.804GlyLys: 3.804 ± 0.382
4.945GlyLeu: 4.945 ± 0.306
1.902GlyMet: 1.902 ± 0.552
1.902GlyAsn: 1.902 ± 0.484
1.902GlyPro: 1.902 ± 1.497
3.043GlyGln: 3.043 ± 1.157
4.945GlyArg: 4.945 ± 1.99
5.325GlySer: 5.325 ± 0.578
3.804GlyThr: 3.804 ± 0.382
4.184GlyVal: 4.184 ± 0.235
0.0GlyTrp: 0.0 ± 0.0
3.043GlyTyr: 3.043 ± 0.856
0.0GlyXaa: 0.0 ± 0.0
His
0.761HisAla: 0.761 ± 1.088
0.761HisCys: 0.761 ± 0.382
0.761HisAsp: 0.761 ± 0.382
1.902HisGlu: 1.902 ± 1.3
0.38HisPhe: 0.38 ± 0.191
1.521HisGly: 1.521 ± 1.25
1.141HisHis: 1.141 ± 0.573
0.38HisIle: 0.38 ± 0.767
0.38HisLys: 0.38 ± 0.191
3.804HisLeu: 3.804 ± 1.196
1.521HisMet: 1.521 ± 0.462
0.761HisAsn: 0.761 ± 0.625
0.38HisPro: 0.38 ± 0.191
0.38HisGln: 0.38 ± 0.191
2.282HisArg: 2.282 ± 0.895
3.423HisSer: 3.423 ± 0.392
0.761HisThr: 0.761 ± 0.773
1.141HisVal: 1.141 ± 0.515
0.0HisTrp: 0.0 ± 0.0
0.38HisTyr: 0.38 ± 0.767
0.0HisXaa: 0.0 ± 0.0
Ile
7.988IleAla: 7.988 ± 2.495
0.0IleCys: 0.0 ± 0.0
3.043IleAsp: 3.043 ± 1.139
3.423IleGlu: 3.423 ± 1.533
2.282IlePhe: 2.282 ± 1.031
3.804IleGly: 3.804 ± 1.196
1.521IleHis: 1.521 ± 0.462
2.282IleIle: 2.282 ± 0.895
2.282IleLys: 2.282 ± 1.145
7.227IleLeu: 7.227 ± 1.855
0.0IleMet: 0.0 ± 0.0
1.902IleAsn: 1.902 ± 0.8
4.564IlePro: 4.564 ± 2.223
0.761IleGln: 0.761 ± 1.088
3.423IleArg: 3.423 ± 1.022
4.945IleSer: 4.945 ± 0.306
3.804IleThr: 3.804 ± 1.196
3.804IleVal: 3.804 ± 1.47
0.761IleTrp: 0.761 ± 0.382
1.902IleTyr: 1.902 ± 2.01
0.0IleXaa: 0.0 ± 0.0
Lys
3.423LysAla: 3.423 ± 1.546
1.141LysCys: 1.141 ± 0.573
2.282LysAsp: 2.282 ± 1.145
4.945LysGlu: 4.945 ± 1.081
1.521LysPhe: 1.521 ± 1.25
3.043LysGly: 3.043 ± 0.287
1.141LysHis: 1.141 ± 0.734
4.184LysIle: 4.184 ± 1.147
2.282LysLys: 2.282 ± 1.468
3.043LysLeu: 3.043 ± 0.738
0.761LysMet: 0.761 ± 0.382
0.761LysAsn: 0.761 ± 0.382
0.38LysPro: 0.38 ± 0.854
2.282LysGln: 2.282 ± 1.468
2.663LysArg: 2.663 ± 0.924
3.423LysSer: 3.423 ± 1.478
2.663LysThr: 2.663 ± 0.703
3.043LysVal: 3.043 ± 1.449
0.0LysTrp: 0.0 ± 0.0
0.38LysTyr: 0.38 ± 0.854
0.0LysXaa: 0.0 ± 0.0
Leu
7.607LeuAla: 7.607 ± 1.313
1.141LeuCys: 1.141 ± 0.573
5.325LeuAsp: 5.325 ± 2.672
3.804LeuGlu: 3.804 ± 0.382
1.902LeuPhe: 1.902 ± 1.13
6.466LeuGly: 6.466 ± 1.263
1.902LeuHis: 1.902 ± 0.484
6.086LeuIle: 6.086 ± 1.848
3.804LeuLys: 3.804 ± 0.546
12.552LeuLeu: 12.552 ± 2.624
2.282LeuMet: 2.282 ± 0.572
3.043LeuAsn: 3.043 ± 0.287
7.227LeuPro: 7.227 ± 2.027
2.663LeuGln: 2.663 ± 1.016
7.607LeuArg: 7.607 ± 2.611
13.693LeuSer: 13.693 ± 5.342
5.706LeuThr: 5.706 ± 1.661
4.184LeuVal: 4.184 ± 1.374
0.761LeuTrp: 0.761 ± 0.382
4.945LeuTyr: 4.945 ± 2.481
0.0LeuXaa: 0.0 ± 0.0
Met
1.521MetAla: 1.521 ± 0.462
0.38MetCys: 0.38 ± 0.191
0.38MetAsp: 0.38 ± 0.191
3.043MetGlu: 3.043 ± 0.738
0.761MetPhe: 0.761 ± 0.382
1.521MetGly: 1.521 ± 0.462
0.761MetHis: 0.761 ± 0.625
1.141MetIle: 1.141 ± 0.734
1.141MetLys: 1.141 ± 0.515
2.663MetLeu: 2.663 ± 0.289
1.141MetMet: 1.141 ± 0.573
0.0MetAsn: 0.0 ± 0.0
1.521MetPro: 1.521 ± 1.25
1.141MetGln: 1.141 ± 0.905
1.902MetArg: 1.902 ± 0.954
2.663MetSer: 2.663 ± 0.289
1.521MetThr: 1.521 ± 0.725
3.423MetVal: 3.423 ± 1.311
0.38MetTrp: 0.38 ± 0.191
0.761MetTyr: 0.761 ± 0.382
0.0MetXaa: 0.0 ± 0.0
Asn
1.902AsnAla: 1.902 ± 1.495
0.38AsnCys: 0.38 ± 0.191
0.38AsnAsp: 0.38 ± 0.191
1.902AsnGlu: 1.902 ± 0.484
2.282AsnPhe: 2.282 ± 0.572
1.141AsnGly: 1.141 ± 0.515
0.0AsnHis: 0.0 ± 0.0
1.521AsnIle: 1.521 ± 0.764
1.141AsnLys: 1.141 ± 0.905
1.141AsnLeu: 1.141 ± 0.905
0.761AsnMet: 0.761 ± 0.773
1.521AsnAsn: 1.521 ± 0.764
2.282AsnPro: 2.282 ± 1.145
0.761AsnGln: 0.761 ± 0.382
1.902AsnArg: 1.902 ± 0.552
2.663AsnSer: 2.663 ± 1.241
2.663AsnThr: 2.663 ± 1.465
1.521AsnVal: 1.521 ± 0.764
0.38AsnTrp: 0.38 ± 0.191
1.141AsnTyr: 1.141 ± 0.573
0.0AsnXaa: 0.0 ± 0.0
Pro
3.043ProAla: 3.043 ± 0.287
0.38ProCys: 0.38 ± 0.191
3.043ProAsp: 3.043 ± 0.924
5.325ProGlu: 5.325 ± 2.465
2.282ProPhe: 2.282 ± 0.895
2.663ProGly: 2.663 ± 0.289
0.761ProHis: 0.761 ± 0.382
5.706ProIle: 5.706 ± 0.674
2.663ProLys: 2.663 ± 1.241
5.325ProLeu: 5.325 ± 2.672
0.38ProMet: 0.38 ± 0.767
0.761ProAsn: 0.761 ± 0.382
4.945ProPro: 4.945 ± 2.607
1.902ProGln: 1.902 ± 1.497
2.663ProArg: 2.663 ± 0.924
4.564ProSer: 4.564 ± 1.758
2.663ProThr: 2.663 ± 2.341
3.804ProVal: 3.804 ± 0.382
0.38ProTrp: 0.38 ± 0.767
2.282ProTyr: 2.282 ± 0.572
0.0ProXaa: 0.0 ± 0.0
Gln
3.043GlnAla: 3.043 ± 2.225
0.38GlnCys: 0.38 ± 0.191
1.141GlnAsp: 1.141 ± 0.905
4.945GlnGlu: 4.945 ± 2.927
0.38GlnPhe: 0.38 ± 0.191
1.141GlnGly: 1.141 ± 0.515
0.761GlnHis: 0.761 ± 1.088
2.663GlnIle: 2.663 ± 1.241
1.902GlnLys: 1.902 ± 0.8
1.902GlnLeu: 1.902 ± 0.484
0.761GlnMet: 0.761 ± 0.382
0.38GlnAsn: 0.38 ± 0.191
1.141GlnPro: 1.141 ± 0.515
1.141GlnGln: 1.141 ± 0.573
1.521GlnArg: 1.521 ± 0.744
2.282GlnSer: 2.282 ± 0.895
1.521GlnThr: 1.521 ± 0.744
2.282GlnVal: 2.282 ± 1.468
0.761GlnTrp: 0.761 ± 0.382
0.38GlnTyr: 0.38 ± 0.191
0.0GlnXaa: 0.0 ± 0.0
Arg
4.184ArgAla: 4.184 ± 0.718
1.141ArgCys: 1.141 ± 0.573
2.282ArgAsp: 2.282 ± 1.145
4.184ArgGlu: 4.184 ± 1.416
2.282ArgPhe: 2.282 ± 1.031
5.706ArgGly: 5.706 ± 2.374
0.761ArgHis: 0.761 ± 0.382
3.423ArgIle: 3.423 ± 1.022
3.423ArgLys: 3.423 ± 0.556
6.466ArgLeu: 6.466 ± 0.659
3.423ArgMet: 3.423 ± 1.062
1.902ArgAsn: 1.902 ± 0.484
2.663ArgPro: 2.663 ± 1.241
1.902ArgGln: 1.902 ± 1.495
5.706ArgArg: 5.706 ± 1.743
6.086ArgSer: 6.086 ± 1.641
3.043ArgThr: 3.043 ± 0.856
4.564ArgVal: 4.564 ± 1.814
1.141ArgTrp: 1.141 ± 0.573
2.663ArgTyr: 2.663 ± 0.703
0.0ArgXaa: 0.0 ± 0.0
Ser
7.227SerAla: 7.227 ± 2.785
1.521SerCys: 1.521 ± 0.462
5.325SerAsp: 5.325 ± 0.471
3.423SerGlu: 3.423 ± 1.062
3.043SerPhe: 3.043 ± 1.527
3.423SerGly: 3.423 ± 0.392
2.663SerHis: 2.663 ± 0.924
4.945SerIle: 4.945 ± 1.086
3.804SerLys: 3.804 ± 0.382
13.693SerLeu: 13.693 ± 1.514
1.141SerMet: 1.141 ± 0.734
2.282SerAsn: 2.282 ± 1.468
4.564SerPro: 4.564 ± 0.195
2.282SerGln: 2.282 ± 0.572
7.607SerArg: 7.607 ± 3.2
12.172SerSer: 12.172 ± 4.963
4.945SerThr: 4.945 ± 0.668
4.945SerVal: 4.945 ± 0.306
1.141SerTrp: 1.141 ± 0.515
3.804SerTyr: 3.804 ± 1.473
0.0SerXaa: 0.0 ± 0.0
Thr
3.043ThrAla: 3.043 ± 0.738
0.38ThrCys: 0.38 ± 0.191
3.804ThrAsp: 3.804 ± 1.105
3.423ThrGlu: 3.423 ± 0.556
1.521ThrPhe: 1.521 ± 0.462
3.423ThrGly: 3.423 ± 1.546
0.0ThrHis: 0.0 ± 0.0
3.804ThrIle: 3.804 ± 1.473
1.521ThrLys: 1.521 ± 0.744
4.184ThrLeu: 4.184 ± 0.235
2.282ThrMet: 2.282 ± 0.572
1.902ThrAsn: 1.902 ± 1.495
3.043ThrPro: 3.043 ± 1.487
1.141ThrGln: 1.141 ± 0.734
4.564ThrArg: 4.564 ± 0.195
4.184ThrSer: 4.184 ± 1.147
2.663ThrThr: 2.663 ± 0.96
3.804ThrVal: 3.804 ± 0.968
1.141ThrTrp: 1.141 ± 0.573
1.521ThrTyr: 1.521 ± 0.462
0.0ThrXaa: 0.0 ± 0.0
Val
5.706ValAla: 5.706 ± 1.485
1.141ValCys: 1.141 ± 0.573
4.564ValAsp: 4.564 ± 1.814
4.184ValGlu: 4.184 ± 1.004
3.043ValPhe: 3.043 ± 1.487
5.706ValGly: 5.706 ± 2.349
1.141ValHis: 1.141 ± 0.515
2.282ValIle: 2.282 ± 1.031
2.663ValLys: 2.663 ± 1.752
6.086ValLeu: 6.086 ± 1.848
1.521ValMet: 1.521 ± 0.764
1.521ValAsn: 1.521 ± 0.462
2.282ValPro: 2.282 ± 1.031
2.663ValGln: 2.663 ± 0.289
5.325ValArg: 5.325 ± 1.266
3.043ValSer: 3.043 ± 2.225
2.663ValThr: 2.663 ± 0.289
4.184ValVal: 4.184 ± 1.374
0.0ValTrp: 0.0 ± 0.0
3.423ValTyr: 3.423 ± 1.022
0.0ValXaa: 0.0 ± 0.0
Trp
1.141TrpAla: 1.141 ± 0.515
0.38TrpCys: 0.38 ± 0.191
0.38TrpAsp: 0.38 ± 0.191
1.141TrpGlu: 1.141 ± 0.573
0.761TrpPhe: 0.761 ± 0.382
0.761TrpGly: 0.761 ± 0.382
0.0TrpHis: 0.0 ± 0.0
0.761TrpIle: 0.761 ± 0.382
1.141TrpLys: 1.141 ± 0.515
0.761TrpLeu: 0.761 ± 0.382
0.38TrpMet: 0.38 ± 0.191
0.38TrpAsn: 0.38 ± 0.191
0.38TrpPro: 0.38 ± 0.191
0.38TrpGln: 0.38 ± 0.767
0.761TrpArg: 0.761 ± 0.382
1.141TrpSer: 1.141 ± 0.573
1.141TrpThr: 1.141 ± 0.573
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.38TrpTyr: 0.38 ± 0.191
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.282TyrAla: 2.282 ± 0.572
0.0TyrCys: 0.0 ± 0.0
0.761TyrAsp: 0.761 ± 0.625
1.521TyrGlu: 1.521 ± 0.764
0.0TyrPhe: 0.0 ± 0.0
3.804TyrGly: 3.804 ± 0.546
1.521TyrHis: 1.521 ± 0.764
2.282TyrIle: 2.282 ± 0.572
0.761TyrLys: 0.761 ± 0.382
2.663TyrLeu: 2.663 ± 0.703
0.761TyrMet: 0.761 ± 0.525
0.38TyrAsn: 0.38 ± 0.191
3.423TyrPro: 3.423 ± 1.022
0.761TyrGln: 0.761 ± 0.625
1.521TyrArg: 1.521 ± 0.764
4.564TyrSer: 4.564 ± 1.386
4.184TyrThr: 4.184 ± 1.042
1.521TyrVal: 1.521 ± 0.462
0.0TyrTrp: 0.0 ± 0.0
1.141TyrTyr: 1.141 ± 0.573
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2630 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski