Amino acid dipepetide frequency for Hubei rhabdo-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.862AlaAla: 6.862 ± 2.261
0.792AlaCys: 0.792 ± 0.402
2.375AlaAsp: 2.375 ± 0.982
4.487AlaGlu: 4.487 ± 1.294
1.847AlaPhe: 1.847 ± 1.009
4.223AlaGly: 4.223 ± 1.071
1.32AlaHis: 1.32 ± 0.308
4.487AlaIle: 4.487 ± 1.679
2.639AlaLys: 2.639 ± 1.249
8.446AlaLeu: 8.446 ± 1.661
1.32AlaMet: 1.32 ± 0.59
3.167AlaAsn: 3.167 ± 1.432
3.167AlaPro: 3.167 ± 1.362
2.903AlaGln: 2.903 ± 0.697
4.223AlaArg: 4.223 ± 1.597
4.223AlaSer: 4.223 ± 1.313
4.223AlaThr: 4.223 ± 1.161
2.903AlaVal: 2.903 ± 0.514
0.792AlaTrp: 0.792 ± 0.368
1.584AlaTyr: 1.584 ± 0.716
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.528CysCys: 0.528 ± 0.293
0.264CysAsp: 0.264 ± 0.322
0.0CysGlu: 0.0 ± 0.0
0.528CysPhe: 0.528 ± 0.574
0.792CysGly: 0.792 ± 0.703
0.792CysHis: 0.792 ± 0.477
0.0CysIle: 0.0 ± 0.0
1.056CysLys: 1.056 ± 0.401
2.639CysLeu: 2.639 ± 0.913
0.0CysMet: 0.0 ± 0.0
1.056CysAsn: 1.056 ± 0.394
0.528CysPro: 0.528 ± 0.644
0.264CysGln: 0.264 ± 0.322
1.056CysArg: 1.056 ± 0.657
1.32CysSer: 1.32 ± 0.582
0.528CysThr: 0.528 ± 0.265
1.32CysVal: 1.32 ± 0.32
0.264CysTrp: 0.264 ± 0.394
0.264CysTyr: 0.264 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
2.639AspAla: 2.639 ± 1.088
0.792AspCys: 0.792 ± 0.323
1.847AspAsp: 1.847 ± 0.558
4.223AspGlu: 4.223 ± 0.864
2.639AspPhe: 2.639 ± 1.061
2.903AspGly: 2.903 ± 0.688
1.584AspHis: 1.584 ± 0.587
2.111AspIle: 2.111 ± 0.549
2.903AspLys: 2.903 ± 0.75
5.278AspLeu: 5.278 ± 1.407
0.792AspMet: 0.792 ± 0.362
2.639AspAsn: 2.639 ± 0.586
2.903AspPro: 2.903 ± 1.307
1.584AspGln: 1.584 ± 0.526
1.584AspArg: 1.584 ± 0.609
3.431AspSer: 3.431 ± 1.155
2.375AspThr: 2.375 ± 0.684
2.111AspVal: 2.111 ± 0.674
0.792AspTrp: 0.792 ± 0.296
0.264AspTyr: 0.264 ± 0.159
0.0AspXaa: 0.0 ± 0.0
Glu
4.487GluAla: 4.487 ± 0.484
0.264GluCys: 0.264 ± 0.322
1.847GluAsp: 1.847 ± 0.363
4.751GluGlu: 4.751 ± 1.258
2.111GluPhe: 2.111 ± 0.605
3.431GluGly: 3.431 ± 0.915
2.111GluHis: 2.111 ± 0.632
4.223GluIle: 4.223 ± 0.713
4.223GluLys: 4.223 ± 0.958
5.806GluLeu: 5.806 ± 0.768
1.32GluMet: 1.32 ± 0.561
1.847GluAsn: 1.847 ± 0.781
1.584GluPro: 1.584 ± 0.665
1.32GluGln: 1.32 ± 0.334
2.903GluArg: 2.903 ± 1.121
6.07GluSer: 6.07 ± 1.692
5.015GluThr: 5.015 ± 0.522
3.695GluVal: 3.695 ± 0.851
0.792GluTrp: 0.792 ± 0.477
0.792GluTyr: 0.792 ± 0.716
0.0GluXaa: 0.0 ± 0.0
Phe
2.375PheAla: 2.375 ± 0.318
0.264PheCys: 0.264 ± 0.394
2.639PheAsp: 2.639 ± 0.938
2.111PheGlu: 2.111 ± 0.612
1.584PhePhe: 1.584 ± 0.839
3.695PheGly: 3.695 ± 0.584
1.056PheHis: 1.056 ± 0.321
2.111PheIle: 2.111 ± 0.348
1.32PheLys: 1.32 ± 0.428
3.695PheLeu: 3.695 ± 1.622
1.056PheMet: 1.056 ± 0.588
1.584PheAsn: 1.584 ± 0.2
4.487PhePro: 4.487 ± 1.322
1.847PheGln: 1.847 ± 0.542
1.584PheArg: 1.584 ± 0.665
3.959PheSer: 3.959 ± 0.865
2.903PheThr: 2.903 ± 0.696
2.375PheVal: 2.375 ± 0.765
0.264PheTrp: 0.264 ± 0.339
1.056PheTyr: 1.056 ± 0.947
0.0PheXaa: 0.0 ± 0.0
Gly
3.959GlyAla: 3.959 ± 1.055
0.528GlyCys: 0.528 ± 0.318
4.223GlyAsp: 4.223 ± 0.974
3.431GlyGlu: 3.431 ± 0.752
3.431GlyPhe: 3.431 ± 0.64
3.695GlyGly: 3.695 ± 0.334
2.111GlyHis: 2.111 ± 0.944
3.167GlyIle: 3.167 ± 0.852
2.903GlyLys: 2.903 ± 0.922
5.278GlyLeu: 5.278 ± 0.645
2.375GlyMet: 2.375 ± 1.09
2.375GlyAsn: 2.375 ± 1.138
2.903GlyPro: 2.903 ± 1.024
1.32GlyGln: 1.32 ± 0.795
2.111GlyArg: 2.111 ± 0.404
6.598GlySer: 6.598 ± 0.892
3.431GlyThr: 3.431 ± 0.605
6.334GlyVal: 6.334 ± 1.323
1.32GlyTrp: 1.32 ± 0.795
1.847GlyTyr: 1.847 ± 0.665
0.0GlyXaa: 0.0 ± 0.0
His
1.584HisAla: 1.584 ± 0.587
0.792HisCys: 0.792 ± 0.368
1.584HisAsp: 1.584 ± 0.2
1.056HisGlu: 1.056 ± 0.394
0.792HisPhe: 0.792 ± 0.323
0.792HisGly: 0.792 ± 0.477
1.584HisHis: 1.584 ± 0.491
1.847HisIle: 1.847 ± 0.738
1.056HisLys: 1.056 ± 0.394
3.431HisLeu: 3.431 ± 0.718
0.264HisMet: 0.264 ± 0.159
0.264HisAsn: 0.264 ± 0.339
2.375HisPro: 2.375 ± 0.893
0.792HisGln: 0.792 ± 0.323
1.584HisArg: 1.584 ± 0.37
2.639HisSer: 2.639 ± 1.047
1.056HisThr: 1.056 ± 0.315
0.792HisVal: 0.792 ± 0.296
0.0HisTrp: 0.0 ± 0.0
1.32HisTyr: 1.32 ± 0.555
0.0HisXaa: 0.0 ± 0.0
Ile
5.542IleAla: 5.542 ± 0.833
1.056IleCys: 1.056 ± 0.636
1.847IleAsp: 1.847 ± 0.734
3.695IleGlu: 3.695 ± 1.37
3.167IlePhe: 3.167 ± 0.698
4.223IleGly: 4.223 ± 0.413
1.584IleHis: 1.584 ± 0.425
3.695IleIle: 3.695 ± 2.126
1.847IleLys: 1.847 ± 0.855
6.334IleLeu: 6.334 ± 0.611
1.584IleMet: 1.584 ± 0.647
2.111IleAsn: 2.111 ± 0.639
4.751IlePro: 4.751 ± 0.768
1.847IleGln: 1.847 ± 0.337
6.07IleArg: 6.07 ± 1.228
3.431IleSer: 3.431 ± 0.738
2.903IleThr: 2.903 ± 0.568
3.167IleVal: 3.167 ± 0.963
1.584IleTrp: 1.584 ± 0.371
1.32IleTyr: 1.32 ± 0.795
0.0IleXaa: 0.0 ± 0.0
Lys
3.695LysAla: 3.695 ± 2.006
0.528LysCys: 0.528 ± 0.473
1.584LysAsp: 1.584 ± 0.491
3.959LysGlu: 3.959 ± 1.201
2.639LysPhe: 2.639 ± 0.923
3.431LysGly: 3.431 ± 1.162
0.528LysHis: 0.528 ± 0.323
4.223LysIle: 4.223 ± 0.484
2.111LysLys: 2.111 ± 0.735
4.751LysLeu: 4.751 ± 0.567
0.528LysMet: 0.528 ± 0.509
2.111LysAsn: 2.111 ± 1.042
2.375LysPro: 2.375 ± 0.588
1.056LysGln: 1.056 ± 0.627
2.903LysArg: 2.903 ± 0.837
2.375LysSer: 2.375 ± 0.874
3.959LysThr: 3.959 ± 1.031
4.487LysVal: 4.487 ± 0.59
1.056LysTrp: 1.056 ± 0.657
0.264LysTyr: 0.264 ± 0.159
0.0LysXaa: 0.0 ± 0.0
Leu
4.751LeuAla: 4.751 ± 1.327
1.847LeuCys: 1.847 ± 0.507
5.278LeuAsp: 5.278 ± 1.04
6.598LeuGlu: 6.598 ± 1.201
4.223LeuPhe: 4.223 ± 1.481
6.862LeuGly: 6.862 ± 1.416
1.847LeuHis: 1.847 ± 0.772
7.654LeuIle: 7.654 ± 1.756
6.598LeuLys: 6.598 ± 1.31
12.404LeuLeu: 12.404 ± 0.713
2.903LeuMet: 2.903 ± 1.119
3.695LeuAsn: 3.695 ± 0.826
3.695LeuPro: 3.695 ± 1.057
4.223LeuGln: 4.223 ± 0.717
7.918LeuArg: 7.918 ± 1.213
10.293LeuSer: 10.293 ± 2.015
6.862LeuThr: 6.862 ± 1.27
6.598LeuVal: 6.598 ± 1.023
0.528LeuTrp: 0.528 ± 0.265
2.375LeuTyr: 2.375 ± 0.775
0.0LeuXaa: 0.0 ± 0.0
Met
1.584MetAla: 1.584 ± 0.516
0.0MetCys: 0.0 ± 0.0
1.32MetAsp: 1.32 ± 0.707
1.056MetGlu: 1.056 ± 0.394
1.584MetPhe: 1.584 ± 0.425
0.792MetGly: 0.792 ± 0.327
0.264MetHis: 0.264 ± 0.394
2.639MetIle: 2.639 ± 0.605
0.528MetLys: 0.528 ± 0.265
1.584MetLeu: 1.584 ± 0.371
1.32MetMet: 1.32 ± 0.549
0.792MetAsn: 0.792 ± 0.703
0.528MetPro: 0.528 ± 0.323
0.528MetGln: 0.528 ± 0.509
1.584MetArg: 1.584 ± 0.2
2.639MetSer: 2.639 ± 0.362
1.584MetThr: 1.584 ± 0.526
1.32MetVal: 1.32 ± 0.718
0.0MetTrp: 0.0 ± 0.0
0.528MetTyr: 0.528 ± 0.323
0.0MetXaa: 0.0 ± 0.0
Asn
1.584AsnAla: 1.584 ± 0.785
0.792AsnCys: 0.792 ± 0.735
0.264AsnAsp: 0.264 ± 0.394
1.056AsnGlu: 1.056 ± 0.423
2.111AsnPhe: 2.111 ± 0.375
2.111AsnGly: 2.111 ± 0.627
0.264AsnHis: 0.264 ± 0.159
2.375AsnIle: 2.375 ± 0.394
0.792AsnLys: 0.792 ± 0.875
4.487AsnLeu: 4.487 ± 1.281
0.264AsnMet: 0.264 ± 0.339
0.528AsnAsn: 0.528 ± 0.586
4.223AsnPro: 4.223 ± 1.032
1.584AsnGln: 1.584 ± 1.515
2.375AsnArg: 2.375 ± 0.829
3.959AsnSer: 3.959 ± 0.808
1.847AsnThr: 1.847 ± 0.384
1.584AsnVal: 1.584 ± 0.568
1.32AsnTrp: 1.32 ± 0.582
0.792AsnTyr: 0.792 ± 0.323
0.0AsnXaa: 0.0 ± 0.0
Pro
2.903ProAla: 2.903 ± 0.409
0.528ProCys: 0.528 ± 0.318
3.431ProAsp: 3.431 ± 1.086
3.167ProGlu: 3.167 ± 0.587
2.375ProPhe: 2.375 ± 0.829
2.375ProGly: 2.375 ± 1.036
2.375ProHis: 2.375 ± 0.588
3.167ProIle: 3.167 ± 0.699
3.431ProLys: 3.431 ± 1.135
5.542ProLeu: 5.542 ± 1.517
0.792ProMet: 0.792 ± 0.477
2.375ProAsn: 2.375 ± 0.318
4.487ProPro: 4.487 ± 0.737
1.847ProGln: 1.847 ± 1.033
2.375ProArg: 2.375 ± 0.63
5.542ProSer: 5.542 ± 0.771
3.431ProThr: 3.431 ± 1.118
4.223ProVal: 4.223 ± 1.039
1.056ProTrp: 1.056 ± 0.321
2.375ProTyr: 2.375 ± 1.013
0.0ProXaa: 0.0 ± 0.0
Gln
2.639GlnAla: 2.639 ± 1.143
0.792GlnCys: 0.792 ± 0.477
1.847GlnAsp: 1.847 ± 1.034
1.32GlnGlu: 1.32 ± 0.506
1.32GlnPhe: 1.32 ± 0.309
2.375GlnGly: 2.375 ± 0.604
1.056GlnHis: 1.056 ± 0.426
1.584GlnIle: 1.584 ± 0.97
2.903GlnLys: 2.903 ± 0.461
3.431GlnLeu: 3.431 ± 1.127
1.056GlnMet: 1.056 ± 0.465
1.584GlnAsn: 1.584 ± 0.389
0.792GlnPro: 0.792 ± 0.323
1.847GlnGln: 1.847 ± 0.619
2.375GlnArg: 2.375 ± 1.236
2.639GlnSer: 2.639 ± 0.255
2.375GlnThr: 2.375 ± 0.834
2.639GlnVal: 2.639 ± 0.82
0.528GlnTrp: 0.528 ± 0.323
0.528GlnTyr: 0.528 ± 0.293
0.0GlnXaa: 0.0 ± 0.0
Arg
4.487ArgAla: 4.487 ± 1.203
1.32ArgCys: 1.32 ± 0.381
2.111ArgAsp: 2.111 ± 0.606
2.903ArgGlu: 2.903 ± 0.561
2.111ArgPhe: 2.111 ± 0.924
4.751ArgGly: 4.751 ± 1.031
1.32ArgHis: 1.32 ± 0.795
3.695ArgIle: 3.695 ± 0.649
2.375ArgLys: 2.375 ± 0.954
5.278ArgLeu: 5.278 ± 1.081
1.847ArgMet: 1.847 ± 1.01
1.32ArgAsn: 1.32 ± 0.707
2.375ArgPro: 2.375 ± 0.318
2.639ArgGln: 2.639 ± 0.543
3.959ArgArg: 3.959 ± 0.819
6.07ArgSer: 6.07 ± 0.39
3.959ArgThr: 3.959 ± 1.323
3.167ArgVal: 3.167 ± 1.058
0.528ArgTrp: 0.528 ± 0.265
2.639ArgTyr: 2.639 ± 0.362
0.0ArgXaa: 0.0 ± 0.0
Ser
5.015SerAla: 5.015 ± 1.678
1.32SerCys: 1.32 ± 0.807
4.487SerAsp: 4.487 ± 1.139
5.278SerGlu: 5.278 ± 1.59
3.959SerPhe: 3.959 ± 1.097
4.487SerGly: 4.487 ± 0.647
1.847SerHis: 1.847 ± 0.538
5.542SerIle: 5.542 ± 1.109
6.07SerLys: 6.07 ± 1.393
11.085SerLeu: 11.085 ± 1.515
1.584SerMet: 1.584 ± 0.646
2.903SerAsn: 2.903 ± 1.032
4.751SerPro: 4.751 ± 1.419
3.167SerGln: 3.167 ± 0.587
5.278SerArg: 5.278 ± 0.591
8.182SerSer: 8.182 ± 2.248
5.015SerThr: 5.015 ± 0.761
5.015SerVal: 5.015 ± 0.915
0.792SerTrp: 0.792 ± 0.296
4.487SerTyr: 4.487 ± 1.334
0.0SerXaa: 0.0 ± 0.0
Thr
4.751ThrAla: 4.751 ± 1.301
0.528ThrCys: 0.528 ± 0.293
2.375ThrAsp: 2.375 ± 0.737
3.695ThrGlu: 3.695 ± 0.726
2.639ThrPhe: 2.639 ± 1.274
5.015ThrGly: 5.015 ± 1.692
2.375ThrHis: 2.375 ± 0.497
3.695ThrIle: 3.695 ± 0.615
1.056ThrLys: 1.056 ± 0.451
7.918ThrLeu: 7.918 ± 0.545
0.528ThrMet: 0.528 ± 0.323
0.264ThrAsn: 0.264 ± 0.417
5.278ThrPro: 5.278 ± 1.183
2.639ThrGln: 2.639 ± 0.834
3.959ThrArg: 3.959 ± 1.004
5.542ThrSer: 5.542 ± 0.849
3.959ThrThr: 3.959 ± 1.763
3.959ThrVal: 3.959 ± 0.786
2.111ThrTrp: 2.111 ± 0.975
1.056ThrTyr: 1.056 ± 0.451
0.0ThrXaa: 0.0 ± 0.0
Val
5.806ValAla: 5.806 ± 1.848
0.264ValCys: 0.264 ± 0.339
4.487ValAsp: 4.487 ± 0.694
3.431ValGlu: 3.431 ± 0.694
1.32ValPhe: 1.32 ± 0.309
3.431ValGly: 3.431 ± 1.083
1.056ValHis: 1.056 ± 0.636
3.695ValIle: 3.695 ± 1.136
2.375ValLys: 2.375 ± 0.823
6.334ValLeu: 6.334 ± 0.799
1.32ValMet: 1.32 ± 0.523
2.903ValAsn: 2.903 ± 0.51
2.903ValPro: 2.903 ± 0.51
2.375ValGln: 2.375 ± 1.299
2.903ValArg: 2.903 ± 0.775
5.806ValSer: 5.806 ± 1.646
5.015ValThr: 5.015 ± 1.601
5.015ValVal: 5.015 ± 0.487
0.264ValTrp: 0.264 ± 0.339
2.639ValTyr: 2.639 ± 0.724
0.0ValXaa: 0.0 ± 0.0
Trp
1.056TrpAla: 1.056 ± 0.315
0.264TrpCys: 0.264 ± 0.394
1.056TrpAsp: 1.056 ± 0.601
1.056TrpGlu: 1.056 ± 0.314
0.528TrpPhe: 0.528 ± 0.52
1.847TrpGly: 1.847 ± 0.814
0.0TrpHis: 0.0 ± 0.0
0.264TrpIle: 0.264 ± 0.159
1.056TrpLys: 1.056 ± 0.627
0.528TrpLeu: 0.528 ± 0.318
0.264TrpMet: 0.264 ± 0.394
0.264TrpAsn: 0.264 ± 0.159
0.264TrpPro: 0.264 ± 0.159
0.528TrpGln: 0.528 ± 0.293
0.528TrpArg: 0.528 ± 0.318
2.111TrpSer: 2.111 ± 0.385
1.847TrpThr: 1.847 ± 0.814
0.528TrpVal: 0.528 ± 0.265
0.528TrpTrp: 0.528 ± 0.52
0.528TrpTyr: 0.528 ± 0.318
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.528TyrAla: 0.528 ± 0.318
0.0TyrCys: 0.0 ± 0.0
0.528TyrAsp: 0.528 ± 0.265
1.32TyrGlu: 1.32 ± 0.308
1.056TyrPhe: 1.056 ± 0.426
2.111TyrGly: 2.111 ± 0.605
0.528TyrHis: 0.528 ± 0.318
1.584TyrIle: 1.584 ± 0.756
1.32TyrLys: 1.32 ± 0.795
2.903TyrLeu: 2.903 ± 0.461
0.792TyrMet: 0.792 ± 0.296
0.528TyrAsn: 0.528 ± 0.293
3.431TyrPro: 3.431 ± 0.907
1.32TyrGln: 1.32 ± 0.428
1.584TyrArg: 1.584 ± 0.419
3.431TyrSer: 3.431 ± 0.963
1.056TyrThr: 1.056 ± 0.463
2.111TyrVal: 2.111 ± 0.641
0.528TyrTrp: 0.528 ± 0.323
1.056TyrTyr: 1.056 ± 0.53
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3790 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski