Amino acid dipepetide frequency for Hubei diptera virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.002AlaAla: 2.002 ± 0.777
0.5AlaCys: 0.5 ± 0.26
2.502AlaAsp: 2.502 ± 1.135
1.751AlaGlu: 1.751 ± 0.681
1.001AlaPhe: 1.001 ± 0.439
2.502AlaGly: 2.502 ± 0.946
1.001AlaHis: 1.001 ± 0.385
4.003AlaIle: 4.003 ± 0.867
3.503AlaLys: 3.503 ± 0.894
2.252AlaLeu: 2.252 ± 0.608
0.25AlaMet: 0.25 ± 0.314
3.753AlaAsn: 3.753 ± 1.208
0.751AlaPro: 0.751 ± 0.88
2.252AlaGln: 2.252 ± 0.856
2.252AlaArg: 2.252 ± 0.467
2.002AlaSer: 2.002 ± 0.725
2.752AlaThr: 2.752 ± 0.845
2.252AlaVal: 2.252 ± 0.828
0.751AlaTrp: 0.751 ± 0.291
1.751AlaTyr: 1.751 ± 0.509
0.0AlaXaa: 0.0 ± 0.0
Cys
0.25CysAla: 0.25 ± 0.154
0.0CysCys: 0.0 ± 0.0
0.5CysAsp: 0.5 ± 0.349
0.5CysGlu: 0.5 ± 0.273
0.0CysPhe: 0.0 ± 0.0
1.001CysGly: 1.001 ± 0.27
0.751CysHis: 0.751 ± 0.586
1.751CysIle: 1.751 ± 0.503
0.5CysLys: 0.5 ± 0.508
1.001CysLeu: 1.001 ± 0.475
0.25CysMet: 0.25 ± 0.325
1.501CysAsn: 1.501 ± 0.338
1.251CysPro: 1.251 ± 0.529
0.5CysGln: 0.5 ± 0.295
0.5CysArg: 0.5 ± 0.309
2.252CysSer: 2.252 ± 1.6
1.751CysThr: 1.751 ± 0.585
0.5CysVal: 0.5 ± 0.273
0.25CysTrp: 0.25 ± 0.154
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.251AspAla: 1.251 ± 0.512
0.5AspCys: 0.5 ± 0.26
3.252AspAsp: 3.252 ± 0.942
3.753AspGlu: 3.753 ± 0.818
2.252AspPhe: 2.252 ± 0.296
1.751AspGly: 1.751 ± 0.774
1.251AspHis: 1.251 ± 0.548
2.752AspIle: 2.752 ± 0.696
2.252AspLys: 2.252 ± 0.391
5.254AspLeu: 5.254 ± 1.566
1.751AspMet: 1.751 ± 0.609
3.002AspAsn: 3.002 ± 1.325
2.752AspPro: 2.752 ± 0.475
2.252AspGln: 2.252 ± 0.805
2.752AspArg: 2.752 ± 0.766
1.751AspSer: 1.751 ± 0.829
2.252AspThr: 2.252 ± 0.584
4.754AspVal: 4.754 ± 0.597
2.502AspTrp: 2.502 ± 0.678
2.752AspTyr: 2.752 ± 0.766
0.0AspXaa: 0.0 ± 0.0
Glu
3.252GluAla: 3.252 ± 2.333
1.001GluCys: 1.001 ± 0.615
3.252GluAsp: 3.252 ± 1.346
3.753GluGlu: 3.753 ± 0.513
2.752GluPhe: 2.752 ± 0.862
3.252GluGly: 3.252 ± 1.014
0.751GluHis: 0.751 ± 0.424
6.255GluIle: 6.255 ± 1.052
3.252GluLys: 3.252 ± 1.024
7.255GluLeu: 7.255 ± 2.07
1.001GluMet: 1.001 ± 0.376
2.502GluAsn: 2.502 ± 1.297
3.002GluPro: 3.002 ± 0.591
1.001GluGln: 1.001 ± 0.486
2.752GluArg: 2.752 ± 0.892
5.004GluSer: 5.004 ± 0.97
3.503GluThr: 3.503 ± 0.733
4.253GluVal: 4.253 ± 0.884
1.251GluTrp: 1.251 ± 1.099
3.002GluTyr: 3.002 ± 0.722
0.0GluXaa: 0.0 ± 0.0
Phe
0.751PheAla: 0.751 ± 0.463
0.751PheCys: 0.751 ± 0.703
2.252PheAsp: 2.252 ± 0.912
3.252PheGlu: 3.252 ± 1.003
2.752PhePhe: 2.752 ± 0.495
2.502PheGly: 2.502 ± 0.251
1.751PheHis: 1.751 ± 0.577
3.252PheIle: 3.252 ± 0.82
2.752PheLys: 2.752 ± 0.646
4.754PheLeu: 4.754 ± 1.288
1.001PheMet: 1.001 ± 0.459
3.252PheAsn: 3.252 ± 0.83
4.003PhePro: 4.003 ± 0.333
2.502PheGln: 2.502 ± 0.538
1.751PheArg: 1.751 ± 0.504
3.753PheSer: 3.753 ± 0.755
0.5PheThr: 0.5 ± 0.273
2.002PheVal: 2.002 ± 0.714
0.5PheTrp: 0.5 ± 0.508
2.752PheTyr: 2.752 ± 0.653
0.0PheXaa: 0.0 ± 0.0
Gly
2.002GlyAla: 2.002 ± 0.777
0.25GlyCys: 0.25 ± 0.314
2.252GlyAsp: 2.252 ± 0.849
4.003GlyGlu: 4.003 ± 1.241
3.503GlyPhe: 3.503 ± 0.888
2.002GlyGly: 2.002 ± 1.235
1.001GlyHis: 1.001 ± 0.465
3.252GlyIle: 3.252 ± 0.967
2.502GlyLys: 2.502 ± 0.251
6.005GlyLeu: 6.005 ± 1.126
0.5GlyMet: 0.5 ± 0.506
1.751GlyAsn: 1.751 ± 0.608
2.502GlyPro: 2.502 ± 1.14
1.751GlyGln: 1.751 ± 0.605
1.251GlyArg: 1.251 ± 0.416
4.253GlySer: 4.253 ± 1.233
3.503GlyThr: 3.503 ± 0.383
4.003GlyVal: 4.003 ± 0.856
1.001GlyTrp: 1.001 ± 0.475
1.251GlyTyr: 1.251 ± 0.512
0.0GlyXaa: 0.0 ± 0.0
His
0.751HisAla: 0.751 ± 0.463
0.5HisCys: 0.5 ± 0.349
0.5HisAsp: 0.5 ± 0.661
1.001HisGlu: 1.001 ± 0.52
1.251HisPhe: 1.251 ± 0.692
1.501HisGly: 1.501 ± 0.896
0.5HisHis: 0.5 ± 0.309
2.002HisIle: 2.002 ± 0.62
2.252HisLys: 2.252 ± 0.859
2.752HisLeu: 2.752 ± 0.772
0.25HisMet: 0.25 ± 0.33
1.251HisAsn: 1.251 ± 0.722
1.751HisPro: 1.751 ± 0.839
1.751HisGln: 1.751 ± 0.65
0.751HisArg: 0.751 ± 0.291
2.002HisSer: 2.002 ± 1.434
1.751HisThr: 1.751 ± 1.306
2.002HisVal: 2.002 ± 0.813
0.25HisTrp: 0.25 ± 0.154
0.751HisTyr: 0.751 ± 0.463
0.0HisXaa: 0.0 ± 0.0
Ile
3.252IleAla: 3.252 ± 0.872
1.251IleCys: 1.251 ± 0.599
3.753IleAsp: 3.753 ± 0.65
4.003IleGlu: 4.003 ± 0.925
3.753IlePhe: 3.753 ± 1.7
4.503IleGly: 4.503 ± 0.983
1.501IleHis: 1.501 ± 0.896
7.255IleIle: 7.255 ± 0.732
3.503IleLys: 3.503 ± 0.694
3.503IleLeu: 3.503 ± 1.065
1.001IleMet: 1.001 ± 0.523
4.253IleAsn: 4.253 ± 1.019
4.754IlePro: 4.754 ± 0.995
2.752IleGln: 2.752 ± 0.93
6.755IleArg: 6.755 ± 0.435
7.005IleSer: 7.005 ± 1.977
5.004IleThr: 5.004 ± 1.653
3.753IleVal: 3.753 ± 0.537
0.25IleTrp: 0.25 ± 0.33
3.252IleTyr: 3.252 ± 0.779
0.0IleXaa: 0.0 ± 0.0
Lys
1.001LysAla: 1.001 ± 0.387
1.001LysCys: 1.001 ± 0.627
3.252LysAsp: 3.252 ± 0.862
3.753LysGlu: 3.753 ± 0.938
2.752LysPhe: 2.752 ± 0.968
2.252LysGly: 2.252 ± 0.888
0.751LysHis: 0.751 ± 0.388
5.504LysIle: 5.504 ± 2.259
5.004LysLys: 5.004 ± 1.448
6.005LysLeu: 6.005 ± 0.604
1.751LysMet: 1.751 ± 0.313
2.752LysAsn: 2.752 ± 0.98
2.002LysPro: 2.002 ± 0.51
1.001LysGln: 1.001 ± 0.956
2.002LysArg: 2.002 ± 0.785
3.503LysSer: 3.503 ± 1.415
4.754LysThr: 4.754 ± 1.393
2.252LysVal: 2.252 ± 0.702
2.252LysTrp: 2.252 ± 0.734
2.752LysTyr: 2.752 ± 0.535
0.0LysXaa: 0.0 ± 0.0
Leu
5.254LeuAla: 5.254 ± 1.056
2.002LeuCys: 2.002 ± 1.198
4.503LeuAsp: 4.503 ± 0.692
6.505LeuGlu: 6.505 ± 1.423
5.254LeuPhe: 5.254 ± 0.919
6.005LeuGly: 6.005 ± 1.159
2.752LeuHis: 2.752 ± 0.815
9.257LeuIle: 9.257 ± 2.126
7.506LeuLys: 7.506 ± 1.241
9.007LeuLeu: 9.007 ± 2.619
1.751LeuMet: 1.751 ± 0.529
6.255LeuAsn: 6.255 ± 1.687
3.503LeuPro: 3.503 ± 0.694
3.503LeuGln: 3.503 ± 0.887
6.755LeuArg: 6.755 ± 1.563
9.757LeuSer: 9.757 ± 2.381
7.255LeuThr: 7.255 ± 1.92
5.504LeuVal: 5.504 ± 0.903
0.25LeuTrp: 0.25 ± 0.33
3.753LeuTyr: 3.753 ± 0.821
0.0LeuXaa: 0.0 ± 0.0
Met
1.251MetAla: 1.251 ± 0.416
0.0MetCys: 0.0 ± 0.0
0.751MetAsp: 0.751 ± 0.424
1.501MetGlu: 1.501 ± 0.334
1.251MetPhe: 1.251 ± 0.421
0.5MetGly: 0.5 ± 0.295
0.0MetHis: 0.0 ± 0.0
1.001MetIle: 1.001 ± 0.303
1.001MetLys: 1.001 ± 0.827
2.752MetLeu: 2.752 ± 0.879
0.25MetMet: 0.25 ± 0.414
1.251MetAsn: 1.251 ± 0.717
0.0MetPro: 0.0 ± 0.0
0.25MetGln: 0.25 ± 0.154
1.251MetArg: 1.251 ± 0.35
2.252MetSer: 2.252 ± 0.674
0.25MetThr: 0.25 ± 0.154
1.251MetVal: 1.251 ± 0.53
0.0MetTrp: 0.0 ± 0.0
0.25MetTyr: 0.25 ± 0.154
0.0MetXaa: 0.0 ± 0.0
Asn
2.002AsnAla: 2.002 ± 0.428
0.751AsnCys: 0.751 ± 0.296
1.251AsnAsp: 1.251 ± 0.505
2.752AsnGlu: 2.752 ± 0.605
2.252AsnPhe: 2.252 ± 0.812
2.752AsnGly: 2.752 ± 0.672
3.002AsnHis: 3.002 ± 0.948
3.503AsnIle: 3.503 ± 0.426
4.003AsnLys: 4.003 ± 1.019
8.256AsnLeu: 8.256 ± 2.766
1.251AsnMet: 1.251 ± 0.646
4.003AsnAsn: 4.003 ± 0.464
5.004AsnPro: 5.004 ± 1.759
1.751AsnGln: 1.751 ± 0.615
2.502AsnArg: 2.502 ± 0.347
4.503AsnSer: 4.503 ± 0.808
3.252AsnThr: 3.252 ± 1.233
3.252AsnVal: 3.252 ± 1.202
1.251AsnTrp: 1.251 ± 0.436
3.252AsnTyr: 3.252 ± 1.207
0.0AsnXaa: 0.0 ± 0.0
Pro
2.502ProAla: 2.502 ± 0.953
0.0ProCys: 0.0 ± 0.0
3.753ProAsp: 3.753 ± 0.72
2.752ProGlu: 2.752 ± 1.03
2.752ProPhe: 2.752 ± 0.494
2.002ProGly: 2.002 ± 1.599
2.002ProHis: 2.002 ± 0.953
2.252ProIle: 2.252 ± 0.597
2.002ProLys: 2.002 ± 0.723
7.255ProLeu: 7.255 ± 0.725
0.25ProMet: 0.25 ± 0.154
3.252ProAsn: 3.252 ± 1.021
2.502ProPro: 2.502 ± 0.636
1.001ProGln: 1.001 ± 0.439
2.002ProArg: 2.002 ± 0.62
6.255ProSer: 6.255 ± 1.245
4.754ProThr: 4.754 ± 1.23
1.251ProVal: 1.251 ± 0.282
0.751ProTrp: 0.751 ± 0.463
1.501ProTyr: 1.501 ± 1.128
0.0ProXaa: 0.0 ± 0.0
Gln
1.251GlnAla: 1.251 ± 0.722
0.751GlnCys: 0.751 ± 0.296
2.002GlnAsp: 2.002 ± 0.753
2.252GlnGlu: 2.252 ± 0.858
2.002GlnPhe: 2.002 ± 0.757
1.501GlnGly: 1.501 ± 0.518
1.251GlnHis: 1.251 ± 0.529
2.502GlnIle: 2.502 ± 0.519
1.501GlnLys: 1.501 ± 0.413
0.751GlnLeu: 0.751 ± 0.416
0.25GlnMet: 0.25 ± 0.474
2.502GlnAsn: 2.502 ± 1.0
0.751GlnPro: 0.751 ± 0.348
0.5GlnGln: 0.5 ± 0.499
1.751GlnArg: 1.751 ± 0.517
5.004GlnSer: 5.004 ± 1.536
3.252GlnThr: 3.252 ± 0.94
2.002GlnVal: 2.002 ± 0.941
0.751GlnTrp: 0.751 ± 0.338
1.001GlnTyr: 1.001 ± 0.392
0.0GlnXaa: 0.0 ± 0.0
Arg
2.252ArgAla: 2.252 ± 0.735
0.751ArgCys: 0.751 ± 0.388
3.002ArgAsp: 3.002 ± 0.403
3.753ArgGlu: 3.753 ± 1.419
3.002ArgPhe: 3.002 ± 0.712
2.252ArgGly: 2.252 ± 0.672
0.751ArgHis: 0.751 ± 0.463
3.002ArgIle: 3.002 ± 1.139
1.001ArgLys: 1.001 ± 0.395
5.254ArgLeu: 5.254 ± 1.215
0.25ArgMet: 0.25 ± 0.154
3.503ArgAsn: 3.503 ± 1.239
3.252ArgPro: 3.252 ± 1.612
2.252ArgGln: 2.252 ± 0.997
2.002ArgArg: 2.002 ± 0.778
6.755ArgSer: 6.755 ± 0.979
2.502ArgThr: 2.502 ± 1.231
3.503ArgVal: 3.503 ± 0.701
0.751ArgTrp: 0.751 ± 0.463
1.001ArgTyr: 1.001 ± 0.385
0.0ArgXaa: 0.0 ± 0.0
Ser
3.753SerAla: 3.753 ± 1.574
1.501SerCys: 1.501 ± 0.643
6.005SerAsp: 6.005 ± 2.065
5.504SerGlu: 5.504 ± 0.622
4.253SerPhe: 4.253 ± 0.694
2.752SerGly: 2.752 ± 0.501
2.002SerHis: 2.002 ± 1.016
6.005SerIle: 6.005 ± 1.566
4.003SerLys: 4.003 ± 1.579
12.009SerLeu: 12.009 ± 1.083
2.002SerMet: 2.002 ± 0.534
3.002SerAsn: 3.002 ± 0.535
4.003SerPro: 4.003 ± 0.839
3.753SerGln: 3.753 ± 0.775
5.004SerArg: 5.004 ± 1.593
6.255SerSer: 6.255 ± 1.056
4.253SerThr: 4.253 ± 1.169
3.753SerVal: 3.753 ± 1.115
3.002SerTrp: 3.002 ± 1.05
3.002SerTyr: 3.002 ± 1.307
0.0SerXaa: 0.0 ± 0.0
Thr
3.503ThrAla: 3.503 ± 1.175
1.501ThrCys: 1.501 ± 0.571
2.252ThrAsp: 2.252 ± 0.856
4.754ThrGlu: 4.754 ± 0.772
1.501ThrPhe: 1.501 ± 0.879
3.002ThrGly: 3.002 ± 0.743
2.502ThrHis: 2.502 ± 1.604
4.003ThrIle: 4.003 ± 1.081
3.002ThrLys: 3.002 ± 0.819
7.756ThrLeu: 7.756 ± 2.031
0.5ThrMet: 0.5 ± 0.309
4.253ThrAsn: 4.253 ± 0.84
3.002ThrPro: 3.002 ± 1.387
2.002ThrGln: 2.002 ± 0.79
4.003ThrArg: 4.003 ± 0.807
5.754ThrSer: 5.754 ± 1.037
4.003ThrThr: 4.003 ± 0.843
4.253ThrVal: 4.253 ± 1.218
0.751ThrTrp: 0.751 ± 0.291
2.002ThrTyr: 2.002 ± 0.875
0.0ThrXaa: 0.0 ± 0.0
Val
1.751ValAla: 1.751 ± 0.783
1.751ValCys: 1.751 ± 0.62
3.252ValAsp: 3.252 ± 0.636
4.253ValGlu: 4.253 ± 1.468
1.251ValPhe: 1.251 ± 0.436
3.002ValGly: 3.002 ± 0.545
0.5ValHis: 0.5 ± 0.62
5.254ValIle: 5.254 ± 1.092
2.502ValLys: 2.502 ± 0.434
6.005ValLeu: 6.005 ± 1.021
1.501ValMet: 1.501 ± 0.485
4.754ValAsn: 4.754 ± 0.743
3.252ValPro: 3.252 ± 0.596
1.251ValGln: 1.251 ± 0.282
1.501ValArg: 1.501 ± 0.518
3.753ValSer: 3.753 ± 0.943
5.254ValThr: 5.254 ± 1.225
3.503ValVal: 3.503 ± 0.976
0.25ValTrp: 0.25 ± 0.154
1.501ValTyr: 1.501 ± 1.039
0.0ValXaa: 0.0 ± 0.0
Trp
0.751TrpAla: 0.751 ± 0.555
0.25TrpCys: 0.25 ± 0.154
1.501TrpAsp: 1.501 ± 0.674
1.001TrpGlu: 1.001 ± 0.475
1.501TrpPhe: 1.501 ± 0.592
1.251TrpGly: 1.251 ± 0.507
0.751TrpHis: 0.751 ± 0.526
0.751TrpIle: 0.751 ± 0.703
0.751TrpLys: 0.751 ± 0.463
1.501TrpLeu: 1.501 ± 0.294
0.25TrpMet: 0.25 ± 0.33
1.751TrpAsn: 1.751 ± 0.416
0.25TrpPro: 0.25 ± 0.154
1.001TrpGln: 1.001 ± 0.387
0.25TrpArg: 0.25 ± 0.154
1.501TrpSer: 1.501 ± 0.496
1.501TrpThr: 1.501 ± 0.896
0.5TrpVal: 0.5 ± 0.26
0.0TrpTrp: 0.0 ± 0.0
0.25TrpTyr: 0.25 ± 0.33
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.001TyrAla: 1.001 ± 0.387
0.0TyrCys: 0.0 ± 0.0
1.001TyrAsp: 1.001 ± 0.392
1.251TyrGlu: 1.251 ± 0.872
1.751TyrPhe: 1.751 ± 0.659
2.252TyrGly: 2.252 ± 0.835
0.751TyrHis: 0.751 ± 0.291
1.751TyrIle: 1.751 ± 0.366
3.503TyrLys: 3.503 ± 2.642
7.005TyrLeu: 7.005 ± 1.183
0.751TyrMet: 0.751 ± 0.291
2.002TyrAsn: 2.002 ± 1.377
2.502TyrPro: 2.502 ± 1.286
0.5TyrGln: 0.5 ± 0.451
3.002TyrArg: 3.002 ± 0.73
2.502TyrSer: 2.502 ± 0.701
2.252TyrThr: 2.252 ± 0.683
1.501TyrVal: 1.501 ± 0.598
0.5TyrTrp: 0.5 ± 0.451
2.002TyrTyr: 2.002 ± 1.229
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3998 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski