Amino acid dipepetide frequency for His1 virus (isolate Australia/Victoria) (His1V) (Haloarcula hispanica virus 1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.58AlaAla: 1.58 ± 0.575
0.677AlaCys: 0.677 ± 0.394
4.288AlaAsp: 4.288 ± 0.911
4.514AlaGlu: 4.514 ± 1.531
2.708AlaPhe: 2.708 ± 0.939
4.288AlaGly: 4.288 ± 0.832
0.226AlaHis: 0.226 ± 0.234
4.288AlaIle: 4.288 ± 1.323
5.868AlaLys: 5.868 ± 1.631
2.257AlaLeu: 2.257 ± 1.049
0.903AlaMet: 0.903 ± 0.539
5.191AlaAsn: 5.191 ± 1.088
1.354AlaPro: 1.354 ± 0.476
0.677AlaGln: 0.677 ± 0.385
4.062AlaArg: 4.062 ± 1.155
2.934AlaSer: 2.934 ± 0.965
4.288AlaThr: 4.288 ± 1.146
4.739AlaVal: 4.739 ± 0.859
2.031AlaTrp: 2.031 ± 0.55
2.257AlaTyr: 2.257 ± 0.679
0.0AlaXaa: 0.0 ± 0.0
Cys
1.128CysAla: 1.128 ± 0.569
0.0CysCys: 0.0 ± 0.0
0.451CysAsp: 0.451 ± 0.282
1.128CysGlu: 1.128 ± 0.528
0.451CysPhe: 0.451 ± 0.319
2.257CysGly: 2.257 ± 0.883
0.451CysHis: 0.451 ± 0.358
0.677CysIle: 0.677 ± 0.364
0.903CysLys: 0.903 ± 0.435
0.903CysLeu: 0.903 ± 0.577
0.451CysMet: 0.451 ± 0.497
0.903CysAsn: 0.903 ± 0.39
1.805CysPro: 1.805 ± 0.777
1.128CysGln: 1.128 ± 0.427
0.677CysArg: 0.677 ± 0.854
1.58CysSer: 1.58 ± 0.672
0.451CysThr: 0.451 ± 0.294
1.128CysVal: 1.128 ± 0.489
0.677CysTrp: 0.677 ± 0.37
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.805AspAla: 1.805 ± 0.81
1.805AspCys: 1.805 ± 0.787
3.16AspAsp: 3.16 ± 0.996
2.257AspGlu: 2.257 ± 0.679
3.611AspPhe: 3.611 ± 0.785
5.642AspGly: 5.642 ± 1.101
0.677AspHis: 0.677 ± 0.366
4.514AspIle: 4.514 ± 1.074
2.934AspLys: 2.934 ± 0.646
3.611AspLeu: 3.611 ± 0.817
2.257AspMet: 2.257 ± 0.865
4.514AspAsn: 4.514 ± 0.74
1.354AspPro: 1.354 ± 0.466
0.451AspGln: 0.451 ± 0.302
1.58AspArg: 1.58 ± 0.573
5.642AspSer: 5.642 ± 1.445
2.708AspThr: 2.708 ± 0.778
2.708AspVal: 2.708 ± 0.914
0.451AspTrp: 0.451 ± 0.309
2.708AspTyr: 2.708 ± 0.685
0.0AspXaa: 0.0 ± 0.0
Glu
3.837GluAla: 3.837 ± 1.295
1.805GluCys: 1.805 ± 0.978
2.031GluAsp: 2.031 ± 0.628
4.965GluGlu: 4.965 ± 1.693
1.354GluPhe: 1.354 ± 0.731
2.934GluGly: 2.934 ± 0.558
1.128GluHis: 1.128 ± 0.6
5.416GluIle: 5.416 ± 0.827
6.996GluLys: 6.996 ± 1.259
7.448GluLeu: 7.448 ± 1.825
2.483GluMet: 2.483 ± 0.922
5.868GluAsn: 5.868 ± 1.241
1.58GluPro: 1.58 ± 0.785
3.837GluGln: 3.837 ± 0.907
2.257GluArg: 2.257 ± 0.767
5.191GluSer: 5.191 ± 1.44
5.191GluThr: 5.191 ± 1.145
4.965GluVal: 4.965 ± 1.127
1.354GluTrp: 1.354 ± 0.391
4.062GluTyr: 4.062 ± 0.78
0.0GluXaa: 0.0 ± 0.0
Phe
1.354PheAla: 1.354 ± 0.462
0.903PheCys: 0.903 ± 0.386
2.934PheAsp: 2.934 ± 0.792
2.483PheGlu: 2.483 ± 0.551
1.58PhePhe: 1.58 ± 0.575
2.257PheGly: 2.257 ± 0.907
0.226PheHis: 0.226 ± 0.264
3.385PheIle: 3.385 ± 0.773
1.805PheLys: 1.805 ± 0.697
4.062PheLeu: 4.062 ± 0.857
1.805PheMet: 1.805 ± 0.619
2.483PheAsn: 2.483 ± 0.979
1.128PhePro: 1.128 ± 0.491
1.128PheGln: 1.128 ± 0.513
0.903PheArg: 0.903 ± 0.459
3.837PheSer: 3.837 ± 1.409
2.031PheThr: 2.031 ± 0.897
4.288PheVal: 4.288 ± 1.171
0.677PheTrp: 0.677 ± 0.441
2.031PheTyr: 2.031 ± 0.743
0.0PheXaa: 0.0 ± 0.0
Gly
4.062GlyAla: 4.062 ± 1.02
1.128GlyCys: 1.128 ± 0.47
3.385GlyAsp: 3.385 ± 0.692
4.514GlyGlu: 4.514 ± 1.102
3.385GlyPhe: 3.385 ± 0.664
2.708GlyGly: 2.708 ± 0.83
0.677GlyHis: 0.677 ± 0.36
4.062GlyIle: 4.062 ± 1.339
4.288GlyLys: 4.288 ± 1.583
7.222GlyLeu: 7.222 ± 1.507
2.257GlyMet: 2.257 ± 0.862
2.934GlyAsn: 2.934 ± 1.164
2.031GlyPro: 2.031 ± 0.529
2.934GlyGln: 2.934 ± 1.008
2.934GlyArg: 2.934 ± 0.935
4.288GlySer: 4.288 ± 1.202
3.837GlyThr: 3.837 ± 0.709
3.837GlyVal: 3.837 ± 0.694
0.677GlyTrp: 0.677 ± 0.439
3.385GlyTyr: 3.385 ± 0.894
0.0GlyXaa: 0.0 ± 0.0
His
0.903HisAla: 0.903 ± 0.384
0.0HisCys: 0.0 ± 0.0
1.128HisAsp: 1.128 ± 0.432
0.451HisGlu: 0.451 ± 0.293
0.677HisPhe: 0.677 ± 0.354
0.451HisGly: 0.451 ± 0.36
0.0HisHis: 0.0 ± 0.0
1.128HisIle: 1.128 ± 0.551
1.128HisLys: 1.128 ± 0.715
0.903HisLeu: 0.903 ± 0.602
0.903HisMet: 0.903 ± 0.424
1.128HisAsn: 1.128 ± 0.577
0.226HisPro: 0.226 ± 0.234
0.226HisGln: 0.226 ± 0.248
0.677HisArg: 0.677 ± 0.467
1.128HisSer: 1.128 ± 0.641
0.451HisThr: 0.451 ± 0.327
0.903HisVal: 0.903 ± 0.458
0.451HisTrp: 0.451 ± 0.497
0.903HisTyr: 0.903 ± 0.427
0.0HisXaa: 0.0 ± 0.0
Ile
4.288IleAla: 4.288 ± 1.278
1.354IleCys: 1.354 ± 0.495
4.739IleAsp: 4.739 ± 1.249
6.996IleGlu: 6.996 ± 0.952
3.837IlePhe: 3.837 ± 0.933
5.191IleGly: 5.191 ± 1.303
0.903IleHis: 0.903 ± 0.484
6.319IleIle: 6.319 ± 2.095
4.062IleLys: 4.062 ± 0.961
4.965IleLeu: 4.965 ± 0.952
1.805IleMet: 1.805 ± 0.536
2.257IleAsn: 2.257 ± 0.604
1.805IlePro: 1.805 ± 0.574
2.483IleGln: 2.483 ± 0.794
1.128IleArg: 1.128 ± 0.654
4.288IleSer: 4.288 ± 1.017
6.545IleThr: 6.545 ± 1.046
3.385IleVal: 3.385 ± 0.788
1.805IleTrp: 1.805 ± 0.612
2.483IleTyr: 2.483 ± 0.687
0.0IleXaa: 0.0 ± 0.0
Lys
4.062LysAla: 4.062 ± 1.124
1.354LysCys: 1.354 ± 0.657
4.062LysAsp: 4.062 ± 1.101
4.514LysGlu: 4.514 ± 1.394
2.031LysPhe: 2.031 ± 0.547
4.514LysGly: 4.514 ± 0.715
1.58LysHis: 1.58 ± 0.71
4.288LysIle: 4.288 ± 0.93
2.934LysLys: 2.934 ± 0.925
3.16LysLeu: 3.16 ± 0.642
1.58LysMet: 1.58 ± 0.547
3.385LysAsn: 3.385 ± 1.074
2.257LysPro: 2.257 ± 0.628
2.708LysGln: 2.708 ± 0.704
2.934LysArg: 2.934 ± 1.085
4.288LysSer: 4.288 ± 0.948
4.288LysThr: 4.288 ± 0.821
2.934LysVal: 2.934 ± 0.622
1.58LysTrp: 1.58 ± 0.553
2.934LysTyr: 2.934 ± 0.85
0.0LysXaa: 0.0 ± 0.0
Leu
7.448LeuAla: 7.448 ± 1.052
1.58LeuCys: 1.58 ± 0.677
2.708LeuAsp: 2.708 ± 0.946
7.899LeuGlu: 7.899 ± 1.188
3.837LeuPhe: 3.837 ± 1.19
6.545LeuGly: 6.545 ± 1.126
1.354LeuHis: 1.354 ± 0.564
4.062LeuIle: 4.062 ± 0.838
4.739LeuLys: 4.739 ± 1.018
8.576LeuLeu: 8.576 ± 2.036
2.257LeuMet: 2.257 ± 1.09
3.16LeuAsn: 3.16 ± 0.863
3.385LeuPro: 3.385 ± 1.01
2.483LeuGln: 2.483 ± 0.857
2.483LeuArg: 2.483 ± 0.89
5.642LeuSer: 5.642 ± 0.975
5.868LeuThr: 5.868 ± 1.259
5.416LeuVal: 5.416 ± 0.945
1.805LeuTrp: 1.805 ± 0.569
1.354LeuTyr: 1.354 ± 0.485
0.0LeuXaa: 0.0 ± 0.0
Met
4.288MetAla: 4.288 ± 1.069
0.226MetCys: 0.226 ± 0.235
0.903MetAsp: 0.903 ± 0.413
1.354MetGlu: 1.354 ± 0.535
0.451MetPhe: 0.451 ± 0.363
0.677MetGly: 0.677 ± 0.372
0.0MetHis: 0.0 ± 0.0
2.934MetIle: 2.934 ± 0.93
1.128MetLys: 1.128 ± 0.561
3.385MetLeu: 3.385 ± 1.082
1.354MetMet: 1.354 ± 0.695
1.58MetAsn: 1.58 ± 0.571
1.128MetPro: 1.128 ± 0.484
1.805MetGln: 1.805 ± 0.691
0.677MetArg: 0.677 ± 0.488
3.611MetSer: 3.611 ± 1.029
2.257MetThr: 2.257 ± 0.639
3.16MetVal: 3.16 ± 0.86
0.451MetTrp: 0.451 ± 0.26
2.031MetTyr: 2.031 ± 0.745
0.0MetXaa: 0.0 ± 0.0
Asn
3.385AsnAla: 3.385 ± 1.049
1.58AsnCys: 1.58 ± 0.706
2.257AsnAsp: 2.257 ± 0.56
4.514AsnGlu: 4.514 ± 1.298
1.354AsnPhe: 1.354 ± 0.538
3.837AsnGly: 3.837 ± 0.782
0.677AsnHis: 0.677 ± 0.328
4.739AsnIle: 4.739 ± 1.432
2.257AsnLys: 2.257 ± 0.789
4.965AsnLeu: 4.965 ± 0.889
2.031AsnMet: 2.031 ± 0.611
2.257AsnAsn: 2.257 ± 1.094
2.483AsnPro: 2.483 ± 0.621
2.483AsnGln: 2.483 ± 0.592
2.031AsnArg: 2.031 ± 0.698
4.739AsnSer: 4.739 ± 0.752
3.611AsnThr: 3.611 ± 0.86
3.385AsnVal: 3.385 ± 1.059
0.903AsnTrp: 0.903 ± 0.364
2.031AsnTyr: 2.031 ± 0.683
0.0AsnXaa: 0.0 ± 0.0
Pro
1.58ProAla: 1.58 ± 0.565
0.226ProCys: 0.226 ± 0.204
2.031ProAsp: 2.031 ± 0.651
3.837ProGlu: 3.837 ± 1.158
2.708ProPhe: 2.708 ± 0.848
2.708ProGly: 2.708 ± 0.847
0.0ProHis: 0.0 ± 0.0
1.354ProIle: 1.354 ± 0.646
1.58ProLys: 1.58 ± 0.696
2.031ProLeu: 2.031 ± 0.711
1.805ProMet: 1.805 ± 0.655
2.483ProAsn: 2.483 ± 0.735
1.354ProPro: 1.354 ± 0.558
0.903ProGln: 0.903 ± 0.394
0.677ProArg: 0.677 ± 0.403
2.257ProSer: 2.257 ± 0.833
1.58ProThr: 1.58 ± 0.621
2.257ProVal: 2.257 ± 0.827
0.451ProTrp: 0.451 ± 0.438
1.128ProTyr: 1.128 ± 0.523
0.0ProXaa: 0.0 ± 0.0
Gln
1.354GlnAla: 1.354 ± 0.553
0.0GlnCys: 0.0 ± 0.0
2.257GlnAsp: 2.257 ± 0.585
4.062GlnGlu: 4.062 ± 0.806
1.58GlnPhe: 1.58 ± 0.564
1.58GlnGly: 1.58 ± 0.681
0.903GlnHis: 0.903 ± 0.543
2.257GlnIle: 2.257 ± 0.749
2.708GlnLys: 2.708 ± 0.854
1.805GlnLeu: 1.805 ± 0.446
1.354GlnMet: 1.354 ± 0.8
1.58GlnAsn: 1.58 ± 0.537
1.354GlnPro: 1.354 ± 0.586
0.903GlnGln: 0.903 ± 0.511
0.677GlnArg: 0.677 ± 0.426
2.031GlnSer: 2.031 ± 0.634
4.062GlnThr: 4.062 ± 1.025
2.934GlnVal: 2.934 ± 0.81
1.128GlnTrp: 1.128 ± 0.548
1.58GlnTyr: 1.58 ± 0.555
0.0GlnXaa: 0.0 ± 0.0
Arg
1.58ArgAla: 1.58 ± 0.608
0.0ArgCys: 0.0 ± 0.0
2.708ArgAsp: 2.708 ± 0.864
4.288ArgGlu: 4.288 ± 1.14
1.58ArgPhe: 1.58 ± 0.628
2.031ArgGly: 2.031 ± 0.719
0.226ArgHis: 0.226 ± 0.204
2.031ArgIle: 2.031 ± 0.738
2.934ArgLys: 2.934 ± 0.934
3.385ArgLeu: 3.385 ± 0.768
0.677ArgMet: 0.677 ± 0.438
1.354ArgAsn: 1.354 ± 0.801
1.354ArgPro: 1.354 ± 0.455
1.805ArgGln: 1.805 ± 0.489
2.257ArgArg: 2.257 ± 0.729
1.354ArgSer: 1.354 ± 0.595
0.677ArgThr: 0.677 ± 0.356
2.257ArgVal: 2.257 ± 0.801
0.903ArgTrp: 0.903 ± 0.522
2.031ArgTyr: 2.031 ± 0.755
0.0ArgXaa: 0.0 ± 0.0
Ser
2.934SerAla: 2.934 ± 0.979
0.451SerCys: 0.451 ± 0.376
4.965SerAsp: 4.965 ± 1.052
5.868SerGlu: 5.868 ± 1.626
2.934SerPhe: 2.934 ± 0.955
5.191SerGly: 5.191 ± 1.431
1.58SerHis: 1.58 ± 0.651
5.191SerIle: 5.191 ± 1.346
2.708SerLys: 2.708 ± 0.82
5.642SerLeu: 5.642 ± 1.069
2.934SerMet: 2.934 ± 0.858
4.288SerAsn: 4.288 ± 1.279
1.805SerPro: 1.805 ± 0.603
3.837SerGln: 3.837 ± 1.123
2.257SerArg: 2.257 ± 0.844
2.483SerSer: 2.483 ± 0.637
4.062SerThr: 4.062 ± 0.855
6.545SerVal: 6.545 ± 1.728
0.451SerTrp: 0.451 ± 0.363
3.611SerTyr: 3.611 ± 0.778
0.0SerXaa: 0.0 ± 0.0
Thr
5.642ThrAla: 5.642 ± 0.977
0.903ThrCys: 0.903 ± 0.591
4.062ThrAsp: 4.062 ± 0.954
4.288ThrGlu: 4.288 ± 1.055
2.708ThrPhe: 2.708 ± 0.632
4.514ThrGly: 4.514 ± 1.114
0.903ThrHis: 0.903 ± 0.438
6.545ThrIle: 6.545 ± 1.554
3.16ThrLys: 3.16 ± 0.826
6.093ThrLeu: 6.093 ± 1.122
1.354ThrMet: 1.354 ± 0.602
4.288ThrAsn: 4.288 ± 1.314
1.58ThrPro: 1.58 ± 0.484
1.128ThrGln: 1.128 ± 0.473
2.031ThrArg: 2.031 ± 0.803
2.934ThrSer: 2.934 ± 0.736
5.642ThrThr: 5.642 ± 1.962
6.093ThrVal: 6.093 ± 1.584
0.451ThrTrp: 0.451 ± 0.269
2.934ThrTyr: 2.934 ± 0.596
0.0ThrXaa: 0.0 ± 0.0
Val
2.934ValAla: 2.934 ± 0.754
1.805ValCys: 1.805 ± 0.59
2.934ValAsp: 2.934 ± 1.126
3.385ValGlu: 3.385 ± 0.991
2.934ValPhe: 2.934 ± 0.957
4.739ValGly: 4.739 ± 1.016
0.677ValHis: 0.677 ± 0.373
3.16ValIle: 3.16 ± 1.001
5.191ValLys: 5.191 ± 1.151
6.77ValLeu: 6.77 ± 1.509
2.483ValMet: 2.483 ± 0.894
2.934ValAsn: 2.934 ± 0.807
2.483ValPro: 2.483 ± 0.618
2.031ValGln: 2.031 ± 0.778
2.483ValArg: 2.483 ± 0.893
7.222ValSer: 7.222 ± 1.692
6.093ValThr: 6.093 ± 1.392
5.868ValVal: 5.868 ± 1.509
1.354ValTrp: 1.354 ± 0.579
1.805ValTyr: 1.805 ± 0.485
0.0ValXaa: 0.0 ± 0.0
Trp
2.031TrpAla: 2.031 ± 0.786
0.226TrpCys: 0.226 ± 0.204
0.903TrpAsp: 0.903 ± 0.352
1.354TrpGlu: 1.354 ± 0.481
0.677TrpPhe: 0.677 ± 0.359
0.0TrpGly: 0.0 ± 0.0
0.677TrpHis: 0.677 ± 0.363
2.031TrpIle: 2.031 ± 0.956
1.128TrpLys: 1.128 ± 0.562
1.805TrpLeu: 1.805 ± 0.514
0.677TrpMet: 0.677 ± 0.375
1.128TrpAsn: 1.128 ± 0.499
0.903TrpPro: 0.903 ± 0.438
2.031TrpGln: 2.031 ± 0.549
0.677TrpArg: 0.677 ± 0.412
0.903TrpSer: 0.903 ± 0.53
0.451TrpThr: 0.451 ± 0.359
0.903TrpVal: 0.903 ± 0.527
0.226TrpTrp: 0.226 ± 0.25
0.226TrpTyr: 0.226 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.934TyrAla: 2.934 ± 0.91
1.128TyrCys: 1.128 ± 0.658
2.483TyrAsp: 2.483 ± 0.969
2.257TyrGlu: 2.257 ± 0.551
0.903TyrPhe: 0.903 ± 0.506
2.483TyrGly: 2.483 ± 0.585
0.903TyrHis: 0.903 ± 0.456
2.257TyrIle: 2.257 ± 0.711
2.934TyrLys: 2.934 ± 0.825
3.611TyrLeu: 3.611 ± 0.836
1.805TyrMet: 1.805 ± 0.642
1.805TyrAsn: 1.805 ± 0.563
1.58TyrPro: 1.58 ± 0.637
1.128TyrGln: 1.128 ± 0.477
2.031TyrArg: 2.031 ± 0.944
3.385TyrSer: 3.385 ± 1.089
3.16TyrThr: 3.16 ± 0.649
1.58TyrVal: 1.58 ± 0.493
1.128TyrTrp: 1.128 ± 0.467
2.257TyrTyr: 2.257 ± 0.864
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 35 proteins (4432 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski