Amino acid dipepetide frequency for Halorubrum pleomorphic virus 1 (HRPV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.893AlaAla: 8.893 ± 2.434
1.334AlaCys: 1.334 ± 0.712
7.114AlaAsp: 7.114 ± 1.385
3.112AlaGlu: 3.112 ± 1.437
1.334AlaPhe: 1.334 ± 0.976
7.559AlaGly: 7.559 ± 1.255
0.889AlaHis: 0.889 ± 0.782
3.112AlaIle: 3.112 ± 1.642
3.112AlaLys: 3.112 ± 1.236
7.114AlaLeu: 7.114 ± 1.478
1.779AlaMet: 1.779 ± 0.752
2.668AlaAsn: 2.668 ± 1.311
2.668AlaPro: 2.668 ± 1.005
2.223AlaGln: 2.223 ± 1.282
6.225AlaArg: 6.225 ± 1.528
8.893AlaSer: 8.893 ± 1.71
4.446AlaThr: 4.446 ± 1.671
7.114AlaVal: 7.114 ± 1.685
2.223AlaTrp: 2.223 ± 0.701
4.002AlaTyr: 4.002 ± 1.191
0.0AlaXaa: 0.0 ± 0.0
Cys
0.889CysAla: 0.889 ± 0.439
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.445CysGlu: 0.445 ± 0.391
0.445CysPhe: 0.445 ± 0.391
0.889CysGly: 0.889 ± 0.573
0.445CysHis: 0.445 ± 0.428
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.445CysLeu: 0.445 ± 0.501
0.445CysMet: 0.445 ± 0.53
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.445CysArg: 0.445 ± 0.474
1.334CysSer: 1.334 ± 0.808
0.0CysThr: 0.0 ± 0.0
0.445CysVal: 0.445 ± 0.391
0.0CysTrp: 0.0 ± 0.0
0.445CysTyr: 0.445 ± 0.391
0.0CysXaa: 0.0 ± 0.0
Asp
3.557AspAla: 3.557 ± 1.119
0.445AspCys: 0.445 ± 0.53
5.336AspAsp: 5.336 ± 1.691
4.891AspGlu: 4.891 ± 1.839
2.223AspPhe: 2.223 ± 0.884
5.336AspGly: 5.336 ± 1.414
0.445AspHis: 0.445 ± 0.391
0.889AspIle: 0.889 ± 0.558
0.889AspLys: 0.889 ± 0.782
9.782AspLeu: 9.782 ± 3.881
3.557AspMet: 3.557 ± 1.126
4.002AspAsn: 4.002 ± 0.859
2.668AspPro: 2.668 ± 1.119
0.445AspGln: 0.445 ± 0.428
7.114AspArg: 7.114 ± 1.482
7.559AspSer: 7.559 ± 1.587
5.78AspThr: 5.78 ± 1.879
5.336AspVal: 5.336 ± 1.204
0.445AspTrp: 0.445 ± 0.391
3.112AspTyr: 3.112 ± 0.878
0.0AspXaa: 0.0 ± 0.0
Glu
6.225GluAla: 6.225 ± 1.361
0.889GluCys: 0.889 ± 0.474
2.668GluAsp: 2.668 ± 0.875
3.557GluGlu: 3.557 ± 2.116
3.112GluPhe: 3.112 ± 1.076
5.336GluGly: 5.336 ± 1.112
0.889GluHis: 0.889 ± 0.782
4.002GluIle: 4.002 ± 1.113
3.557GluLys: 3.557 ± 1.141
5.336GluLeu: 5.336 ± 0.922
0.889GluMet: 0.889 ± 0.782
4.002GluAsn: 4.002 ± 1.772
3.557GluPro: 3.557 ± 0.914
3.557GluGln: 3.557 ± 1.129
2.223GluArg: 2.223 ± 1.163
5.336GluSer: 5.336 ± 1.281
3.112GluThr: 3.112 ± 0.615
3.557GluVal: 3.557 ± 1.534
1.779GluTrp: 1.779 ± 0.953
2.223GluTyr: 2.223 ± 0.777
0.0GluXaa: 0.0 ± 0.0
Phe
3.112PheAla: 3.112 ± 0.939
0.445PheCys: 0.445 ± 0.501
3.112PheAsp: 3.112 ± 1.268
2.668PheGlu: 2.668 ± 1.171
0.445PhePhe: 0.445 ± 0.33
2.668PheGly: 2.668 ± 0.618
0.445PheHis: 0.445 ± 0.391
1.334PheIle: 1.334 ± 0.437
0.0PheLys: 0.0 ± 0.0
2.223PheLeu: 2.223 ± 0.784
1.334PheMet: 1.334 ± 0.857
0.889PheAsn: 0.889 ± 0.588
0.445PhePro: 0.445 ± 0.391
0.889PheGln: 0.889 ± 0.558
2.223PheArg: 2.223 ± 0.738
0.889PheSer: 0.889 ± 0.681
2.668PheThr: 2.668 ± 0.683
2.668PheVal: 2.668 ± 1.228
0.889PheTrp: 0.889 ± 0.521
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.446GlyAla: 4.446 ± 1.021
0.0GlyCys: 0.0 ± 0.0
2.668GlyAsp: 2.668 ± 1.221
6.225GlyGlu: 6.225 ± 1.908
0.889GlyPhe: 0.889 ± 0.474
7.114GlyGly: 7.114 ± 2.478
0.445GlyHis: 0.445 ± 0.428
3.557GlyIle: 3.557 ± 1.226
3.557GlyLys: 3.557 ± 1.037
7.559GlyLeu: 7.559 ± 0.863
2.668GlyMet: 2.668 ± 0.973
1.334GlyAsn: 1.334 ± 0.745
4.446GlyPro: 4.446 ± 1.292
1.779GlyGln: 1.779 ± 1.036
1.779GlyArg: 1.779 ± 0.653
8.448GlySer: 8.448 ± 2.083
5.336GlyThr: 5.336 ± 1.093
6.225GlyVal: 6.225 ± 1.39
0.0GlyTrp: 0.0 ± 0.0
1.779GlyTyr: 1.779 ± 0.834
0.0GlyXaa: 0.0 ± 0.0
His
1.779HisAla: 1.779 ± 1.265
0.0HisCys: 0.0 ± 0.0
1.779HisAsp: 1.779 ± 1.061
0.445HisGlu: 0.445 ± 0.391
0.889HisPhe: 0.889 ± 0.521
1.334HisGly: 1.334 ± 0.825
0.0HisHis: 0.0 ± 0.0
1.334HisIle: 1.334 ± 0.712
0.445HisLys: 0.445 ± 0.428
1.779HisLeu: 1.779 ± 1.265
0.0HisMet: 0.0 ± 0.0
0.445HisAsn: 0.445 ± 0.391
0.889HisPro: 0.889 ± 0.607
0.445HisGln: 0.445 ± 0.428
0.445HisArg: 0.445 ± 0.391
0.889HisSer: 0.889 ± 0.596
2.668HisThr: 2.668 ± 0.682
0.445HisVal: 0.445 ± 0.391
0.0HisTrp: 0.0 ± 0.0
1.334HisTyr: 1.334 ± 0.712
0.0HisXaa: 0.0 ± 0.0
Ile
6.67IleAla: 6.67 ± 1.503
0.0IleCys: 0.0 ± 0.0
4.891IleAsp: 4.891 ± 1.183
3.112IleGlu: 3.112 ± 0.768
0.0IlePhe: 0.0 ± 0.0
3.557IleGly: 3.557 ± 1.541
1.334IleHis: 1.334 ± 0.789
1.334IleIle: 1.334 ± 0.711
1.779IleLys: 1.779 ± 0.599
2.223IleLeu: 2.223 ± 1.111
0.889IleMet: 0.889 ± 0.427
1.779IleAsn: 1.779 ± 0.772
1.779IlePro: 1.779 ± 0.698
2.223IleGln: 2.223 ± 0.857
1.334IleArg: 1.334 ± 0.885
4.002IleSer: 4.002 ± 1.362
2.668IleThr: 2.668 ± 2.191
2.668IleVal: 2.668 ± 1.017
0.0IleTrp: 0.0 ± 0.0
0.445IleTyr: 0.445 ± 0.33
0.0IleXaa: 0.0 ± 0.0
Lys
2.223LysAla: 2.223 ± 1.056
0.0LysCys: 0.0 ± 0.0
4.446LysAsp: 4.446 ± 1.645
0.0LysGlu: 0.0 ± 0.0
0.445LysPhe: 0.445 ± 0.53
1.779LysGly: 1.779 ± 0.596
1.334LysHis: 1.334 ± 0.825
0.889LysIle: 0.889 ± 0.573
0.445LysLys: 0.445 ± 0.523
3.557LysLeu: 3.557 ± 1.166
0.445LysMet: 0.445 ± 0.428
1.779LysAsn: 1.779 ± 0.937
1.334LysPro: 1.334 ± 1.089
0.889LysGln: 0.889 ± 0.472
2.668LysArg: 2.668 ± 1.022
2.668LysSer: 2.668 ± 0.818
4.002LysThr: 4.002 ± 0.738
2.223LysVal: 2.223 ± 0.724
1.334LysTrp: 1.334 ± 0.508
1.779LysTyr: 1.779 ± 0.854
0.0LysXaa: 0.0 ± 0.0
Leu
9.337LeuAla: 9.337 ± 1.486
0.445LeuCys: 0.445 ± 0.391
7.559LeuAsp: 7.559 ± 1.607
10.671LeuGlu: 10.671 ± 3.77
2.668LeuPhe: 2.668 ± 1.132
5.336LeuGly: 5.336 ± 1.614
1.334LeuHis: 1.334 ± 0.712
4.446LeuIle: 4.446 ± 1.437
2.223LeuLys: 2.223 ± 1.116
13.784LeuLeu: 13.784 ± 3.791
3.112LeuMet: 3.112 ± 1.173
2.223LeuAsn: 2.223 ± 0.763
3.112LeuPro: 3.112 ± 1.212
2.223LeuGln: 2.223 ± 0.608
5.336LeuArg: 5.336 ± 1.441
8.448LeuSer: 8.448 ± 1.741
4.446LeuThr: 4.446 ± 1.957
4.446LeuVal: 4.446 ± 2.047
1.334LeuTrp: 1.334 ± 0.551
3.557LeuTyr: 3.557 ± 1.712
0.0LeuXaa: 0.0 ± 0.0
Met
1.779MetAla: 1.779 ± 0.954
0.0MetCys: 0.0 ± 0.0
1.334MetAsp: 1.334 ± 0.448
0.889MetGlu: 0.889 ± 0.57
0.889MetPhe: 0.889 ± 0.66
1.334MetGly: 1.334 ± 0.707
0.0MetHis: 0.0 ± 0.0
1.334MetIle: 1.334 ± 0.716
1.334MetLys: 1.334 ± 0.655
2.668MetLeu: 2.668 ± 1.318
1.334MetMet: 1.334 ± 0.759
1.779MetAsn: 1.779 ± 0.594
1.334MetPro: 1.334 ± 0.712
0.445MetGln: 0.445 ± 0.474
0.0MetArg: 0.0 ± 0.0
3.112MetSer: 3.112 ± 0.989
2.223MetThr: 2.223 ± 0.523
1.779MetVal: 1.779 ± 0.632
0.889MetTrp: 0.889 ± 0.573
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.002AsnAla: 4.002 ± 1.301
0.0AsnCys: 0.0 ± 0.0
3.557AsnAsp: 3.557 ± 1.479
0.445AsnGlu: 0.445 ± 0.33
3.112AsnPhe: 3.112 ± 1.125
2.223AsnGly: 2.223 ± 0.523
0.445AsnHis: 0.445 ± 0.391
0.445AsnIle: 0.445 ± 0.33
1.334AsnLys: 1.334 ± 0.745
4.002AsnLeu: 4.002 ± 1.266
0.0AsnMet: 0.0 ± 0.0
3.557AsnAsn: 3.557 ± 0.852
3.557AsnPro: 3.557 ± 1.125
2.223AsnGln: 2.223 ± 0.934
1.779AsnArg: 1.779 ± 0.806
5.336AsnSer: 5.336 ± 1.528
4.446AsnThr: 4.446 ± 1.681
2.223AsnVal: 2.223 ± 1.039
0.0AsnTrp: 0.0 ± 0.0
2.223AsnTyr: 2.223 ± 0.951
0.0AsnXaa: 0.0 ± 0.0
Pro
5.336ProAla: 5.336 ± 1.419
0.445ProCys: 0.445 ± 0.391
4.891ProAsp: 4.891 ± 1.235
2.668ProGlu: 2.668 ± 1.336
1.334ProPhe: 1.334 ± 1.054
1.779ProGly: 1.779 ± 0.698
0.445ProHis: 0.445 ± 0.391
1.779ProIle: 1.779 ± 1.106
2.223ProLys: 2.223 ± 0.822
2.668ProLeu: 2.668 ± 1.518
0.0ProMet: 0.0 ± 0.0
2.668ProAsn: 2.668 ± 0.987
1.334ProPro: 1.334 ± 0.639
0.445ProGln: 0.445 ± 0.391
0.889ProArg: 0.889 ± 0.734
3.557ProSer: 3.557 ± 1.033
2.223ProThr: 2.223 ± 1.007
2.223ProVal: 2.223 ± 0.997
0.445ProTrp: 0.445 ± 0.428
2.223ProTyr: 2.223 ± 1.155
0.0ProXaa: 0.0 ± 0.0
Gln
1.334GlnAla: 1.334 ± 0.752
0.0GlnCys: 0.0 ± 0.0
2.223GlnAsp: 2.223 ± 0.98
1.334GlnGlu: 1.334 ± 0.448
2.668GlnPhe: 2.668 ± 1.039
0.889GlnGly: 0.889 ± 0.599
1.334GlnHis: 1.334 ± 0.766
1.334GlnIle: 1.334 ± 0.658
0.445GlnLys: 0.445 ± 0.33
2.668GlnLeu: 2.668 ± 1.449
0.445GlnMet: 0.445 ± 0.33
2.223GlnAsn: 2.223 ± 0.961
1.779GlnPro: 1.779 ± 0.596
0.0GlnGln: 0.0 ± 0.0
0.889GlnArg: 0.889 ± 0.427
3.112GlnSer: 3.112 ± 0.928
3.557GlnThr: 3.557 ± 1.041
2.223GlnVal: 2.223 ± 1.38
1.334GlnTrp: 1.334 ± 0.448
0.445GlnTyr: 0.445 ± 0.33
0.0GlnXaa: 0.0 ± 0.0
Arg
4.891ArgAla: 4.891 ± 1.08
0.0ArgCys: 0.0 ± 0.0
1.779ArgAsp: 1.779 ± 1.555
3.112ArgGlu: 3.112 ± 0.965
1.779ArgPhe: 1.779 ± 1.132
4.891ArgGly: 4.891 ± 1.757
1.334ArgHis: 1.334 ± 0.774
4.446ArgIle: 4.446 ± 0.662
0.889ArgLys: 0.889 ± 0.521
5.336ArgLeu: 5.336 ± 1.551
1.779ArgMet: 1.779 ± 0.706
1.779ArgAsn: 1.779 ± 0.826
0.445ArgPro: 0.445 ± 0.428
2.223ArgGln: 2.223 ± 0.657
2.223ArgArg: 2.223 ± 0.903
4.002ArgSer: 4.002 ± 0.797
2.223ArgThr: 2.223 ± 0.701
2.223ArgVal: 2.223 ± 1.095
1.779ArgTrp: 1.779 ± 0.806
2.668ArgTyr: 2.668 ± 1.526
0.0ArgXaa: 0.0 ± 0.0
Ser
6.225SerAla: 6.225 ± 1.157
1.779SerCys: 1.779 ± 0.632
6.67SerAsp: 6.67 ± 1.487
10.671SerGlu: 10.671 ± 2.362
2.223SerPhe: 2.223 ± 1.11
6.67SerGly: 6.67 ± 2.272
2.223SerHis: 2.223 ± 1.116
4.002SerIle: 4.002 ± 1.24
4.891SerLys: 4.891 ± 1.442
8.004SerLeu: 8.004 ± 1.944
1.779SerMet: 1.779 ± 0.982
4.891SerAsn: 4.891 ± 1.566
4.002SerPro: 4.002 ± 0.883
2.223SerGln: 2.223 ± 0.737
5.78SerArg: 5.78 ± 1.565
11.561SerSer: 11.561 ± 4.263
5.336SerThr: 5.336 ± 1.421
7.559SerVal: 7.559 ± 2.246
2.223SerTrp: 2.223 ± 1.155
1.779SerTyr: 1.779 ± 0.632
0.0SerXaa: 0.0 ± 0.0
Thr
4.446ThrAla: 4.446 ± 2.788
0.0ThrCys: 0.0 ± 0.0
4.446ThrAsp: 4.446 ± 0.899
4.446ThrGlu: 4.446 ± 1.846
1.779ThrPhe: 1.779 ± 0.988
4.002ThrGly: 4.002 ± 0.858
1.779ThrHis: 1.779 ± 0.914
2.668ThrIle: 2.668 ± 1.52
3.112ThrLys: 3.112 ± 1.284
6.67ThrLeu: 6.67 ± 1.658
1.334ThrMet: 1.334 ± 0.437
3.112ThrAsn: 3.112 ± 0.898
3.112ThrPro: 3.112 ± 1.186
1.779ThrGln: 1.779 ± 0.737
1.334ThrArg: 1.334 ± 1.173
7.114ThrSer: 7.114 ± 1.873
8.004ThrThr: 8.004 ± 1.738
7.559ThrVal: 7.559 ± 1.894
1.779ThrTrp: 1.779 ± 0.718
1.334ThrTyr: 1.334 ± 0.655
0.0ThrXaa: 0.0 ± 0.0
Val
4.891ValAla: 4.891 ± 1.494
0.889ValCys: 0.889 ± 0.573
5.336ValAsp: 5.336 ± 1.065
4.446ValGlu: 4.446 ± 1.48
3.112ValPhe: 3.112 ± 0.757
5.78ValGly: 5.78 ± 2.48
0.889ValHis: 0.889 ± 0.427
3.557ValIle: 3.557 ± 1.699
1.779ValLys: 1.779 ± 0.599
4.891ValLeu: 4.891 ± 1.282
1.779ValMet: 1.779 ± 1.114
2.668ValAsn: 2.668 ± 1.636
2.668ValPro: 2.668 ± 1.074
2.668ValGln: 2.668 ± 1.423
4.891ValArg: 4.891 ± 2.03
5.78ValSer: 5.78 ± 1.324
4.446ValThr: 4.446 ± 1.566
6.67ValVal: 6.67 ± 3.12
0.445ValTrp: 0.445 ± 0.428
1.334ValTyr: 1.334 ± 0.56
0.0ValXaa: 0.0 ± 0.0
Trp
2.668TrpAla: 2.668 ± 1.35
0.0TrpCys: 0.0 ± 0.0
0.889TrpAsp: 0.889 ± 0.521
0.445TrpGlu: 0.445 ± 0.391
0.0TrpPhe: 0.0 ± 0.0
0.445TrpGly: 0.445 ± 0.441
0.0TrpHis: 0.0 ± 0.0
1.334TrpIle: 1.334 ± 0.745
0.889TrpLys: 0.889 ± 0.427
2.668TrpLeu: 2.668 ± 1.046
0.445TrpMet: 0.445 ± 0.33
0.889TrpAsn: 0.889 ± 0.521
0.445TrpPro: 0.445 ± 0.428
1.779TrpGln: 1.779 ± 0.574
0.889TrpArg: 0.889 ± 0.474
4.002TrpSer: 4.002 ± 1.049
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.557TyrAla: 3.557 ± 1.326
0.0TyrCys: 0.0 ± 0.0
3.112TyrAsp: 3.112 ± 0.768
2.223TyrGlu: 2.223 ± 0.748
0.0TyrPhe: 0.0 ± 0.0
1.334TyrGly: 1.334 ± 0.54
1.334TyrHis: 1.334 ± 0.89
0.889TyrIle: 0.889 ± 0.598
1.334TyrLys: 1.334 ± 0.759
2.668TyrLeu: 2.668 ± 1.388
0.0TyrMet: 0.0 ± 0.0
2.223TyrAsn: 2.223 ± 1.056
0.0TyrPro: 0.0 ± 0.0
1.779TyrGln: 1.779 ± 0.914
1.334TyrArg: 1.334 ± 0.665
4.446TyrSer: 4.446 ± 1.483
2.223TyrThr: 2.223 ± 1.342
1.334TyrVal: 1.334 ± 0.665
0.889TyrTrp: 0.889 ± 0.782
1.334TyrTyr: 1.334 ± 0.665
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2250 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski