Amino acid dipepetide frequency for Haloarcula hispanica pleomorphic virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.899AlaAla: 17.899 ± 1.976
1.167AlaCys: 1.167 ± 0.511
9.339AlaAsp: 9.339 ± 2.157
3.502AlaGlu: 3.502 ± 1.568
2.724AlaPhe: 2.724 ± 1.121
12.84AlaGly: 12.84 ± 1.641
2.335AlaHis: 2.335 ± 0.65
3.502AlaIle: 3.502 ± 0.718
3.113AlaLys: 3.113 ± 0.809
6.226AlaLeu: 6.226 ± 1.645
2.335AlaMet: 2.335 ± 0.538
3.113AlaAsn: 3.113 ± 0.713
3.891AlaPro: 3.891 ± 1.717
2.335AlaGln: 2.335 ± 1.158
5.837AlaArg: 5.837 ± 1.882
5.837AlaSer: 5.837 ± 1.111
7.004AlaThr: 7.004 ± 1.111
7.004AlaVal: 7.004 ± 2.081
0.778AlaTrp: 0.778 ± 0.321
1.556AlaTyr: 1.556 ± 0.588
0.0AlaXaa: 0.0 ± 0.0
Cys
1.556CysAla: 1.556 ± 0.831
0.389CysCys: 0.389 ± 0.281
2.335CysAsp: 2.335 ± 1.332
0.389CysGlu: 0.389 ± 0.281
0.0CysPhe: 0.0 ± 0.0
1.167CysGly: 1.167 ± 0.599
0.389CysHis: 0.389 ± 0.331
0.0CysIle: 0.0 ± 0.0
0.778CysLys: 0.778 ± 0.563
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.167CysPro: 1.167 ± 0.844
0.389CysGln: 0.389 ± 0.281
0.778CysArg: 0.778 ± 0.446
2.724CysSer: 2.724 ± 1.171
0.778CysThr: 0.778 ± 0.563
1.556CysVal: 1.556 ± 0.851
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.393AspAla: 7.393 ± 1.764
1.167AspCys: 1.167 ± 0.599
7.782AspAsp: 7.782 ± 2.584
9.728AspGlu: 9.728 ± 2.194
1.556AspPhe: 1.556 ± 0.859
8.171AspGly: 8.171 ± 1.682
1.556AspHis: 1.556 ± 0.788
1.556AspIle: 1.556 ± 0.579
0.778AspLys: 0.778 ± 0.321
8.56AspLeu: 8.56 ± 2.715
1.556AspMet: 1.556 ± 0.491
1.556AspAsn: 1.556 ± 0.586
5.447AspPro: 5.447 ± 1.067
1.556AspGln: 1.556 ± 0.641
6.226AspArg: 6.226 ± 2.315
5.447AspSer: 5.447 ± 1.475
4.28AspThr: 4.28 ± 0.801
6.615AspVal: 6.615 ± 1.303
1.556AspTrp: 1.556 ± 0.851
2.724AspTyr: 2.724 ± 0.667
0.0AspXaa: 0.0 ± 0.0
Glu
2.335GluAla: 2.335 ± 0.668
2.724GluCys: 2.724 ± 1.236
6.615GluAsp: 6.615 ± 1.222
5.837GluGlu: 5.837 ± 1.691
2.335GluPhe: 2.335 ± 0.807
2.724GluGly: 2.724 ± 0.883
2.724GluHis: 2.724 ± 1.66
1.556GluIle: 1.556 ± 0.743
5.447GluLys: 5.447 ± 0.953
3.891GluLeu: 3.891 ± 1.52
1.556GluMet: 1.556 ± 0.478
1.556GluAsn: 1.556 ± 0.597
5.058GluPro: 5.058 ± 1.378
5.058GluGln: 5.058 ± 0.972
7.393GluArg: 7.393 ± 1.915
3.891GluSer: 3.891 ± 0.976
5.837GluThr: 5.837 ± 1.199
7.004GluVal: 7.004 ± 1.355
1.167GluTrp: 1.167 ± 0.844
2.724GluTyr: 2.724 ± 0.869
0.0GluXaa: 0.0 ± 0.0
Phe
4.669PheAla: 4.669 ± 2.08
0.0PheCys: 0.0 ± 0.0
2.335PheAsp: 2.335 ± 0.876
1.556PheGlu: 1.556 ± 0.836
1.556PhePhe: 1.556 ± 0.935
1.946PheGly: 1.946 ± 0.65
0.778PheHis: 0.778 ± 0.433
0.0PheIle: 0.0 ± 0.0
1.556PheLys: 1.556 ± 0.787
1.946PheLeu: 1.946 ± 1.23
0.389PheMet: 0.389 ± 0.485
0.389PheAsn: 0.389 ± 0.354
0.778PhePro: 0.778 ± 0.524
1.167PheGln: 1.167 ± 1.062
0.778PheArg: 0.778 ± 0.476
1.946PheSer: 1.946 ± 0.806
2.335PheThr: 2.335 ± 0.869
4.28PheVal: 4.28 ± 1.17
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
7.004GlyAla: 7.004 ± 1.25
1.167GlyCys: 1.167 ± 0.594
8.56GlyAsp: 8.56 ± 2.057
8.949GlyGlu: 8.949 ± 1.785
2.335GlyPhe: 2.335 ± 0.908
8.171GlyGly: 8.171 ± 2.206
0.0GlyHis: 0.0 ± 0.0
2.724GlyIle: 2.724 ± 1.22
2.724GlyLys: 2.724 ± 0.574
3.891GlyLeu: 3.891 ± 0.906
0.778GlyMet: 0.778 ± 0.505
3.891GlyAsn: 3.891 ± 1.402
2.724GlyPro: 2.724 ± 1.242
0.778GlyGln: 0.778 ± 0.549
3.113GlyArg: 3.113 ± 1.117
6.615GlySer: 6.615 ± 1.331
4.669GlyThr: 4.669 ± 1.068
5.447GlyVal: 5.447 ± 1.364
0.778GlyTrp: 0.778 ± 0.548
3.113GlyTyr: 3.113 ± 0.746
0.0GlyXaa: 0.0 ± 0.0
His
2.335HisAla: 2.335 ± 0.736
0.778HisCys: 0.778 ± 0.433
1.556HisAsp: 1.556 ± 0.849
1.946HisGlu: 1.946 ± 0.834
0.389HisPhe: 0.389 ± 0.331
1.946HisGly: 1.946 ± 1.086
1.556HisHis: 1.556 ± 0.646
0.0HisIle: 0.0 ± 0.0
1.167HisLys: 1.167 ± 0.6
1.167HisLeu: 1.167 ± 0.671
0.0HisMet: 0.0 ± 0.0
0.389HisAsn: 0.389 ± 0.417
0.389HisPro: 0.389 ± 0.331
0.0HisGln: 0.0 ± 0.0
0.389HisArg: 0.389 ± 0.417
0.778HisSer: 0.778 ± 0.321
1.556HisThr: 1.556 ± 0.471
1.946HisVal: 1.946 ± 0.69
0.389HisTrp: 0.389 ± 0.281
0.389HisTyr: 0.389 ± 0.354
0.0HisXaa: 0.0 ± 0.0
Ile
3.891IleAla: 3.891 ± 1.322
0.389IleCys: 0.389 ± 0.331
2.724IleAsp: 2.724 ± 1.001
2.335IleGlu: 2.335 ± 0.85
0.778IlePhe: 0.778 ± 0.515
1.167IleGly: 1.167 ± 1.038
0.0IleHis: 0.0 ± 0.0
1.167IleIle: 1.167 ± 0.989
1.556IleLys: 1.556 ± 1.113
2.335IleLeu: 2.335 ± 1.062
0.0IleMet: 0.0 ± 0.0
0.778IleAsn: 0.778 ± 0.533
3.891IlePro: 3.891 ± 1.701
1.167IleGln: 1.167 ± 0.614
1.946IleArg: 1.946 ± 0.863
0.778IleSer: 0.778 ± 0.321
2.724IleThr: 2.724 ± 1.195
2.724IleVal: 2.724 ± 0.952
0.389IleTrp: 0.389 ± 0.45
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.113LysAla: 3.113 ± 0.96
0.0LysCys: 0.0 ± 0.0
1.167LysAsp: 1.167 ± 0.599
2.724LysGlu: 2.724 ± 1.136
0.778LysPhe: 0.778 ± 0.394
1.946LysGly: 1.946 ± 0.693
1.556LysHis: 1.556 ± 0.729
1.167LysIle: 1.167 ± 0.641
0.778LysLys: 0.778 ± 0.321
3.113LysLeu: 3.113 ± 0.579
0.389LysMet: 0.389 ± 0.429
1.556LysAsn: 1.556 ± 0.682
1.167LysPro: 1.167 ± 0.802
1.167LysGln: 1.167 ± 0.621
1.946LysArg: 1.946 ± 0.974
2.335LysSer: 2.335 ± 0.836
1.167LysThr: 1.167 ± 0.702
3.113LysVal: 3.113 ± 0.887
0.389LysTrp: 0.389 ± 0.429
0.389LysTyr: 0.389 ± 0.354
0.0LysXaa: 0.0 ± 0.0
Leu
5.837LeuAla: 5.837 ± 1.871
0.778LeuCys: 0.778 ± 0.476
7.782LeuAsp: 7.782 ± 2.191
7.782LeuGlu: 7.782 ± 1.3
1.946LeuPhe: 1.946 ± 0.904
5.837LeuGly: 5.837 ± 1.136
1.167LeuHis: 1.167 ± 0.346
1.946LeuIle: 1.946 ± 0.79
1.946LeuLys: 1.946 ± 0.689
8.949LeuLeu: 8.949 ± 2.285
1.556LeuMet: 1.556 ± 0.777
1.556LeuAsn: 1.556 ± 0.648
4.28LeuPro: 4.28 ± 1.049
0.778LeuGln: 0.778 ± 0.576
7.393LeuArg: 7.393 ± 1.494
4.669LeuSer: 4.669 ± 2.093
2.724LeuThr: 2.724 ± 0.848
6.226LeuVal: 6.226 ± 2.46
0.778LeuTrp: 0.778 ± 0.834
1.556LeuTyr: 1.556 ± 0.533
0.0LeuXaa: 0.0 ± 0.0
Met
2.724MetAla: 2.724 ± 1.494
0.0MetCys: 0.0 ± 0.0
0.389MetAsp: 0.389 ± 0.331
0.389MetGlu: 0.389 ± 0.281
0.0MetPhe: 0.0 ± 0.0
0.778MetGly: 0.778 ± 0.708
0.0MetHis: 0.0 ± 0.0
0.389MetIle: 0.389 ± 0.281
0.389MetLys: 0.389 ± 0.354
1.946MetLeu: 1.946 ± 0.652
0.778MetMet: 0.778 ± 0.548
0.778MetAsn: 0.778 ± 0.576
0.389MetPro: 0.389 ± 0.485
1.556MetGln: 1.556 ± 1.096
0.389MetArg: 0.389 ± 0.281
3.113MetSer: 3.113 ± 0.657
3.113MetThr: 3.113 ± 1.247
0.778MetVal: 0.778 ± 0.476
0.778MetTrp: 0.778 ± 0.563
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.113AsnAla: 3.113 ± 1.36
0.0AsnCys: 0.0 ± 0.0
1.556AsnAsp: 1.556 ± 0.744
1.556AsnGlu: 1.556 ± 0.683
0.778AsnPhe: 0.778 ± 0.591
2.724AsnGly: 2.724 ± 0.798
0.778AsnHis: 0.778 ± 0.321
1.167AsnIle: 1.167 ± 0.779
0.389AsnLys: 0.389 ± 0.354
1.946AsnLeu: 1.946 ± 0.835
0.778AsnMet: 0.778 ± 0.662
1.946AsnAsn: 1.946 ± 0.96
1.556AsnPro: 1.556 ± 0.916
1.556AsnGln: 1.556 ± 0.905
1.167AsnArg: 1.167 ± 0.671
2.335AsnSer: 2.335 ± 0.962
2.335AsnThr: 2.335 ± 1.229
2.335AsnVal: 2.335 ± 0.412
0.0AsnTrp: 0.0 ± 0.0
1.556AsnTyr: 1.556 ± 0.849
0.0AsnXaa: 0.0 ± 0.0
Pro
3.113ProAla: 3.113 ± 1.163
0.0ProCys: 0.0 ± 0.0
5.447ProAsp: 5.447 ± 1.887
3.113ProGlu: 3.113 ± 1.3
1.556ProPhe: 1.556 ± 0.649
1.556ProGly: 1.556 ± 0.646
0.778ProHis: 0.778 ± 0.433
0.778ProIle: 0.778 ± 0.321
1.167ProLys: 1.167 ± 0.798
3.891ProLeu: 3.891 ± 1.029
1.167ProMet: 1.167 ± 0.346
1.556ProAsn: 1.556 ± 0.845
1.946ProPro: 1.946 ± 1.086
0.389ProGln: 0.389 ± 0.281
3.891ProArg: 3.891 ± 1.641
5.447ProSer: 5.447 ± 1.599
2.335ProThr: 2.335 ± 0.84
6.226ProVal: 6.226 ± 2.123
0.778ProTrp: 0.778 ± 0.591
1.946ProTyr: 1.946 ± 0.703
0.0ProXaa: 0.0 ± 0.0
Gln
1.167GlnAla: 1.167 ± 0.802
0.0GlnCys: 0.0 ± 0.0
1.167GlnAsp: 1.167 ± 0.346
1.946GlnGlu: 1.946 ± 1.063
1.556GlnPhe: 1.556 ± 0.914
1.556GlnGly: 1.556 ± 0.492
0.778GlnHis: 0.778 ± 0.549
1.556GlnIle: 1.556 ± 0.747
1.946GlnLys: 1.946 ± 0.689
0.389GlnLeu: 0.389 ± 0.354
0.0GlnMet: 0.0 ± 0.0
1.167GlnAsn: 1.167 ± 0.702
1.167GlnPro: 1.167 ± 0.867
1.167GlnGln: 1.167 ± 0.824
1.556GlnArg: 1.556 ± 0.779
3.891GlnSer: 3.891 ± 0.957
2.724GlnThr: 2.724 ± 1.305
1.167GlnVal: 1.167 ± 0.614
0.778GlnTrp: 0.778 ± 0.394
2.335GlnTyr: 2.335 ± 0.923
0.0GlnXaa: 0.0 ± 0.0
Arg
7.782ArgAla: 7.782 ± 2.311
2.335ArgCys: 2.335 ± 1.351
4.669ArgAsp: 4.669 ± 1.043
4.28ArgGlu: 4.28 ± 1.478
1.556ArgPhe: 1.556 ± 1.011
2.724ArgGly: 2.724 ± 1.291
0.0ArgHis: 0.0 ± 0.0
1.167ArgIle: 1.167 ± 1.079
0.778ArgLys: 0.778 ± 0.612
4.669ArgLeu: 4.669 ± 1.65
1.167ArgMet: 1.167 ± 0.74
1.556ArgAsn: 1.556 ± 0.502
2.724ArgPro: 2.724 ± 1.319
0.778ArgGln: 0.778 ± 0.321
5.837ArgArg: 5.837 ± 1.695
7.782ArgSer: 7.782 ± 2.022
5.058ArgThr: 5.058 ± 1.025
5.447ArgVal: 5.447 ± 1.319
0.778ArgTrp: 0.778 ± 0.563
2.335ArgTyr: 2.335 ± 1.043
0.0ArgXaa: 0.0 ± 0.0
Ser
8.171SerAla: 8.171 ± 1.602
1.167SerCys: 1.167 ± 0.594
7.393SerAsp: 7.393 ± 1.471
7.004SerGlu: 7.004 ± 1.236
3.891SerPhe: 3.891 ± 1.343
6.226SerGly: 6.226 ± 1.187
1.167SerHis: 1.167 ± 0.511
3.113SerIle: 3.113 ± 1.312
1.946SerLys: 1.946 ± 0.806
7.393SerLeu: 7.393 ± 1.849
0.778SerMet: 0.778 ± 0.505
2.335SerAsn: 2.335 ± 0.711
1.556SerPro: 1.556 ± 0.729
3.113SerGln: 3.113 ± 1.292
4.669SerArg: 4.669 ± 1.964
4.669SerSer: 4.669 ± 0.886
4.669SerThr: 4.669 ± 0.747
6.226SerVal: 6.226 ± 2.234
1.167SerTrp: 1.167 ± 0.672
2.335SerTyr: 2.335 ± 0.644
0.0SerXaa: 0.0 ± 0.0
Thr
8.56ThrAla: 8.56 ± 1.304
0.778ThrCys: 0.778 ± 0.563
4.28ThrAsp: 4.28 ± 1.017
7.782ThrGlu: 7.782 ± 1.064
2.335ThrPhe: 2.335 ± 0.473
5.837ThrGly: 5.837 ± 1.396
1.556ThrHis: 1.556 ± 0.437
2.335ThrIle: 2.335 ± 1.01
0.778ThrLys: 0.778 ± 0.588
4.28ThrLeu: 4.28 ± 0.781
1.946ThrMet: 1.946 ± 0.506
1.946ThrAsn: 1.946 ± 1.101
1.556ThrPro: 1.556 ± 0.98
2.724ThrGln: 2.724 ± 0.759
1.946ThrArg: 1.946 ± 0.684
3.113ThrSer: 3.113 ± 1.594
6.615ThrThr: 6.615 ± 2.985
8.171ThrVal: 8.171 ± 1.565
1.167ThrTrp: 1.167 ± 0.409
1.167ThrTyr: 1.167 ± 0.798
0.0ThrXaa: 0.0 ± 0.0
Val
9.339ValAla: 9.339 ± 1.043
1.167ValCys: 1.167 ± 0.844
7.393ValAsp: 7.393 ± 1.422
5.837ValGlu: 5.837 ± 1.299
1.556ValPhe: 1.556 ± 0.45
7.782ValGly: 7.782 ± 2.316
0.778ValHis: 0.778 ± 0.563
5.447ValIle: 5.447 ± 1.587
2.335ValLys: 2.335 ± 0.729
6.615ValLeu: 6.615 ± 1.683
1.946ValMet: 1.946 ± 0.623
1.946ValAsn: 1.946 ± 0.818
6.226ValPro: 6.226 ± 1.886
0.778ValGln: 0.778 ± 0.663
5.058ValArg: 5.058 ± 1.77
8.56ValSer: 8.56 ± 1.837
5.058ValThr: 5.058 ± 1.126
7.004ValVal: 7.004 ± 1.055
0.778ValTrp: 0.778 ± 0.505
1.167ValTyr: 1.167 ± 1.062
0.0ValXaa: 0.0 ± 0.0
Trp
1.167TrpAla: 1.167 ± 0.488
0.0TrpCys: 0.0 ± 0.0
0.389TrpAsp: 0.389 ± 0.331
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.778TrpGly: 0.778 ± 0.476
0.778TrpHis: 0.778 ± 0.433
0.778TrpIle: 0.778 ± 0.616
0.389TrpLys: 0.389 ± 0.281
1.167TrpLeu: 1.167 ± 0.634
0.778TrpMet: 0.778 ± 0.48
0.389TrpAsn: 0.389 ± 0.331
0.778TrpPro: 0.778 ± 0.394
0.389TrpGln: 0.389 ± 0.331
1.556TrpArg: 1.556 ± 0.891
1.167TrpSer: 1.167 ± 0.47
0.778TrpThr: 0.778 ± 0.563
1.167TrpVal: 1.167 ± 0.6
0.389TrpTrp: 0.389 ± 0.417
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.556TyrAla: 1.556 ± 0.588
0.389TyrCys: 0.389 ± 0.281
2.335TyrAsp: 2.335 ± 1.084
0.778TyrGlu: 0.778 ± 0.523
0.778TyrPhe: 0.778 ± 0.708
1.556TyrGly: 1.556 ± 0.682
0.389TyrHis: 0.389 ± 0.281
0.389TyrIle: 0.389 ± 0.354
0.0TyrLys: 0.0 ± 0.0
3.113TyrLeu: 3.113 ± 0.928
0.389TyrMet: 0.389 ± 0.354
1.167TyrAsn: 1.167 ± 0.642
0.389TyrPro: 0.389 ± 0.417
1.167TyrGln: 1.167 ± 0.672
1.556TyrArg: 1.556 ± 0.698
3.891TyrSer: 3.891 ± 1.962
3.113TyrThr: 3.113 ± 1.148
2.335TyrVal: 2.335 ± 0.894
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2571 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski