Amino acid dipepetide frequency for Haloarcula hispanica virus SH1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.901AlaAla: 13.901 ± 2.355
0.302AlaCys: 0.302 ± 0.171
12.39AlaAsp: 12.39 ± 1.698
7.857AlaGlu: 7.857 ± 0.947
4.634AlaPhe: 4.634 ± 0.872
8.865AlaGly: 8.865 ± 1.851
2.015AlaHis: 2.015 ± 0.461
2.216AlaIle: 2.216 ± 0.512
2.115AlaLys: 2.115 ± 0.517
7.354AlaLeu: 7.354 ± 0.879
2.619AlaMet: 2.619 ± 0.637
1.612AlaAsn: 1.612 ± 0.504
3.727AlaPro: 3.727 ± 0.692
3.022AlaGln: 3.022 ± 0.698
7.757AlaArg: 7.757 ± 0.964
4.936AlaSer: 4.936 ± 1.028
6.346AlaThr: 6.346 ± 0.789
9.771AlaVal: 9.771 ± 0.896
2.72AlaTrp: 2.72 ± 0.726
2.518AlaTyr: 2.518 ± 0.505
0.0AlaXaa: 0.0 ± 0.0
Cys
0.403CysAla: 0.403 ± 0.173
0.101CysCys: 0.101 ± 0.106
1.209CysAsp: 1.209 ± 0.437
0.604CysGlu: 0.604 ± 0.267
0.101CysPhe: 0.101 ± 0.107
1.41CysGly: 1.41 ± 0.606
0.201CysHis: 0.201 ± 0.214
0.201CysIle: 0.201 ± 0.142
0.101CysLys: 0.101 ± 0.106
0.604CysLeu: 0.604 ± 0.215
0.201CysMet: 0.201 ± 0.211
0.101CysAsn: 0.101 ± 0.095
1.209CysPro: 1.209 ± 0.437
0.604CysGln: 0.604 ± 0.32
0.907CysArg: 0.907 ± 0.402
1.209CysSer: 1.209 ± 0.427
0.201CysThr: 0.201 ± 0.132
0.101CysVal: 0.101 ± 0.116
0.302CysTrp: 0.302 ± 0.263
0.101CysTyr: 0.101 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
10.98AspAla: 10.98 ± 1.797
1.007AspCys: 1.007 ± 0.389
15.413AspAsp: 15.413 ± 2.163
9.872AspGlu: 9.872 ± 1.301
3.324AspPhe: 3.324 ± 0.726
11.988AspGly: 11.988 ± 1.457
1.41AspHis: 1.41 ± 0.355
3.123AspIle: 3.123 ± 0.64
1.209AspLys: 1.209 ± 0.429
6.749AspLeu: 6.749 ± 1.006
0.806AspMet: 0.806 ± 0.264
1.914AspAsn: 1.914 ± 0.343
7.152AspPro: 7.152 ± 0.761
2.921AspGln: 2.921 ± 0.501
5.44AspArg: 5.44 ± 0.854
6.044AspSer: 6.044 ± 0.772
5.238AspThr: 5.238 ± 0.806
6.346AspVal: 6.346 ± 0.94
1.31AspTrp: 1.31 ± 0.455
2.821AspTyr: 2.821 ± 0.519
0.0AspXaa: 0.0 ± 0.0
Glu
10.879GluAla: 10.879 ± 1.361
1.007GluCys: 1.007 ± 0.333
7.354GluAsp: 7.354 ± 0.949
8.764GluGlu: 8.764 ± 1.445
1.914GluPhe: 1.914 ± 0.365
7.253GluGly: 7.253 ± 0.891
2.821GluHis: 2.821 ± 0.655
1.713GluIle: 1.713 ± 0.464
2.518GluLys: 2.518 ± 0.489
3.727GluLeu: 3.727 ± 0.663
1.612GluMet: 1.612 ± 0.515
2.821GluAsn: 2.821 ± 0.467
2.518GluPro: 2.518 ± 0.525
3.123GluGln: 3.123 ± 0.396
6.145GluArg: 6.145 ± 0.964
2.921GluSer: 2.921 ± 0.471
4.634GluThr: 4.634 ± 0.768
6.649GluVal: 6.649 ± 0.708
1.31GluTrp: 1.31 ± 0.362
2.72GluTyr: 2.72 ± 0.568
0.0GluXaa: 0.0 ± 0.0
Phe
3.123PheAla: 3.123 ± 0.602
0.201PheCys: 0.201 ± 0.144
3.425PheAsp: 3.425 ± 0.655
3.324PheGlu: 3.324 ± 0.59
0.907PhePhe: 0.907 ± 0.346
3.224PheGly: 3.224 ± 0.623
0.403PheHis: 0.403 ± 0.178
0.806PheIle: 0.806 ± 0.283
0.604PheLys: 0.604 ± 0.26
1.209PheLeu: 1.209 ± 0.29
0.403PheMet: 0.403 ± 0.229
1.41PheAsn: 1.41 ± 0.451
0.604PhePro: 0.604 ± 0.255
0.604PheGln: 0.604 ± 0.325
1.108PheArg: 1.108 ± 0.361
1.511PheSer: 1.511 ± 0.494
1.813PheThr: 1.813 ± 0.538
2.115PheVal: 2.115 ± 0.613
0.504PheTrp: 0.504 ± 0.219
0.705PheTyr: 0.705 ± 0.281
0.0PheXaa: 0.0 ± 0.0
Gly
8.764GlyAla: 8.764 ± 2.086
1.007GlyCys: 1.007 ± 0.366
10.577GlyAsp: 10.577 ± 1.345
5.037GlyGlu: 5.037 ± 0.778
2.216GlyPhe: 2.216 ± 0.457
12.995GlyGly: 12.995 ± 1.862
1.31GlyHis: 1.31 ± 0.32
2.921GlyIle: 2.921 ± 0.616
3.425GlyLys: 3.425 ± 0.684
5.843GlyLeu: 5.843 ± 0.999
1.209GlyMet: 1.209 ± 0.34
2.015GlyAsn: 2.015 ± 0.501
5.943GlyPro: 5.943 ± 1.183
4.332GlyGln: 4.332 ± 0.67
6.649GlyArg: 6.649 ± 0.764
5.44GlySer: 5.44 ± 1.002
5.54GlyThr: 5.54 ± 0.924
8.462GlyVal: 8.462 ± 1.193
1.31GlyTrp: 1.31 ± 0.394
3.626GlyTyr: 3.626 ± 0.707
0.0GlyXaa: 0.0 ± 0.0
His
1.108HisAla: 1.108 ± 0.383
0.0HisCys: 0.0 ± 0.0
1.914HisAsp: 1.914 ± 0.396
1.209HisGlu: 1.209 ± 0.337
0.403HisPhe: 0.403 ± 0.194
2.015HisGly: 2.015 ± 0.54
0.201HisHis: 0.201 ± 0.122
0.806HisIle: 0.806 ± 0.352
0.201HisLys: 0.201 ± 0.109
1.511HisLeu: 1.511 ± 0.432
0.0HisMet: 0.0 ± 0.0
0.604HisAsn: 0.604 ± 0.241
1.41HisPro: 1.41 ± 0.414
0.604HisGln: 0.604 ± 0.272
1.612HisArg: 1.612 ± 0.426
0.302HisSer: 0.302 ± 0.169
1.41HisThr: 1.41 ± 0.318
1.41HisVal: 1.41 ± 0.381
0.201HisTrp: 0.201 ± 0.141
0.705HisTyr: 0.705 ± 0.271
0.0HisXaa: 0.0 ± 0.0
Ile
2.921IleAla: 2.921 ± 0.558
0.101IleCys: 0.101 ± 0.095
2.317IleAsp: 2.317 ± 0.526
2.216IleGlu: 2.216 ± 0.369
0.504IlePhe: 0.504 ± 0.198
2.518IleGly: 2.518 ± 0.45
0.403IleHis: 0.403 ± 0.279
0.705IleIle: 0.705 ± 0.266
0.302IleLys: 0.302 ± 0.182
1.007IleLeu: 1.007 ± 0.381
0.403IleMet: 0.403 ± 0.164
0.806IleAsn: 0.806 ± 0.277
1.813IlePro: 1.813 ± 0.456
0.907IleGln: 0.907 ± 0.312
1.612IleArg: 1.612 ± 0.4
1.612IleSer: 1.612 ± 0.507
1.31IleThr: 1.31 ± 0.417
2.518IleVal: 2.518 ± 0.686
0.201IleTrp: 0.201 ± 0.161
1.007IleTyr: 1.007 ± 0.297
0.0IleXaa: 0.0 ± 0.0
Lys
3.626LysAla: 3.626 ± 0.626
0.302LysCys: 0.302 ± 0.2
2.015LysAsp: 2.015 ± 0.464
1.007LysGlu: 1.007 ± 0.256
0.403LysPhe: 0.403 ± 0.195
1.108LysGly: 1.108 ± 0.3
0.705LysHis: 0.705 ± 0.254
0.201LysIle: 0.201 ± 0.137
0.705LysLys: 0.705 ± 0.251
1.914LysLeu: 1.914 ± 0.464
0.302LysMet: 0.302 ± 0.179
1.007LysAsn: 1.007 ± 0.318
1.007LysPro: 1.007 ± 0.346
1.108LysGln: 1.108 ± 0.304
2.015LysArg: 2.015 ± 0.455
1.209LysSer: 1.209 ± 0.327
1.41LysThr: 1.41 ± 0.317
1.813LysVal: 1.813 ± 0.366
0.101LysTrp: 0.101 ± 0.108
0.604LysTyr: 0.604 ± 0.203
0.0LysXaa: 0.0 ± 0.0
Leu
7.454LeuAla: 7.454 ± 0.798
0.604LeuCys: 0.604 ± 0.272
5.641LeuAsp: 5.641 ± 0.633
4.936LeuGlu: 4.936 ± 1.144
1.31LeuPhe: 1.31 ± 0.318
6.346LeuGly: 6.346 ± 0.876
0.806LeuHis: 0.806 ± 0.352
1.108LeuIle: 1.108 ± 0.337
1.813LeuLys: 1.813 ± 0.399
6.447LeuLeu: 6.447 ± 0.997
1.511LeuMet: 1.511 ± 0.475
0.907LeuAsn: 0.907 ± 0.257
3.022LeuPro: 3.022 ± 0.694
1.813LeuGln: 1.813 ± 0.567
7.454LeuArg: 7.454 ± 1.108
4.231LeuSer: 4.231 ± 0.624
4.835LeuThr: 4.835 ± 0.694
4.029LeuVal: 4.029 ± 0.604
0.302LeuTrp: 0.302 ± 0.171
2.015LeuTyr: 2.015 ± 0.492
0.0LeuXaa: 0.0 ± 0.0
Met
2.115MetAla: 2.115 ± 0.543
0.302MetCys: 0.302 ± 0.201
1.31MetAsp: 1.31 ± 0.371
0.806MetGlu: 0.806 ± 0.251
0.403MetPhe: 0.403 ± 0.238
1.511MetGly: 1.511 ± 0.26
0.0MetHis: 0.0 ± 0.0
0.504MetIle: 0.504 ± 0.216
0.302MetLys: 0.302 ± 0.186
0.705MetLeu: 0.705 ± 0.26
0.101MetMet: 0.101 ± 0.112
1.31MetAsn: 1.31 ± 0.39
0.705MetPro: 0.705 ± 0.263
0.504MetGln: 0.504 ± 0.213
0.806MetArg: 0.806 ± 0.361
1.41MetSer: 1.41 ± 0.529
1.612MetThr: 1.612 ± 0.475
1.108MetVal: 1.108 ± 0.32
0.101MetTrp: 0.101 ± 0.105
0.101MetTyr: 0.101 ± 0.096
0.0MetXaa: 0.0 ± 0.0
Asn
2.115AsnAla: 2.115 ± 0.525
0.403AsnCys: 0.403 ± 0.192
2.72AsnAsp: 2.72 ± 0.76
1.41AsnGlu: 1.41 ± 0.354
0.403AsnPhe: 0.403 ± 0.183
3.526AsnGly: 3.526 ± 0.855
0.504AsnHis: 0.504 ± 0.245
1.108AsnIle: 1.108 ± 0.322
0.604AsnLys: 0.604 ± 0.247
2.115AsnLeu: 2.115 ± 0.486
0.504AsnMet: 0.504 ± 0.198
0.907AsnAsn: 0.907 ± 0.354
2.418AsnPro: 2.418 ± 0.518
1.007AsnGln: 1.007 ± 0.359
1.209AsnArg: 1.209 ± 0.373
1.209AsnSer: 1.209 ± 0.481
1.713AsnThr: 1.713 ± 0.386
1.007AsnVal: 1.007 ± 0.295
0.907AsnTrp: 0.907 ± 0.343
0.403AsnTyr: 0.403 ± 0.163
0.0AsnXaa: 0.0 ± 0.0
Pro
3.929ProAla: 3.929 ± 0.697
0.302ProCys: 0.302 ± 0.216
6.951ProAsp: 6.951 ± 0.98
5.54ProGlu: 5.54 ± 0.959
1.914ProPhe: 1.914 ± 0.345
4.533ProGly: 4.533 ± 1.382
1.007ProHis: 1.007 ± 0.375
1.713ProIle: 1.713 ± 0.399
0.604ProLys: 0.604 ± 0.241
3.224ProLeu: 3.224 ± 0.559
1.007ProMet: 1.007 ± 0.456
1.612ProAsn: 1.612 ± 0.364
3.123ProPro: 3.123 ± 0.745
1.41ProGln: 1.41 ± 0.407
3.022ProArg: 3.022 ± 0.723
2.72ProSer: 2.72 ± 0.803
4.231ProThr: 4.231 ± 0.941
3.324ProVal: 3.324 ± 0.512
0.403ProTrp: 0.403 ± 0.218
0.806ProTyr: 0.806 ± 0.282
0.0ProXaa: 0.0 ± 0.0
Gln
5.339GlnAla: 5.339 ± 0.793
0.0GlnCys: 0.0 ± 0.0
3.425GlnAsp: 3.425 ± 0.54
1.713GlnGlu: 1.713 ± 0.312
1.31GlnPhe: 1.31 ± 0.49
3.526GlnGly: 3.526 ± 0.627
0.504GlnHis: 0.504 ± 0.264
1.612GlnIle: 1.612 ± 0.448
1.209GlnLys: 1.209 ± 0.32
1.612GlnLeu: 1.612 ± 0.392
0.806GlnMet: 0.806 ± 0.323
1.41GlnAsn: 1.41 ± 0.327
1.713GlnPro: 1.713 ± 0.407
2.015GlnGln: 2.015 ± 0.521
2.317GlnArg: 2.317 ± 0.598
1.31GlnSer: 1.31 ± 0.491
2.619GlnThr: 2.619 ± 0.421
2.619GlnVal: 2.619 ± 0.593
0.504GlnTrp: 0.504 ± 0.227
1.209GlnTyr: 1.209 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
5.54ArgAla: 5.54 ± 0.751
1.108ArgCys: 1.108 ± 0.389
6.85ArgAsp: 6.85 ± 0.995
6.85ArgGlu: 6.85 ± 0.902
2.317ArgPhe: 2.317 ± 0.546
4.533ArgGly: 4.533 ± 0.77
0.705ArgHis: 0.705 ± 0.268
1.511ArgIle: 1.511 ± 0.343
2.115ArgLys: 2.115 ± 0.42
5.943ArgLeu: 5.943 ± 0.649
1.41ArgMet: 1.41 ± 0.386
1.209ArgAsn: 1.209 ± 0.257
3.022ArgPro: 3.022 ± 0.707
3.727ArgGln: 3.727 ± 0.554
6.346ArgArg: 6.346 ± 1.207
3.425ArgSer: 3.425 ± 0.798
5.641ArgThr: 5.641 ± 0.705
4.432ArgVal: 4.432 ± 0.623
1.612ArgTrp: 1.612 ± 0.538
1.914ArgTyr: 1.914 ± 0.421
0.0ArgXaa: 0.0 ± 0.0
Ser
5.138SerAla: 5.138 ± 0.818
0.504SerCys: 0.504 ± 0.274
6.246SerAsp: 6.246 ± 1.076
3.324SerGlu: 3.324 ± 0.623
1.209SerPhe: 1.209 ± 0.362
7.354SerGly: 7.354 ± 1.485
0.504SerHis: 0.504 ± 0.211
1.108SerIle: 1.108 ± 0.35
1.31SerLys: 1.31 ± 0.398
2.821SerLeu: 2.821 ± 0.445
0.604SerMet: 0.604 ± 0.231
1.914SerAsn: 1.914 ± 0.39
3.022SerPro: 3.022 ± 0.749
1.914SerGln: 1.914 ± 0.475
3.626SerArg: 3.626 ± 0.75
3.425SerSer: 3.425 ± 0.791
3.626SerThr: 3.626 ± 0.554
4.533SerVal: 4.533 ± 0.704
0.705SerTrp: 0.705 ± 0.248
1.209SerTyr: 1.209 ± 0.354
0.0SerXaa: 0.0 ± 0.0
Thr
7.051ThrAla: 7.051 ± 1.028
1.108ThrCys: 1.108 ± 0.44
6.145ThrAsp: 6.145 ± 0.779
6.145ThrGlu: 6.145 ± 0.801
1.713ThrPhe: 1.713 ± 0.359
6.246ThrGly: 6.246 ± 0.891
0.705ThrHis: 0.705 ± 0.224
1.813ThrIle: 1.813 ± 0.34
0.504ThrLys: 0.504 ± 0.211
4.936ThrLeu: 4.936 ± 0.623
0.302ThrMet: 0.302 ± 0.168
1.813ThrAsn: 1.813 ± 0.502
3.324ThrPro: 3.324 ± 0.558
1.713ThrGln: 1.713 ± 0.393
3.224ThrArg: 3.224 ± 0.574
3.425ThrSer: 3.425 ± 0.513
4.634ThrThr: 4.634 ± 0.691
5.742ThrVal: 5.742 ± 0.834
1.108ThrTrp: 1.108 ± 0.458
2.216ThrTyr: 2.216 ± 0.522
0.0ThrXaa: 0.0 ± 0.0
Val
8.563ValAla: 8.563 ± 0.943
0.604ValCys: 0.604 ± 0.254
5.843ValAsp: 5.843 ± 0.733
8.563ValGlu: 8.563 ± 0.964
1.511ValPhe: 1.511 ± 0.467
6.044ValGly: 6.044 ± 0.859
2.115ValHis: 2.115 ± 0.595
1.41ValIle: 1.41 ± 0.314
1.813ValLys: 1.813 ± 0.383
4.634ValLeu: 4.634 ± 0.81
1.007ValMet: 1.007 ± 0.312
1.612ValAsn: 1.612 ± 0.386
3.425ValPro: 3.425 ± 0.734
3.022ValGln: 3.022 ± 0.727
5.037ValArg: 5.037 ± 0.692
5.742ValSer: 5.742 ± 0.584
4.13ValThr: 4.13 ± 0.644
5.54ValVal: 5.54 ± 0.862
1.209ValTrp: 1.209 ± 0.383
3.324ValTyr: 3.324 ± 0.484
0.0ValXaa: 0.0 ± 0.0
Trp
1.713TrpAla: 1.713 ± 0.403
0.504TrpCys: 0.504 ± 0.273
0.806TrpAsp: 0.806 ± 0.253
1.813TrpGlu: 1.813 ± 0.385
0.504TrpPhe: 0.504 ± 0.193
1.108TrpGly: 1.108 ± 0.353
0.705TrpHis: 0.705 ± 0.298
0.201TrpIle: 0.201 ± 0.152
0.705TrpLys: 0.705 ± 0.204
1.41TrpLeu: 1.41 ± 0.445
0.504TrpMet: 0.504 ± 0.206
0.302TrpAsn: 0.302 ± 0.196
0.302TrpPro: 0.302 ± 0.173
0.403TrpGln: 0.403 ± 0.193
0.907TrpArg: 0.907 ± 0.266
0.504TrpSer: 0.504 ± 0.266
1.209TrpThr: 1.209 ± 0.362
1.108TrpVal: 1.108 ± 0.317
0.101TrpTrp: 0.101 ± 0.096
0.201TrpTyr: 0.201 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.015TyrAla: 2.015 ± 0.376
0.403TyrCys: 0.403 ± 0.23
2.518TyrAsp: 2.518 ± 0.645
1.41TyrGlu: 1.41 ± 0.339
1.007TyrPhe: 1.007 ± 0.338
2.518TyrGly: 2.518 ± 0.589
0.604TyrHis: 0.604 ± 0.201
0.403TyrIle: 0.403 ± 0.229
0.403TyrLys: 0.403 ± 0.223
2.921TyrLeu: 2.921 ± 0.471
0.302TyrMet: 0.302 ± 0.169
0.907TyrAsn: 0.907 ± 0.307
1.914TyrPro: 1.914 ± 0.454
2.015TyrGln: 2.015 ± 0.546
3.022TyrArg: 3.022 ± 0.599
1.511TyrSer: 1.511 ± 0.395
1.713TyrThr: 1.713 ± 0.457
2.518TyrVal: 2.518 ± 0.545
0.101TyrTrp: 0.101 ± 0.096
0.806TyrTyr: 0.806 ± 0.33
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (9928 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski