Amino acid dipepetide frequency for Halomonas phage QHHSV-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.556AlaAla: 13.556 ± 1.383
1.752AlaCys: 1.752 ± 0.449
5.256AlaAsp: 5.256 ± 0.844
8.115AlaGlu: 8.115 ± 0.92
2.951AlaPhe: 2.951 ± 0.461
8.945AlaGly: 8.945 ± 0.836
1.937AlaHis: 1.937 ± 0.362
5.533AlaIle: 5.533 ± 0.571
4.703AlaLys: 4.703 ± 0.673
11.527AlaLeu: 11.527 ± 1.146
4.334AlaMet: 4.334 ± 0.705
3.043AlaAsn: 3.043 ± 0.472
5.164AlaPro: 5.164 ± 0.968
5.902AlaGln: 5.902 ± 0.671
9.959AlaArg: 9.959 ± 1.221
6.64AlaSer: 6.64 ± 0.821
8.115AlaThr: 8.115 ± 1.065
7.008AlaVal: 7.008 ± 0.629
2.029AlaTrp: 2.029 ± 0.508
2.305AlaTyr: 2.305 ± 0.391
0.0AlaXaa: 0.0 ± 0.0
Cys
0.922CysAla: 0.922 ± 0.311
0.092CysCys: 0.092 ± 0.08
0.646CysAsp: 0.646 ± 0.263
0.646CysGlu: 0.646 ± 0.259
0.369CysPhe: 0.369 ± 0.185
1.291CysGly: 1.291 ± 0.464
0.369CysHis: 0.369 ± 0.179
0.277CysIle: 0.277 ± 0.127
0.646CysLys: 0.646 ± 0.218
0.277CysLeu: 0.277 ± 0.153
0.184CysMet: 0.184 ± 0.133
0.277CysAsn: 0.277 ± 0.151
0.646CysPro: 0.646 ± 0.311
0.369CysGln: 0.369 ± 0.181
0.553CysArg: 0.553 ± 0.219
0.738CysSer: 0.738 ± 0.212
0.738CysThr: 0.738 ± 0.275
0.553CysVal: 0.553 ± 0.201
0.092CysTrp: 0.092 ± 0.074
0.277CysTyr: 0.277 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
7.931AspAla: 7.931 ± 1.012
1.383AspCys: 1.383 ± 0.368
5.81AspAsp: 5.81 ± 0.827
5.349AspGlu: 5.349 ± 0.688
1.107AspPhe: 1.107 ± 0.305
6.916AspGly: 6.916 ± 0.949
1.383AspHis: 1.383 ± 0.326
3.135AspIle: 3.135 ± 0.55
1.937AspLys: 1.937 ± 0.476
5.441AspLeu: 5.441 ± 0.638
1.107AspMet: 1.107 ± 0.277
1.475AspAsn: 1.475 ± 0.338
4.058AspPro: 4.058 ± 0.763
2.121AspGln: 2.121 ± 0.473
3.689AspArg: 3.689 ± 0.521
3.32AspSer: 3.32 ± 0.587
3.043AspThr: 3.043 ± 0.463
3.228AspVal: 3.228 ± 0.62
1.291AspTrp: 1.291 ± 0.324
1.568AspTyr: 1.568 ± 0.379
0.0AspXaa: 0.0 ± 0.0
Glu
8.115GluAla: 8.115 ± 0.828
0.277GluCys: 0.277 ± 0.138
4.242GluAsp: 4.242 ± 0.52
4.98GluGlu: 4.98 ± 0.795
2.49GluPhe: 2.49 ± 0.536
5.994GluGly: 5.994 ± 0.864
2.029GluHis: 2.029 ± 0.405
2.674GluIle: 2.674 ± 0.571
0.83GluLys: 0.83 ± 0.287
8.3GluLeu: 8.3 ± 0.823
1.937GluMet: 1.937 ± 0.486
1.937GluAsn: 1.937 ± 0.396
3.135GluPro: 3.135 ± 0.562
2.951GluGln: 2.951 ± 0.59
6.824GluArg: 6.824 ± 0.703
2.49GluSer: 2.49 ± 0.377
4.611GluThr: 4.611 ± 0.599
5.625GluVal: 5.625 ± 0.96
1.752GluTrp: 1.752 ± 0.432
1.014GluTyr: 1.014 ± 0.264
0.0GluXaa: 0.0 ± 0.0
Phe
3.504PheAla: 3.504 ± 0.544
0.277PheCys: 0.277 ± 0.159
2.398PheAsp: 2.398 ± 0.402
1.383PheGlu: 1.383 ± 0.316
0.369PhePhe: 0.369 ± 0.168
2.213PheGly: 2.213 ± 0.461
1.014PheHis: 1.014 ± 0.289
0.922PheIle: 0.922 ± 0.298
0.461PheLys: 0.461 ± 0.204
2.029PheLeu: 2.029 ± 0.493
0.738PheMet: 0.738 ± 0.263
0.553PheAsn: 0.553 ± 0.199
0.922PhePro: 0.922 ± 0.312
0.83PheGln: 0.83 ± 0.273
1.568PheArg: 1.568 ± 0.316
1.199PheSer: 1.199 ± 0.32
1.383PheThr: 1.383 ± 0.287
2.49PheVal: 2.49 ± 0.485
0.277PheTrp: 0.277 ± 0.158
0.369PheTyr: 0.369 ± 0.184
0.0PheXaa: 0.0 ± 0.0
Gly
8.023GlyAla: 8.023 ± 0.917
0.738GlyCys: 0.738 ± 0.237
5.256GlyAsp: 5.256 ± 0.618
5.625GlyGlu: 5.625 ± 0.625
2.398GlyPhe: 2.398 ± 0.464
6.916GlyGly: 6.916 ± 0.866
2.121GlyHis: 2.121 ± 0.508
3.412GlyIle: 3.412 ± 0.485
3.32GlyLys: 3.32 ± 0.707
6.732GlyLeu: 6.732 ± 0.818
2.213GlyMet: 2.213 ± 0.502
2.582GlyAsn: 2.582 ± 0.441
2.121GlyPro: 2.121 ± 0.541
3.965GlyGln: 3.965 ± 0.583
6.363GlyArg: 6.363 ± 0.68
5.533GlySer: 5.533 ± 0.659
4.795GlyThr: 4.795 ± 0.753
4.611GlyVal: 4.611 ± 0.672
2.582GlyTrp: 2.582 ± 0.399
1.199GlyTyr: 1.199 ± 0.357
0.0GlyXaa: 0.0 ± 0.0
His
2.305HisAla: 2.305 ± 0.484
0.277HisCys: 0.277 ± 0.156
2.121HisAsp: 2.121 ± 0.324
2.213HisGlu: 2.213 ± 0.456
0.277HisPhe: 0.277 ± 0.158
1.844HisGly: 1.844 ± 0.385
0.738HisHis: 0.738 ± 0.253
1.291HisIle: 1.291 ± 0.379
0.277HisLys: 0.277 ± 0.18
1.107HisLeu: 1.107 ± 0.345
0.553HisMet: 0.553 ± 0.203
0.738HisAsn: 0.738 ± 0.25
1.568HisPro: 1.568 ± 0.447
0.646HisGln: 0.646 ± 0.224
1.752HisArg: 1.752 ± 0.338
1.107HisSer: 1.107 ± 0.343
1.291HisThr: 1.291 ± 0.326
0.922HisVal: 0.922 ± 0.339
0.184HisTrp: 0.184 ± 0.126
1.199HisTyr: 1.199 ± 0.314
0.0HisXaa: 0.0 ± 0.0
Ile
6.086IleAla: 6.086 ± 0.837
0.369IleCys: 0.369 ± 0.173
3.689IleAsp: 3.689 ± 0.561
3.596IleGlu: 3.596 ± 0.589
0.553IlePhe: 0.553 ± 0.213
3.596IleGly: 3.596 ± 0.573
0.738IleHis: 0.738 ± 0.262
1.937IleIle: 1.937 ± 0.396
1.568IleLys: 1.568 ± 0.327
2.305IleLeu: 2.305 ± 0.53
0.83IleMet: 0.83 ± 0.276
1.383IleAsn: 1.383 ± 0.29
1.568IlePro: 1.568 ± 0.392
2.029IleGln: 2.029 ± 0.395
3.32IleArg: 3.32 ± 0.542
2.582IleSer: 2.582 ± 0.585
4.242IleThr: 4.242 ± 0.67
2.305IleVal: 2.305 ± 0.432
0.922IleTrp: 0.922 ± 0.325
0.83IleTyr: 0.83 ± 0.255
0.0IleXaa: 0.0 ± 0.0
Lys
3.504LysAla: 3.504 ± 0.701
0.184LysCys: 0.184 ± 0.124
1.568LysAsp: 1.568 ± 0.425
2.398LysGlu: 2.398 ± 0.48
0.553LysPhe: 0.553 ± 0.313
2.49LysGly: 2.49 ± 0.349
1.107LysHis: 1.107 ± 0.273
0.83LysIle: 0.83 ± 0.274
1.199LysLys: 1.199 ± 0.283
1.844LysLeu: 1.844 ± 0.396
0.738LysMet: 0.738 ± 0.268
0.83LysAsn: 0.83 ± 0.279
1.568LysPro: 1.568 ± 0.353
2.121LysGln: 2.121 ± 0.407
3.043LysArg: 3.043 ± 0.519
1.937LysSer: 1.937 ± 0.415
1.937LysThr: 1.937 ± 0.533
2.121LysVal: 2.121 ± 0.397
0.461LysTrp: 0.461 ± 0.184
0.646LysTyr: 0.646 ± 0.267
0.0LysXaa: 0.0 ± 0.0
Leu
11.619LeuAla: 11.619 ± 0.977
1.199LeuCys: 1.199 ± 0.359
7.193LeuAsp: 7.193 ± 0.755
7.008LeuGlu: 7.008 ± 0.742
1.844LeuPhe: 1.844 ± 0.364
5.441LeuGly: 5.441 ± 0.743
1.291LeuHis: 1.291 ± 0.368
4.15LeuIle: 4.15 ± 0.589
2.859LeuLys: 2.859 ± 0.411
5.164LeuLeu: 5.164 ± 0.746
1.568LeuMet: 1.568 ± 0.341
1.475LeuAsn: 1.475 ± 0.416
4.334LeuPro: 4.334 ± 0.566
2.951LeuGln: 2.951 ± 0.514
5.81LeuArg: 5.81 ± 0.607
5.533LeuSer: 5.533 ± 0.676
4.98LeuThr: 4.98 ± 0.655
5.902LeuVal: 5.902 ± 0.754
1.014LeuTrp: 1.014 ± 0.314
2.305LeuTyr: 2.305 ± 0.443
0.0LeuXaa: 0.0 ± 0.0
Met
3.043MetAla: 3.043 ± 0.596
0.0MetCys: 0.0 ± 0.0
1.107MetAsp: 1.107 ± 0.289
1.383MetGlu: 1.383 ± 0.323
0.277MetPhe: 0.277 ± 0.152
1.568MetGly: 1.568 ± 0.357
0.553MetHis: 0.553 ± 0.207
1.383MetIle: 1.383 ± 0.346
0.83MetLys: 0.83 ± 0.373
1.568MetLeu: 1.568 ± 0.373
0.553MetMet: 0.553 ± 0.263
0.369MetAsn: 0.369 ± 0.162
1.568MetPro: 1.568 ± 0.356
0.83MetGln: 0.83 ± 0.277
2.49MetArg: 2.49 ± 0.478
1.844MetSer: 1.844 ± 0.43
2.859MetThr: 2.859 ± 0.492
1.107MetVal: 1.107 ± 0.281
0.369MetTrp: 0.369 ± 0.2
0.369MetTyr: 0.369 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
3.043AsnAla: 3.043 ± 0.536
0.092AsnCys: 0.092 ± 0.083
1.475AsnAsp: 1.475 ± 0.347
1.383AsnGlu: 1.383 ± 0.428
0.738AsnPhe: 0.738 ± 0.252
2.674AsnGly: 2.674 ± 0.449
0.738AsnHis: 0.738 ± 0.285
0.738AsnIle: 0.738 ± 0.313
0.922AsnLys: 0.922 ± 0.229
2.398AsnLeu: 2.398 ± 0.361
0.369AsnMet: 0.369 ± 0.185
0.461AsnAsn: 0.461 ± 0.228
1.66AsnPro: 1.66 ± 0.375
0.922AsnGln: 0.922 ± 0.313
1.66AsnArg: 1.66 ± 0.403
1.014AsnSer: 1.014 ± 0.246
1.475AsnThr: 1.475 ± 0.325
1.291AsnVal: 1.291 ± 0.29
0.369AsnTrp: 0.369 ± 0.263
0.738AsnTyr: 0.738 ± 0.287
0.0AsnXaa: 0.0 ± 0.0
Pro
7.101ProAla: 7.101 ± 0.977
0.277ProCys: 0.277 ± 0.166
2.859ProAsp: 2.859 ± 0.607
3.965ProGlu: 3.965 ± 0.677
0.553ProPhe: 0.553 ± 0.213
4.519ProGly: 4.519 ± 0.952
0.83ProHis: 0.83 ± 0.279
1.844ProIle: 1.844 ± 0.419
1.383ProLys: 1.383 ± 0.327
3.504ProLeu: 3.504 ± 0.485
0.922ProMet: 0.922 ± 0.315
1.199ProAsn: 1.199 ± 0.3
2.859ProPro: 2.859 ± 0.567
1.014ProGln: 1.014 ± 0.341
2.49ProArg: 2.49 ± 0.402
3.504ProSer: 3.504 ± 0.636
4.242ProThr: 4.242 ± 0.796
3.32ProVal: 3.32 ± 0.559
0.922ProTrp: 0.922 ± 0.281
0.738ProTyr: 0.738 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
6.824GlnAla: 6.824 ± 0.848
0.184GlnCys: 0.184 ± 0.123
2.767GlnAsp: 2.767 ± 0.552
2.951GlnGlu: 2.951 ± 0.55
1.199GlnPhe: 1.199 ± 0.336
3.781GlnGly: 3.781 ± 0.507
1.014GlnHis: 1.014 ± 0.318
1.66GlnIle: 1.66 ± 0.391
1.014GlnLys: 1.014 ± 0.276
4.15GlnLeu: 4.15 ± 0.549
0.646GlnMet: 0.646 ± 0.226
0.922GlnAsn: 0.922 ± 0.315
1.937GlnPro: 1.937 ± 0.438
2.582GlnGln: 2.582 ± 0.601
4.426GlnArg: 4.426 ± 0.614
2.121GlnSer: 2.121 ± 0.511
1.66GlnThr: 1.66 ± 0.338
3.412GlnVal: 3.412 ± 0.516
0.461GlnTrp: 0.461 ± 0.2
0.646GlnTyr: 0.646 ± 0.294
0.0GlnXaa: 0.0 ± 0.0
Arg
9.683ArgAla: 9.683 ± 1.051
0.646ArgCys: 0.646 ± 0.249
3.965ArgAsp: 3.965 ± 0.529
6.086ArgGlu: 6.086 ± 0.895
3.32ArgPhe: 3.32 ± 0.62
5.349ArgGly: 5.349 ± 0.632
2.121ArgHis: 2.121 ± 0.358
3.965ArgIle: 3.965 ± 0.515
2.859ArgLys: 2.859 ± 0.469
7.654ArgLeu: 7.654 ± 0.736
1.844ArgMet: 1.844 ± 0.388
1.937ArgAsn: 1.937 ± 0.403
3.965ArgPro: 3.965 ± 0.489
4.703ArgGln: 4.703 ± 0.863
6.363ArgArg: 6.363 ± 0.839
3.412ArgSer: 3.412 ± 0.59
3.135ArgThr: 3.135 ± 0.453
4.15ArgVal: 4.15 ± 0.48
2.121ArgTrp: 2.121 ± 0.512
2.674ArgTyr: 2.674 ± 0.459
0.0ArgXaa: 0.0 ± 0.0
Ser
6.732SerAla: 6.732 ± 0.858
0.092SerCys: 0.092 ± 0.08
3.781SerAsp: 3.781 ± 0.595
4.058SerGlu: 4.058 ± 0.449
1.291SerPhe: 1.291 ± 0.253
4.334SerGly: 4.334 ± 0.672
0.83SerHis: 0.83 ± 0.246
2.398SerIle: 2.398 ± 0.395
1.568SerLys: 1.568 ± 0.33
5.256SerLeu: 5.256 ± 0.646
1.291SerMet: 1.291 ± 0.341
1.568SerAsn: 1.568 ± 0.324
2.582SerPro: 2.582 ± 0.355
3.043SerGln: 3.043 ± 0.52
5.164SerArg: 5.164 ± 0.606
4.15SerSer: 4.15 ± 0.806
3.412SerThr: 3.412 ± 0.647
3.504SerVal: 3.504 ± 0.638
1.199SerTrp: 1.199 ± 0.429
1.291SerTyr: 1.291 ± 0.282
0.0SerXaa: 0.0 ± 0.0
Thr
6.086ThrAla: 6.086 ± 0.692
0.461ThrCys: 0.461 ± 0.246
4.058ThrAsp: 4.058 ± 0.653
4.058ThrGlu: 4.058 ± 0.591
1.568ThrPhe: 1.568 ± 0.468
5.441ThrGly: 5.441 ± 0.69
1.014ThrHis: 1.014 ± 0.328
4.519ThrIle: 4.519 ± 0.548
1.291ThrLys: 1.291 ± 0.325
5.994ThrLeu: 5.994 ± 0.755
1.107ThrMet: 1.107 ± 0.33
1.014ThrAsn: 1.014 ± 0.293
3.412ThrPro: 3.412 ± 0.499
2.121ThrGln: 2.121 ± 0.389
5.164ThrArg: 5.164 ± 0.567
4.15ThrSer: 4.15 ± 0.501
4.795ThrThr: 4.795 ± 0.785
4.426ThrVal: 4.426 ± 0.568
1.475ThrTrp: 1.475 ± 0.329
1.568ThrTyr: 1.568 ± 0.313
0.0ThrXaa: 0.0 ± 0.0
Val
6.547ValAla: 6.547 ± 0.767
0.461ValCys: 0.461 ± 0.214
4.887ValAsp: 4.887 ± 0.805
5.349ValGlu: 5.349 ± 0.75
1.937ValPhe: 1.937 ± 0.335
4.15ValGly: 4.15 ± 0.678
1.199ValHis: 1.199 ± 0.283
2.767ValIle: 2.767 ± 0.422
1.937ValLys: 1.937 ± 0.423
5.072ValLeu: 5.072 ± 0.72
1.937ValMet: 1.937 ± 0.408
1.66ValAsn: 1.66 ± 0.349
2.859ValPro: 2.859 ± 0.483
2.49ValGln: 2.49 ± 0.347
4.611ValArg: 4.611 ± 0.881
3.873ValSer: 3.873 ± 0.556
4.15ValThr: 4.15 ± 0.665
4.426ValVal: 4.426 ± 0.523
1.199ValTrp: 1.199 ± 0.345
1.291ValTyr: 1.291 ± 0.388
0.0ValXaa: 0.0 ± 0.0
Trp
1.475TrpAla: 1.475 ± 0.362
0.553TrpCys: 0.553 ± 0.237
1.107TrpAsp: 1.107 ± 0.288
1.199TrpGlu: 1.199 ± 0.398
0.83TrpPhe: 0.83 ± 0.275
1.107TrpGly: 1.107 ± 0.333
0.553TrpHis: 0.553 ± 0.219
0.553TrpIle: 0.553 ± 0.219
0.83TrpLys: 0.83 ± 0.239
1.475TrpLeu: 1.475 ± 0.353
0.553TrpMet: 0.553 ± 0.226
0.277TrpAsn: 0.277 ± 0.183
1.475TrpPro: 1.475 ± 0.328
1.66TrpGln: 1.66 ± 0.44
1.66TrpArg: 1.66 ± 0.349
1.199TrpSer: 1.199 ± 0.316
1.014TrpThr: 1.014 ± 0.267
1.199TrpVal: 1.199 ± 0.327
0.369TrpTrp: 0.369 ± 0.236
0.553TrpTyr: 0.553 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.674TyrAla: 2.674 ± 0.604
0.646TyrCys: 0.646 ± 0.225
1.107TyrAsp: 1.107 ± 0.272
0.461TyrGlu: 0.461 ± 0.197
0.277TyrPhe: 0.277 ± 0.156
1.937TyrGly: 1.937 ± 0.399
0.83TyrHis: 0.83 ± 0.289
0.461TyrIle: 0.461 ± 0.206
0.646TyrLys: 0.646 ± 0.279
1.937TyrLeu: 1.937 ± 0.304
0.553TyrMet: 0.553 ± 0.222
0.461TyrAsn: 0.461 ± 0.213
0.646TyrPro: 0.646 ± 0.262
1.107TyrGln: 1.107 ± 0.354
3.043TyrArg: 3.043 ± 0.532
1.199TyrSer: 1.199 ± 0.334
1.844TyrThr: 1.844 ± 0.378
1.199TyrVal: 1.199 ± 0.392
0.553TyrTrp: 0.553 ± 0.18
0.922TyrTyr: 0.922 ± 0.299
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (10845 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski