Amino acid dipepetide frequency for Achromobacter phage phiAxp-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.795AlaAla: 15.795 ± 2.108
0.935AlaCys: 0.935 ± 0.262
6.703AlaAsp: 6.703 ± 0.606
5.56AlaGlu: 5.56 ± 0.748
3.793AlaPhe: 3.793 ± 0.402
7.638AlaGly: 7.638 ± 0.672
1.195AlaHis: 1.195 ± 0.239
5.04AlaIle: 5.04 ± 0.619
4.313AlaLys: 4.313 ± 0.632
9.56AlaLeu: 9.56 ± 0.952
3.585AlaMet: 3.585 ± 0.365
5.144AlaAsn: 5.144 ± 0.697
5.196AlaPro: 5.196 ± 0.794
5.144AlaGln: 5.144 ± 0.727
5.923AlaArg: 5.923 ± 0.596
5.404AlaSer: 5.404 ± 0.756
6.287AlaThr: 6.287 ± 0.753
7.274AlaVal: 7.274 ± 0.818
1.871AlaTrp: 1.871 ± 0.325
3.221AlaTyr: 3.221 ± 0.39
0.0AlaXaa: 0.0 ± 0.0
Cys
1.039CysAla: 1.039 ± 0.289
0.052CysCys: 0.052 ± 0.051
0.779CysAsp: 0.779 ± 0.231
0.26CysGlu: 0.26 ± 0.121
0.416CysPhe: 0.416 ± 0.141
0.779CysGly: 0.779 ± 0.229
0.208CysHis: 0.208 ± 0.106
0.468CysIle: 0.468 ± 0.121
0.364CysLys: 0.364 ± 0.124
0.52CysLeu: 0.52 ± 0.173
0.52CysMet: 0.52 ± 0.18
0.26CysAsn: 0.26 ± 0.11
0.572CysPro: 0.572 ± 0.172
0.416CysGln: 0.416 ± 0.177
0.572CysArg: 0.572 ± 0.239
0.364CysSer: 0.364 ± 0.14
0.727CysThr: 0.727 ± 0.24
0.883CysVal: 0.883 ± 0.269
0.364CysTrp: 0.364 ± 0.127
0.156CysTyr: 0.156 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
7.066AspAla: 7.066 ± 0.924
0.26AspCys: 0.26 ± 0.106
4.001AspAsp: 4.001 ± 0.481
4.884AspGlu: 4.884 ± 0.703
1.559AspPhe: 1.559 ± 0.252
4.988AspGly: 4.988 ± 0.533
1.403AspHis: 1.403 ± 0.266
3.118AspIle: 3.118 ± 0.425
2.806AspLys: 2.806 ± 0.329
5.04AspLeu: 5.04 ± 0.656
1.715AspMet: 1.715 ± 0.362
1.974AspAsn: 1.974 ± 0.307
2.754AspPro: 2.754 ± 0.394
1.922AspGln: 1.922 ± 0.344
3.066AspArg: 3.066 ± 0.356
2.754AspSer: 2.754 ± 0.376
2.91AspThr: 2.91 ± 0.326
4.728AspVal: 4.728 ± 0.461
1.143AspTrp: 1.143 ± 0.329
2.13AspTyr: 2.13 ± 0.443
0.0AspXaa: 0.0 ± 0.0
Glu
5.715GluAla: 5.715 ± 0.595
0.883GluCys: 0.883 ± 0.275
3.689GluAsp: 3.689 ± 0.633
4.001GluGlu: 4.001 ± 0.428
1.819GluPhe: 1.819 ± 0.295
3.585GluGly: 3.585 ± 0.536
0.831GluHis: 0.831 ± 0.225
2.702GluIle: 2.702 ± 0.344
2.338GluLys: 2.338 ± 0.35
6.079GluLeu: 6.079 ± 0.715
1.767GluMet: 1.767 ± 0.294
2.026GluAsn: 2.026 ± 0.346
2.442GluPro: 2.442 ± 0.398
2.442GluGln: 2.442 ± 0.373
4.624GluArg: 4.624 ± 0.629
3.221GluSer: 3.221 ± 0.38
3.273GluThr: 3.273 ± 0.486
3.689GluVal: 3.689 ± 0.442
1.143GluTrp: 1.143 ± 0.289
2.13GluTyr: 2.13 ± 0.285
0.0GluXaa: 0.0 ± 0.0
Phe
3.014PheAla: 3.014 ± 0.426
0.364PheCys: 0.364 ± 0.154
2.962PheAsp: 2.962 ± 0.531
1.974PheGlu: 1.974 ± 0.272
1.091PhePhe: 1.091 ± 0.226
2.078PheGly: 2.078 ± 0.256
0.468PheHis: 0.468 ± 0.15
2.182PheIle: 2.182 ± 0.252
1.611PheLys: 1.611 ± 0.291
2.702PheLeu: 2.702 ± 0.397
1.039PheMet: 1.039 ± 0.242
2.286PheAsn: 2.286 ± 0.301
1.143PhePro: 1.143 ± 0.167
1.611PheGln: 1.611 ± 0.298
1.767PheArg: 1.767 ± 0.306
1.611PheSer: 1.611 ± 0.307
2.65PheThr: 2.65 ± 0.464
2.078PheVal: 2.078 ± 0.328
0.52PheTrp: 0.52 ± 0.164
1.351PheTyr: 1.351 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
6.183GlyAla: 6.183 ± 0.72
0.675GlyCys: 0.675 ± 0.181
4.572GlyAsp: 4.572 ± 0.447
4.572GlyGlu: 4.572 ± 0.597
2.806GlyPhe: 2.806 ± 0.407
5.404GlyGly: 5.404 ± 0.706
1.039GlyHis: 1.039 ± 0.235
3.897GlyIle: 3.897 ± 0.405
4.261GlyLys: 4.261 ± 0.559
5.3GlyLeu: 5.3 ± 0.473
2.026GlyMet: 2.026 ± 0.349
3.585GlyAsn: 3.585 ± 0.515
2.754GlyPro: 2.754 ± 0.459
3.221GlyGln: 3.221 ± 0.473
5.767GlyArg: 5.767 ± 0.51
4.832GlySer: 4.832 ± 0.767
5.456GlyThr: 5.456 ± 0.641
4.572GlyVal: 4.572 ± 0.47
1.507GlyTrp: 1.507 ± 0.264
3.014GlyTyr: 3.014 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
1.455HisAla: 1.455 ± 0.374
0.416HisCys: 0.416 ± 0.167
1.143HisAsp: 1.143 ± 0.256
0.779HisGlu: 0.779 ± 0.19
0.624HisPhe: 0.624 ± 0.179
1.403HisGly: 1.403 ± 0.347
0.364HisHis: 0.364 ± 0.133
1.143HisIle: 1.143 ± 0.219
0.52HisLys: 0.52 ± 0.226
1.351HisLeu: 1.351 ± 0.353
0.416HisMet: 0.416 ± 0.15
0.727HisAsn: 0.727 ± 0.195
0.779HisPro: 0.779 ± 0.208
0.468HisGln: 0.468 ± 0.179
0.727HisArg: 0.727 ± 0.25
0.779HisSer: 0.779 ± 0.211
0.935HisThr: 0.935 ± 0.23
0.675HisVal: 0.675 ± 0.203
0.364HisTrp: 0.364 ± 0.113
0.883HisTyr: 0.883 ± 0.225
0.0HisXaa: 0.0 ± 0.0
Ile
5.248IleAla: 5.248 ± 0.535
0.416IleCys: 0.416 ± 0.159
3.377IleAsp: 3.377 ± 0.323
3.169IleGlu: 3.169 ± 0.396
1.767IlePhe: 1.767 ± 0.319
3.533IleGly: 3.533 ± 0.436
1.091IleHis: 1.091 ± 0.3
1.871IleIle: 1.871 ± 0.322
2.65IleLys: 2.65 ± 0.311
3.118IleLeu: 3.118 ± 0.365
1.403IleMet: 1.403 ± 0.294
2.442IleAsn: 2.442 ± 0.437
2.546IlePro: 2.546 ± 0.373
1.922IleGln: 1.922 ± 0.366
3.533IleArg: 3.533 ± 0.348
2.65IleSer: 2.65 ± 0.426
3.741IleThr: 3.741 ± 0.467
3.637IleVal: 3.637 ± 0.414
0.468IleTrp: 0.468 ± 0.123
1.143IleTyr: 1.143 ± 0.203
0.0IleXaa: 0.0 ± 0.0
Lys
4.832LysAla: 4.832 ± 0.562
0.312LysCys: 0.312 ± 0.148
1.922LysAsp: 1.922 ± 0.269
2.91LysGlu: 2.91 ± 0.388
1.767LysPhe: 1.767 ± 0.244
3.273LysGly: 3.273 ± 0.447
0.883LysHis: 0.883 ± 0.21
2.39LysIle: 2.39 ± 0.32
1.663LysLys: 1.663 ± 0.317
4.468LysLeu: 4.468 ± 0.579
1.715LysMet: 1.715 ± 0.285
1.871LysAsn: 1.871 ± 0.283
2.806LysPro: 2.806 ± 0.497
1.455LysGln: 1.455 ± 0.255
2.806LysArg: 2.806 ± 0.389
1.767LysSer: 1.767 ± 0.277
2.598LysThr: 2.598 ± 0.341
2.806LysVal: 2.806 ± 0.395
0.883LysTrp: 0.883 ± 0.177
1.195LysTyr: 1.195 ± 0.28
0.0LysXaa: 0.0 ± 0.0
Leu
11.275LeuAla: 11.275 ± 0.941
0.935LeuCys: 0.935 ± 0.293
4.988LeuAsp: 4.988 ± 0.474
4.78LeuGlu: 4.78 ± 0.58
2.39LeuPhe: 2.39 ± 0.447
6.287LeuGly: 6.287 ± 0.764
1.299LeuHis: 1.299 ± 0.309
3.481LeuIle: 3.481 ± 0.421
3.533LeuLys: 3.533 ± 0.643
6.079LeuLeu: 6.079 ± 0.654
2.598LeuMet: 2.598 ± 0.404
2.91LeuAsn: 2.91 ± 0.503
3.949LeuPro: 3.949 ± 0.463
2.338LeuGln: 2.338 ± 0.566
6.651LeuArg: 6.651 ± 0.674
3.949LeuSer: 3.949 ± 0.585
6.027LeuThr: 6.027 ± 0.457
5.664LeuVal: 5.664 ± 0.658
1.039LeuTrp: 1.039 ± 0.226
2.338LeuTyr: 2.338 ± 0.382
0.0LeuXaa: 0.0 ± 0.0
Met
3.481MetAla: 3.481 ± 0.379
0.26MetCys: 0.26 ± 0.116
1.299MetAsp: 1.299 ± 0.242
1.195MetGlu: 1.195 ± 0.304
0.727MetPhe: 0.727 ± 0.191
2.338MetGly: 2.338 ± 0.401
0.624MetHis: 0.624 ± 0.163
0.831MetIle: 0.831 ± 0.163
1.611MetLys: 1.611 ± 0.258
2.286MetLeu: 2.286 ± 0.48
0.831MetMet: 0.831 ± 0.167
1.403MetAsn: 1.403 ± 0.231
1.819MetPro: 1.819 ± 0.379
1.455MetGln: 1.455 ± 0.232
2.234MetArg: 2.234 ± 0.321
2.182MetSer: 2.182 ± 0.395
2.182MetThr: 2.182 ± 0.359
1.819MetVal: 1.819 ± 0.249
0.364MetTrp: 0.364 ± 0.117
0.364MetTyr: 0.364 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
4.468AsnAla: 4.468 ± 0.69
0.364AsnCys: 0.364 ± 0.157
1.663AsnAsp: 1.663 ± 0.246
2.078AsnGlu: 2.078 ± 0.3
1.403AsnPhe: 1.403 ± 0.264
4.468AsnGly: 4.468 ± 0.52
0.52AsnHis: 0.52 ± 0.16
1.715AsnIle: 1.715 ± 0.301
2.026AsnLys: 2.026 ± 0.469
3.845AsnLeu: 3.845 ± 0.41
1.247AsnMet: 1.247 ± 0.252
1.507AsnAsn: 1.507 ± 0.342
2.754AsnPro: 2.754 ± 0.441
1.663AsnGln: 1.663 ± 0.46
2.546AsnArg: 2.546 ± 0.387
1.922AsnSer: 1.922 ± 0.339
2.39AsnThr: 2.39 ± 0.38
2.858AsnVal: 2.858 ± 0.348
0.883AsnTrp: 0.883 ± 0.203
1.715AsnTyr: 1.715 ± 0.353
0.0AsnXaa: 0.0 ± 0.0
Pro
6.183ProAla: 6.183 ± 0.72
0.572ProCys: 0.572 ± 0.185
3.481ProAsp: 3.481 ± 0.452
3.066ProGlu: 3.066 ± 0.419
2.182ProPhe: 2.182 ± 0.379
4.572ProGly: 4.572 ± 0.656
0.675ProHis: 0.675 ± 0.223
2.234ProIle: 2.234 ± 0.365
1.403ProLys: 1.403 ± 0.263
2.806ProLeu: 2.806 ± 0.407
1.299ProMet: 1.299 ± 0.233
2.078ProAsn: 2.078 ± 0.413
2.702ProPro: 2.702 ± 0.481
1.611ProGln: 1.611 ± 0.294
1.974ProArg: 1.974 ± 0.273
2.858ProSer: 2.858 ± 0.471
2.754ProThr: 2.754 ± 0.441
4.417ProVal: 4.417 ± 0.578
0.987ProTrp: 0.987 ± 0.25
1.559ProTyr: 1.559 ± 0.262
0.0ProXaa: 0.0 ± 0.0
Gln
4.053GlnAla: 4.053 ± 0.434
0.675GlnCys: 0.675 ± 0.199
2.182GlnAsp: 2.182 ± 0.343
2.078GlnGlu: 2.078 ± 0.469
1.819GlnPhe: 1.819 ± 0.263
2.442GlnGly: 2.442 ± 0.376
0.364GlnHis: 0.364 ± 0.117
2.598GlnIle: 2.598 ± 0.373
1.455GlnLys: 1.455 ± 0.269
4.001GlnLeu: 4.001 ± 0.609
1.247GlnMet: 1.247 ± 0.242
1.195GlnAsn: 1.195 ± 0.355
2.078GlnPro: 2.078 ± 0.376
2.182GlnGln: 2.182 ± 0.378
3.377GlnArg: 3.377 ± 0.4
2.598GlnSer: 2.598 ± 0.793
2.026GlnThr: 2.026 ± 0.314
3.273GlnVal: 3.273 ± 0.416
0.727GlnTrp: 0.727 ± 0.241
0.831GlnTyr: 0.831 ± 0.211
0.0GlnXaa: 0.0 ± 0.0
Arg
6.391ArgAla: 6.391 ± 0.745
0.364ArgCys: 0.364 ± 0.169
4.053ArgAsp: 4.053 ± 0.471
3.793ArgGlu: 3.793 ± 0.454
2.13ArgPhe: 2.13 ± 0.334
4.209ArgGly: 4.209 ± 0.464
1.455ArgHis: 1.455 ± 0.385
4.468ArgIle: 4.468 ± 0.398
3.014ArgLys: 3.014 ± 0.436
5.715ArgLeu: 5.715 ± 0.415
2.494ArgMet: 2.494 ± 0.397
2.91ArgAsn: 2.91 ± 0.354
2.598ArgPro: 2.598 ± 0.477
2.806ArgGln: 2.806 ± 0.43
3.741ArgArg: 3.741 ± 0.467
2.806ArgSer: 2.806 ± 0.347
2.546ArgThr: 2.546 ± 0.458
5.144ArgVal: 5.144 ± 0.574
1.195ArgTrp: 1.195 ± 0.304
1.819ArgTyr: 1.819 ± 0.332
0.0ArgXaa: 0.0 ± 0.0
Ser
5.923SerAla: 5.923 ± 0.656
0.468SerCys: 0.468 ± 0.187
3.273SerAsp: 3.273 ± 0.419
3.118SerGlu: 3.118 ± 0.394
1.507SerPhe: 1.507 ± 0.298
4.313SerGly: 4.313 ± 0.542
0.935SerHis: 0.935 ± 0.246
2.702SerIle: 2.702 ± 0.38
2.858SerLys: 2.858 ± 0.391
4.676SerLeu: 4.676 ± 0.617
1.091SerMet: 1.091 ± 0.236
2.338SerAsn: 2.338 ± 0.342
2.39SerPro: 2.39 ± 0.448
2.234SerGln: 2.234 ± 0.452
3.118SerArg: 3.118 ± 0.262
2.598SerSer: 2.598 ± 0.462
2.806SerThr: 2.806 ± 0.528
3.793SerVal: 3.793 ± 0.455
1.091SerTrp: 1.091 ± 0.269
1.922SerTyr: 1.922 ± 0.262
0.0SerXaa: 0.0 ± 0.0
Thr
5.56ThrAla: 5.56 ± 1.11
0.468ThrCys: 0.468 ± 0.143
3.221ThrAsp: 3.221 ± 0.439
3.118ThrGlu: 3.118 ± 0.403
2.494ThrPhe: 2.494 ± 0.415
5.612ThrGly: 5.612 ± 0.543
0.624ThrHis: 0.624 ± 0.185
3.273ThrIle: 3.273 ± 0.499
2.598ThrLys: 2.598 ± 0.439
5.456ThrLeu: 5.456 ± 0.699
1.299ThrMet: 1.299 ± 0.248
2.546ThrAsn: 2.546 ± 0.442
3.793ThrPro: 3.793 ± 0.598
2.286ThrGln: 2.286 ± 0.402
3.273ThrArg: 3.273 ± 0.427
3.845ThrSer: 3.845 ± 0.492
4.78ThrThr: 4.78 ± 0.48
4.936ThrVal: 4.936 ± 0.408
0.831ThrTrp: 0.831 ± 0.185
1.767ThrTyr: 1.767 ± 0.39
0.0ThrXaa: 0.0 ± 0.0
Val
7.898ValAla: 7.898 ± 0.733
0.52ValCys: 0.52 ± 0.196
3.793ValAsp: 3.793 ± 0.41
4.468ValGlu: 4.468 ± 0.451
2.39ValPhe: 2.39 ± 0.376
4.78ValGly: 4.78 ± 0.458
1.039ValHis: 1.039 ± 0.24
3.273ValIle: 3.273 ± 0.401
3.793ValLys: 3.793 ± 0.581
5.56ValLeu: 5.56 ± 0.456
1.559ValMet: 1.559 ± 0.284
3.066ValAsn: 3.066 ± 0.378
3.481ValPro: 3.481 ± 0.531
3.429ValGln: 3.429 ± 0.446
4.676ValArg: 4.676 ± 0.519
4.209ValSer: 4.209 ± 0.433
4.78ValThr: 4.78 ± 0.72
5.404ValVal: 5.404 ± 0.598
0.675ValTrp: 0.675 ± 0.208
1.922ValTyr: 1.922 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
1.299TrpAla: 1.299 ± 0.214
0.416TrpCys: 0.416 ± 0.153
0.987TrpAsp: 0.987 ± 0.242
0.987TrpGlu: 0.987 ± 0.24
0.883TrpPhe: 0.883 ± 0.21
0.935TrpGly: 0.935 ± 0.302
0.26TrpHis: 0.26 ± 0.122
0.883TrpIle: 0.883 ± 0.234
0.624TrpLys: 0.624 ± 0.192
1.663TrpLeu: 1.663 ± 0.297
0.364TrpMet: 0.364 ± 0.119
0.779TrpAsn: 0.779 ± 0.181
1.143TrpPro: 1.143 ± 0.25
0.883TrpGln: 0.883 ± 0.215
1.143TrpArg: 1.143 ± 0.277
1.195TrpSer: 1.195 ± 0.245
1.039TrpThr: 1.039 ± 0.283
0.831TrpVal: 0.831 ± 0.218
0.572TrpTrp: 0.572 ± 0.187
0.675TrpTyr: 0.675 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.702TyrAla: 2.702 ± 0.397
0.364TyrCys: 0.364 ± 0.12
2.13TyrAsp: 2.13 ± 0.303
1.611TyrGlu: 1.611 ± 0.256
0.935TyrPhe: 0.935 ± 0.179
2.806TyrGly: 2.806 ± 0.499
0.675TyrHis: 0.675 ± 0.142
1.455TyrIle: 1.455 ± 0.303
0.883TyrLys: 0.883 ± 0.22
2.338TyrLeu: 2.338 ± 0.354
1.039TyrMet: 1.039 ± 0.229
0.987TyrAsn: 0.987 ± 0.221
1.715TyrPro: 1.715 ± 0.299
1.715TyrGln: 1.715 ± 0.321
2.13TyrArg: 2.13 ± 0.361
1.611TyrSer: 1.611 ± 0.272
1.871TyrThr: 1.871 ± 0.433
2.234TyrVal: 2.234 ± 0.401
0.935TyrTrp: 0.935 ± 0.198
0.675TyrTyr: 0.675 ± 0.197
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (19247 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski