Amino acid dipepetide frequency for Acinetobacter phage AM24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.166AlaAla: 3.166 ± 0.514
0.695AlaCys: 0.695 ± 0.162
4.016AlaAsp: 4.016 ± 0.498
4.016AlaGlu: 4.016 ± 0.373
2.201AlaPhe: 2.201 ± 0.333
3.514AlaGly: 3.514 ± 0.344
0.618AlaHis: 0.618 ± 0.156
3.823AlaIle: 3.823 ± 0.423
5.599AlaLys: 5.599 ± 0.516
5.715AlaLeu: 5.715 ± 0.52
1.66AlaMet: 1.66 ± 0.194
2.394AlaAsn: 2.394 ± 0.35
1.081AlaPro: 1.081 ± 0.175
2.471AlaGln: 2.471 ± 0.319
3.205AlaArg: 3.205 ± 0.336
3.359AlaSer: 3.359 ± 0.419
3.436AlaThr: 3.436 ± 0.587
3.823AlaVal: 3.823 ± 0.368
0.927AlaTrp: 0.927 ± 0.171
2.433AlaTyr: 2.433 ± 0.308
0.0AlaXaa: 0.0 ± 0.0
Cys
0.772CysAla: 0.772 ± 0.17
0.077CysCys: 0.077 ± 0.05
0.888CysAsp: 0.888 ± 0.189
0.656CysGlu: 0.656 ± 0.146
0.502CysPhe: 0.502 ± 0.133
0.927CysGly: 0.927 ± 0.178
0.348CysHis: 0.348 ± 0.105
0.772CysIle: 0.772 ± 0.178
0.772CysLys: 0.772 ± 0.204
0.965CysLeu: 0.965 ± 0.206
0.232CysMet: 0.232 ± 0.092
0.386CysAsn: 0.386 ± 0.116
0.309CysPro: 0.309 ± 0.095
0.463CysGln: 0.463 ± 0.17
0.618CysArg: 0.618 ± 0.156
0.811CysSer: 0.811 ± 0.211
0.772CysThr: 0.772 ± 0.155
0.695CysVal: 0.695 ± 0.168
0.348CysTrp: 0.348 ± 0.103
0.849CysTyr: 0.849 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
3.668AspAla: 3.668 ± 0.341
0.849AspCys: 0.849 ± 0.176
4.17AspAsp: 4.17 ± 0.468
4.672AspGlu: 4.672 ± 0.42
3.243AspPhe: 3.243 ± 0.365
4.865AspGly: 4.865 ± 0.438
1.236AspHis: 1.236 ± 0.203
3.861AspIle: 3.861 ± 0.399
4.17AspLys: 4.17 ± 0.433
6.718AspLeu: 6.718 ± 0.548
1.738AspMet: 1.738 ± 0.26
2.626AspAsn: 2.626 ± 0.309
2.124AspPro: 2.124 ± 0.291
2.046AspGln: 2.046 ± 0.31
2.896AspArg: 2.896 ± 0.267
3.166AspSer: 3.166 ± 0.331
3.089AspThr: 3.089 ± 0.346
5.83AspVal: 5.83 ± 0.464
1.197AspTrp: 1.197 ± 0.224
2.703AspTyr: 2.703 ± 0.375
0.0AspXaa: 0.0 ± 0.0
Glu
3.861GluAla: 3.861 ± 0.383
0.927GluCys: 0.927 ± 0.208
4.865GluAsp: 4.865 ± 0.499
6.332GluGlu: 6.332 ± 0.592
2.973GluPhe: 2.973 ± 0.333
5.058GluGly: 5.058 ± 0.449
1.351GluHis: 1.351 ± 0.246
5.406GluIle: 5.406 ± 0.496
5.56GluLys: 5.56 ± 0.487
5.135GluLeu: 5.135 ± 0.458
2.201GluMet: 2.201 ± 0.32
3.321GluAsn: 3.321 ± 0.368
1.467GluPro: 1.467 ± 0.236
3.012GluGln: 3.012 ± 0.347
3.012GluArg: 3.012 ± 0.351
3.359GluSer: 3.359 ± 0.338
3.243GluThr: 3.243 ± 0.35
6.796GluVal: 6.796 ± 0.566
0.965GluTrp: 0.965 ± 0.202
3.05GluTyr: 3.05 ± 0.438
0.0GluXaa: 0.0 ± 0.0
Phe
2.162PheAla: 2.162 ± 0.305
0.502PheCys: 0.502 ± 0.115
3.398PheAsp: 3.398 ± 0.335
4.016PheGlu: 4.016 ± 0.356
1.506PhePhe: 1.506 ± 0.283
2.703PheGly: 2.703 ± 0.323
0.811PheHis: 0.811 ± 0.181
2.934PheIle: 2.934 ± 0.367
3.823PheLys: 3.823 ± 0.445
2.703PheLeu: 2.703 ± 0.33
1.197PheMet: 1.197 ± 0.189
2.124PheAsn: 2.124 ± 0.296
0.888PhePro: 0.888 ± 0.198
1.313PheGln: 1.313 ± 0.202
1.583PheArg: 1.583 ± 0.214
2.819PheSer: 2.819 ± 0.328
2.664PheThr: 2.664 ± 0.306
2.587PheVal: 2.587 ± 0.322
0.27PheTrp: 0.27 ± 0.094
1.506PheTyr: 1.506 ± 0.276
0.0PheXaa: 0.0 ± 0.0
Gly
3.938GlyAla: 3.938 ± 0.463
1.197GlyCys: 1.197 ± 0.218
5.715GlyAsp: 5.715 ± 0.613
4.247GlyGlu: 4.247 ± 0.473
3.128GlyPhe: 3.128 ± 0.341
5.019GlyGly: 5.019 ± 0.584
1.66GlyHis: 1.66 ± 0.254
3.9GlyIle: 3.9 ± 0.316
5.908GlyLys: 5.908 ± 0.447
5.174GlyLeu: 5.174 ± 0.455
2.162GlyMet: 2.162 ± 0.275
3.591GlyAsn: 3.591 ± 0.454
0.695GlyPro: 0.695 ± 0.149
2.008GlyGln: 2.008 ± 0.308
3.552GlyArg: 3.552 ± 0.377
4.44GlySer: 4.44 ± 0.344
4.093GlyThr: 4.093 ± 0.482
5.29GlyVal: 5.29 ± 0.421
1.158GlyTrp: 1.158 ± 0.194
3.745GlyTyr: 3.745 ± 0.457
0.0GlyXaa: 0.0 ± 0.0
His
0.965HisAla: 0.965 ± 0.209
0.309HisCys: 0.309 ± 0.117
0.695HisAsp: 0.695 ± 0.167
1.429HisGlu: 1.429 ± 0.306
0.849HisPhe: 0.849 ± 0.19
1.12HisGly: 1.12 ± 0.253
0.425HisHis: 0.425 ± 0.103
1.158HisIle: 1.158 ± 0.21
1.738HisLys: 1.738 ± 0.254
2.008HisLeu: 2.008 ± 0.3
0.463HisMet: 0.463 ± 0.129
0.927HisAsn: 0.927 ± 0.203
0.734HisPro: 0.734 ± 0.191
0.541HisGln: 0.541 ± 0.12
0.772HisArg: 0.772 ± 0.22
1.197HisSer: 1.197 ± 0.206
1.043HisThr: 1.043 ± 0.195
1.081HisVal: 1.081 ± 0.229
0.27HisTrp: 0.27 ± 0.102
0.927HisTyr: 0.927 ± 0.168
0.0HisXaa: 0.0 ± 0.0
Ile
3.668IleAla: 3.668 ± 0.401
0.965IleCys: 0.965 ± 0.197
3.9IleAsp: 3.9 ± 0.373
4.865IleGlu: 4.865 ± 0.426
2.355IlePhe: 2.355 ± 0.297
4.711IleGly: 4.711 ± 0.52
0.965IleHis: 0.965 ± 0.186
3.012IleIle: 3.012 ± 0.362
4.981IleLys: 4.981 ± 0.432
4.402IleLeu: 4.402 ± 0.436
1.544IleMet: 1.544 ± 0.239
3.707IleAsn: 3.707 ± 0.57
2.471IlePro: 2.471 ± 0.331
2.664IleGln: 2.664 ± 0.32
2.973IleArg: 2.973 ± 0.323
4.826IleSer: 4.826 ± 0.456
3.514IleThr: 3.514 ± 0.458
4.633IleVal: 4.633 ± 0.392
0.541IleTrp: 0.541 ± 0.151
1.892IleTyr: 1.892 ± 0.215
0.0IleXaa: 0.0 ± 0.0
Lys
5.019LysAla: 5.019 ± 0.518
0.811LysCys: 0.811 ± 0.18
5.251LysAsp: 5.251 ± 0.46
5.019LysGlu: 5.019 ± 0.479
2.819LysPhe: 2.819 ± 0.353
4.556LysGly: 4.556 ± 0.428
1.969LysHis: 1.969 ± 0.355
5.019LysIle: 5.019 ± 0.439
4.44LysLys: 4.44 ± 0.529
6.718LysLeu: 6.718 ± 0.503
2.394LysMet: 2.394 ± 0.284
3.05LysAsn: 3.05 ± 0.387
3.012LysPro: 3.012 ± 0.329
2.548LysGln: 2.548 ± 0.349
3.977LysArg: 3.977 ± 0.465
4.17LysSer: 4.17 ± 0.332
4.054LysThr: 4.054 ± 0.498
7.336LysVal: 7.336 ± 0.57
0.965LysTrp: 0.965 ± 0.186
3.591LysTyr: 3.591 ± 0.36
0.0LysXaa: 0.0 ± 0.0
Leu
5.251LeuAla: 5.251 ± 0.546
1.043LeuCys: 1.043 ± 0.207
6.216LeuAsp: 6.216 ± 0.443
6.332LeuGlu: 6.332 ± 0.473
3.089LeuPhe: 3.089 ± 0.333
5.521LeuGly: 5.521 ± 0.477
1.12LeuHis: 1.12 ± 0.246
4.942LeuIle: 4.942 ± 0.407
6.525LeuLys: 6.525 ± 0.454
6.062LeuLeu: 6.062 ± 0.504
2.278LeuMet: 2.278 ± 0.295
4.131LeuAsn: 4.131 ± 0.438
2.433LeuPro: 2.433 ± 0.312
3.243LeuGln: 3.243 ± 0.34
2.973LeuArg: 2.973 ± 0.309
5.869LeuSer: 5.869 ± 0.585
4.595LeuThr: 4.595 ± 0.384
5.753LeuVal: 5.753 ± 0.563
0.965LeuTrp: 0.965 ± 0.179
2.78LeuTyr: 2.78 ± 0.293
0.0LeuXaa: 0.0 ± 0.0
Met
2.278MetAla: 2.278 ± 0.303
0.348MetCys: 0.348 ± 0.105
1.429MetAsp: 1.429 ± 0.259
2.046MetGlu: 2.046 ± 0.299
1.043MetPhe: 1.043 ± 0.189
1.274MetGly: 1.274 ± 0.219
0.425MetHis: 0.425 ± 0.134
1.467MetIle: 1.467 ± 0.253
2.008MetLys: 2.008 ± 0.271
1.931MetLeu: 1.931 ± 0.289
0.849MetMet: 0.849 ± 0.179
1.004MetAsn: 1.004 ± 0.187
1.12MetPro: 1.12 ± 0.203
0.927MetGln: 0.927 ± 0.184
1.043MetArg: 1.043 ± 0.225
2.046MetSer: 2.046 ± 0.291
1.622MetThr: 1.622 ± 0.282
1.815MetVal: 1.815 ± 0.292
0.386MetTrp: 0.386 ± 0.124
1.197MetTyr: 1.197 ± 0.238
0.0MetXaa: 0.0 ± 0.0
Asn
2.741AsnAla: 2.741 ± 0.4
0.386AsnCys: 0.386 ± 0.121
2.124AsnAsp: 2.124 ± 0.265
2.587AsnGlu: 2.587 ± 0.371
2.046AsnPhe: 2.046 ± 0.321
4.788AsnGly: 4.788 ± 0.438
0.502AsnHis: 0.502 ± 0.142
3.668AsnIle: 3.668 ± 0.477
3.205AsnLys: 3.205 ± 0.413
4.093AsnLeu: 4.093 ± 0.399
1.274AsnMet: 1.274 ± 0.175
2.587AsnAsn: 2.587 ± 0.345
3.552AsnPro: 3.552 ± 0.466
1.776AsnGln: 1.776 ± 0.272
1.622AsnArg: 1.622 ± 0.223
2.78AsnSer: 2.78 ± 0.296
3.861AsnThr: 3.861 ± 0.371
3.05AsnVal: 3.05 ± 0.375
0.811AsnTrp: 0.811 ± 0.167
1.776AsnTyr: 1.776 ± 0.312
0.0AsnXaa: 0.0 ± 0.0
Pro
1.274ProAla: 1.274 ± 0.247
0.193ProCys: 0.193 ± 0.082
2.433ProAsp: 2.433 ± 0.301
3.166ProGlu: 3.166 ± 0.34
1.274ProPhe: 1.274 ± 0.277
0.0ProGly: 0.0 ± 0.0
0.579ProHis: 0.579 ± 0.154
2.433ProIle: 2.433 ± 0.276
2.819ProLys: 2.819 ± 0.418
2.239ProLeu: 2.239 ± 0.292
0.734ProMet: 0.734 ± 0.194
1.815ProAsn: 1.815 ± 0.27
0.772ProPro: 0.772 ± 0.19
1.158ProGln: 1.158 ± 0.184
1.197ProArg: 1.197 ± 0.191
2.278ProSer: 2.278 ± 0.323
2.471ProThr: 2.471 ± 0.341
2.626ProVal: 2.626 ± 0.256
0.27ProTrp: 0.27 ± 0.096
1.158ProTyr: 1.158 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
2.934GlnAla: 2.934 ± 0.372
0.386GlnCys: 0.386 ± 0.125
1.969GlnAsp: 1.969 ± 0.25
2.626GlnGlu: 2.626 ± 0.304
1.158GlnPhe: 1.158 ± 0.182
2.51GlnGly: 2.51 ± 0.291
0.772GlnHis: 0.772 ± 0.203
2.278GlnIle: 2.278 ± 0.271
2.433GlnLys: 2.433 ± 0.321
3.128GlnLeu: 3.128 ± 0.36
0.965GlnMet: 0.965 ± 0.174
2.008GlnAsn: 2.008 ± 0.249
1.004GlnPro: 1.004 ± 0.246
1.892GlnGln: 1.892 ± 0.317
1.274GlnArg: 1.274 ± 0.19
2.008GlnSer: 2.008 ± 0.282
2.433GlnThr: 2.433 ± 0.316
2.548GlnVal: 2.548 ± 0.312
0.618GlnTrp: 0.618 ± 0.151
2.046GlnTyr: 2.046 ± 0.299
0.0GlnXaa: 0.0 ± 0.0
Arg
2.394ArgAla: 2.394 ± 0.238
0.386ArgCys: 0.386 ± 0.118
2.124ArgAsp: 2.124 ± 0.269
3.282ArgGlu: 3.282 ± 0.378
2.085ArgPhe: 2.085 ± 0.259
2.896ArgGly: 2.896 ± 0.355
1.12ArgHis: 1.12 ± 0.166
2.78ArgIle: 2.78 ± 0.294
3.552ArgLys: 3.552 ± 0.352
3.436ArgLeu: 3.436 ± 0.375
1.351ArgMet: 1.351 ± 0.204
2.664ArgAsn: 2.664 ± 0.306
1.274ArgPro: 1.274 ± 0.219
1.274ArgGln: 1.274 ± 0.2
1.815ArgArg: 1.815 ± 0.316
2.664ArgSer: 2.664 ± 0.315
2.278ArgThr: 2.278 ± 0.253
3.745ArgVal: 3.745 ± 0.343
0.734ArgTrp: 0.734 ± 0.153
2.085ArgTyr: 2.085 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
2.973SerAla: 2.973 ± 0.431
0.734SerCys: 0.734 ± 0.153
3.745SerAsp: 3.745 ± 0.379
4.324SerGlu: 4.324 ± 0.407
3.321SerPhe: 3.321 ± 0.316
5.251SerGly: 5.251 ± 0.56
1.081SerHis: 1.081 ± 0.241
3.243SerIle: 3.243 ± 0.308
4.942SerLys: 4.942 ± 0.451
5.444SerLeu: 5.444 ± 0.395
1.197SerMet: 1.197 ± 0.196
2.973SerAsn: 2.973 ± 0.365
2.239SerPro: 2.239 ± 0.261
2.626SerGln: 2.626 ± 0.309
2.857SerArg: 2.857 ± 0.268
4.711SerSer: 4.711 ± 0.483
4.093SerThr: 4.093 ± 0.458
4.054SerVal: 4.054 ± 0.412
0.888SerTrp: 0.888 ± 0.212
2.51SerTyr: 2.51 ± 0.261
0.0SerXaa: 0.0 ± 0.0
Thr
3.629ThrAla: 3.629 ± 0.547
0.849ThrCys: 0.849 ± 0.197
3.475ThrAsp: 3.475 ± 0.465
3.591ThrGlu: 3.591 ± 0.356
2.355ThrPhe: 2.355 ± 0.286
5.367ThrGly: 5.367 ± 0.659
0.965ThrHis: 0.965 ± 0.209
3.784ThrIle: 3.784 ± 0.367
3.707ThrLys: 3.707 ± 0.404
4.324ThrLeu: 4.324 ± 0.368
1.274ThrMet: 1.274 ± 0.224
2.278ThrAsn: 2.278 ± 0.356
2.085ThrPro: 2.085 ± 0.27
2.587ThrGln: 2.587 ± 0.304
2.51ThrArg: 2.51 ± 0.322
4.479ThrSer: 4.479 ± 0.408
3.668ThrThr: 3.668 ± 0.459
4.054ThrVal: 4.054 ± 0.433
0.579ThrTrp: 0.579 ± 0.116
2.394ThrTyr: 2.394 ± 0.274
0.0ThrXaa: 0.0 ± 0.0
Val
4.363ValAla: 4.363 ± 0.425
0.502ValCys: 0.502 ± 0.135
5.019ValAsp: 5.019 ± 0.368
4.904ValGlu: 4.904 ± 0.53
3.05ValPhe: 3.05 ± 0.284
6.796ValGly: 6.796 ± 0.53
1.506ValHis: 1.506 ± 0.245
5.097ValIle: 5.097 ± 0.387
5.946ValLys: 5.946 ± 0.421
6.101ValLeu: 6.101 ± 0.481
1.544ValMet: 1.544 ± 0.27
4.17ValAsn: 4.17 ± 0.443
2.085ValPro: 2.085 ± 0.252
2.51ValGln: 2.51 ± 0.291
3.205ValArg: 3.205 ± 0.337
4.711ValSer: 4.711 ± 0.464
4.016ValThr: 4.016 ± 0.47
6.448ValVal: 6.448 ± 0.529
0.927ValTrp: 0.927 ± 0.168
3.9ValTyr: 3.9 ± 0.399
0.0ValXaa: 0.0 ± 0.0
Trp
0.502TrpAla: 0.502 ± 0.165
0.27TrpCys: 0.27 ± 0.096
1.004TrpAsp: 1.004 ± 0.201
1.158TrpGlu: 1.158 ± 0.207
0.849TrpPhe: 0.849 ± 0.232
0.734TrpGly: 0.734 ± 0.175
0.348TrpHis: 0.348 ± 0.116
0.656TrpIle: 0.656 ± 0.2
1.274TrpLys: 1.274 ± 0.221
1.313TrpLeu: 1.313 ± 0.288
0.232TrpMet: 0.232 ± 0.075
0.772TrpAsn: 0.772 ± 0.189
0.039TrpPro: 0.039 ± 0.04
0.502TrpGln: 0.502 ± 0.156
0.734TrpArg: 0.734 ± 0.169
0.965TrpSer: 0.965 ± 0.172
0.386TrpThr: 0.386 ± 0.16
1.004TrpVal: 1.004 ± 0.174
0.425TrpTrp: 0.425 ± 0.138
0.618TrpTyr: 0.618 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.587TyrAla: 2.587 ± 0.27
0.734TyrCys: 0.734 ± 0.171
2.433TyrAsp: 2.433 ± 0.337
2.626TyrGlu: 2.626 ± 0.326
1.66TyrPhe: 1.66 ± 0.241
3.012TyrGly: 3.012 ± 0.333
0.849TyrHis: 0.849 ± 0.157
2.355TyrIle: 2.355 ± 0.387
3.321TyrLys: 3.321 ± 0.361
3.552TyrLeu: 3.552 ± 0.356
0.927TyrMet: 0.927 ± 0.221
2.703TyrAsn: 2.703 ± 0.302
1.506TyrPro: 1.506 ± 0.249
1.429TyrGln: 1.429 ± 0.206
2.124TyrArg: 2.124 ± 0.26
2.548TyrSer: 2.548 ± 0.252
2.626TyrThr: 2.626 ± 0.289
3.629TyrVal: 3.629 ± 0.427
0.541TyrTrp: 0.541 ± 0.161
1.506TyrTyr: 1.506 ± 0.245
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 146 proteins (25900 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski