Amino acid dipepetide frequency for Acinetobacter phage AbKT21phiIII

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.055AlaAla: 8.055 ± 0.998
0.78AlaCys: 0.78 ± 0.312
5.37AlaAsp: 5.37 ± 0.787
5.63AlaGlu: 5.63 ± 0.665
1.992AlaPhe: 1.992 ± 0.326
6.583AlaGly: 6.583 ± 0.457
1.299AlaHis: 1.299 ± 0.357
4.851AlaIle: 4.851 ± 0.756
5.024AlaLys: 5.024 ± 0.646
8.055AlaLeu: 8.055 ± 0.623
3.291AlaMet: 3.291 ± 0.53
3.725AlaAsn: 3.725 ± 0.399
2.079AlaPro: 2.079 ± 0.54
4.677AlaGln: 4.677 ± 0.713
2.685AlaArg: 2.685 ± 0.589
3.984AlaSer: 3.984 ± 0.617
4.851AlaThr: 4.851 ± 0.625
5.197AlaVal: 5.197 ± 0.747
0.346AlaTrp: 0.346 ± 0.174
3.118AlaTyr: 3.118 ± 0.497
0.0AlaXaa: 0.0 ± 0.0
Cys
0.78CysAla: 0.78 ± 0.262
0.087CysCys: 0.087 ± 0.082
0.953CysAsp: 0.953 ± 0.334
0.346CysGlu: 0.346 ± 0.163
0.52CysPhe: 0.52 ± 0.195
1.126CysGly: 1.126 ± 0.328
0.433CysHis: 0.433 ± 0.202
0.52CysIle: 0.52 ± 0.216
0.0CysLys: 0.0 ± 0.0
1.299CysLeu: 1.299 ± 0.399
0.0CysMet: 0.0 ± 0.0
0.346CysAsn: 0.346 ± 0.22
0.173CysPro: 0.173 ± 0.103
0.26CysGln: 0.26 ± 0.142
0.173CysArg: 0.173 ± 0.12
0.433CysSer: 0.433 ± 0.179
0.52CysThr: 0.52 ± 0.201
0.953CysVal: 0.953 ± 0.309
0.173CysTrp: 0.173 ± 0.113
0.693CysTyr: 0.693 ± 0.222
0.0CysXaa: 0.0 ± 0.0
Asp
5.024AspAla: 5.024 ± 0.788
1.039AspCys: 1.039 ± 0.31
3.465AspAsp: 3.465 ± 0.623
4.244AspGlu: 4.244 ± 0.667
2.079AspPhe: 2.079 ± 0.4
3.378AspGly: 3.378 ± 0.499
1.559AspHis: 1.559 ± 0.428
5.024AspIle: 5.024 ± 0.603
4.504AspLys: 4.504 ± 0.655
5.11AspLeu: 5.11 ± 0.635
1.646AspMet: 1.646 ± 0.377
2.945AspAsn: 2.945 ± 0.431
1.732AspPro: 1.732 ± 0.405
2.079AspGln: 2.079 ± 0.353
3.205AspArg: 3.205 ± 0.599
3.205AspSer: 3.205 ± 0.475
3.898AspThr: 3.898 ± 0.594
4.158AspVal: 4.158 ± 0.517
1.819AspTrp: 1.819 ± 0.46
2.425AspTyr: 2.425 ± 0.473
0.0AspXaa: 0.0 ± 0.0
Glu
5.89GluAla: 5.89 ± 0.628
0.26GluCys: 0.26 ± 0.148
4.937GluAsp: 4.937 ± 0.644
3.638GluGlu: 3.638 ± 0.658
1.992GluPhe: 1.992 ± 0.457
3.811GluGly: 3.811 ± 0.489
1.299GluHis: 1.299 ± 0.26
3.551GluIle: 3.551 ± 0.733
2.685GluLys: 2.685 ± 0.676
6.756GluLeu: 6.756 ± 0.749
1.559GluMet: 1.559 ± 0.29
2.772GluAsn: 2.772 ± 0.47
2.599GluPro: 2.599 ± 0.403
2.772GluGln: 2.772 ± 0.529
2.599GluArg: 2.599 ± 0.536
3.205GluSer: 3.205 ± 0.493
1.992GluThr: 1.992 ± 0.431
3.898GluVal: 3.898 ± 0.527
1.299GluTrp: 1.299 ± 0.341
4.158GluTyr: 4.158 ± 0.612
0.0GluXaa: 0.0 ± 0.0
Phe
2.425PheAla: 2.425 ± 0.516
0.52PheCys: 0.52 ± 0.241
2.599PheAsp: 2.599 ± 0.536
2.858PheGlu: 2.858 ± 0.64
1.299PhePhe: 1.299 ± 0.323
2.599PheGly: 2.599 ± 0.584
0.52PheHis: 0.52 ± 0.217
1.559PheIle: 1.559 ± 0.414
2.079PheLys: 2.079 ± 0.348
2.685PheLeu: 2.685 ± 0.52
0.78PheMet: 0.78 ± 0.233
2.512PheAsn: 2.512 ± 0.623
1.386PhePro: 1.386 ± 0.304
1.386PheGln: 1.386 ± 0.285
0.953PheArg: 0.953 ± 0.281
2.512PheSer: 2.512 ± 0.299
2.165PheThr: 2.165 ± 0.385
1.992PheVal: 1.992 ± 0.52
0.26PheTrp: 0.26 ± 0.14
0.78PheTyr: 0.78 ± 0.258
0.0PheXaa: 0.0 ± 0.0
Gly
4.591GlyAla: 4.591 ± 0.831
0.693GlyCys: 0.693 ± 0.277
3.811GlyAsp: 3.811 ± 0.59
3.551GlyGlu: 3.551 ± 0.596
2.339GlyPhe: 2.339 ± 0.516
6.323GlyGly: 6.323 ± 1.194
1.039GlyHis: 1.039 ± 0.294
4.244GlyIle: 4.244 ± 0.628
6.756GlyLys: 6.756 ± 0.778
5.457GlyLeu: 5.457 ± 0.764
2.858GlyMet: 2.858 ± 0.762
3.551GlyAsn: 3.551 ± 0.661
1.126GlyPro: 1.126 ± 0.319
2.945GlyGln: 2.945 ± 0.623
3.811GlyArg: 3.811 ± 0.395
4.937GlySer: 4.937 ± 0.835
6.063GlyThr: 6.063 ± 1.132
5.457GlyVal: 5.457 ± 0.598
0.953GlyTrp: 0.953 ± 0.311
2.772GlyTyr: 2.772 ± 0.51
0.0GlyXaa: 0.0 ± 0.0
His
1.299HisAla: 1.299 ± 0.384
0.52HisCys: 0.52 ± 0.232
1.992HisAsp: 1.992 ± 0.446
0.606HisGlu: 0.606 ± 0.257
1.559HisPhe: 1.559 ± 0.426
1.819HisGly: 1.819 ± 0.355
0.346HisHis: 0.346 ± 0.149
0.953HisIle: 0.953 ± 0.249
1.646HisLys: 1.646 ± 0.54
1.646HisLeu: 1.646 ± 0.348
0.26HisMet: 0.26 ± 0.134
1.039HisAsn: 1.039 ± 0.277
1.126HisPro: 1.126 ± 0.253
0.953HisGln: 0.953 ± 0.3
0.433HisArg: 0.433 ± 0.211
0.953HisSer: 0.953 ± 0.279
0.173HisThr: 0.173 ± 0.129
1.646HisVal: 1.646 ± 0.335
0.606HisTrp: 0.606 ± 0.183
0.866HisTyr: 0.866 ± 0.37
0.0HisXaa: 0.0 ± 0.0
Ile
5.024IleAla: 5.024 ± 0.605
0.26IleCys: 0.26 ± 0.136
4.677IleAsp: 4.677 ± 0.531
3.725IleGlu: 3.725 ± 0.569
1.039IlePhe: 1.039 ± 0.311
3.725IleGly: 3.725 ± 0.62
1.732IleHis: 1.732 ± 0.334
3.205IleIle: 3.205 ± 0.422
4.244IleLys: 4.244 ± 0.712
3.551IleLeu: 3.551 ± 0.491
1.472IleMet: 1.472 ± 0.372
3.032IleAsn: 3.032 ± 0.356
2.079IlePro: 2.079 ± 0.448
2.599IleGln: 2.599 ± 0.459
2.945IleArg: 2.945 ± 0.534
3.032IleSer: 3.032 ± 0.472
3.291IleThr: 3.291 ± 0.578
3.118IleVal: 3.118 ± 0.479
0.433IleTrp: 0.433 ± 0.18
2.858IleTyr: 2.858 ± 0.47
0.0IleXaa: 0.0 ± 0.0
Lys
5.457LysAla: 5.457 ± 0.79
0.433LysCys: 0.433 ± 0.176
4.071LysAsp: 4.071 ± 0.523
5.284LysGlu: 5.284 ± 0.591
2.685LysPhe: 2.685 ± 0.536
5.803LysGly: 5.803 ± 0.544
2.165LysHis: 2.165 ± 0.475
2.685LysIle: 2.685 ± 0.527
2.858LysLys: 2.858 ± 0.675
5.717LysLeu: 5.717 ± 0.565
1.819LysMet: 1.819 ± 0.35
2.512LysAsn: 2.512 ± 0.405
2.945LysPro: 2.945 ± 0.693
2.858LysGln: 2.858 ± 0.499
3.465LysArg: 3.465 ± 0.5
3.898LysSer: 3.898 ± 0.489
4.158LysThr: 4.158 ± 0.762
4.331LysVal: 4.331 ± 0.708
0.606LysTrp: 0.606 ± 0.198
2.685LysTyr: 2.685 ± 0.594
0.0LysXaa: 0.0 ± 0.0
Leu
7.189LeuAla: 7.189 ± 0.696
1.039LeuCys: 1.039 ± 0.28
5.717LeuAsp: 5.717 ± 0.522
5.457LeuGlu: 5.457 ± 0.714
1.992LeuPhe: 1.992 ± 0.386
5.717LeuGly: 5.717 ± 0.749
1.472LeuHis: 1.472 ± 0.437
4.504LeuIle: 4.504 ± 0.661
6.063LeuLys: 6.063 ± 0.764
8.402LeuLeu: 8.402 ± 1.155
1.386LeuMet: 1.386 ± 0.274
4.417LeuAsn: 4.417 ± 0.578
3.551LeuPro: 3.551 ± 0.579
4.158LeuGln: 4.158 ± 0.583
4.504LeuArg: 4.504 ± 0.524
5.024LeuSer: 5.024 ± 0.606
5.11LeuThr: 5.11 ± 0.766
5.717LeuVal: 5.717 ± 0.619
0.78LeuTrp: 0.78 ± 0.245
3.205LeuTyr: 3.205 ± 0.448
0.0LeuXaa: 0.0 ± 0.0
Met
2.252MetAla: 2.252 ± 0.603
0.26MetCys: 0.26 ± 0.135
2.079MetAsp: 2.079 ± 0.394
0.866MetGlu: 0.866 ± 0.265
1.039MetPhe: 1.039 ± 0.249
1.299MetGly: 1.299 ± 0.246
0.433MetHis: 0.433 ± 0.168
1.472MetIle: 1.472 ± 0.384
1.819MetLys: 1.819 ± 0.412
1.906MetLeu: 1.906 ± 0.341
0.26MetMet: 0.26 ± 0.119
1.213MetAsn: 1.213 ± 0.412
0.78MetPro: 0.78 ± 0.285
1.819MetGln: 1.819 ± 0.306
1.992MetArg: 1.992 ± 0.377
1.906MetSer: 1.906 ± 0.319
2.165MetThr: 2.165 ± 0.401
0.866MetVal: 0.866 ± 0.243
0.346MetTrp: 0.346 ± 0.168
1.472MetTyr: 1.472 ± 0.313
0.0MetXaa: 0.0 ± 0.0
Asn
3.811AsnAla: 3.811 ± 0.768
0.52AsnCys: 0.52 ± 0.168
1.646AsnAsp: 1.646 ± 0.4
2.339AsnGlu: 2.339 ± 0.428
1.906AsnPhe: 1.906 ± 0.42
4.764AsnGly: 4.764 ± 0.631
0.78AsnHis: 0.78 ± 0.274
3.032AsnIle: 3.032 ± 0.428
3.032AsnLys: 3.032 ± 0.418
3.551AsnLeu: 3.551 ± 0.487
1.646AsnMet: 1.646 ± 0.382
1.992AsnAsn: 1.992 ± 0.429
3.725AsnPro: 3.725 ± 0.727
1.906AsnGln: 1.906 ± 0.423
2.339AsnArg: 2.339 ± 0.632
2.599AsnSer: 2.599 ± 0.479
3.551AsnThr: 3.551 ± 0.615
3.032AsnVal: 3.032 ± 0.483
0.52AsnTrp: 0.52 ± 0.199
2.858AsnTyr: 2.858 ± 0.329
0.0AsnXaa: 0.0 ± 0.0
Pro
3.032ProAla: 3.032 ± 0.623
0.087ProCys: 0.087 ± 0.086
2.512ProAsp: 2.512 ± 0.39
3.378ProGlu: 3.378 ± 0.462
1.299ProPhe: 1.299 ± 0.29
0.0ProGly: 0.0 ± 0.0
0.52ProHis: 0.52 ± 0.21
2.599ProIle: 2.599 ± 0.595
3.118ProLys: 3.118 ± 0.563
2.425ProLeu: 2.425 ± 0.342
1.039ProMet: 1.039 ± 0.387
2.772ProAsn: 2.772 ± 0.551
0.693ProPro: 0.693 ± 0.175
1.906ProGln: 1.906 ± 0.603
1.299ProArg: 1.299 ± 0.308
2.252ProSer: 2.252 ± 0.508
2.772ProThr: 2.772 ± 0.412
3.118ProVal: 3.118 ± 0.627
0.087ProTrp: 0.087 ± 0.089
2.079ProTyr: 2.079 ± 0.399
0.0ProXaa: 0.0 ± 0.0
Gln
3.551GlnAla: 3.551 ± 0.537
0.26GlnCys: 0.26 ± 0.203
1.819GlnAsp: 1.819 ± 0.366
2.599GlnGlu: 2.599 ± 0.491
2.339GlnPhe: 2.339 ± 0.398
3.725GlnGly: 3.725 ± 0.914
1.732GlnHis: 1.732 ± 0.302
2.339GlnIle: 2.339 ± 0.505
2.685GlnLys: 2.685 ± 0.577
5.024GlnLeu: 5.024 ± 0.586
0.78GlnMet: 0.78 ± 0.224
1.386GlnAsn: 1.386 ± 0.337
1.213GlnPro: 1.213 ± 0.253
2.599GlnGln: 2.599 ± 0.524
2.685GlnArg: 2.685 ± 0.449
2.165GlnSer: 2.165 ± 0.347
2.079GlnThr: 2.079 ± 0.394
3.984GlnVal: 3.984 ± 0.828
0.52GlnTrp: 0.52 ± 0.219
3.032GlnTyr: 3.032 ± 0.496
0.0GlnXaa: 0.0 ± 0.0
Arg
3.638ArgAla: 3.638 ± 0.463
0.52ArgCys: 0.52 ± 0.206
2.079ArgAsp: 2.079 ± 0.41
2.685ArgGlu: 2.685 ± 0.567
1.992ArgPhe: 1.992 ± 0.409
3.291ArgGly: 3.291 ± 0.457
0.78ArgHis: 0.78 ± 0.209
3.205ArgIle: 3.205 ± 0.455
3.205ArgLys: 3.205 ± 0.611
4.764ArgLeu: 4.764 ± 0.796
1.646ArgMet: 1.646 ± 0.42
3.205ArgAsn: 3.205 ± 0.541
1.472ArgPro: 1.472 ± 0.363
2.079ArgGln: 2.079 ± 0.565
3.032ArgArg: 3.032 ± 0.603
2.512ArgSer: 2.512 ± 0.478
2.165ArgThr: 2.165 ± 0.435
3.725ArgVal: 3.725 ± 0.473
0.346ArgTrp: 0.346 ± 0.147
1.992ArgTyr: 1.992 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
5.197SerAla: 5.197 ± 0.692
0.52SerCys: 0.52 ± 0.214
3.811SerAsp: 3.811 ± 0.531
3.291SerGlu: 3.291 ± 0.421
2.252SerPhe: 2.252 ± 0.389
3.984SerGly: 3.984 ± 0.582
1.213SerHis: 1.213 ± 0.356
3.205SerIle: 3.205 ± 0.541
4.071SerLys: 4.071 ± 0.625
3.638SerLeu: 3.638 ± 0.471
1.386SerMet: 1.386 ± 0.376
2.512SerAsn: 2.512 ± 0.523
1.819SerPro: 1.819 ± 0.344
1.906SerGln: 1.906 ± 0.52
2.858SerArg: 2.858 ± 0.631
3.551SerSer: 3.551 ± 0.661
5.024SerThr: 5.024 ± 1.033
3.465SerVal: 3.465 ± 0.558
0.953SerTrp: 0.953 ± 0.317
2.252SerTyr: 2.252 ± 0.421
0.0SerXaa: 0.0 ± 0.0
Thr
4.677ThrAla: 4.677 ± 0.722
0.606ThrCys: 0.606 ± 0.225
3.205ThrAsp: 3.205 ± 0.536
3.378ThrGlu: 3.378 ± 0.52
1.906ThrPhe: 1.906 ± 0.338
6.41ThrGly: 6.41 ± 0.619
1.126ThrHis: 1.126 ± 0.266
3.378ThrIle: 3.378 ± 0.47
3.551ThrLys: 3.551 ± 0.448
4.937ThrLeu: 4.937 ± 0.589
1.299ThrMet: 1.299 ± 0.343
2.339ThrAsn: 2.339 ± 0.591
2.772ThrPro: 2.772 ± 0.604
3.811ThrGln: 3.811 ± 0.791
2.858ThrArg: 2.858 ± 0.519
2.685ThrSer: 2.685 ± 0.477
4.244ThrThr: 4.244 ± 0.576
4.071ThrVal: 4.071 ± 0.838
0.606ThrTrp: 0.606 ± 0.184
2.512ThrTyr: 2.512 ± 0.432
0.0ThrXaa: 0.0 ± 0.0
Val
5.89ValAla: 5.89 ± 0.52
0.52ValCys: 0.52 ± 0.168
4.417ValAsp: 4.417 ± 0.659
4.417ValGlu: 4.417 ± 0.792
1.819ValPhe: 1.819 ± 0.358
4.937ValGly: 4.937 ± 0.621
1.299ValHis: 1.299 ± 0.268
3.032ValIle: 3.032 ± 0.565
5.024ValLys: 5.024 ± 0.664
4.591ValLeu: 4.591 ± 0.571
1.299ValMet: 1.299 ± 0.314
3.898ValAsn: 3.898 ± 0.699
3.551ValPro: 3.551 ± 0.525
3.118ValGln: 3.118 ± 0.486
3.465ValArg: 3.465 ± 0.511
4.158ValSer: 4.158 ± 0.614
3.551ValThr: 3.551 ± 0.65
5.284ValVal: 5.284 ± 0.751
0.693ValTrp: 0.693 ± 0.209
2.772ValTyr: 2.772 ± 0.487
0.0ValXaa: 0.0 ± 0.0
Trp
0.693TrpAla: 0.693 ± 0.229
0.346TrpCys: 0.346 ± 0.145
0.78TrpAsp: 0.78 ± 0.269
0.866TrpGlu: 0.866 ± 0.253
0.606TrpPhe: 0.606 ± 0.246
0.693TrpGly: 0.693 ± 0.225
0.173TrpHis: 0.173 ± 0.117
0.52TrpIle: 0.52 ± 0.196
1.213TrpLys: 1.213 ± 0.214
1.299TrpLeu: 1.299 ± 0.295
0.433TrpMet: 0.433 ± 0.173
0.52TrpAsn: 0.52 ± 0.199
0.0TrpPro: 0.0 ± 0.0
0.173TrpGln: 0.173 ± 0.134
0.693TrpArg: 0.693 ± 0.212
0.78TrpSer: 0.78 ± 0.183
0.78TrpThr: 0.78 ± 0.408
0.52TrpVal: 0.52 ± 0.234
0.087TrpTrp: 0.087 ± 0.084
0.52TrpTyr: 0.52 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.118TyrAla: 3.118 ± 0.607
0.52TyrCys: 0.52 ± 0.235
2.599TyrAsp: 2.599 ± 0.397
2.512TyrGlu: 2.512 ± 0.543
1.213TyrPhe: 1.213 ± 0.25
3.291TyrGly: 3.291 ± 0.532
0.52TyrHis: 0.52 ± 0.169
2.165TyrIle: 2.165 ± 0.532
2.858TyrLys: 2.858 ± 0.526
4.591TyrLeu: 4.591 ± 0.632
1.299TyrMet: 1.299 ± 0.299
2.858TyrAsn: 2.858 ± 0.616
2.165TyrPro: 2.165 ± 0.423
2.512TyrGln: 2.512 ± 0.638
2.339TyrArg: 2.339 ± 0.39
3.205TyrSer: 3.205 ± 0.449
1.819TyrThr: 1.819 ± 0.472
3.205TyrVal: 3.205 ± 0.502
0.26TyrTrp: 0.26 ± 0.14
1.559TyrTyr: 1.559 ± 0.365
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 42 proteins (11546 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski