Amino acid dipepetide frequency for Acinetobacter phage AbP2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.436AlaAla: 5.436 ± 0.861
0.715AlaCys: 0.715 ± 0.262
3.791AlaAsp: 3.791 ± 0.473
3.576AlaGlu: 3.576 ± 0.501
2.575AlaPhe: 2.575 ± 0.376
4.506AlaGly: 4.506 ± 0.568
1.144AlaHis: 1.144 ± 0.283
6.151AlaIle: 6.151 ± 0.682
5.865AlaLys: 5.865 ± 0.75
6.437AlaLeu: 6.437 ± 0.754
2.36AlaMet: 2.36 ± 0.531
4.434AlaAsn: 4.434 ± 0.549
2.575AlaPro: 2.575 ± 0.398
3.433AlaGln: 3.433 ± 0.552
2.575AlaArg: 2.575 ± 0.331
4.291AlaSer: 4.291 ± 0.59
4.792AlaThr: 4.792 ± 0.768
3.719AlaVal: 3.719 ± 0.525
0.644AlaTrp: 0.644 ± 0.183
3.218AlaTyr: 3.218 ± 0.394
0.0AlaXaa: 0.0 ± 0.0
Cys
0.644CysAla: 0.644 ± 0.17
0.215CysCys: 0.215 ± 0.121
1.073CysAsp: 1.073 ± 0.299
1.001CysGlu: 1.001 ± 0.258
0.572CysPhe: 0.572 ± 0.186
0.715CysGly: 0.715 ± 0.204
0.143CysHis: 0.143 ± 0.095
0.501CysIle: 0.501 ± 0.179
0.93CysLys: 0.93 ± 0.281
1.001CysLeu: 1.001 ± 0.285
0.358CysMet: 0.358 ± 0.152
0.429CysAsn: 0.429 ± 0.16
0.358CysPro: 0.358 ± 0.146
0.215CysGln: 0.215 ± 0.121
0.715CysArg: 0.715 ± 0.236
0.572CysSer: 0.572 ± 0.181
0.501CysThr: 0.501 ± 0.192
1.43CysVal: 1.43 ± 0.258
0.215CysTrp: 0.215 ± 0.127
0.644CysTyr: 0.644 ± 0.231
0.0CysXaa: 0.0 ± 0.0
Asp
4.649AspAla: 4.649 ± 0.554
0.787AspCys: 0.787 ± 0.226
3.791AspAsp: 3.791 ± 0.536
4.291AspGlu: 4.291 ± 0.58
3.004AspPhe: 3.004 ± 0.427
4.506AspGly: 4.506 ± 0.623
0.501AspHis: 0.501 ± 0.16
4.22AspIle: 4.22 ± 0.495
4.863AspLys: 4.863 ± 0.643
3.862AspLeu: 3.862 ± 0.508
1.359AspMet: 1.359 ± 0.303
2.718AspAsn: 2.718 ± 0.329
1.502AspPro: 1.502 ± 0.294
2.932AspGln: 2.932 ± 0.403
2.789AspArg: 2.789 ± 0.395
3.505AspSer: 3.505 ± 0.498
2.503AspThr: 2.503 ± 0.431
4.291AspVal: 4.291 ± 0.555
1.502AspTrp: 1.502 ± 0.363
2.932AspTyr: 2.932 ± 0.437
0.0AspXaa: 0.0 ± 0.0
Glu
4.792GluAla: 4.792 ± 0.712
0.644GluCys: 0.644 ± 0.208
3.719GluAsp: 3.719 ± 0.68
4.935GluGlu: 4.935 ± 0.596
4.148GluPhe: 4.148 ± 0.602
3.791GluGly: 3.791 ± 0.569
1.359GluHis: 1.359 ± 0.276
4.72GluIle: 4.72 ± 0.59
4.506GluLys: 4.506 ± 0.565
6.079GluLeu: 6.079 ± 0.717
1.86GluMet: 1.86 ± 0.365
3.218GluAsn: 3.218 ± 0.44
1.86GluPro: 1.86 ± 0.368
3.361GluGln: 3.361 ± 0.536
1.931GluArg: 1.931 ± 0.362
4.72GluSer: 4.72 ± 0.521
2.36GluThr: 2.36 ± 0.555
4.863GluVal: 4.863 ± 0.57
1.001GluTrp: 1.001 ± 0.307
3.218GluTyr: 3.218 ± 0.511
0.0GluXaa: 0.0 ± 0.0
Phe
3.075PheAla: 3.075 ± 0.509
0.858PheCys: 0.858 ± 0.221
3.648PheAsp: 3.648 ± 0.412
2.932PheGlu: 2.932 ± 0.487
1.502PhePhe: 1.502 ± 0.296
3.004PheGly: 3.004 ± 0.355
0.787PheHis: 0.787 ± 0.222
3.934PheIle: 3.934 ± 0.536
3.648PheLys: 3.648 ± 0.445
2.861PheLeu: 2.861 ± 0.451
1.502PheMet: 1.502 ± 0.334
2.432PheAsn: 2.432 ± 0.38
0.858PhePro: 0.858 ± 0.213
1.216PheGln: 1.216 ± 0.321
1.287PheArg: 1.287 ± 0.257
2.289PheSer: 2.289 ± 0.38
2.932PheThr: 2.932 ± 0.413
2.718PheVal: 2.718 ± 0.393
0.572PheTrp: 0.572 ± 0.274
2.861PheTyr: 2.861 ± 0.444
0.0PheXaa: 0.0 ± 0.0
Gly
5.149GlyAla: 5.149 ± 0.742
0.715GlyCys: 0.715 ± 0.249
3.576GlyAsp: 3.576 ± 0.525
4.005GlyGlu: 4.005 ± 0.464
4.22GlyPhe: 4.22 ± 0.538
5.221GlyGly: 5.221 ± 0.67
1.287GlyHis: 1.287 ± 0.285
4.577GlyIle: 4.577 ± 0.664
4.291GlyLys: 4.291 ± 0.63
6.079GlyLeu: 6.079 ± 0.586
2.36GlyMet: 2.36 ± 0.37
3.791GlyAsn: 3.791 ± 0.466
0.572GlyPro: 0.572 ± 0.297
2.217GlyGln: 2.217 ± 0.485
2.861GlyArg: 2.861 ± 0.396
4.077GlySer: 4.077 ± 0.729
3.361GlyThr: 3.361 ± 0.485
5.865GlyVal: 5.865 ± 0.704
1.144GlyTrp: 1.144 ± 0.31
3.147GlyTyr: 3.147 ± 0.429
0.0GlyXaa: 0.0 ± 0.0
His
1.43HisAla: 1.43 ± 0.323
0.143HisCys: 0.143 ± 0.098
0.644HisAsp: 0.644 ± 0.2
1.931HisGlu: 1.931 ± 0.383
0.358HisPhe: 0.358 ± 0.17
1.144HisGly: 1.144 ± 0.272
0.286HisHis: 0.286 ± 0.123
1.287HisIle: 1.287 ± 0.28
1.502HisLys: 1.502 ± 0.358
1.144HisLeu: 1.144 ± 0.261
0.358HisMet: 0.358 ± 0.166
1.001HisAsn: 1.001 ± 0.272
0.715HisPro: 0.715 ± 0.223
0.715HisGln: 0.715 ± 0.244
0.429HisArg: 0.429 ± 0.201
0.644HisSer: 0.644 ± 0.181
0.787HisThr: 0.787 ± 0.246
0.858HisVal: 0.858 ± 0.276
0.215HisTrp: 0.215 ± 0.115
1.073HisTyr: 1.073 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
5.149IleAla: 5.149 ± 0.664
0.858IleCys: 0.858 ± 0.274
5.579IleAsp: 5.579 ± 0.542
6.58IleGlu: 6.58 ± 0.731
1.931IlePhe: 1.931 ± 0.347
4.649IleGly: 4.649 ± 0.652
1.502IleHis: 1.502 ± 0.407
3.862IleIle: 3.862 ± 0.506
7.295IleLys: 7.295 ± 0.67
5.078IleLeu: 5.078 ± 0.572
1.144IleMet: 1.144 ± 0.308
4.005IleAsn: 4.005 ± 0.474
3.576IlePro: 3.576 ± 0.476
2.432IleGln: 2.432 ± 0.375
2.217IleArg: 2.217 ± 0.423
4.863IleSer: 4.863 ± 0.595
4.577IleThr: 4.577 ± 0.67
4.22IleVal: 4.22 ± 0.562
0.644IleTrp: 0.644 ± 0.207
2.074IleTyr: 2.074 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
5.507LysAla: 5.507 ± 0.714
0.93LysCys: 0.93 ± 0.333
4.005LysAsp: 4.005 ± 0.529
5.65LysGlu: 5.65 ± 0.736
3.004LysPhe: 3.004 ± 0.525
5.579LysGly: 5.579 ± 0.551
0.93LysHis: 0.93 ± 0.255
5.579LysIle: 5.579 ± 0.808
5.936LysLys: 5.936 ± 0.813
5.722LysLeu: 5.722 ± 0.544
2.432LysMet: 2.432 ± 0.411
4.077LysAsn: 4.077 ± 0.515
2.575LysPro: 2.575 ± 0.417
3.004LysGln: 3.004 ± 0.397
4.22LysArg: 4.22 ± 0.586
4.005LysSer: 4.005 ± 0.609
4.434LysThr: 4.434 ± 0.565
5.436LysVal: 5.436 ± 0.552
1.073LysTrp: 1.073 ± 0.24
2.289LysTyr: 2.289 ± 0.35
0.0LysXaa: 0.0 ± 0.0
Leu
6.651LeuAla: 6.651 ± 0.747
0.644LeuCys: 0.644 ± 0.215
5.293LeuAsp: 5.293 ± 0.616
6.151LeuGlu: 6.151 ± 0.696
2.789LeuPhe: 2.789 ± 0.492
5.865LeuGly: 5.865 ± 0.658
1.573LeuHis: 1.573 ± 0.355
5.507LeuIle: 5.507 ± 0.528
6.58LeuLys: 6.58 ± 0.724
5.793LeuLeu: 5.793 ± 0.594
2.36LeuMet: 2.36 ± 0.465
6.723LeuAsn: 6.723 ± 0.78
1.573LeuPro: 1.573 ± 0.366
1.86LeuGln: 1.86 ± 0.4
3.648LeuArg: 3.648 ± 0.524
5.078LeuSer: 5.078 ± 0.496
4.434LeuThr: 4.434 ± 0.507
4.649LeuVal: 4.649 ± 0.508
0.787LeuTrp: 0.787 ± 0.266
2.217LeuTyr: 2.217 ± 0.443
0.0LeuXaa: 0.0 ± 0.0
Met
1.86MetAla: 1.86 ± 0.373
0.572MetCys: 0.572 ± 0.199
1.144MetAsp: 1.144 ± 0.404
1.86MetGlu: 1.86 ± 0.334
1.359MetPhe: 1.359 ± 0.37
1.716MetGly: 1.716 ± 0.426
0.501MetHis: 0.501 ± 0.22
1.645MetIle: 1.645 ± 0.348
2.003MetLys: 2.003 ± 0.346
2.217MetLeu: 2.217 ± 0.4
0.715MetMet: 0.715 ± 0.212
2.503MetAsn: 2.503 ± 0.461
0.858MetPro: 0.858 ± 0.223
1.573MetGln: 1.573 ± 0.311
1.573MetArg: 1.573 ± 0.333
2.646MetSer: 2.646 ± 0.439
2.003MetThr: 2.003 ± 0.309
0.93MetVal: 0.93 ± 0.225
0.286MetTrp: 0.286 ± 0.15
0.572MetTyr: 0.572 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
4.792AsnAla: 4.792 ± 0.513
0.358AsnCys: 0.358 ± 0.136
4.506AsnAsp: 4.506 ± 0.565
4.148AsnGlu: 4.148 ± 0.554
1.86AsnPhe: 1.86 ± 0.353
4.577AsnGly: 4.577 ± 0.759
1.073AsnHis: 1.073 ± 0.305
4.649AsnIle: 4.649 ± 0.546
3.075AsnLys: 3.075 ± 0.493
4.863AsnLeu: 4.863 ± 0.566
1.86AsnMet: 1.86 ± 0.439
3.719AsnAsn: 3.719 ± 0.569
2.575AsnPro: 2.575 ± 0.407
2.289AsnGln: 2.289 ± 0.377
2.003AsnArg: 2.003 ± 0.319
3.648AsnSer: 3.648 ± 0.529
3.433AsnThr: 3.433 ± 0.407
3.361AsnVal: 3.361 ± 0.501
0.501AsnTrp: 0.501 ± 0.174
2.646AsnTyr: 2.646 ± 0.442
0.0AsnXaa: 0.0 ± 0.0
Pro
1.716ProAla: 1.716 ± 0.277
0.501ProCys: 0.501 ± 0.224
1.788ProAsp: 1.788 ± 0.298
2.575ProGlu: 2.575 ± 0.369
1.144ProPhe: 1.144 ± 0.293
0.143ProGly: 0.143 ± 0.094
0.358ProHis: 0.358 ± 0.194
2.003ProIle: 2.003 ± 0.396
2.432ProLys: 2.432 ± 0.426
2.646ProLeu: 2.646 ± 0.418
0.93ProMet: 0.93 ± 0.321
2.289ProAsn: 2.289 ± 0.332
0.644ProPro: 0.644 ± 0.214
1.43ProGln: 1.43 ± 0.308
0.787ProArg: 0.787 ± 0.229
2.074ProSer: 2.074 ± 0.399
1.86ProThr: 1.86 ± 0.396
1.788ProVal: 1.788 ± 0.281
0.215ProTrp: 0.215 ± 0.123
1.716ProTyr: 1.716 ± 0.36
0.0ProXaa: 0.0 ± 0.0
Gln
3.147GlnAla: 3.147 ± 0.516
0.143GlnCys: 0.143 ± 0.101
2.146GlnAsp: 2.146 ± 0.461
2.575GlnGlu: 2.575 ± 0.453
2.003GlnPhe: 2.003 ± 0.425
2.432GlnGly: 2.432 ± 0.414
0.715GlnHis: 0.715 ± 0.182
2.432GlnIle: 2.432 ± 0.472
3.361GlnLys: 3.361 ± 0.583
3.576GlnLeu: 3.576 ± 0.63
1.216GlnMet: 1.216 ± 0.32
1.788GlnAsn: 1.788 ± 0.305
0.787GlnPro: 0.787 ± 0.232
1.43GlnGln: 1.43 ± 0.42
1.359GlnArg: 1.359 ± 0.362
2.646GlnSer: 2.646 ± 0.444
2.074GlnThr: 2.074 ± 0.356
2.003GlnVal: 2.003 ± 0.38
0.93GlnTrp: 0.93 ± 0.274
1.502GlnTyr: 1.502 ± 0.405
0.0GlnXaa: 0.0 ± 0.0
Arg
2.36ArgAla: 2.36 ± 0.445
1.001ArgCys: 1.001 ± 0.32
1.931ArgAsp: 1.931 ± 0.391
2.646ArgGlu: 2.646 ± 0.492
2.503ArgPhe: 2.503 ± 0.338
2.217ArgGly: 2.217 ± 0.395
1.073ArgHis: 1.073 ± 0.232
3.075ArgIle: 3.075 ± 0.609
3.576ArgLys: 3.576 ± 0.431
3.004ArgLeu: 3.004 ± 0.475
0.93ArgMet: 0.93 ± 0.21
1.931ArgAsn: 1.931 ± 0.317
0.858ArgPro: 0.858 ± 0.258
1.359ArgGln: 1.359 ± 0.331
0.93ArgArg: 0.93 ± 0.205
2.789ArgSer: 2.789 ± 0.51
2.074ArgThr: 2.074 ± 0.385
2.789ArgVal: 2.789 ± 0.417
0.429ArgTrp: 0.429 ± 0.156
1.287ArgTyr: 1.287 ± 0.395
0.0ArgXaa: 0.0 ± 0.0
Ser
3.361SerAla: 3.361 ± 0.496
0.572SerCys: 0.572 ± 0.215
3.29SerAsp: 3.29 ± 0.445
3.218SerGlu: 3.218 ± 0.519
3.361SerPhe: 3.361 ± 0.485
5.006SerGly: 5.006 ± 0.613
1.216SerHis: 1.216 ± 0.293
5.364SerIle: 5.364 ± 0.594
5.006SerLys: 5.006 ± 0.682
6.365SerLeu: 6.365 ± 0.686
1.86SerMet: 1.86 ± 0.403
4.148SerAsn: 4.148 ± 0.672
1.359SerPro: 1.359 ± 0.255
2.432SerGln: 2.432 ± 0.401
2.503SerArg: 2.503 ± 0.394
3.648SerSer: 3.648 ± 0.569
3.004SerThr: 3.004 ± 0.371
4.291SerVal: 4.291 ± 0.54
0.787SerTrp: 0.787 ± 0.227
2.217SerTyr: 2.217 ± 0.463
0.0SerXaa: 0.0 ± 0.0
Thr
4.148ThrAla: 4.148 ± 0.56
0.787ThrCys: 0.787 ± 0.17
2.932ThrAsp: 2.932 ± 0.476
1.645ThrGlu: 1.645 ± 0.293
1.645ThrPhe: 1.645 ± 0.344
4.72ThrGly: 4.72 ± 0.515
0.858ThrHis: 0.858 ± 0.257
4.649ThrIle: 4.649 ± 0.555
3.576ThrLys: 3.576 ± 0.454
4.863ThrLeu: 4.863 ± 0.628
1.645ThrMet: 1.645 ± 0.39
2.718ThrAsn: 2.718 ± 0.512
2.217ThrPro: 2.217 ± 0.358
1.931ThrGln: 1.931 ± 0.434
2.217ThrArg: 2.217 ± 0.352
2.646ThrSer: 2.646 ± 0.561
3.29ThrThr: 3.29 ± 0.576
4.22ThrVal: 4.22 ± 0.664
1.43ThrTrp: 1.43 ± 0.348
1.502ThrTyr: 1.502 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
4.506ValAla: 4.506 ± 0.624
0.572ValCys: 0.572 ± 0.205
4.005ValAsp: 4.005 ± 0.451
4.649ValGlu: 4.649 ± 0.57
3.576ValPhe: 3.576 ± 0.433
4.935ValGly: 4.935 ± 0.649
0.572ValHis: 0.572 ± 0.172
4.792ValIle: 4.792 ± 0.654
4.434ValLys: 4.434 ± 0.546
4.649ValLeu: 4.649 ± 0.664
1.788ValMet: 1.788 ± 0.406
4.363ValAsn: 4.363 ± 0.503
1.716ValPro: 1.716 ± 0.303
2.217ValGln: 2.217 ± 0.392
2.289ValArg: 2.289 ± 0.374
4.22ValSer: 4.22 ± 0.573
3.147ValThr: 3.147 ± 0.584
4.148ValVal: 4.148 ± 0.644
1.073ValTrp: 1.073 ± 0.308
2.718ValTyr: 2.718 ± 0.434
0.0ValXaa: 0.0 ± 0.0
Trp
1.645TrpAla: 1.645 ± 0.339
0.286TrpCys: 0.286 ± 0.154
0.787TrpAsp: 0.787 ± 0.232
0.572TrpGlu: 0.572 ± 0.168
1.216TrpPhe: 1.216 ± 0.278
0.715TrpGly: 0.715 ± 0.219
0.215TrpHis: 0.215 ± 0.128
0.858TrpIle: 0.858 ± 0.218
0.715TrpLys: 0.715 ± 0.206
1.216TrpLeu: 1.216 ± 0.28
0.429TrpMet: 0.429 ± 0.164
0.787TrpAsn: 0.787 ± 0.232
0.143TrpPro: 0.143 ± 0.103
0.644TrpGln: 0.644 ± 0.198
0.858TrpArg: 0.858 ± 0.247
1.073TrpSer: 1.073 ± 0.318
0.644TrpThr: 0.644 ± 0.183
0.858TrpVal: 0.858 ± 0.218
0.358TrpTrp: 0.358 ± 0.15
0.143TrpTyr: 0.143 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.217TyrAla: 2.217 ± 0.392
0.93TyrCys: 0.93 ± 0.318
2.861TyrAsp: 2.861 ± 0.423
2.146TyrGlu: 2.146 ± 0.376
2.146TyrPhe: 2.146 ± 0.483
3.075TyrGly: 3.075 ± 0.503
0.572TyrHis: 0.572 ± 0.203
2.503TyrIle: 2.503 ± 0.346
2.789TyrLys: 2.789 ± 0.404
2.718TyrLeu: 2.718 ± 0.456
1.001TyrMet: 1.001 ± 0.219
2.861TyrAsn: 2.861 ± 0.415
1.716TyrPro: 1.716 ± 0.359
1.573TyrGln: 1.573 ± 0.348
1.573TyrArg: 1.573 ± 0.375
3.576TyrSer: 3.576 ± 0.487
1.43TyrThr: 1.43 ± 0.257
1.931TyrVal: 1.931 ± 0.343
0.358TyrTrp: 0.358 ± 0.163
1.359TyrTyr: 1.359 ± 0.331
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 88 proteins (13983 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski