Amino acid dipepetide frequency for Acinetobacter phage YMC/09/02/B1251

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.829AlaAla: 6.829 ± 1.099
0.808AlaCys: 0.808 ± 0.288
3.965AlaAsp: 3.965 ± 0.548
6.535AlaGlu: 6.535 ± 0.846
3.157AlaPhe: 3.157 ± 0.489
4.038AlaGly: 4.038 ± 0.662
1.248AlaHis: 1.248 ± 0.35
4.92AlaIle: 4.92 ± 0.728
6.682AlaLys: 6.682 ± 0.717
7.49AlaLeu: 7.49 ± 0.853
2.79AlaMet: 2.79 ± 0.459
4.479AlaAsn: 4.479 ± 0.549
2.056AlaPro: 2.056 ± 0.465
3.818AlaGln: 3.818 ± 1.049
4.038AlaArg: 4.038 ± 0.552
4.626AlaSer: 4.626 ± 0.943
4.993AlaThr: 4.993 ± 0.906
4.92AlaVal: 4.92 ± 0.577
1.248AlaTrp: 1.248 ± 0.334
2.57AlaTyr: 2.57 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
0.514CysAla: 0.514 ± 0.222
0.22CysCys: 0.22 ± 0.139
0.587CysAsp: 0.587 ± 0.226
1.322CysGlu: 1.322 ± 0.335
0.294CysPhe: 0.294 ± 0.138
0.661CysGly: 0.661 ± 0.29
0.22CysHis: 0.22 ± 0.118
0.587CysIle: 0.587 ± 0.185
0.441CysLys: 0.441 ± 0.197
0.955CysLeu: 0.955 ± 0.33
0.22CysMet: 0.22 ± 0.147
0.367CysAsn: 0.367 ± 0.146
0.22CysPro: 0.22 ± 0.12
0.514CysGln: 0.514 ± 0.23
0.587CysArg: 0.587 ± 0.224
0.294CysSer: 0.294 ± 0.143
0.661CysThr: 0.661 ± 0.249
0.441CysVal: 0.441 ± 0.17
0.0CysTrp: 0.0 ± 0.0
0.22CysTyr: 0.22 ± 0.144
0.0CysXaa: 0.0 ± 0.0
Asp
5.14AspAla: 5.14 ± 0.672
0.367AspCys: 0.367 ± 0.167
3.378AspAsp: 3.378 ± 0.601
3.965AspGlu: 3.965 ± 0.518
3.231AspPhe: 3.231 ± 0.399
4.92AspGly: 4.92 ± 0.701
1.175AspHis: 1.175 ± 0.311
3.892AspIle: 3.892 ± 0.641
3.671AspLys: 3.671 ± 0.407
5.507AspLeu: 5.507 ± 0.52
1.248AspMet: 1.248 ± 0.295
2.35AspAsn: 2.35 ± 0.389
2.203AspPro: 2.203 ± 0.453
2.203AspGln: 2.203 ± 0.325
2.129AspArg: 2.129 ± 0.386
3.524AspSer: 3.524 ± 0.595
2.423AspThr: 2.423 ± 0.416
3.818AspVal: 3.818 ± 0.791
0.514AspTrp: 0.514 ± 0.208
2.864AspTyr: 2.864 ± 0.444
0.0AspXaa: 0.0 ± 0.0
Glu
6.388GluAla: 6.388 ± 0.826
0.514GluCys: 0.514 ± 0.2
2.643GluAsp: 2.643 ± 0.414
4.699GluGlu: 4.699 ± 0.67
3.671GluPhe: 3.671 ± 0.522
3.378GluGly: 3.378 ± 0.395
1.395GluHis: 1.395 ± 0.381
5.948GluIle: 5.948 ± 0.609
5.58GluLys: 5.58 ± 0.776
8.004GluLeu: 8.004 ± 0.711
1.689GluMet: 1.689 ± 0.373
4.038GluAsn: 4.038 ± 0.706
1.689GluPro: 1.689 ± 0.392
3.965GluGln: 3.965 ± 0.792
3.892GluArg: 3.892 ± 0.534
3.892GluSer: 3.892 ± 0.528
2.937GluThr: 2.937 ± 0.494
4.92GluVal: 4.92 ± 0.55
0.661GluTrp: 0.661 ± 0.273
2.57GluTyr: 2.57 ± 0.448
0.0GluXaa: 0.0 ± 0.0
Phe
2.717PheAla: 2.717 ± 0.422
1.175PheCys: 1.175 ± 0.338
2.497PheAsp: 2.497 ± 0.537
3.231PheGlu: 3.231 ± 0.453
1.615PhePhe: 1.615 ± 0.44
2.276PheGly: 2.276 ± 0.385
0.955PheHis: 0.955 ± 0.253
2.937PheIle: 2.937 ± 0.441
3.598PheLys: 3.598 ± 0.562
3.157PheLeu: 3.157 ± 0.719
0.661PheMet: 0.661 ± 0.175
2.57PheAsn: 2.57 ± 0.429
1.322PhePro: 1.322 ± 0.338
1.983PheGln: 1.983 ± 0.304
1.175PheArg: 1.175 ± 0.275
1.983PheSer: 1.983 ± 0.354
2.643PheThr: 2.643 ± 0.475
2.276PheVal: 2.276 ± 0.466
0.367PheTrp: 0.367 ± 0.189
1.322PheTyr: 1.322 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
5.14GlyAla: 5.14 ± 1.12
0.367GlyCys: 0.367 ± 0.129
3.671GlyAsp: 3.671 ± 0.577
4.406GlyGlu: 4.406 ± 0.515
3.671GlyPhe: 3.671 ± 0.733
6.315GlyGly: 6.315 ± 0.846
0.661GlyHis: 0.661 ± 0.197
3.524GlyIle: 3.524 ± 0.477
4.846GlyLys: 4.846 ± 0.719
5.36GlyLeu: 5.36 ± 0.659
1.175GlyMet: 1.175 ± 0.315
2.35GlyAsn: 2.35 ± 0.395
1.615GlyPro: 1.615 ± 0.414
2.717GlyGln: 2.717 ± 0.506
2.864GlyArg: 2.864 ± 0.521
3.671GlySer: 3.671 ± 0.536
3.157GlyThr: 3.157 ± 0.724
3.598GlyVal: 3.598 ± 0.515
1.248GlyTrp: 1.248 ± 0.361
2.937GlyTyr: 2.937 ± 0.478
0.0GlyXaa: 0.0 ± 0.0
His
1.028HisAla: 1.028 ± 0.271
0.147HisCys: 0.147 ± 0.092
1.322HisAsp: 1.322 ± 0.333
1.615HisGlu: 1.615 ± 0.412
0.808HisPhe: 0.808 ± 0.234
0.441HisGly: 0.441 ± 0.162
0.294HisHis: 0.294 ± 0.127
1.322HisIle: 1.322 ± 0.431
1.322HisLys: 1.322 ± 0.399
2.423HisLeu: 2.423 ± 0.486
0.514HisMet: 0.514 ± 0.178
0.587HisAsn: 0.587 ± 0.258
0.587HisPro: 0.587 ± 0.176
0.587HisGln: 0.587 ± 0.173
0.441HisArg: 0.441 ± 0.165
0.734HisSer: 0.734 ± 0.294
0.514HisThr: 0.514 ± 0.215
0.881HisVal: 0.881 ± 0.258
0.22HisTrp: 0.22 ± 0.117
0.661HisTyr: 0.661 ± 0.24
0.0HisXaa: 0.0 ± 0.0
Ile
5.14IleAla: 5.14 ± 0.795
0.441IleCys: 0.441 ± 0.192
4.626IleAsp: 4.626 ± 0.62
5.287IleGlu: 5.287 ± 0.733
1.395IlePhe: 1.395 ± 0.287
3.304IleGly: 3.304 ± 0.474
0.514IleHis: 0.514 ± 0.186
3.084IleIle: 3.084 ± 0.492
4.846IleLys: 4.846 ± 0.662
3.304IleLeu: 3.304 ± 0.66
0.881IleMet: 0.881 ± 0.214
3.892IleAsn: 3.892 ± 0.661
2.864IlePro: 2.864 ± 0.405
3.157IleGln: 3.157 ± 0.642
2.643IleArg: 2.643 ± 0.532
3.892IleSer: 3.892 ± 0.505
4.479IleThr: 4.479 ± 0.521
3.818IleVal: 3.818 ± 0.408
0.367IleTrp: 0.367 ± 0.171
3.084IleTyr: 3.084 ± 0.63
0.0IleXaa: 0.0 ± 0.0
Lys
6.902LysAla: 6.902 ± 1.189
0.294LysCys: 0.294 ± 0.168
4.773LysAsp: 4.773 ± 0.701
6.315LysGlu: 6.315 ± 0.99
2.57LysPhe: 2.57 ± 0.452
4.332LysGly: 4.332 ± 0.532
1.836LysHis: 1.836 ± 0.383
5.58LysIle: 5.58 ± 0.766
5.287LysLys: 5.287 ± 0.919
7.196LysLeu: 7.196 ± 0.735
1.983LysMet: 1.983 ± 0.395
3.011LysAsn: 3.011 ± 0.433
3.378LysPro: 3.378 ± 0.701
4.185LysGln: 4.185 ± 0.989
3.818LysArg: 3.818 ± 0.514
4.479LysSer: 4.479 ± 0.575
4.552LysThr: 4.552 ± 0.503
4.699LysVal: 4.699 ± 0.454
1.101LysTrp: 1.101 ± 0.28
2.276LysTyr: 2.276 ± 0.432
0.0LysXaa: 0.0 ± 0.0
Leu
7.857LeuAla: 7.857 ± 0.893
0.881LeuCys: 0.881 ± 0.265
5.36LeuAsp: 5.36 ± 0.724
5.213LeuGlu: 5.213 ± 0.75
2.35LeuPhe: 2.35 ± 0.43
5.654LeuGly: 5.654 ± 0.66
1.248LeuHis: 1.248 ± 0.316
5.58LeuIle: 5.58 ± 0.829
7.563LeuLys: 7.563 ± 0.964
6.168LeuLeu: 6.168 ± 0.696
1.983LeuMet: 1.983 ± 0.323
6.168LeuAsn: 6.168 ± 0.685
3.231LeuPro: 3.231 ± 0.618
3.524LeuGln: 3.524 ± 0.536
4.185LeuArg: 4.185 ± 0.49
5.654LeuSer: 5.654 ± 0.58
5.948LeuThr: 5.948 ± 0.657
5.36LeuVal: 5.36 ± 0.634
0.587LeuTrp: 0.587 ± 0.192
2.864LeuTyr: 2.864 ± 0.583
0.0LeuXaa: 0.0 ± 0.0
Met
2.056MetAla: 2.056 ± 0.378
0.073MetCys: 0.073 ± 0.063
2.056MetAsp: 2.056 ± 0.4
0.881MetGlu: 0.881 ± 0.213
0.955MetPhe: 0.955 ± 0.261
1.762MetGly: 1.762 ± 0.37
0.073MetHis: 0.073 ± 0.077
1.028MetIle: 1.028 ± 0.219
1.689MetLys: 1.689 ± 0.474
1.836MetLeu: 1.836 ± 0.287
0.514MetMet: 0.514 ± 0.162
1.322MetAsn: 1.322 ± 0.351
1.175MetPro: 1.175 ± 0.266
0.881MetGln: 0.881 ± 0.22
1.248MetArg: 1.248 ± 0.399
1.469MetSer: 1.469 ± 0.342
1.615MetThr: 1.615 ± 0.325
1.395MetVal: 1.395 ± 0.398
0.0MetTrp: 0.0 ± 0.0
0.294MetTyr: 0.294 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
4.112AsnAla: 4.112 ± 0.564
0.881AsnCys: 0.881 ± 0.276
3.084AsnAsp: 3.084 ± 0.566
4.259AsnGlu: 4.259 ± 0.4
2.35AsnPhe: 2.35 ± 0.505
4.626AsnGly: 4.626 ± 0.76
0.734AsnHis: 0.734 ± 0.328
2.203AsnIle: 2.203 ± 0.471
3.304AsnLys: 3.304 ± 0.403
4.773AsnLeu: 4.773 ± 0.597
1.469AsnMet: 1.469 ± 0.289
3.818AsnAsn: 3.818 ± 0.647
2.497AsnPro: 2.497 ± 0.476
2.423AsnGln: 2.423 ± 0.457
1.395AsnArg: 1.395 ± 0.286
3.965AsnSer: 3.965 ± 0.596
3.231AsnThr: 3.231 ± 0.444
3.011AsnVal: 3.011 ± 0.556
1.028AsnTrp: 1.028 ± 0.312
1.836AsnTyr: 1.836 ± 0.353
0.0AsnXaa: 0.0 ± 0.0
Pro
3.818ProAla: 3.818 ± 0.608
0.073ProCys: 0.073 ± 0.074
3.231ProAsp: 3.231 ± 0.558
3.084ProGlu: 3.084 ± 0.527
1.395ProPhe: 1.395 ± 0.288
1.395ProGly: 1.395 ± 0.545
0.367ProHis: 0.367 ± 0.164
2.203ProIle: 2.203 ± 0.436
3.157ProLys: 3.157 ± 0.472
2.79ProLeu: 2.79 ± 0.561
0.808ProMet: 0.808 ± 0.217
1.909ProAsn: 1.909 ± 0.42
1.615ProPro: 1.615 ± 0.366
1.322ProGln: 1.322 ± 0.261
0.881ProArg: 0.881 ± 0.308
2.203ProSer: 2.203 ± 0.383
2.129ProThr: 2.129 ± 0.442
2.35ProVal: 2.35 ± 0.398
0.147ProTrp: 0.147 ± 0.112
1.248ProTyr: 1.248 ± 0.32
0.0ProXaa: 0.0 ± 0.0
Gln
4.479GlnAla: 4.479 ± 0.721
0.367GlnCys: 0.367 ± 0.181
2.203GlnAsp: 2.203 ± 0.38
3.892GlnGlu: 3.892 ± 0.826
0.955GlnPhe: 0.955 ± 0.293
2.35GlnGly: 2.35 ± 0.377
1.395GlnHis: 1.395 ± 0.303
2.937GlnIle: 2.937 ± 0.471
3.818GlnLys: 3.818 ± 0.653
3.818GlnLeu: 3.818 ± 0.394
0.881GlnMet: 0.881 ± 0.25
2.643GlnAsn: 2.643 ± 0.403
0.955GlnPro: 0.955 ± 0.281
2.056GlnGln: 2.056 ± 0.448
1.836GlnArg: 1.836 ± 0.407
2.643GlnSer: 2.643 ± 0.574
3.157GlnThr: 3.157 ± 0.642
3.598GlnVal: 3.598 ± 0.635
0.587GlnTrp: 0.587 ± 0.181
1.909GlnTyr: 1.909 ± 0.382
0.0GlnXaa: 0.0 ± 0.0
Arg
2.717ArgAla: 2.717 ± 0.369
0.587ArgCys: 0.587 ± 0.211
2.203ArgAsp: 2.203 ± 0.424
2.57ArgGlu: 2.57 ± 0.385
2.056ArgPhe: 2.056 ± 0.39
2.864ArgGly: 2.864 ± 0.505
0.734ArgHis: 0.734 ± 0.225
3.011ArgIle: 3.011 ± 0.465
3.524ArgLys: 3.524 ± 0.48
5.287ArgLeu: 5.287 ± 0.521
1.028ArgMet: 1.028 ± 0.26
2.717ArgAsn: 2.717 ± 0.562
1.322ArgPro: 1.322 ± 0.27
2.423ArgGln: 2.423 ± 0.505
2.276ArgArg: 2.276 ± 0.298
2.056ArgSer: 2.056 ± 0.498
2.129ArgThr: 2.129 ± 0.405
2.57ArgVal: 2.57 ± 0.372
0.587ArgTrp: 0.587 ± 0.142
1.615ArgTyr: 1.615 ± 0.408
0.0ArgXaa: 0.0 ± 0.0
Ser
4.406SerAla: 4.406 ± 0.923
0.367SerCys: 0.367 ± 0.157
2.864SerAsp: 2.864 ± 0.573
4.038SerGlu: 4.038 ± 0.58
3.451SerPhe: 3.451 ± 0.581
4.552SerGly: 4.552 ± 0.551
1.028SerHis: 1.028 ± 0.241
3.231SerIle: 3.231 ± 0.456
6.094SerLys: 6.094 ± 0.702
4.993SerLeu: 4.993 ± 0.63
1.028SerMet: 1.028 ± 0.253
3.157SerAsn: 3.157 ± 0.503
2.35SerPro: 2.35 ± 0.493
2.864SerGln: 2.864 ± 0.479
2.423SerArg: 2.423 ± 0.504
5.727SerSer: 5.727 ± 0.769
2.35SerThr: 2.35 ± 0.489
4.479SerVal: 4.479 ± 0.612
0.587SerTrp: 0.587 ± 0.19
1.615SerTyr: 1.615 ± 0.343
0.0SerXaa: 0.0 ± 0.0
Thr
4.038ThrAla: 4.038 ± 0.658
0.587ThrCys: 0.587 ± 0.233
3.084ThrAsp: 3.084 ± 0.549
3.304ThrGlu: 3.304 ± 0.396
2.57ThrPhe: 2.57 ± 0.529
3.892ThrGly: 3.892 ± 0.551
1.175ThrHis: 1.175 ± 0.273
3.671ThrIle: 3.671 ± 0.555
4.259ThrLys: 4.259 ± 0.447
4.846ThrLeu: 4.846 ± 0.572
0.367ThrMet: 0.367 ± 0.16
3.231ThrAsn: 3.231 ± 0.767
3.304ThrPro: 3.304 ± 0.59
2.35ThrGln: 2.35 ± 0.566
2.643ThrArg: 2.643 ± 0.426
3.084ThrSer: 3.084 ± 0.369
2.717ThrThr: 2.717 ± 0.7
3.965ThrVal: 3.965 ± 0.695
0.661ThrTrp: 0.661 ± 0.231
1.762ThrTyr: 1.762 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
4.626ValAla: 4.626 ± 0.857
0.734ValCys: 0.734 ± 0.27
3.671ValAsp: 3.671 ± 0.635
4.259ValGlu: 4.259 ± 0.6
2.35ValPhe: 2.35 ± 0.53
3.598ValGly: 3.598 ± 0.48
0.514ValHis: 0.514 ± 0.171
2.643ValIle: 2.643 ± 0.448
5.287ValLys: 5.287 ± 0.672
5.434ValLeu: 5.434 ± 0.76
2.129ValMet: 2.129 ± 0.365
3.965ValAsn: 3.965 ± 0.701
2.57ValPro: 2.57 ± 0.471
3.157ValGln: 3.157 ± 0.594
2.937ValArg: 2.937 ± 0.519
4.259ValSer: 4.259 ± 0.591
3.378ValThr: 3.378 ± 0.652
4.626ValVal: 4.626 ± 0.64
1.028ValTrp: 1.028 ± 0.243
2.129ValTyr: 2.129 ± 0.384
0.0ValXaa: 0.0 ± 0.0
Trp
0.734TrpAla: 0.734 ± 0.26
0.294TrpCys: 0.294 ± 0.176
0.808TrpAsp: 0.808 ± 0.264
0.808TrpGlu: 0.808 ± 0.321
0.294TrpPhe: 0.294 ± 0.16
0.587TrpGly: 0.587 ± 0.212
0.22TrpHis: 0.22 ± 0.13
0.955TrpIle: 0.955 ± 0.306
0.514TrpLys: 0.514 ± 0.227
0.808TrpLeu: 0.808 ± 0.229
0.294TrpMet: 0.294 ± 0.152
0.587TrpAsn: 0.587 ± 0.219
0.367TrpPro: 0.367 ± 0.163
0.808TrpGln: 0.808 ± 0.27
0.661TrpArg: 0.661 ± 0.213
0.808TrpSer: 0.808 ± 0.242
0.661TrpThr: 0.661 ± 0.236
0.441TrpVal: 0.441 ± 0.177
0.147TrpTrp: 0.147 ± 0.113
0.514TrpTyr: 0.514 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.643TyrAla: 2.643 ± 0.344
0.147TyrCys: 0.147 ± 0.108
2.497TyrAsp: 2.497 ± 0.372
2.79TyrGlu: 2.79 ± 0.535
1.322TyrPhe: 1.322 ± 0.333
2.35TyrGly: 2.35 ± 0.597
0.661TyrHis: 0.661 ± 0.21
1.762TyrIle: 1.762 ± 0.332
3.084TyrLys: 3.084 ± 0.465
3.231TyrLeu: 3.231 ± 0.488
0.587TyrMet: 0.587 ± 0.181
1.836TyrAsn: 1.836 ± 0.36
0.808TyrPro: 0.808 ± 0.275
1.542TyrGln: 1.542 ± 0.32
2.203TyrArg: 2.203 ± 0.517
2.643TyrSer: 2.643 ± 0.408
1.836TyrThr: 1.836 ± 0.378
2.129TyrVal: 2.129 ± 0.481
0.22TyrTrp: 0.22 ± 0.128
1.101TyrTyr: 1.101 ± 0.262
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (13620 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski