Amino acid dipepetide frequency for Acinetobacter phage IMEAB3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.941AlaAla: 10.941 ± 1.835
0.587AlaCys: 0.587 ± 0.161
4.406AlaAsp: 4.406 ± 0.596
5.434AlaGlu: 5.434 ± 0.529
3.304AlaPhe: 3.304 ± 0.439
6.021AlaGly: 6.021 ± 0.834
2.056AlaHis: 2.056 ± 0.322
5.287AlaIle: 5.287 ± 0.766
5.874AlaLys: 5.874 ± 0.911
7.343AlaLeu: 7.343 ± 1.286
2.57AlaMet: 2.57 ± 0.578
5.727AlaAsn: 5.727 ± 0.608
3.304AlaPro: 3.304 ± 0.475
4.038AlaGln: 4.038 ± 0.751
3.524AlaArg: 3.524 ± 0.371
5.874AlaSer: 5.874 ± 0.815
6.315AlaThr: 6.315 ± 1.2
5.507AlaVal: 5.507 ± 0.941
1.395AlaTrp: 1.395 ± 0.428
2.79AlaTyr: 2.79 ± 0.429
0.0AlaXaa: 0.0 ± 0.0
Cys
0.441CysAla: 0.441 ± 0.221
0.367CysCys: 0.367 ± 0.17
0.661CysAsp: 0.661 ± 0.219
0.734CysGlu: 0.734 ± 0.216
0.441CysPhe: 0.441 ± 0.145
0.661CysGly: 0.661 ± 0.273
0.147CysHis: 0.147 ± 0.105
0.734CysIle: 0.734 ± 0.225
0.661CysLys: 0.661 ± 0.273
0.587CysLeu: 0.587 ± 0.235
0.147CysMet: 0.147 ± 0.098
0.441CysAsn: 0.441 ± 0.179
0.808CysPro: 0.808 ± 0.331
0.514CysGln: 0.514 ± 0.221
0.661CysArg: 0.661 ± 0.289
0.514CysSer: 0.514 ± 0.224
0.367CysThr: 0.367 ± 0.138
0.661CysVal: 0.661 ± 0.253
0.073CysTrp: 0.073 ± 0.085
0.367CysTyr: 0.367 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
5.14AspAla: 5.14 ± 0.602
0.587AspCys: 0.587 ± 0.225
4.626AspAsp: 4.626 ± 0.672
4.406AspGlu: 4.406 ± 0.515
2.57AspPhe: 2.57 ± 0.429
5.434AspGly: 5.434 ± 0.68
0.441AspHis: 0.441 ± 0.155
4.406AspIle: 4.406 ± 0.51
3.231AspLys: 3.231 ± 0.538
4.92AspLeu: 4.92 ± 0.594
1.909AspMet: 1.909 ± 0.363
2.35AspAsn: 2.35 ± 0.439
1.909AspPro: 1.909 ± 0.497
2.129AspGln: 2.129 ± 0.344
3.157AspArg: 3.157 ± 0.593
2.717AspSer: 2.717 ± 0.469
2.937AspThr: 2.937 ± 0.456
3.892AspVal: 3.892 ± 0.446
1.322AspTrp: 1.322 ± 0.391
1.983AspTyr: 1.983 ± 0.309
0.0AspXaa: 0.0 ± 0.0
Glu
4.699GluAla: 4.699 ± 0.668
0.514GluCys: 0.514 ± 0.165
2.643GluAsp: 2.643 ± 0.634
2.79GluGlu: 2.79 ± 0.469
2.497GluPhe: 2.497 ± 0.375
4.552GluGly: 4.552 ± 0.68
1.395GluHis: 1.395 ± 0.327
4.185GluIle: 4.185 ± 0.744
4.259GluLys: 4.259 ± 0.528
5.507GluLeu: 5.507 ± 0.765
2.203GluMet: 2.203 ± 0.382
3.084GluAsn: 3.084 ± 0.577
2.35GluPro: 2.35 ± 0.45
3.598GluGln: 3.598 ± 0.416
3.304GluArg: 3.304 ± 0.654
2.79GluSer: 2.79 ± 0.452
3.965GluThr: 3.965 ± 0.585
2.35GluVal: 2.35 ± 0.383
1.028GluTrp: 1.028 ± 0.306
3.084GluTyr: 3.084 ± 0.483
0.0GluXaa: 0.0 ± 0.0
Phe
3.084PheAla: 3.084 ± 0.514
0.441PheCys: 0.441 ± 0.191
3.818PheAsp: 3.818 ± 0.58
2.129PheGlu: 2.129 ± 0.405
1.542PhePhe: 1.542 ± 0.372
3.965PheGly: 3.965 ± 0.484
0.147PheHis: 0.147 ± 0.102
3.011PheIle: 3.011 ± 0.477
2.276PheLys: 2.276 ± 0.4
1.762PheLeu: 1.762 ± 0.312
1.028PheMet: 1.028 ± 0.28
1.689PheAsn: 1.689 ± 0.423
1.395PhePro: 1.395 ± 0.327
0.881PheGln: 0.881 ± 0.241
1.615PheArg: 1.615 ± 0.324
1.983PheSer: 1.983 ± 0.383
3.011PheThr: 3.011 ± 0.497
2.717PheVal: 2.717 ± 0.451
0.514PheTrp: 0.514 ± 0.191
1.689PheTyr: 1.689 ± 0.369
0.0PheXaa: 0.0 ± 0.0
Gly
6.241GlyAla: 6.241 ± 0.795
0.514GlyCys: 0.514 ± 0.256
4.552GlyAsp: 4.552 ± 0.646
4.332GlyGlu: 4.332 ± 0.714
4.185GlyPhe: 4.185 ± 0.716
6.829GlyGly: 6.829 ± 1.141
1.615GlyHis: 1.615 ± 0.355
4.332GlyIle: 4.332 ± 0.453
4.479GlyLys: 4.479 ± 0.502
5.654GlyLeu: 5.654 ± 0.644
1.983GlyMet: 1.983 ± 0.353
3.451GlyAsn: 3.451 ± 0.83
0.0GlyPro: 0.0 ± 0.0
2.937GlyGln: 2.937 ± 0.759
2.423GlyArg: 2.423 ± 0.391
4.332GlySer: 4.332 ± 0.648
4.406GlyThr: 4.406 ± 0.553
6.094GlyVal: 6.094 ± 0.812
1.469GlyTrp: 1.469 ± 0.412
2.497GlyTyr: 2.497 ± 0.457
0.0GlyXaa: 0.0 ± 0.0
His
1.248HisAla: 1.248 ± 0.333
0.147HisCys: 0.147 ± 0.113
0.514HisAsp: 0.514 ± 0.147
1.028HisGlu: 1.028 ± 0.238
0.808HisPhe: 0.808 ± 0.257
0.587HisGly: 0.587 ± 0.166
0.514HisHis: 0.514 ± 0.193
1.689HisIle: 1.689 ± 0.351
0.955HisLys: 0.955 ± 0.375
1.322HisLeu: 1.322 ± 0.342
0.441HisMet: 0.441 ± 0.175
0.881HisAsn: 0.881 ± 0.314
0.367HisPro: 0.367 ± 0.195
0.514HisGln: 0.514 ± 0.251
0.808HisArg: 0.808 ± 0.288
1.101HisSer: 1.101 ± 0.283
0.808HisThr: 0.808 ± 0.283
0.734HisVal: 0.734 ± 0.261
0.147HisTrp: 0.147 ± 0.112
0.808HisTyr: 0.808 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
6.682IleAla: 6.682 ± 0.721
0.808IleCys: 0.808 ± 0.271
4.259IleAsp: 4.259 ± 0.547
5.213IleGlu: 5.213 ± 0.609
1.322IlePhe: 1.322 ± 0.418
4.406IleGly: 4.406 ± 0.545
0.661IleHis: 0.661 ± 0.251
2.864IleIle: 2.864 ± 0.47
5.434IleLys: 5.434 ± 0.526
2.864IleLeu: 2.864 ± 0.469
1.322IleMet: 1.322 ± 0.313
3.451IleAsn: 3.451 ± 0.548
3.378IlePro: 3.378 ± 0.599
1.689IleGln: 1.689 ± 0.323
2.57IleArg: 2.57 ± 0.413
4.406IleSer: 4.406 ± 0.438
4.259IleThr: 4.259 ± 0.614
4.038IleVal: 4.038 ± 0.542
1.248IleTrp: 1.248 ± 0.322
2.643IleTyr: 2.643 ± 0.491
0.0IleXaa: 0.0 ± 0.0
Lys
6.902LysAla: 6.902 ± 0.896
0.808LysCys: 0.808 ± 0.313
4.552LysAsp: 4.552 ± 0.667
4.332LysGlu: 4.332 ± 0.598
1.762LysPhe: 1.762 ± 0.41
3.745LysGly: 3.745 ± 0.562
1.248LysHis: 1.248 ± 0.38
3.524LysIle: 3.524 ± 0.458
4.699LysLys: 4.699 ± 0.799
4.699LysLeu: 4.699 ± 0.582
2.203LysMet: 2.203 ± 0.409
2.497LysAsn: 2.497 ± 0.427
2.864LysPro: 2.864 ± 0.515
2.57LysGln: 2.57 ± 0.373
2.57LysArg: 2.57 ± 0.469
2.643LysSer: 2.643 ± 0.378
4.773LysThr: 4.773 ± 0.757
3.965LysVal: 3.965 ± 0.559
0.661LysTrp: 0.661 ± 0.185
1.983LysTyr: 1.983 ± 0.372
0.0LysXaa: 0.0 ± 0.0
Leu
6.535LeuAla: 6.535 ± 1.169
0.661LeuCys: 0.661 ± 0.199
5.14LeuAsp: 5.14 ± 0.771
4.773LeuGlu: 4.773 ± 0.569
3.231LeuPhe: 3.231 ± 0.501
5.14LeuGly: 5.14 ± 0.742
1.028LeuHis: 1.028 ± 0.342
4.406LeuIle: 4.406 ± 0.768
4.552LeuLys: 4.552 ± 0.649
5.654LeuLeu: 5.654 ± 0.633
1.615LeuMet: 1.615 ± 0.346
4.773LeuAsn: 4.773 ± 0.524
3.598LeuPro: 3.598 ± 0.429
3.524LeuGln: 3.524 ± 0.539
3.745LeuArg: 3.745 ± 0.437
4.773LeuSer: 4.773 ± 0.557
5.948LeuThr: 5.948 ± 0.713
3.671LeuVal: 3.671 ± 0.587
0.587LeuTrp: 0.587 ± 0.177
2.497LeuTyr: 2.497 ± 0.409
0.0LeuXaa: 0.0 ± 0.0
Met
3.157MetAla: 3.157 ± 0.455
0.0MetCys: 0.0 ± 0.0
1.248MetAsp: 1.248 ± 0.355
1.028MetGlu: 1.028 ± 0.226
0.808MetPhe: 0.808 ± 0.251
1.836MetGly: 1.836 ± 0.399
0.22MetHis: 0.22 ± 0.153
1.395MetIle: 1.395 ± 0.263
2.276MetLys: 2.276 ± 0.502
2.717MetLeu: 2.717 ± 0.431
0.808MetMet: 0.808 ± 0.218
0.881MetAsn: 0.881 ± 0.252
1.615MetPro: 1.615 ± 0.279
1.469MetGln: 1.469 ± 0.325
0.808MetArg: 0.808 ± 0.27
2.129MetSer: 2.129 ± 0.373
1.395MetThr: 1.395 ± 0.368
1.395MetVal: 1.395 ± 0.337
0.661MetTrp: 0.661 ± 0.191
1.101MetTyr: 1.101 ± 0.268
0.0MetXaa: 0.0 ± 0.0
Asn
5.14AsnAla: 5.14 ± 0.786
0.514AsnCys: 0.514 ± 0.165
3.304AsnAsp: 3.304 ± 0.533
3.231AsnGlu: 3.231 ± 0.723
1.395AsnPhe: 1.395 ± 0.318
5.58AsnGly: 5.58 ± 1.029
0.367AsnHis: 0.367 ± 0.184
2.717AsnIle: 2.717 ± 0.485
3.011AsnLys: 3.011 ± 0.484
3.378AsnLeu: 3.378 ± 0.582
1.395AsnMet: 1.395 ± 0.312
3.745AsnAsn: 3.745 ± 0.529
3.011AsnPro: 3.011 ± 0.404
2.423AsnGln: 2.423 ± 0.508
2.35AsnArg: 2.35 ± 0.405
3.231AsnSer: 3.231 ± 0.455
3.524AsnThr: 3.524 ± 0.572
4.259AsnVal: 4.259 ± 0.559
0.734AsnTrp: 0.734 ± 0.203
2.129AsnTyr: 2.129 ± 0.573
0.0AsnXaa: 0.0 ± 0.0
Pro
2.717ProAla: 2.717 ± 0.466
0.661ProCys: 0.661 ± 0.162
2.056ProAsp: 2.056 ± 0.434
2.35ProGlu: 2.35 ± 0.463
1.469ProPhe: 1.469 ± 0.311
0.0ProGly: 0.0 ± 0.0
0.808ProHis: 0.808 ± 0.208
2.423ProIle: 2.423 ± 0.448
2.864ProLys: 2.864 ± 0.682
2.717ProLeu: 2.717 ± 0.48
1.175ProMet: 1.175 ± 0.285
2.864ProAsn: 2.864 ± 0.56
1.909ProPro: 1.909 ± 0.517
2.056ProGln: 2.056 ± 0.319
1.836ProArg: 1.836 ± 0.43
2.203ProSer: 2.203 ± 0.441
3.671ProThr: 3.671 ± 0.567
3.304ProVal: 3.304 ± 0.571
0.22ProTrp: 0.22 ± 0.127
1.469ProTyr: 1.469 ± 0.475
0.0ProXaa: 0.0 ± 0.0
Gln
3.671GlnAla: 3.671 ± 0.945
0.367GlnCys: 0.367 ± 0.197
1.615GlnAsp: 1.615 ± 0.38
1.689GlnGlu: 1.689 ± 0.354
1.689GlnPhe: 1.689 ± 0.349
2.57GlnGly: 2.57 ± 0.833
0.661GlnHis: 0.661 ± 0.236
2.35GlnIle: 2.35 ± 0.525
2.056GlnLys: 2.056 ± 0.478
4.038GlnLeu: 4.038 ± 0.437
1.836GlnMet: 1.836 ± 0.421
2.57GlnAsn: 2.57 ± 0.495
1.762GlnPro: 1.762 ± 0.405
3.671GlnGln: 3.671 ± 1.51
2.79GlnArg: 2.79 ± 0.569
2.276GlnSer: 2.276 ± 0.464
2.276GlnThr: 2.276 ± 0.418
2.79GlnVal: 2.79 ± 0.411
0.441GlnTrp: 0.441 ± 0.147
1.248GlnTyr: 1.248 ± 0.375
0.0GlnXaa: 0.0 ± 0.0
Arg
3.157ArgAla: 3.157 ± 0.498
0.514ArgCys: 0.514 ± 0.264
2.276ArgAsp: 2.276 ± 0.485
2.056ArgGlu: 2.056 ± 0.573
1.983ArgPhe: 1.983 ± 0.408
2.276ArgGly: 2.276 ± 0.372
0.955ArgHis: 0.955 ± 0.344
3.524ArgIle: 3.524 ± 0.602
2.423ArgLys: 2.423 ± 0.35
4.626ArgLeu: 4.626 ± 0.454
1.836ArgMet: 1.836 ± 0.42
2.864ArgAsn: 2.864 ± 0.439
1.469ArgPro: 1.469 ± 0.306
1.175ArgGln: 1.175 ± 0.32
2.056ArgArg: 2.056 ± 0.404
2.57ArgSer: 2.57 ± 0.275
2.35ArgThr: 2.35 ± 0.487
2.79ArgVal: 2.79 ± 0.452
0.661ArgTrp: 0.661 ± 0.214
1.542ArgTyr: 1.542 ± 0.277
0.0ArgXaa: 0.0 ± 0.0
Ser
5.654SerAla: 5.654 ± 0.763
0.294SerCys: 0.294 ± 0.128
3.965SerAsp: 3.965 ± 0.541
3.157SerGlu: 3.157 ± 0.435
2.497SerPhe: 2.497 ± 0.4
4.699SerGly: 4.699 ± 0.754
0.587SerHis: 0.587 ± 0.231
3.965SerIle: 3.965 ± 0.526
3.671SerLys: 3.671 ± 0.676
5.36SerLeu: 5.36 ± 0.559
0.881SerMet: 0.881 ± 0.238
3.524SerAsn: 3.524 ± 0.532
2.423SerPro: 2.423 ± 0.391
2.203SerGln: 2.203 ± 0.404
1.469SerArg: 1.469 ± 0.352
3.451SerSer: 3.451 ± 0.508
3.671SerThr: 3.671 ± 0.545
3.231SerVal: 3.231 ± 0.51
1.175SerTrp: 1.175 ± 0.316
2.643SerTyr: 2.643 ± 0.558
0.0SerXaa: 0.0 ± 0.0
Thr
6.608ThrAla: 6.608 ± 1.006
0.441ThrCys: 0.441 ± 0.219
3.671ThrAsp: 3.671 ± 0.438
5.36ThrGlu: 5.36 ± 0.532
2.57ThrPhe: 2.57 ± 0.414
4.552ThrGly: 4.552 ± 0.578
0.734ThrHis: 0.734 ± 0.219
4.552ThrIle: 4.552 ± 0.537
3.451ThrLys: 3.451 ± 0.645
4.846ThrLeu: 4.846 ± 0.532
1.248ThrMet: 1.248 ± 0.34
3.965ThrAsn: 3.965 ± 0.474
2.57ThrPro: 2.57 ± 0.538
2.643ThrGln: 2.643 ± 0.618
2.35ThrArg: 2.35 ± 0.407
3.598ThrSer: 3.598 ± 0.623
3.598ThrThr: 3.598 ± 0.599
5.213ThrVal: 5.213 ± 0.695
0.661ThrTrp: 0.661 ± 0.241
2.203ThrTyr: 2.203 ± 0.316
0.0ThrXaa: 0.0 ± 0.0
Val
6.388ValAla: 6.388 ± 1.009
0.881ValCys: 0.881 ± 0.276
3.304ValAsp: 3.304 ± 0.44
3.965ValGlu: 3.965 ± 0.545
2.276ValPhe: 2.276 ± 0.438
4.993ValGly: 4.993 ± 0.852
0.808ValHis: 0.808 ± 0.219
4.626ValIle: 4.626 ± 0.476
3.965ValLys: 3.965 ± 0.572
4.479ValLeu: 4.479 ± 0.439
1.469ValMet: 1.469 ± 0.378
4.552ValAsn: 4.552 ± 0.561
2.35ValPro: 2.35 ± 0.416
2.203ValGln: 2.203 ± 0.362
2.203ValArg: 2.203 ± 0.473
4.406ValSer: 4.406 ± 0.598
4.332ValThr: 4.332 ± 0.394
4.626ValVal: 4.626 ± 0.836
1.542ValTrp: 1.542 ± 0.291
2.203ValTyr: 2.203 ± 0.375
0.0ValXaa: 0.0 ± 0.0
Trp
1.689TrpAla: 1.689 ± 0.329
0.147TrpCys: 0.147 ± 0.091
0.808TrpAsp: 0.808 ± 0.249
0.734TrpGlu: 0.734 ± 0.188
0.881TrpPhe: 0.881 ± 0.235
1.542TrpGly: 1.542 ± 0.44
0.514TrpHis: 0.514 ± 0.19
0.587TrpIle: 0.587 ± 0.233
0.734TrpLys: 0.734 ± 0.27
1.248TrpLeu: 1.248 ± 0.305
0.073TrpMet: 0.073 ± 0.074
0.587TrpAsn: 0.587 ± 0.231
0.0TrpPro: 0.0 ± 0.0
0.881TrpGln: 0.881 ± 0.202
1.101TrpArg: 1.101 ± 0.394
1.028TrpSer: 1.028 ± 0.285
0.955TrpThr: 0.955 ± 0.353
1.175TrpVal: 1.175 ± 0.27
0.441TrpTrp: 0.441 ± 0.199
0.587TrpTyr: 0.587 ± 0.215
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.203TyrAla: 2.203 ± 0.393
0.734TyrCys: 0.734 ± 0.207
2.643TyrAsp: 2.643 ± 0.552
2.056TyrGlu: 2.056 ± 0.445
1.469TyrPhe: 1.469 ± 0.398
2.937TyrGly: 2.937 ± 0.54
0.514TyrHis: 0.514 ± 0.233
2.937TyrIle: 2.937 ± 0.453
2.129TyrLys: 2.129 ± 0.397
2.129TyrLeu: 2.129 ± 0.318
0.734TyrMet: 0.734 ± 0.304
1.542TyrAsn: 1.542 ± 0.299
1.689TyrPro: 1.689 ± 0.505
1.248TyrGln: 1.248 ± 0.314
1.762TyrArg: 1.762 ± 0.386
2.497TyrSer: 2.497 ± 0.446
2.276TyrThr: 2.276 ± 0.449
3.231TyrVal: 3.231 ± 0.465
0.734TyrTrp: 0.734 ± 0.281
1.322TyrTyr: 1.322 ± 0.47
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (13620 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski