Amino acid dipepetide frequency for Mycobacterium phage Flathead

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.82AlaAla: 13.82 ± 1.668
0.936AlaCys: 0.936 ± 0.239
6.993AlaAsp: 6.993 ± 0.705
7.984AlaGlu: 7.984 ± 0.68
2.588AlaPhe: 2.588 ± 0.443
8.865AlaGly: 8.865 ± 1.261
2.643AlaHis: 2.643 ± 0.462
4.295AlaIle: 4.295 ± 0.557
3.689AlaLys: 3.689 ± 0.465
8.149AlaLeu: 8.149 ± 0.832
2.808AlaMet: 2.808 ± 0.444
3.193AlaAsn: 3.193 ± 0.455
5.396AlaPro: 5.396 ± 0.576
3.799AlaGln: 3.799 ± 0.516
6.883AlaArg: 6.883 ± 0.66
6.057AlaSer: 6.057 ± 0.621
6.112AlaThr: 6.112 ± 0.605
6.993AlaVal: 6.993 ± 0.517
2.918AlaTrp: 2.918 ± 0.481
2.202AlaTyr: 2.202 ± 0.29
0.0AlaXaa: 0.0 ± 0.0
Cys
1.046CysAla: 1.046 ± 0.267
0.165CysCys: 0.165 ± 0.097
1.156CysAsp: 1.156 ± 0.28
0.716CysGlu: 0.716 ± 0.208
0.275CysPhe: 0.275 ± 0.127
1.652CysGly: 1.652 ± 0.36
0.385CysHis: 0.385 ± 0.151
0.385CysIle: 0.385 ± 0.129
0.275CysLys: 0.275 ± 0.128
0.881CysLeu: 0.881 ± 0.245
0.275CysMet: 0.275 ± 0.111
0.496CysAsn: 0.496 ± 0.162
1.542CysPro: 1.542 ± 0.363
0.496CysGln: 0.496 ± 0.187
0.881CysArg: 0.881 ± 0.256
0.496CysSer: 0.496 ± 0.135
0.936CysThr: 0.936 ± 0.231
0.496CysVal: 0.496 ± 0.162
0.33CysTrp: 0.33 ± 0.141
0.165CysTyr: 0.165 ± 0.101
0.0CysXaa: 0.0 ± 0.0
Asp
6.607AspAla: 6.607 ± 0.523
1.046AspCys: 1.046 ± 0.286
4.405AspAsp: 4.405 ± 0.448
3.579AspGlu: 3.579 ± 0.508
1.817AspPhe: 1.817 ± 0.264
6.497AspGly: 6.497 ± 0.584
1.432AspHis: 1.432 ± 0.236
2.423AspIle: 2.423 ± 0.394
1.982AspLys: 1.982 ± 0.335
6.112AspLeu: 6.112 ± 0.544
1.156AspMet: 1.156 ± 0.277
1.652AspAsn: 1.652 ± 0.307
4.735AspPro: 4.735 ± 0.54
2.643AspGln: 2.643 ± 0.304
5.286AspArg: 5.286 ± 0.594
3.689AspSer: 3.689 ± 0.454
3.579AspThr: 3.579 ± 0.407
4.46AspVal: 4.46 ± 0.527
1.597AspTrp: 1.597 ± 0.305
1.762AspTyr: 1.762 ± 0.311
0.0AspXaa: 0.0 ± 0.0
Glu
6.662GluAla: 6.662 ± 0.746
0.881GluCys: 0.881 ± 0.239
3.854GluAsp: 3.854 ± 0.386
2.588GluGlu: 2.588 ± 0.4
1.982GluPhe: 1.982 ± 0.272
2.863GluGly: 2.863 ± 0.365
1.266GluHis: 1.266 ± 0.334
2.863GluIle: 2.863 ± 0.485
1.707GluLys: 1.707 ± 0.363
4.9GluLeu: 4.9 ± 0.607
2.037GluMet: 2.037 ± 0.361
1.982GluAsn: 1.982 ± 0.29
2.588GluPro: 2.588 ± 0.416
2.753GluGln: 2.753 ± 0.353
4.405GluArg: 4.405 ± 0.449
2.918GluSer: 2.918 ± 0.404
4.24GluThr: 4.24 ± 0.508
3.524GluVal: 3.524 ± 0.486
0.881GluTrp: 0.881 ± 0.191
1.927GluTyr: 1.927 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
3.083PheAla: 3.083 ± 0.405
0.33PheCys: 0.33 ± 0.13
2.368PheAsp: 2.368 ± 0.308
1.156PheGlu: 1.156 ± 0.278
0.881PhePhe: 0.881 ± 0.245
3.579PheGly: 3.579 ± 0.746
0.275PheHis: 0.275 ± 0.114
1.321PheIle: 1.321 ± 0.365
1.321PheLys: 1.321 ± 0.303
1.872PheLeu: 1.872 ± 0.283
0.661PheMet: 0.661 ± 0.236
1.156PheAsn: 1.156 ± 0.328
1.817PhePro: 1.817 ± 0.321
1.046PheGln: 1.046 ± 0.277
1.597PheArg: 1.597 ± 0.276
1.652PheSer: 1.652 ± 0.264
1.927PheThr: 1.927 ± 0.347
1.762PheVal: 1.762 ± 0.336
0.606PheTrp: 0.606 ± 0.176
0.991PheTyr: 0.991 ± 0.235
0.0PheXaa: 0.0 ± 0.0
Gly
9.966GlyAla: 9.966 ± 1.311
0.881GlyCys: 0.881 ± 0.225
6.332GlyAsp: 6.332 ± 0.604
3.469GlyGlu: 3.469 ± 0.452
2.753GlyPhe: 2.753 ± 0.474
11.122GlyGly: 11.122 ± 2.935
1.652GlyHis: 1.652 ± 0.296
4.68GlyIle: 4.68 ± 0.556
2.863GlyLys: 2.863 ± 0.467
5.891GlyLeu: 5.891 ± 0.588
2.257GlyMet: 2.257 ± 0.485
2.918GlyAsn: 2.918 ± 0.409
4.019GlyPro: 4.019 ± 0.509
2.037GlyGln: 2.037 ± 0.517
5.01GlyArg: 5.01 ± 0.675
5.506GlySer: 5.506 ± 0.935
6.332GlyThr: 6.332 ± 0.603
5.891GlyVal: 5.891 ± 0.594
2.698GlyTrp: 2.698 ± 0.357
2.313GlyTyr: 2.313 ± 0.455
0.0GlyXaa: 0.0 ± 0.0
His
1.872HisAla: 1.872 ± 0.326
0.385HisCys: 0.385 ± 0.171
0.936HisAsp: 0.936 ± 0.268
1.266HisGlu: 1.266 ± 0.247
0.385HisPhe: 0.385 ± 0.122
1.927HisGly: 1.927 ± 0.336
1.101HisHis: 1.101 ± 0.286
1.321HisIle: 1.321 ± 0.273
0.606HisLys: 0.606 ± 0.179
1.266HisLeu: 1.266 ± 0.265
0.661HisMet: 0.661 ± 0.19
0.661HisAsn: 0.661 ± 0.18
1.707HisPro: 1.707 ± 0.351
0.936HisGln: 0.936 ± 0.285
1.982HisArg: 1.982 ± 0.35
0.771HisSer: 0.771 ± 0.207
1.707HisThr: 1.707 ± 0.354
1.542HisVal: 1.542 ± 0.289
0.496HisTrp: 0.496 ± 0.181
0.826HisTyr: 0.826 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
5.341IleAla: 5.341 ± 0.559
0.716IleCys: 0.716 ± 0.254
3.524IleAsp: 3.524 ± 0.396
3.359IleGlu: 3.359 ± 0.404
0.881IlePhe: 0.881 ± 0.253
3.744IleGly: 3.744 ± 0.467
1.377IleHis: 1.377 ± 0.3
1.321IleIle: 1.321 ± 0.263
0.991IleLys: 0.991 ± 0.244
2.423IleLeu: 2.423 ± 0.367
0.385IleMet: 0.385 ± 0.142
1.982IleAsn: 1.982 ± 0.284
3.249IlePro: 3.249 ± 0.353
1.542IleGln: 1.542 ± 0.286
2.423IleArg: 2.423 ± 0.383
2.257IleSer: 2.257 ± 0.396
3.689IleThr: 3.689 ± 0.452
3.414IleVal: 3.414 ± 0.395
1.046IleTrp: 1.046 ± 0.226
0.606IleTyr: 0.606 ± 0.171
0.0IleXaa: 0.0 ± 0.0
Lys
3.469LysAla: 3.469 ± 0.381
0.496LysCys: 0.496 ± 0.195
1.432LysAsp: 1.432 ± 0.295
1.266LysGlu: 1.266 ± 0.277
1.156LysPhe: 1.156 ± 0.231
3.028LysGly: 3.028 ± 0.376
1.156LysHis: 1.156 ± 0.25
0.826LysIle: 0.826 ± 0.282
1.101LysLys: 1.101 ± 0.295
2.698LysLeu: 2.698 ± 0.42
0.661LysMet: 0.661 ± 0.142
0.826LysAsn: 0.826 ± 0.213
2.643LysPro: 2.643 ± 0.36
1.542LysGln: 1.542 ± 0.284
2.037LysArg: 2.037 ± 0.365
1.817LysSer: 1.817 ± 0.337
2.313LysThr: 2.313 ± 0.433
2.092LysVal: 2.092 ± 0.389
0.771LysTrp: 0.771 ± 0.201
0.771LysTyr: 0.771 ± 0.235
0.0LysXaa: 0.0 ± 0.0
Leu
7.763LeuAla: 7.763 ± 0.677
0.881LeuCys: 0.881 ± 0.197
5.01LeuAsp: 5.01 ± 0.568
3.744LeuGlu: 3.744 ± 0.475
2.092LeuPhe: 2.092 ± 0.273
5.616LeuGly: 5.616 ± 0.573
0.771LeuHis: 0.771 ± 0.25
3.579LeuIle: 3.579 ± 0.477
2.092LeuLys: 2.092 ± 0.379
4.405LeuLeu: 4.405 ± 0.528
1.211LeuMet: 1.211 ± 0.268
2.808LeuAsn: 2.808 ± 0.381
5.066LeuPro: 5.066 ± 0.608
2.588LeuGln: 2.588 ± 0.426
5.616LeuArg: 5.616 ± 0.515
5.616LeuSer: 5.616 ± 0.628
5.561LeuThr: 5.561 ± 0.566
5.451LeuVal: 5.451 ± 0.626
1.266LeuTrp: 1.266 ± 0.286
1.872LeuTyr: 1.872 ± 0.342
0.0LeuXaa: 0.0 ± 0.0
Met
1.707MetAla: 1.707 ± 0.329
0.165MetCys: 0.165 ± 0.091
1.211MetAsp: 1.211 ± 0.251
0.771MetGlu: 0.771 ± 0.158
0.716MetPhe: 0.716 ± 0.219
2.037MetGly: 2.037 ± 0.352
0.22MetHis: 0.22 ± 0.098
1.101MetIle: 1.101 ± 0.235
0.716MetLys: 0.716 ± 0.218
1.652MetLeu: 1.652 ± 0.23
0.496MetMet: 0.496 ± 0.199
1.211MetAsn: 1.211 ± 0.249
1.707MetPro: 1.707 ± 0.293
0.661MetGln: 0.661 ± 0.173
1.321MetArg: 1.321 ± 0.239
2.973MetSer: 2.973 ± 0.437
1.872MetThr: 1.872 ± 0.271
1.377MetVal: 1.377 ± 0.347
0.606MetTrp: 0.606 ± 0.185
0.33MetTyr: 0.33 ± 0.145
0.0MetXaa: 0.0 ± 0.0
Asn
3.634AsnAla: 3.634 ± 0.443
0.275AsnCys: 0.275 ± 0.117
1.707AsnAsp: 1.707 ± 0.318
1.762AsnGlu: 1.762 ± 0.306
0.881AsnPhe: 0.881 ± 0.287
3.799AsnGly: 3.799 ± 0.56
1.156AsnHis: 1.156 ± 0.253
1.432AsnIle: 1.432 ± 0.443
1.101AsnLys: 1.101 ± 0.236
2.698AsnLeu: 2.698 ± 0.349
0.716AsnMet: 0.716 ± 0.18
1.707AsnAsn: 1.707 ± 0.358
2.643AsnPro: 2.643 ± 0.39
1.156AsnGln: 1.156 ± 0.308
2.037AsnArg: 2.037 ± 0.344
1.707AsnSer: 1.707 ± 0.28
2.423AsnThr: 2.423 ± 0.37
1.707AsnVal: 1.707 ± 0.288
0.826AsnTrp: 0.826 ± 0.16
0.44AsnTyr: 0.44 ± 0.141
0.0AsnXaa: 0.0 ± 0.0
Pro
6.057ProAla: 6.057 ± 0.614
1.046ProCys: 1.046 ± 0.308
4.735ProAsp: 4.735 ± 0.542
4.295ProGlu: 4.295 ± 0.481
1.982ProPhe: 1.982 ± 0.309
6.607ProGly: 6.607 ± 0.685
1.487ProHis: 1.487 ± 0.278
2.092ProIle: 2.092 ± 0.26
1.872ProLys: 1.872 ± 0.332
4.68ProLeu: 4.68 ± 0.493
1.266ProMet: 1.266 ± 0.29
2.257ProAsn: 2.257 ± 0.326
3.799ProPro: 3.799 ± 0.576
2.533ProGln: 2.533 ± 0.397
3.359ProArg: 3.359 ± 0.482
2.973ProSer: 2.973 ± 0.46
3.359ProThr: 3.359 ± 0.419
4.955ProVal: 4.955 ± 0.483
1.156ProTrp: 1.156 ± 0.237
1.707ProTyr: 1.707 ± 0.277
0.0ProXaa: 0.0 ± 0.0
Gln
5.066GlnAla: 5.066 ± 0.604
0.275GlnCys: 0.275 ± 0.134
1.542GlnAsp: 1.542 ± 0.285
1.707GlnGlu: 1.707 ± 0.381
0.991GlnPhe: 0.991 ± 0.2
2.257GlnGly: 2.257 ± 0.47
0.716GlnHis: 0.716 ± 0.205
2.092GlnIle: 2.092 ± 0.36
1.487GlnLys: 1.487 ± 0.261
3.304GlnLeu: 3.304 ± 0.483
0.551GlnMet: 0.551 ± 0.16
0.991GlnAsn: 0.991 ± 0.232
2.478GlnPro: 2.478 ± 0.416
1.707GlnGln: 1.707 ± 0.424
2.423GlnArg: 2.423 ± 0.34
2.698GlnSer: 2.698 ± 0.347
1.487GlnThr: 1.487 ± 0.346
2.643GlnVal: 2.643 ± 0.456
0.606GlnTrp: 0.606 ± 0.165
1.101GlnTyr: 1.101 ± 0.293
0.0GlnXaa: 0.0 ± 0.0
Arg
5.946ArgAla: 5.946 ± 0.59
1.542ArgCys: 1.542 ± 0.405
4.405ArgAsp: 4.405 ± 0.677
5.341ArgGlu: 5.341 ± 0.631
2.533ArgPhe: 2.533 ± 0.377
4.35ArgGly: 4.35 ± 0.459
1.266ArgHis: 1.266 ± 0.288
3.469ArgIle: 3.469 ± 0.386
1.927ArgLys: 1.927 ± 0.326
4.955ArgLeu: 4.955 ± 0.549
2.588ArgMet: 2.588 ± 0.42
2.202ArgAsn: 2.202 ± 0.427
3.524ArgPro: 3.524 ± 0.458
2.423ArgGln: 2.423 ± 0.402
5.286ArgArg: 5.286 ± 0.768
3.909ArgSer: 3.909 ± 0.448
3.359ArgThr: 3.359 ± 0.516
4.625ArgVal: 4.625 ± 0.538
1.542ArgTrp: 1.542 ± 0.298
1.927ArgTyr: 1.927 ± 0.32
0.0ArgXaa: 0.0 ± 0.0
Ser
6.772SerAla: 6.772 ± 0.993
0.606SerCys: 0.606 ± 0.194
3.909SerAsp: 3.909 ± 0.458
3.359SerGlu: 3.359 ± 0.496
2.257SerPhe: 2.257 ± 0.463
6.552SerGly: 6.552 ± 0.821
1.321SerHis: 1.321 ± 0.247
2.973SerIle: 2.973 ± 0.398
2.313SerLys: 2.313 ± 0.372
3.634SerLeu: 3.634 ± 0.478
1.377SerMet: 1.377 ± 0.262
2.037SerAsn: 2.037 ± 0.371
3.744SerPro: 3.744 ± 0.412
1.872SerGln: 1.872 ± 0.259
3.689SerArg: 3.689 ± 0.498
4.019SerSer: 4.019 ± 0.652
3.304SerThr: 3.304 ± 0.453
4.625SerVal: 4.625 ± 0.546
1.432SerTrp: 1.432 ± 0.288
1.597SerTyr: 1.597 ± 0.257
0.0SerXaa: 0.0 ± 0.0
Thr
5.671ThrAla: 5.671 ± 0.567
0.826ThrCys: 0.826 ± 0.216
3.909ThrAsp: 3.909 ± 0.562
3.634ThrGlu: 3.634 ± 0.475
1.652ThrPhe: 1.652 ± 0.314
5.506ThrGly: 5.506 ± 0.69
1.652ThrHis: 1.652 ± 0.282
3.083ThrIle: 3.083 ± 0.483
2.257ThrLys: 2.257 ± 0.359
4.515ThrLeu: 4.515 ± 0.528
1.211ThrMet: 1.211 ± 0.265
2.423ThrAsn: 2.423 ± 0.372
4.9ThrPro: 4.9 ± 0.488
1.817ThrGln: 1.817 ± 0.262
4.405ThrArg: 4.405 ± 0.463
3.909ThrSer: 3.909 ± 0.416
5.341ThrThr: 5.341 ± 0.689
5.286ThrVal: 5.286 ± 0.644
1.046ThrTrp: 1.046 ± 0.265
2.037ThrTyr: 2.037 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
7.598ValAla: 7.598 ± 0.663
1.046ValCys: 1.046 ± 0.232
5.396ValAsp: 5.396 ± 0.58
4.24ValGlu: 4.24 ± 0.514
2.037ValPhe: 2.037 ± 0.328
5.726ValGly: 5.726 ± 0.643
1.266ValHis: 1.266 ± 0.301
3.028ValIle: 3.028 ± 0.524
2.423ValLys: 2.423 ± 0.349
5.286ValLeu: 5.286 ± 0.545
1.321ValMet: 1.321 ± 0.212
2.092ValAsn: 2.092 ± 0.321
4.35ValPro: 4.35 ± 0.389
2.423ValGln: 2.423 ± 0.327
4.24ValArg: 4.24 ± 0.558
5.396ValSer: 5.396 ± 0.552
4.405ValThr: 4.405 ± 0.449
5.891ValVal: 5.891 ± 0.556
1.542ValTrp: 1.542 ± 0.3
1.321ValTyr: 1.321 ± 0.306
0.0ValXaa: 0.0 ± 0.0
Trp
1.872TrpAla: 1.872 ± 0.306
0.275TrpCys: 0.275 ± 0.127
1.707TrpAsp: 1.707 ± 0.296
0.936TrpGlu: 0.936 ± 0.228
0.881TrpPhe: 0.881 ± 0.211
1.046TrpGly: 1.046 ± 0.251
0.551TrpHis: 0.551 ± 0.199
0.991TrpIle: 0.991 ± 0.202
0.716TrpLys: 0.716 ± 0.177
1.707TrpLeu: 1.707 ± 0.325
0.991TrpMet: 0.991 ± 0.269
0.496TrpAsn: 0.496 ± 0.217
1.046TrpPro: 1.046 ± 0.265
1.101TrpGln: 1.101 ± 0.248
2.202TrpArg: 2.202 ± 0.362
1.762TrpSer: 1.762 ± 0.401
1.432TrpThr: 1.432 ± 0.305
1.927TrpVal: 1.927 ± 0.398
0.771TrpTrp: 0.771 ± 0.173
0.496TrpTyr: 0.496 ± 0.188
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.368TyrAla: 2.368 ± 0.372
0.275TyrCys: 0.275 ± 0.133
1.927TyrAsp: 1.927 ± 0.403
1.707TyrGlu: 1.707 ± 0.322
0.661TyrPhe: 0.661 ± 0.188
1.817TyrGly: 1.817 ± 0.344
0.661TyrHis: 0.661 ± 0.186
0.991TyrIle: 0.991 ± 0.215
0.661TyrLys: 0.661 ± 0.198
1.982TyrLeu: 1.982 ± 0.357
0.165TyrMet: 0.165 ± 0.078
0.716TyrAsn: 0.716 ± 0.203
1.266TyrPro: 1.266 ± 0.184
1.046TyrGln: 1.046 ± 0.228
2.037TyrArg: 2.037 ± 0.378
1.266TyrSer: 1.266 ± 0.255
1.707TyrThr: 1.707 ± 0.402
2.368TyrVal: 2.368 ± 0.331
0.771TyrTrp: 0.771 ± 0.215
0.716TyrTyr: 0.716 ± 0.166
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 108 proteins (18163 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski