Amino acid dipepetide frequency for Pseudomonas phage PMBT14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.005AlaAla: 9.005 ± 1.399
1.126AlaCys: 1.126 ± 0.355
4.039AlaAsp: 4.039 ± 0.432
6.026AlaGlu: 6.026 ± 0.73
2.516AlaPhe: 2.516 ± 0.445
6.092AlaGly: 6.092 ± 0.696
1.457AlaHis: 1.457 ± 0.363
4.966AlaIle: 4.966 ± 0.616
5.893AlaLys: 5.893 ± 0.773
7.35AlaLeu: 7.35 ± 0.777
2.516AlaMet: 2.516 ± 0.395
4.039AlaAsn: 4.039 ± 0.618
2.847AlaPro: 2.847 ± 0.479
3.509AlaGln: 3.509 ± 0.609
4.039AlaArg: 4.039 ± 0.632
5.496AlaSer: 5.496 ± 0.85
6.092AlaThr: 6.092 ± 0.928
5.562AlaVal: 5.562 ± 0.605
1.391AlaTrp: 1.391 ± 0.265
3.576AlaTyr: 3.576 ± 0.444
0.0AlaXaa: 0.0 ± 0.0
Cys
0.53CysAla: 0.53 ± 0.231
0.0CysCys: 0.0 ± 0.0
0.331CysAsp: 0.331 ± 0.132
0.596CysGlu: 0.596 ± 0.175
0.132CysPhe: 0.132 ± 0.096
1.126CysGly: 1.126 ± 0.286
0.199CysHis: 0.199 ± 0.118
0.397CysIle: 0.397 ± 0.128
0.795CysLys: 0.795 ± 0.245
0.927CysLeu: 0.927 ± 0.237
0.132CysMet: 0.132 ± 0.103
0.596CysAsn: 0.596 ± 0.235
0.397CysPro: 0.397 ± 0.172
0.199CysGln: 0.199 ± 0.104
0.795CysArg: 0.795 ± 0.221
0.728CysSer: 0.728 ± 0.243
0.861CysThr: 0.861 ± 0.309
0.397CysVal: 0.397 ± 0.192
0.331CysTrp: 0.331 ± 0.166
0.265CysTyr: 0.265 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
5.231AspAla: 5.231 ± 0.538
0.596AspCys: 0.596 ± 0.2
2.582AspAsp: 2.582 ± 0.389
3.245AspGlu: 3.245 ± 0.462
2.251AspPhe: 2.251 ± 0.333
5.827AspGly: 5.827 ± 0.577
0.662AspHis: 0.662 ± 0.257
2.649AspIle: 2.649 ± 0.404
2.781AspLys: 2.781 ± 0.336
5.893AspLeu: 5.893 ± 0.817
1.854AspMet: 1.854 ± 0.375
2.582AspAsn: 2.582 ± 0.428
2.98AspPro: 2.98 ± 0.477
2.251AspGln: 2.251 ± 0.397
2.847AspArg: 2.847 ± 0.479
3.178AspSer: 3.178 ± 0.403
2.781AspThr: 2.781 ± 0.457
3.576AspVal: 3.576 ± 0.616
1.655AspTrp: 1.655 ± 0.363
1.788AspTyr: 1.788 ± 0.35
0.0AspXaa: 0.0 ± 0.0
Glu
5.695GluAla: 5.695 ± 0.764
0.53GluCys: 0.53 ± 0.185
5.165GluAsp: 5.165 ± 0.548
5.099GluGlu: 5.099 ± 0.84
2.45GluPhe: 2.45 ± 0.423
3.642GluGly: 3.642 ± 0.612
1.059GluHis: 1.059 ± 0.231
3.443GluIle: 3.443 ± 0.47
4.172GluLys: 4.172 ± 0.732
7.284GluLeu: 7.284 ± 0.81
2.318GluMet: 2.318 ± 0.37
2.847GluAsn: 2.847 ± 0.474
2.649GluPro: 2.649 ± 0.426
2.649GluGln: 2.649 ± 0.453
3.576GluArg: 3.576 ± 0.413
3.443GluSer: 3.443 ± 0.425
3.112GluThr: 3.112 ± 0.424
4.569GluVal: 4.569 ± 0.607
0.662GluTrp: 0.662 ± 0.17
2.715GluTyr: 2.715 ± 0.458
0.0GluXaa: 0.0 ± 0.0
Phe
1.986PheAla: 1.986 ± 0.341
0.199PheCys: 0.199 ± 0.135
2.516PheAsp: 2.516 ± 0.33
2.582PheGlu: 2.582 ± 0.469
1.391PhePhe: 1.391 ± 0.318
2.582PheGly: 2.582 ± 0.474
0.53PheHis: 0.53 ± 0.203
1.788PheIle: 1.788 ± 0.353
1.391PheLys: 1.391 ± 0.321
2.516PheLeu: 2.516 ± 0.407
0.927PheMet: 0.927 ± 0.239
1.457PheAsn: 1.457 ± 0.29
1.655PhePro: 1.655 ± 0.275
1.589PheGln: 1.589 ± 0.269
2.516PheArg: 2.516 ± 0.395
1.457PheSer: 1.457 ± 0.287
2.45PheThr: 2.45 ± 0.381
1.192PheVal: 1.192 ± 0.264
0.464PheTrp: 0.464 ± 0.141
0.927PheTyr: 0.927 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
5.297GlyAla: 5.297 ± 0.602
0.265GlyCys: 0.265 ± 0.126
3.774GlyAsp: 3.774 ± 0.684
4.768GlyGlu: 4.768 ± 0.467
2.847GlyPhe: 2.847 ± 0.396
6.224GlyGly: 6.224 ± 0.726
1.192GlyHis: 1.192 ± 0.256
4.172GlyIle: 4.172 ± 0.429
5.364GlyLys: 5.364 ± 0.572
6.423GlyLeu: 6.423 ± 0.661
1.589GlyMet: 1.589 ± 0.275
2.649GlyAsn: 2.649 ± 0.384
3.443GlyPro: 3.443 ± 0.533
3.112GlyGln: 3.112 ± 0.445
3.377GlyArg: 3.377 ± 0.475
4.569GlySer: 4.569 ± 0.611
4.37GlyThr: 4.37 ± 0.843
5.628GlyVal: 5.628 ± 0.629
1.92GlyTrp: 1.92 ± 0.403
2.251GlyTyr: 2.251 ± 0.48
0.0GlyXaa: 0.0 ± 0.0
His
0.927HisAla: 0.927 ± 0.242
0.066HisCys: 0.066 ± 0.074
0.795HisAsp: 0.795 ± 0.252
1.192HisGlu: 1.192 ± 0.317
0.464HisPhe: 0.464 ± 0.237
1.986HisGly: 1.986 ± 0.47
0.331HisHis: 0.331 ± 0.194
1.457HisIle: 1.457 ± 0.325
0.861HisLys: 0.861 ± 0.288
1.324HisLeu: 1.324 ± 0.395
0.464HisMet: 0.464 ± 0.184
0.397HisAsn: 0.397 ± 0.149
0.662HisPro: 0.662 ± 0.17
0.861HisGln: 0.861 ± 0.248
0.861HisArg: 0.861 ± 0.258
1.391HisSer: 1.391 ± 0.292
0.795HisThr: 0.795 ± 0.244
1.258HisVal: 1.258 ± 0.272
0.596HisTrp: 0.596 ± 0.232
0.265HisTyr: 0.265 ± 0.134
0.0HisXaa: 0.0 ± 0.0
Ile
4.635IleAla: 4.635 ± 0.502
0.596IleCys: 0.596 ± 0.157
3.708IleAsp: 3.708 ± 0.457
4.304IleGlu: 4.304 ± 0.514
1.059IlePhe: 1.059 ± 0.311
3.377IleGly: 3.377 ± 0.409
1.324IleHis: 1.324 ± 0.385
2.715IleIle: 2.715 ± 0.453
3.046IleLys: 3.046 ± 0.411
4.039IleLeu: 4.039 ± 0.564
1.258IleMet: 1.258 ± 0.254
2.119IleAsn: 2.119 ± 0.376
2.251IlePro: 2.251 ± 0.358
2.384IleGln: 2.384 ± 0.371
4.039IleArg: 4.039 ± 0.545
3.907IleSer: 3.907 ± 0.484
3.509IleThr: 3.509 ± 0.5
2.582IleVal: 2.582 ± 0.367
0.464IleTrp: 0.464 ± 0.172
1.655IleTyr: 1.655 ± 0.32
0.0IleXaa: 0.0 ± 0.0
Lys
6.489LysAla: 6.489 ± 0.819
0.53LysCys: 0.53 ± 0.209
2.582LysAsp: 2.582 ± 0.537
3.973LysGlu: 3.973 ± 0.492
1.92LysPhe: 1.92 ± 0.362
3.774LysGly: 3.774 ± 0.606
0.993LysHis: 0.993 ± 0.281
2.384LysIle: 2.384 ± 0.52
3.178LysLys: 3.178 ± 0.562
5.761LysLeu: 5.761 ± 0.514
1.986LysMet: 1.986 ± 0.322
2.649LysAsn: 2.649 ± 0.404
4.569LysPro: 4.569 ± 0.788
2.384LysGln: 2.384 ± 0.409
3.973LysArg: 3.973 ± 0.574
3.046LysSer: 3.046 ± 0.448
3.377LysThr: 3.377 ± 0.542
4.436LysVal: 4.436 ± 0.484
0.464LysTrp: 0.464 ± 0.185
1.391LysTyr: 1.391 ± 0.276
0.0LysXaa: 0.0 ± 0.0
Leu
7.747LeuAla: 7.747 ± 0.733
0.993LeuCys: 0.993 ± 0.307
6.622LeuAsp: 6.622 ± 0.649
6.489LeuGlu: 6.489 ± 0.775
2.119LeuPhe: 2.119 ± 0.27
5.695LeuGly: 5.695 ± 0.585
1.722LeuHis: 1.722 ± 0.412
4.635LeuIle: 4.635 ± 0.485
5.628LeuLys: 5.628 ± 0.69
6.688LeuLeu: 6.688 ± 0.736
2.649LeuMet: 2.649 ± 0.426
3.443LeuAsn: 3.443 ± 0.528
4.834LeuPro: 4.834 ± 0.616
3.841LeuGln: 3.841 ± 0.471
6.754LeuArg: 6.754 ± 0.61
4.238LeuSer: 4.238 ± 0.682
5.099LeuThr: 5.099 ± 0.695
6.291LeuVal: 6.291 ± 0.476
1.523LeuTrp: 1.523 ± 0.334
2.582LeuTyr: 2.582 ± 0.503
0.0LeuXaa: 0.0 ± 0.0
Met
2.914MetAla: 2.914 ± 0.385
0.199MetCys: 0.199 ± 0.14
1.258MetAsp: 1.258 ± 0.31
1.324MetGlu: 1.324 ± 0.268
0.728MetPhe: 0.728 ± 0.228
1.854MetGly: 1.854 ± 0.368
0.728MetHis: 0.728 ± 0.252
1.655MetIle: 1.655 ± 0.339
2.781MetLys: 2.781 ± 0.5
2.185MetLeu: 2.185 ± 0.392
0.728MetMet: 0.728 ± 0.258
1.258MetAsn: 1.258 ± 0.289
1.258MetPro: 1.258 ± 0.29
0.861MetGln: 0.861 ± 0.293
1.126MetArg: 1.126 ± 0.308
2.053MetSer: 2.053 ± 0.483
1.788MetThr: 1.788 ± 0.331
1.324MetVal: 1.324 ± 0.273
0.397MetTrp: 0.397 ± 0.174
0.331MetTyr: 0.331 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
3.841AsnAla: 3.841 ± 0.485
0.265AsnCys: 0.265 ± 0.125
2.119AsnAsp: 2.119 ± 0.406
2.251AsnGlu: 2.251 ± 0.361
1.192AsnPhe: 1.192 ± 0.238
4.635AsnGly: 4.635 ± 0.658
0.861AsnHis: 0.861 ± 0.241
1.324AsnIle: 1.324 ± 0.269
1.986AsnLys: 1.986 ± 0.375
4.039AsnLeu: 4.039 ± 0.564
0.861AsnMet: 0.861 ± 0.224
2.053AsnAsn: 2.053 ± 0.467
2.715AsnPro: 2.715 ± 0.379
1.589AsnGln: 1.589 ± 0.357
2.516AsnArg: 2.516 ± 0.439
2.318AsnSer: 2.318 ± 0.462
2.318AsnThr: 2.318 ± 0.41
2.582AsnVal: 2.582 ± 0.452
1.324AsnTrp: 1.324 ± 0.288
1.722AsnTyr: 1.722 ± 0.347
0.0AsnXaa: 0.0 ± 0.0
Pro
4.436ProAla: 4.436 ± 0.641
0.53ProCys: 0.53 ± 0.252
3.178ProAsp: 3.178 ± 0.427
3.443ProGlu: 3.443 ± 0.593
1.788ProPhe: 1.788 ± 0.282
3.841ProGly: 3.841 ± 0.578
0.397ProHis: 0.397 ± 0.168
2.582ProIle: 2.582 ± 0.521
2.516ProLys: 2.516 ± 0.391
3.311ProLeu: 3.311 ± 0.375
0.927ProMet: 0.927 ± 0.292
2.053ProAsn: 2.053 ± 0.374
2.119ProPro: 2.119 ± 0.43
1.722ProGln: 1.722 ± 0.341
2.715ProArg: 2.715 ± 0.463
2.384ProSer: 2.384 ± 0.402
3.774ProThr: 3.774 ± 0.47
3.642ProVal: 3.642 ± 0.474
0.596ProTrp: 0.596 ± 0.182
1.192ProTyr: 1.192 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
4.635GlnAla: 4.635 ± 0.794
0.331GlnCys: 0.331 ± 0.149
1.92GlnAsp: 1.92 ± 0.36
2.582GlnGlu: 2.582 ± 0.459
1.258GlnPhe: 1.258 ± 0.255
2.98GlnGly: 2.98 ± 0.377
0.464GlnHis: 0.464 ± 0.159
2.119GlnIle: 2.119 ± 0.329
2.384GlnLys: 2.384 ± 0.406
3.112GlnLeu: 3.112 ± 0.582
1.589GlnMet: 1.589 ± 0.307
1.589GlnAsn: 1.589 ± 0.431
1.854GlnPro: 1.854 ± 0.323
2.251GlnGln: 2.251 ± 0.345
2.582GlnArg: 2.582 ± 0.347
2.185GlnSer: 2.185 ± 0.303
2.781GlnThr: 2.781 ± 0.367
3.443GlnVal: 3.443 ± 0.41
0.464GlnTrp: 0.464 ± 0.163
1.457GlnTyr: 1.457 ± 0.409
0.0GlnXaa: 0.0 ± 0.0
Arg
4.039ArgAla: 4.039 ± 0.543
0.795ArgCys: 0.795 ± 0.292
3.245ArgAsp: 3.245 ± 0.539
3.112ArgGlu: 3.112 ± 0.508
2.251ArgPhe: 2.251 ± 0.408
3.708ArgGly: 3.708 ± 0.395
1.258ArgHis: 1.258 ± 0.319
4.039ArgIle: 4.039 ± 0.495
4.238ArgLys: 4.238 ± 0.476
6.423ArgLeu: 6.423 ± 0.515
1.589ArgMet: 1.589 ± 0.322
2.914ArgAsn: 2.914 ± 0.496
1.788ArgPro: 1.788 ± 0.463
2.185ArgGln: 2.185 ± 0.354
2.781ArgArg: 2.781 ± 0.52
3.576ArgSer: 3.576 ± 0.459
2.914ArgThr: 2.914 ± 0.458
5.032ArgVal: 5.032 ± 0.62
1.192ArgTrp: 1.192 ± 0.25
1.589ArgTyr: 1.589 ± 0.381
0.0ArgXaa: 0.0 ± 0.0
Ser
5.099SerAla: 5.099 ± 0.68
0.596SerCys: 0.596 ± 0.171
2.847SerAsp: 2.847 ± 0.323
3.443SerGlu: 3.443 ± 0.436
2.053SerPhe: 2.053 ± 0.275
4.238SerGly: 4.238 ± 0.708
1.258SerHis: 1.258 ± 0.338
3.112SerIle: 3.112 ± 0.469
3.311SerLys: 3.311 ± 0.444
4.635SerLeu: 4.635 ± 0.613
1.788SerMet: 1.788 ± 0.274
1.722SerAsn: 1.722 ± 0.354
2.914SerPro: 2.914 ± 0.523
2.715SerGln: 2.715 ± 0.484
3.245SerArg: 3.245 ± 0.537
3.046SerSer: 3.046 ± 0.447
4.039SerThr: 4.039 ± 0.596
4.635SerVal: 4.635 ± 0.525
0.927SerTrp: 0.927 ± 0.249
1.722SerTyr: 1.722 ± 0.336
0.0SerXaa: 0.0 ± 0.0
Thr
6.224ThrAla: 6.224 ± 0.743
0.861ThrCys: 0.861 ± 0.233
3.576ThrAsp: 3.576 ± 0.441
3.708ThrGlu: 3.708 ± 0.453
2.185ThrPhe: 2.185 ± 0.347
4.9ThrGly: 4.9 ± 0.599
0.53ThrHis: 0.53 ± 0.155
3.112ThrIle: 3.112 ± 0.477
3.112ThrLys: 3.112 ± 0.503
5.959ThrLeu: 5.959 ± 0.676
1.324ThrMet: 1.324 ± 0.276
2.185ThrAsn: 2.185 ± 0.503
3.443ThrPro: 3.443 ± 0.383
2.781ThrGln: 2.781 ± 0.588
3.443ThrArg: 3.443 ± 0.485
3.509ThrSer: 3.509 ± 0.599
3.774ThrThr: 3.774 ± 0.53
4.37ThrVal: 4.37 ± 0.691
1.192ThrTrp: 1.192 ± 0.32
1.788ThrTyr: 1.788 ± 0.302
0.0ThrXaa: 0.0 ± 0.0
Val
5.032ValAla: 5.032 ± 0.577
0.53ValCys: 0.53 ± 0.198
3.377ValAsp: 3.377 ± 0.517
5.496ValGlu: 5.496 ± 0.616
1.92ValPhe: 1.92 ± 0.37
3.907ValGly: 3.907 ± 0.551
1.059ValHis: 1.059 ± 0.34
4.304ValIle: 4.304 ± 0.493
4.172ValLys: 4.172 ± 0.501
6.224ValLeu: 6.224 ± 0.626
1.324ValMet: 1.324 ± 0.298
3.112ValAsn: 3.112 ± 0.334
2.914ValPro: 2.914 ± 0.474
2.516ValGln: 2.516 ± 0.493
4.238ValArg: 4.238 ± 0.454
3.973ValSer: 3.973 ± 0.476
5.032ValThr: 5.032 ± 0.899
4.768ValVal: 4.768 ± 0.512
1.523ValTrp: 1.523 ± 0.275
2.384ValTyr: 2.384 ± 0.449
0.0ValXaa: 0.0 ± 0.0
Trp
1.523TrpAla: 1.523 ± 0.258
0.331TrpCys: 0.331 ± 0.161
1.722TrpAsp: 1.722 ± 0.366
1.126TrpGlu: 1.126 ± 0.258
0.53TrpPhe: 0.53 ± 0.198
0.861TrpGly: 0.861 ± 0.281
0.397TrpHis: 0.397 ± 0.166
0.993TrpIle: 0.993 ± 0.364
0.795TrpLys: 0.795 ± 0.204
1.788TrpLeu: 1.788 ± 0.385
0.199TrpMet: 0.199 ± 0.099
1.523TrpAsn: 1.523 ± 0.301
0.861TrpPro: 0.861 ± 0.239
0.662TrpGln: 0.662 ± 0.184
0.927TrpArg: 0.927 ± 0.236
0.861TrpSer: 0.861 ± 0.258
1.059TrpThr: 1.059 ± 0.239
0.993TrpVal: 0.993 ± 0.226
0.464TrpTrp: 0.464 ± 0.149
0.662TrpTyr: 0.662 ± 0.197
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.251TyrAla: 2.251 ± 0.503
0.397TyrCys: 0.397 ± 0.152
1.854TyrAsp: 1.854 ± 0.295
2.318TyrGlu: 2.318 ± 0.339
1.059TyrPhe: 1.059 ± 0.271
1.986TyrGly: 1.986 ± 0.379
0.53TyrHis: 0.53 ± 0.199
1.126TyrIle: 1.126 ± 0.377
1.457TyrLys: 1.457 ± 0.309
3.774TyrLeu: 3.774 ± 0.458
0.662TyrMet: 0.662 ± 0.203
1.324TyrAsn: 1.324 ± 0.31
0.993TyrPro: 0.993 ± 0.219
1.92TyrGln: 1.92 ± 0.324
2.318TyrArg: 2.318 ± 0.39
1.986TyrSer: 1.986 ± 0.436
2.053TyrThr: 2.053 ± 0.496
1.523TyrVal: 1.523 ± 0.312
0.728TyrTrp: 0.728 ± 0.208
0.993TyrTyr: 0.993 ± 0.277
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (15103 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski