Amino acid dipepetide frequency for Faecalibacterium phage FP_oengus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.137AlaAla: 9.137 ± 1.234
0.931AlaCys: 0.931 ± 0.243
5.413AlaAsp: 5.413 ± 0.66
6.169AlaGlu: 6.169 ± 0.617
3.085AlaPhe: 3.085 ± 0.45
6.169AlaGly: 6.169 ± 0.952
0.989AlaHis: 0.989 ± 0.217
5.529AlaIle: 5.529 ± 0.636
6.984AlaLys: 6.984 ± 0.795
8.788AlaLeu: 8.788 ± 1.215
2.968AlaMet: 2.968 ± 0.371
4.249AlaAsn: 4.249 ± 0.496
1.862AlaPro: 1.862 ± 0.33
3.085AlaGln: 3.085 ± 0.321
2.968AlaArg: 2.968 ± 0.434
5.063AlaSer: 5.063 ± 0.807
4.307AlaThr: 4.307 ± 0.542
5.82AlaVal: 5.82 ± 0.806
1.28AlaTrp: 1.28 ± 0.254
3.143AlaTyr: 3.143 ± 0.456
0.0AlaXaa: 0.0 ± 0.0
Cys
0.757CysAla: 0.757 ± 0.3
0.407CysCys: 0.407 ± 0.155
1.222CysAsp: 1.222 ± 0.297
0.931CysGlu: 0.931 ± 0.221
0.466CysPhe: 0.466 ± 0.156
1.455CysGly: 1.455 ± 0.358
0.291CysHis: 0.291 ± 0.144
1.164CysIle: 1.164 ± 0.34
0.64CysLys: 0.64 ± 0.217
1.28CysLeu: 1.28 ± 0.32
0.524CysMet: 0.524 ± 0.208
0.931CysAsn: 0.931 ± 0.248
0.873CysPro: 0.873 ± 0.276
0.349CysGln: 0.349 ± 0.146
0.64CysArg: 0.64 ± 0.222
0.582CysSer: 0.582 ± 0.205
0.757CysThr: 0.757 ± 0.266
0.757CysVal: 0.757 ± 0.218
0.524CysTrp: 0.524 ± 0.186
0.582CysTyr: 0.582 ± 0.2
0.0CysXaa: 0.0 ± 0.0
Asp
5.704AspAla: 5.704 ± 0.546
0.757AspCys: 0.757 ± 0.264
3.958AspAsp: 3.958 ± 0.487
5.005AspGlu: 5.005 ± 0.535
2.794AspPhe: 2.794 ± 0.521
5.005AspGly: 5.005 ± 0.56
0.873AspHis: 0.873 ± 0.254
4.307AspIle: 4.307 ± 0.477
4.481AspLys: 4.481 ± 0.538
5.413AspLeu: 5.413 ± 0.496
1.688AspMet: 1.688 ± 0.325
2.037AspAsn: 2.037 ± 0.392
3.026AspPro: 3.026 ± 0.421
1.455AspGln: 1.455 ± 0.28
2.794AspArg: 2.794 ± 0.434
3.026AspSer: 3.026 ± 0.333
3.55AspThr: 3.55 ± 0.438
3.259AspVal: 3.259 ± 0.49
1.28AspTrp: 1.28 ± 0.274
3.259AspTyr: 3.259 ± 0.477
0.0AspXaa: 0.0 ± 0.0
Glu
5.936GluAla: 5.936 ± 0.638
1.106GluCys: 1.106 ± 0.346
4.889GluAsp: 4.889 ± 0.656
5.704GluGlu: 5.704 ± 0.753
2.037GluPhe: 2.037 ± 0.351
3.55GluGly: 3.55 ± 0.408
1.63GluHis: 1.63 ± 0.256
4.249GluIle: 4.249 ± 0.507
6.169GluLys: 6.169 ± 0.621
6.053GluLeu: 6.053 ± 0.712
2.852GluMet: 2.852 ± 0.496
4.598GluAsn: 4.598 ± 0.557
1.746GluPro: 1.746 ± 0.364
2.968GluGln: 2.968 ± 0.316
3.841GluArg: 3.841 ± 0.533
2.619GluSer: 2.619 ± 0.413
4.947GluThr: 4.947 ± 0.475
4.132GluVal: 4.132 ± 0.493
1.339GluTrp: 1.339 ± 0.225
3.143GluTyr: 3.143 ± 0.375
0.0GluXaa: 0.0 ± 0.0
Phe
2.852PheAla: 2.852 ± 0.398
0.757PheCys: 0.757 ± 0.209
2.561PheAsp: 2.561 ± 0.38
2.794PheGlu: 2.794 ± 0.343
0.931PhePhe: 0.931 ± 0.195
3.201PheGly: 3.201 ± 0.434
0.873PheHis: 0.873 ± 0.308
2.037PheIle: 2.037 ± 0.33
1.63PheLys: 1.63 ± 0.401
2.386PheLeu: 2.386 ± 0.413
0.989PheMet: 0.989 ± 0.255
1.862PheAsn: 1.862 ± 0.329
0.873PhePro: 0.873 ± 0.232
0.407PheGln: 0.407 ± 0.12
1.63PheArg: 1.63 ± 0.262
2.444PheSer: 2.444 ± 0.43
2.444PheThr: 2.444 ± 0.345
1.63PheVal: 1.63 ± 0.37
0.524PheTrp: 0.524 ± 0.171
1.862PheTyr: 1.862 ± 0.447
0.0PheXaa: 0.0 ± 0.0
Gly
5.122GlyAla: 5.122 ± 0.67
0.931GlyCys: 0.931 ± 0.257
4.365GlyAsp: 4.365 ± 0.49
4.423GlyGlu: 4.423 ± 0.535
2.503GlyPhe: 2.503 ± 0.327
4.772GlyGly: 4.772 ± 0.629
1.164GlyHis: 1.164 ± 0.304
4.074GlyIle: 4.074 ± 0.561
4.714GlyLys: 4.714 ± 0.507
5.413GlyLeu: 5.413 ± 0.806
2.677GlyMet: 2.677 ± 0.37
2.386GlyAsn: 2.386 ± 0.29
0.815GlyPro: 0.815 ± 0.175
1.63GlyGln: 1.63 ± 0.277
2.677GlyArg: 2.677 ± 0.454
4.19GlySer: 4.19 ± 0.698
5.18GlyThr: 5.18 ± 0.759
5.063GlyVal: 5.063 ± 0.57
1.571GlyTrp: 1.571 ± 0.284
2.91GlyTyr: 2.91 ± 0.387
0.0GlyXaa: 0.0 ± 0.0
His
0.873HisAla: 0.873 ± 0.189
0.466HisCys: 0.466 ± 0.202
0.989HisAsp: 0.989 ± 0.268
1.28HisGlu: 1.28 ± 0.321
0.815HisPhe: 0.815 ± 0.248
1.048HisGly: 1.048 ± 0.242
0.407HisHis: 0.407 ± 0.197
1.048HisIle: 1.048 ± 0.313
0.931HisLys: 0.931 ± 0.227
1.63HisLeu: 1.63 ± 0.328
0.466HisMet: 0.466 ± 0.146
0.989HisAsn: 0.989 ± 0.26
0.931HisPro: 0.931 ± 0.287
0.291HisGln: 0.291 ± 0.114
0.64HisArg: 0.64 ± 0.17
1.339HisSer: 1.339 ± 0.369
0.815HisThr: 0.815 ± 0.26
0.815HisVal: 0.815 ± 0.195
0.116HisTrp: 0.116 ± 0.079
1.106HisTyr: 1.106 ± 0.313
0.0HisXaa: 0.0 ± 0.0
Ile
6.227IleAla: 6.227 ± 0.935
1.222IleCys: 1.222 ± 0.339
3.899IleAsp: 3.899 ± 0.483
4.19IleGlu: 4.19 ± 0.495
1.746IlePhe: 1.746 ± 0.381
3.958IleGly: 3.958 ± 0.808
0.931IleHis: 0.931 ± 0.219
2.794IleIle: 2.794 ± 0.474
3.783IleLys: 3.783 ± 0.364
4.307IleLeu: 4.307 ± 0.463
1.921IleMet: 1.921 ± 0.315
2.444IleAsn: 2.444 ± 0.284
2.153IlePro: 2.153 ± 0.381
2.328IleGln: 2.328 ± 0.422
2.619IleArg: 2.619 ± 0.5
3.841IleSer: 3.841 ± 0.598
3.201IleThr: 3.201 ± 0.462
3.899IleVal: 3.899 ± 0.375
0.466IleTrp: 0.466 ± 0.15
2.153IleTyr: 2.153 ± 0.373
0.0IleXaa: 0.0 ± 0.0
Lys
6.518LysAla: 6.518 ± 0.667
0.873LysCys: 0.873 ± 0.259
4.307LysAsp: 4.307 ± 0.514
6.111LysGlu: 6.111 ± 0.853
1.979LysPhe: 1.979 ± 0.291
4.132LysGly: 4.132 ± 0.454
1.571LysHis: 1.571 ± 0.424
4.132LysIle: 4.132 ± 0.524
7.159LysLys: 7.159 ± 0.696
6.169LysLeu: 6.169 ± 0.628
2.735LysMet: 2.735 ± 0.463
3.434LysAsn: 3.434 ± 0.436
2.386LysPro: 2.386 ± 0.492
2.968LysGln: 2.968 ± 0.431
2.968LysArg: 2.968 ± 0.536
4.249LysSer: 4.249 ± 0.528
5.063LysThr: 5.063 ± 0.502
4.19LysVal: 4.19 ± 0.449
1.222LysTrp: 1.222 ± 0.28
2.794LysTyr: 2.794 ± 0.567
0.0LysXaa: 0.0 ± 0.0
Leu
6.693LeuAla: 6.693 ± 1.135
1.222LeuCys: 1.222 ± 0.249
5.529LeuAsp: 5.529 ± 0.427
4.947LeuGlu: 4.947 ± 0.437
2.968LeuPhe: 2.968 ± 0.397
5.587LeuGly: 5.587 ± 1.128
1.397LeuHis: 1.397 ± 0.34
4.481LeuIle: 4.481 ± 0.589
6.868LeuLys: 6.868 ± 0.756
6.227LeuLeu: 6.227 ± 0.654
2.735LeuMet: 2.735 ± 0.384
3.958LeuAsn: 3.958 ± 0.562
2.212LeuPro: 2.212 ± 0.351
2.677LeuGln: 2.677 ± 0.458
3.608LeuArg: 3.608 ± 0.462
6.577LeuSer: 6.577 ± 0.981
6.169LeuThr: 6.169 ± 0.531
4.19LeuVal: 4.19 ± 0.508
1.28LeuTrp: 1.28 ± 0.249
2.27LeuTyr: 2.27 ± 0.428
0.0LeuXaa: 0.0 ± 0.0
Met
3.143MetAla: 3.143 ± 0.407
0.291MetCys: 0.291 ± 0.131
2.037MetAsp: 2.037 ± 0.357
2.503MetGlu: 2.503 ± 0.439
0.873MetPhe: 0.873 ± 0.209
1.804MetGly: 1.804 ± 0.324
0.698MetHis: 0.698 ± 0.193
1.921MetIle: 1.921 ± 0.334
2.444MetLys: 2.444 ± 0.547
2.735MetLeu: 2.735 ± 0.298
0.698MetMet: 0.698 ± 0.282
1.688MetAsn: 1.688 ± 0.297
0.698MetPro: 0.698 ± 0.181
1.222MetGln: 1.222 ± 0.298
1.746MetArg: 1.746 ± 0.272
2.27MetSer: 2.27 ± 0.347
1.513MetThr: 1.513 ± 0.262
1.921MetVal: 1.921 ± 0.301
0.291MetTrp: 0.291 ± 0.122
1.28MetTyr: 1.28 ± 0.291
0.0MetXaa: 0.0 ± 0.0
Asn
5.936AsnAla: 5.936 ± 0.871
0.698AsnCys: 0.698 ± 0.203
3.201AsnAsp: 3.201 ± 0.443
3.55AsnGlu: 3.55 ± 0.504
1.28AsnPhe: 1.28 ± 0.235
4.19AsnGly: 4.19 ± 0.444
0.757AsnHis: 0.757 ± 0.188
2.328AsnIle: 2.328 ± 0.349
2.852AsnLys: 2.852 ± 0.294
3.55AsnLeu: 3.55 ± 0.479
1.28AsnMet: 1.28 ± 0.226
1.921AsnAsn: 1.921 ± 0.333
2.328AsnPro: 2.328 ± 0.407
1.339AsnGln: 1.339 ± 0.244
3.026AsnArg: 3.026 ± 0.526
1.921AsnSer: 1.921 ± 0.26
2.27AsnThr: 2.27 ± 0.37
3.085AsnVal: 3.085 ± 0.435
0.64AsnTrp: 0.64 ± 0.176
2.444AsnTyr: 2.444 ± 0.412
0.0AsnXaa: 0.0 ± 0.0
Pro
1.862ProAla: 1.862 ± 0.427
0.524ProCys: 0.524 ± 0.222
1.63ProAsp: 1.63 ± 0.363
3.434ProGlu: 3.434 ± 0.468
1.106ProPhe: 1.106 ± 0.293
2.037ProGly: 2.037 ± 0.428
0.64ProHis: 0.64 ± 0.183
2.27ProIle: 2.27 ± 0.439
2.561ProLys: 2.561 ± 0.476
1.746ProLeu: 1.746 ± 0.307
0.757ProMet: 0.757 ± 0.185
1.397ProAsn: 1.397 ± 0.277
1.397ProPro: 1.397 ± 0.293
0.64ProGln: 0.64 ± 0.204
0.815ProArg: 0.815 ± 0.201
1.106ProSer: 1.106 ± 0.222
1.921ProThr: 1.921 ± 0.358
2.968ProVal: 2.968 ± 0.485
0.698ProTrp: 0.698 ± 0.23
1.979ProTyr: 1.979 ± 0.351
0.0ProXaa: 0.0 ± 0.0
Gln
2.27GlnAla: 2.27 ± 0.331
0.407GlnCys: 0.407 ± 0.161
0.931GlnAsp: 0.931 ± 0.162
2.212GlnGlu: 2.212 ± 0.405
1.048GlnPhe: 1.048 ± 0.289
1.455GlnGly: 1.455 ± 0.267
0.582GlnHis: 0.582 ± 0.175
1.921GlnIle: 1.921 ± 0.332
2.619GlnLys: 2.619 ± 0.373
3.143GlnLeu: 3.143 ± 0.589
1.106GlnMet: 1.106 ± 0.241
2.503GlnAsn: 2.503 ± 0.353
0.64GlnPro: 0.64 ± 0.184
1.339GlnGln: 1.339 ± 0.301
2.328GlnArg: 2.328 ± 0.457
1.397GlnSer: 1.397 ± 0.278
1.746GlnThr: 1.746 ± 0.44
1.804GlnVal: 1.804 ± 0.251
0.466GlnTrp: 0.466 ± 0.18
1.339GlnTyr: 1.339 ± 0.264
0.0GlnXaa: 0.0 ± 0.0
Arg
3.55ArgAla: 3.55 ± 0.584
1.106ArgCys: 1.106 ± 0.32
2.095ArgAsp: 2.095 ± 0.422
2.968ArgGlu: 2.968 ± 0.368
2.037ArgPhe: 2.037 ± 0.442
1.746ArgGly: 1.746 ± 0.326
0.524ArgHis: 0.524 ± 0.162
2.677ArgIle: 2.677 ± 0.403
3.317ArgLys: 3.317 ± 0.395
3.725ArgLeu: 3.725 ± 0.507
1.804ArgMet: 1.804 ± 0.334
2.968ArgAsn: 2.968 ± 0.386
1.746ArgPro: 1.746 ± 0.434
1.513ArgGln: 1.513 ± 0.297
2.153ArgArg: 2.153 ± 0.443
2.328ArgSer: 2.328 ± 0.359
3.026ArgThr: 3.026 ± 0.434
3.725ArgVal: 3.725 ± 0.499
0.815ArgTrp: 0.815 ± 0.195
2.212ArgTyr: 2.212 ± 0.44
0.0ArgXaa: 0.0 ± 0.0
Ser
4.423SerAla: 4.423 ± 0.928
0.64SerCys: 0.64 ± 0.194
4.074SerAsp: 4.074 ± 0.684
4.423SerGlu: 4.423 ± 0.638
1.979SerPhe: 1.979 ± 0.387
5.063SerGly: 5.063 ± 0.684
0.931SerHis: 0.931 ± 0.254
3.085SerIle: 3.085 ± 0.555
4.54SerLys: 4.54 ± 0.734
4.772SerLeu: 4.772 ± 0.727
1.571SerMet: 1.571 ± 0.281
2.095SerAsn: 2.095 ± 0.3
1.63SerPro: 1.63 ± 0.301
1.921SerGln: 1.921 ± 0.316
2.386SerArg: 2.386 ± 0.322
3.725SerSer: 3.725 ± 0.565
3.201SerThr: 3.201 ± 0.53
3.317SerVal: 3.317 ± 0.388
0.582SerTrp: 0.582 ± 0.178
1.571SerTyr: 1.571 ± 0.331
0.0SerXaa: 0.0 ± 0.0
Thr
6.46ThrAla: 6.46 ± 0.766
0.815ThrCys: 0.815 ± 0.198
4.365ThrAsp: 4.365 ± 0.503
4.074ThrGlu: 4.074 ± 0.507
2.153ThrPhe: 2.153 ± 0.314
3.667ThrGly: 3.667 ± 0.512
0.815ThrHis: 0.815 ± 0.262
4.307ThrIle: 4.307 ± 0.545
4.656ThrLys: 4.656 ± 0.414
5.354ThrLeu: 5.354 ± 0.649
1.746ThrMet: 1.746 ± 0.311
2.735ThrAsn: 2.735 ± 0.483
1.979ThrPro: 1.979 ± 0.306
1.339ThrGln: 1.339 ± 0.256
2.794ThrArg: 2.794 ± 0.344
2.968ThrSer: 2.968 ± 0.747
3.725ThrThr: 3.725 ± 0.627
5.122ThrVal: 5.122 ± 0.576
0.582ThrTrp: 0.582 ± 0.187
2.27ThrTyr: 2.27 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
5.413ValAla: 5.413 ± 0.602
0.873ValCys: 0.873 ± 0.288
3.958ValAsp: 3.958 ± 0.512
4.656ValGlu: 4.656 ± 0.535
2.444ValPhe: 2.444 ± 0.414
3.667ValGly: 3.667 ± 0.479
0.873ValHis: 0.873 ± 0.233
3.143ValIle: 3.143 ± 0.307
5.18ValLys: 5.18 ± 0.496
4.598ValLeu: 4.598 ± 0.519
2.037ValMet: 2.037 ± 0.371
2.794ValAsn: 2.794 ± 0.363
2.619ValPro: 2.619 ± 0.38
1.571ValGln: 1.571 ± 0.272
4.016ValArg: 4.016 ± 0.597
3.725ValSer: 3.725 ± 0.584
4.249ValThr: 4.249 ± 0.681
4.016ValVal: 4.016 ± 0.635
0.407ValTrp: 0.407 ± 0.158
2.677ValTyr: 2.677 ± 0.495
0.0ValXaa: 0.0 ± 0.0
Trp
1.28TrpAla: 1.28 ± 0.327
0.233TrpCys: 0.233 ± 0.101
1.106TrpAsp: 1.106 ± 0.219
0.815TrpGlu: 0.815 ± 0.181
0.698TrpPhe: 0.698 ± 0.195
0.815TrpGly: 0.815 ± 0.238
0.233TrpHis: 0.233 ± 0.155
0.524TrpIle: 0.524 ± 0.144
1.106TrpLys: 1.106 ± 0.201
1.688TrpLeu: 1.688 ± 0.283
0.233TrpMet: 0.233 ± 0.111
1.164TrpAsn: 1.164 ± 0.327
0.291TrpPro: 0.291 ± 0.124
0.466TrpGln: 0.466 ± 0.191
0.698TrpArg: 0.698 ± 0.25
0.873TrpSer: 0.873 ± 0.254
0.815TrpThr: 0.815 ± 0.203
0.873TrpVal: 0.873 ± 0.228
0.291TrpTrp: 0.291 ± 0.132
0.873TrpTyr: 0.873 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.783TyrAla: 3.783 ± 0.592
0.873TyrCys: 0.873 ± 0.254
3.143TyrAsp: 3.143 ± 0.37
3.317TyrGlu: 3.317 ± 0.429
1.746TyrPhe: 1.746 ± 0.374
3.259TyrGly: 3.259 ± 0.41
0.815TyrHis: 0.815 ± 0.217
2.095TyrIle: 2.095 ± 0.329
2.328TyrLys: 2.328 ± 0.396
2.619TyrLeu: 2.619 ± 0.465
0.931TyrMet: 0.931 ± 0.256
2.444TyrAsn: 2.444 ± 0.355
1.28TyrPro: 1.28 ± 0.273
1.746TyrGln: 1.746 ± 0.308
1.688TyrArg: 1.688 ± 0.333
1.688TyrSer: 1.688 ± 0.269
3.026TyrThr: 3.026 ± 0.398
2.386TyrVal: 2.386 ± 0.398
0.698TyrTrp: 0.698 ± 0.21
1.979TyrTyr: 1.979 ± 0.413
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (17183 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski