Amino acid dipepetide frequency for Faecalibacterium phage FP_Toutatis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.535AlaAla: 14.535 ± 1.518
1.118AlaCys: 1.118 ± 0.3
6.336AlaAsp: 6.336 ± 0.663
9.504AlaGlu: 9.504 ± 1.014
3.106AlaPhe: 3.106 ± 0.431
7.578AlaGly: 7.578 ± 1.016
1.242AlaHis: 1.242 ± 0.342
4.472AlaIle: 4.472 ± 0.471
5.963AlaLys: 5.963 ± 0.675
8.261AlaLeu: 8.261 ± 0.676
2.485AlaMet: 2.485 ± 0.351
3.416AlaAsn: 3.416 ± 0.585
3.603AlaPro: 3.603 ± 0.42
3.851AlaGln: 3.851 ± 0.57
4.286AlaArg: 4.286 ± 0.595
5.093AlaSer: 5.093 ± 0.546
5.528AlaThr: 5.528 ± 0.692
8.448AlaVal: 8.448 ± 0.821
0.87AlaTrp: 0.87 ± 0.214
3.106AlaTyr: 3.106 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
0.994CysAla: 0.994 ± 0.288
0.248CysCys: 0.248 ± 0.13
0.497CysAsp: 0.497 ± 0.157
1.242CysGlu: 1.242 ± 0.284
0.311CysPhe: 0.311 ± 0.146
1.304CysGly: 1.304 ± 0.29
0.311CysHis: 0.311 ± 0.136
0.808CysIle: 0.808 ± 0.245
1.18CysLys: 1.18 ± 0.268
0.808CysLeu: 0.808 ± 0.242
0.435CysMet: 0.435 ± 0.152
0.435CysAsn: 0.435 ± 0.165
0.994CysPro: 0.994 ± 0.217
0.559CysGln: 0.559 ± 0.209
1.118CysArg: 1.118 ± 0.336
0.932CysSer: 0.932 ± 0.231
1.056CysThr: 1.056 ± 0.292
0.683CysVal: 0.683 ± 0.212
0.311CysTrp: 0.311 ± 0.137
0.435CysTyr: 0.435 ± 0.197
0.0CysXaa: 0.0 ± 0.0
Asp
7.019AspAla: 7.019 ± 0.885
0.87AspCys: 0.87 ± 0.256
4.721AspAsp: 4.721 ± 0.514
4.659AspGlu: 4.659 ± 0.659
2.423AspPhe: 2.423 ± 0.415
6.274AspGly: 6.274 ± 0.736
1.118AspHis: 1.118 ± 0.233
3.416AspIle: 3.416 ± 0.462
3.416AspLys: 3.416 ± 0.402
4.597AspLeu: 4.597 ± 0.552
1.801AspMet: 1.801 ± 0.327
1.988AspAsn: 1.988 ± 0.307
1.988AspPro: 1.988 ± 0.493
1.739AspGln: 1.739 ± 0.318
2.671AspArg: 2.671 ± 0.441
3.603AspSer: 3.603 ± 0.494
3.416AspThr: 3.416 ± 0.392
5.031AspVal: 5.031 ± 0.526
0.808AspTrp: 0.808 ± 0.231
2.236AspTyr: 2.236 ± 0.418
0.0AspXaa: 0.0 ± 0.0
Glu
6.336GluAla: 6.336 ± 0.702
0.745GluCys: 0.745 ± 0.235
3.541GluAsp: 3.541 ± 0.435
5.342GluGlu: 5.342 ± 0.65
2.423GluPhe: 2.423 ± 0.362
3.478GluGly: 3.478 ± 0.537
1.429GluHis: 1.429 ± 0.304
4.41GluIle: 4.41 ± 0.519
6.833GluLys: 6.833 ± 0.691
5.404GluLeu: 5.404 ± 0.55
2.671GluMet: 2.671 ± 0.396
3.292GluAsn: 3.292 ± 0.485
1.988GluPro: 1.988 ± 0.387
2.857GluGln: 2.857 ± 0.378
3.727GluArg: 3.727 ± 0.558
2.609GluSer: 2.609 ± 0.371
5.031GluThr: 5.031 ± 0.749
5.156GluVal: 5.156 ± 0.586
0.745GluTrp: 0.745 ± 0.227
3.106GluTyr: 3.106 ± 0.467
0.0GluXaa: 0.0 ± 0.0
Phe
2.857PheAla: 2.857 ± 0.465
0.435PheCys: 0.435 ± 0.154
2.174PheAsp: 2.174 ± 0.316
1.863PheGlu: 1.863 ± 0.325
1.367PhePhe: 1.367 ± 0.357
2.485PheGly: 2.485 ± 0.547
0.497PheHis: 0.497 ± 0.156
2.36PheIle: 2.36 ± 0.377
1.677PheLys: 1.677 ± 0.317
2.36PheLeu: 2.36 ± 0.371
0.932PheMet: 0.932 ± 0.233
1.553PheAsn: 1.553 ± 0.327
1.429PhePro: 1.429 ± 0.283
0.808PheGln: 0.808 ± 0.263
1.242PheArg: 1.242 ± 0.272
1.988PheSer: 1.988 ± 0.321
2.36PheThr: 2.36 ± 0.389
2.36PheVal: 2.36 ± 0.354
0.497PheTrp: 0.497 ± 0.161
0.994PheTyr: 0.994 ± 0.246
0.0PheXaa: 0.0 ± 0.0
Gly
6.584GlyAla: 6.584 ± 0.578
0.994GlyCys: 0.994 ± 0.242
4.224GlyAsp: 4.224 ± 0.52
4.597GlyGlu: 4.597 ± 0.573
2.857GlyPhe: 2.857 ± 0.491
5.59GlyGly: 5.59 ± 0.976
1.18GlyHis: 1.18 ± 0.275
3.603GlyIle: 3.603 ± 0.594
5.715GlyLys: 5.715 ± 0.597
4.659GlyLeu: 4.659 ± 0.62
2.857GlyMet: 2.857 ± 0.384
3.975GlyAsn: 3.975 ± 0.599
1.491GlyPro: 1.491 ± 0.299
2.298GlyGln: 2.298 ± 0.562
3.23GlyArg: 3.23 ± 0.474
4.472GlySer: 4.472 ± 0.688
5.653GlyThr: 5.653 ± 0.86
4.783GlyVal: 4.783 ± 0.632
0.932GlyTrp: 0.932 ± 0.192
2.733GlyTyr: 2.733 ± 0.39
0.0GlyXaa: 0.0 ± 0.0
His
1.242HisAla: 1.242 ± 0.311
0.373HisCys: 0.373 ± 0.141
1.18HisAsp: 1.18 ± 0.299
1.367HisGlu: 1.367 ± 0.308
0.745HisPhe: 0.745 ± 0.178
1.615HisGly: 1.615 ± 0.281
0.621HisHis: 0.621 ± 0.249
1.118HisIle: 1.118 ± 0.245
1.367HisLys: 1.367 ± 0.277
1.056HisLeu: 1.056 ± 0.288
0.373HisMet: 0.373 ± 0.142
0.683HisAsn: 0.683 ± 0.206
0.497HisPro: 0.497 ± 0.18
0.186HisGln: 0.186 ± 0.111
0.932HisArg: 0.932 ± 0.247
0.932HisSer: 0.932 ± 0.231
1.056HisThr: 1.056 ± 0.289
0.559HisVal: 0.559 ± 0.179
0.248HisTrp: 0.248 ± 0.132
0.683HisTyr: 0.683 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
5.28IleAla: 5.28 ± 0.708
1.056IleCys: 1.056 ± 0.231
3.727IleAsp: 3.727 ± 0.505
4.472IleGlu: 4.472 ± 0.482
1.677IlePhe: 1.677 ± 0.319
3.665IleGly: 3.665 ± 0.608
0.87IleHis: 0.87 ± 0.221
2.919IleIle: 2.919 ± 0.469
3.292IleLys: 3.292 ± 0.479
3.478IleLeu: 3.478 ± 0.453
1.429IleMet: 1.429 ± 0.289
1.801IleAsn: 1.801 ± 0.357
2.174IlePro: 2.174 ± 0.407
1.926IleGln: 1.926 ± 0.306
3.044IleArg: 3.044 ± 0.367
2.733IleSer: 2.733 ± 0.409
3.168IleThr: 3.168 ± 0.466
3.975IleVal: 3.975 ± 0.431
0.621IleTrp: 0.621 ± 0.187
1.739IleTyr: 1.739 ± 0.39
0.0IleXaa: 0.0 ± 0.0
Lys
7.702LysAla: 7.702 ± 0.789
0.435LysCys: 0.435 ± 0.154
4.348LysAsp: 4.348 ± 0.512
4.348LysGlu: 4.348 ± 0.533
2.05LysPhe: 2.05 ± 0.32
3.789LysGly: 3.789 ± 0.424
0.994LysHis: 0.994 ± 0.242
3.292LysIle: 3.292 ± 0.585
6.212LysLys: 6.212 ± 0.912
6.398LysLeu: 6.398 ± 0.687
1.615LysMet: 1.615 ± 0.318
3.354LysAsn: 3.354 ± 0.458
2.733LysPro: 2.733 ± 0.362
2.733LysGln: 2.733 ± 0.495
4.286LysArg: 4.286 ± 0.614
3.168LysSer: 3.168 ± 0.422
4.534LysThr: 4.534 ± 0.532
4.348LysVal: 4.348 ± 0.513
0.683LysTrp: 0.683 ± 0.21
2.36LysTyr: 2.36 ± 0.386
0.0LysXaa: 0.0 ± 0.0
Leu
7.516LeuAla: 7.516 ± 0.621
1.18LeuCys: 1.18 ± 0.286
6.025LeuAsp: 6.025 ± 0.624
4.162LeuGlu: 4.162 ± 0.46
1.863LeuPhe: 1.863 ± 0.356
4.162LeuGly: 4.162 ± 1.018
1.553LeuHis: 1.553 ± 0.279
2.733LeuIle: 2.733 ± 0.562
4.783LeuLys: 4.783 ± 0.422
5.156LeuLeu: 5.156 ± 0.61
1.491LeuMet: 1.491 ± 0.27
3.913LeuAsn: 3.913 ± 0.406
2.919LeuPro: 2.919 ± 0.429
2.547LeuGln: 2.547 ± 0.354
4.348LeuArg: 4.348 ± 0.586
4.534LeuSer: 4.534 ± 0.692
6.087LeuThr: 6.087 ± 0.578
3.975LeuVal: 3.975 ± 0.437
1.118LeuTrp: 1.118 ± 0.238
2.609LeuTyr: 2.609 ± 0.466
0.0LeuXaa: 0.0 ± 0.0
Met
3.292MetAla: 3.292 ± 0.496
0.373MetCys: 0.373 ± 0.149
2.174MetAsp: 2.174 ± 0.414
1.926MetGlu: 1.926 ± 0.297
0.497MetPhe: 0.497 ± 0.158
1.491MetGly: 1.491 ± 0.336
0.435MetHis: 0.435 ± 0.15
1.553MetIle: 1.553 ± 0.289
1.988MetLys: 1.988 ± 0.301
2.174MetLeu: 2.174 ± 0.318
0.373MetMet: 0.373 ± 0.15
1.18MetAsn: 1.18 ± 0.227
0.808MetPro: 0.808 ± 0.207
1.18MetGln: 1.18 ± 0.256
1.801MetArg: 1.801 ± 0.419
2.174MetSer: 2.174 ± 0.338
2.05MetThr: 2.05 ± 0.338
1.677MetVal: 1.677 ± 0.313
0.124MetTrp: 0.124 ± 0.081
0.559MetTyr: 0.559 ± 0.166
0.0MetXaa: 0.0 ± 0.0
Asn
3.478AsnAla: 3.478 ± 0.498
0.683AsnCys: 0.683 ± 0.201
1.926AsnAsp: 1.926 ± 0.309
3.106AsnGlu: 3.106 ± 0.514
1.304AsnPhe: 1.304 ± 0.256
4.969AsnGly: 4.969 ± 0.616
0.932AsnHis: 0.932 ± 0.204
2.609AsnIle: 2.609 ± 0.385
3.044AsnLys: 3.044 ± 0.492
2.547AsnLeu: 2.547 ± 0.443
1.056AsnMet: 1.056 ± 0.232
1.863AsnAsn: 1.863 ± 0.571
2.05AsnPro: 2.05 ± 0.405
1.118AsnGln: 1.118 ± 0.281
2.733AsnArg: 2.733 ± 0.471
1.988AsnSer: 1.988 ± 0.369
2.609AsnThr: 2.609 ± 0.447
3.106AsnVal: 3.106 ± 0.429
0.559AsnTrp: 0.559 ± 0.161
1.056AsnTyr: 1.056 ± 0.238
0.0AsnXaa: 0.0 ± 0.0
Pro
4.162ProAla: 4.162 ± 0.524
0.497ProCys: 0.497 ± 0.176
3.23ProAsp: 3.23 ± 0.43
3.044ProGlu: 3.044 ± 0.437
1.118ProPhe: 1.118 ± 0.251
2.795ProGly: 2.795 ± 0.482
0.683ProHis: 0.683 ± 0.23
1.863ProIle: 1.863 ± 0.301
2.547ProLys: 2.547 ± 0.349
2.485ProLeu: 2.485 ± 0.382
1.304ProMet: 1.304 ± 0.32
1.491ProAsn: 1.491 ± 0.333
0.932ProPro: 0.932 ± 0.256
0.87ProGln: 0.87 ± 0.205
1.677ProArg: 1.677 ± 0.276
1.677ProSer: 1.677 ± 0.287
2.298ProThr: 2.298 ± 0.375
2.671ProVal: 2.671 ± 0.489
0.435ProTrp: 0.435 ± 0.167
1.553ProTyr: 1.553 ± 0.304
0.0ProXaa: 0.0 ± 0.0
Gln
2.982GlnAla: 2.982 ± 0.391
0.497GlnCys: 0.497 ± 0.184
1.677GlnAsp: 1.677 ± 0.27
1.863GlnGlu: 1.863 ± 0.345
1.18GlnPhe: 1.18 ± 0.294
1.739GlnGly: 1.739 ± 0.241
0.559GlnHis: 0.559 ± 0.188
2.547GlnIle: 2.547 ± 0.444
2.547GlnLys: 2.547 ± 0.376
2.423GlnLeu: 2.423 ± 0.337
1.118GlnMet: 1.118 ± 0.248
1.801GlnAsn: 1.801 ± 0.346
1.553GlnPro: 1.553 ± 0.281
2.112GlnGln: 2.112 ± 0.293
2.298GlnArg: 2.298 ± 0.39
1.429GlnSer: 1.429 ± 0.305
2.174GlnThr: 2.174 ± 0.325
1.863GlnVal: 1.863 ± 0.343
0.745GlnTrp: 0.745 ± 0.177
1.304GlnTyr: 1.304 ± 0.254
0.0GlnXaa: 0.0 ± 0.0
Arg
4.783ArgAla: 4.783 ± 0.497
1.118ArgCys: 1.118 ± 0.368
2.857ArgAsp: 2.857 ± 0.364
3.789ArgGlu: 3.789 ± 0.652
1.739ArgPhe: 1.739 ± 0.36
2.733ArgGly: 2.733 ± 0.463
0.994ArgHis: 0.994 ± 0.281
3.292ArgIle: 3.292 ± 0.448
3.416ArgLys: 3.416 ± 0.567
4.1ArgLeu: 4.1 ± 0.562
1.615ArgMet: 1.615 ± 0.293
2.112ArgAsn: 2.112 ± 0.369
1.801ArgPro: 1.801 ± 0.391
2.547ArgGln: 2.547 ± 0.326
3.727ArgArg: 3.727 ± 0.669
2.547ArgSer: 2.547 ± 0.424
1.988ArgThr: 1.988 ± 0.305
4.038ArgVal: 4.038 ± 0.523
0.994ArgTrp: 0.994 ± 0.248
2.547ArgTyr: 2.547 ± 0.441
0.0ArgXaa: 0.0 ± 0.0
Ser
5.093SerAla: 5.093 ± 0.538
0.994SerCys: 0.994 ± 0.25
3.044SerAsp: 3.044 ± 0.428
2.919SerGlu: 2.919 ± 0.461
1.926SerPhe: 1.926 ± 0.396
5.839SerGly: 5.839 ± 0.784
0.808SerHis: 0.808 ± 0.207
3.292SerIle: 3.292 ± 0.441
2.547SerLys: 2.547 ± 0.516
3.789SerLeu: 3.789 ± 0.496
1.677SerMet: 1.677 ± 0.406
2.36SerAsn: 2.36 ± 0.311
2.174SerPro: 2.174 ± 0.358
1.491SerGln: 1.491 ± 0.281
2.547SerArg: 2.547 ± 0.457
3.106SerSer: 3.106 ± 0.481
3.168SerThr: 3.168 ± 0.505
3.541SerVal: 3.541 ± 0.433
0.683SerTrp: 0.683 ± 0.196
1.926SerTyr: 1.926 ± 0.379
0.0SerXaa: 0.0 ± 0.0
Thr
7.889ThrAla: 7.889 ± 0.716
0.435ThrCys: 0.435 ± 0.158
4.41ThrAsp: 4.41 ± 0.483
3.665ThrGlu: 3.665 ± 0.553
2.298ThrPhe: 2.298 ± 0.349
4.845ThrGly: 4.845 ± 0.647
0.808ThrHis: 0.808 ± 0.202
3.23ThrIle: 3.23 ± 0.469
4.162ThrLys: 4.162 ± 0.482
4.597ThrLeu: 4.597 ± 0.54
1.367ThrMet: 1.367 ± 0.333
1.801ThrAsn: 1.801 ± 0.364
3.416ThrPro: 3.416 ± 0.451
1.429ThrGln: 1.429 ± 0.388
2.36ThrArg: 2.36 ± 0.452
3.541ThrSer: 3.541 ± 0.347
3.727ThrThr: 3.727 ± 0.459
6.398ThrVal: 6.398 ± 0.72
0.497ThrTrp: 0.497 ± 0.177
2.982ThrTyr: 2.982 ± 0.649
0.0ThrXaa: 0.0 ± 0.0
Val
6.46ValAla: 6.46 ± 0.686
1.18ValCys: 1.18 ± 0.291
4.907ValAsp: 4.907 ± 0.496
5.59ValGlu: 5.59 ± 0.69
1.367ValPhe: 1.367 ± 0.278
4.41ValGly: 4.41 ± 0.748
0.808ValHis: 0.808 ± 0.205
2.609ValIle: 2.609 ± 0.427
5.777ValLys: 5.777 ± 0.574
5.404ValLeu: 5.404 ± 0.467
1.615ValMet: 1.615 ± 0.305
3.23ValAsn: 3.23 ± 0.418
3.168ValPro: 3.168 ± 0.433
2.609ValGln: 2.609 ± 0.335
3.913ValArg: 3.913 ± 0.423
4.534ValSer: 4.534 ± 0.676
4.038ValThr: 4.038 ± 0.545
4.534ValVal: 4.534 ± 0.573
1.242ValTrp: 1.242 ± 0.264
2.485ValTyr: 2.485 ± 0.457
0.0ValXaa: 0.0 ± 0.0
Trp
1.677TrpAla: 1.677 ± 0.324
0.186TrpCys: 0.186 ± 0.116
0.497TrpAsp: 0.497 ± 0.163
1.056TrpGlu: 1.056 ± 0.235
0.621TrpPhe: 0.621 ± 0.193
0.559TrpGly: 0.559 ± 0.234
0.497TrpHis: 0.497 ± 0.192
0.621TrpIle: 0.621 ± 0.275
0.994TrpLys: 0.994 ± 0.256
0.932TrpLeu: 0.932 ± 0.286
0.435TrpMet: 0.435 ± 0.166
0.994TrpAsn: 0.994 ± 0.227
0.124TrpPro: 0.124 ± 0.081
0.559TrpGln: 0.559 ± 0.194
0.683TrpArg: 0.683 ± 0.228
0.311TrpSer: 0.311 ± 0.137
0.621TrpThr: 0.621 ± 0.169
0.932TrpVal: 0.932 ± 0.227
0.497TrpTrp: 0.497 ± 0.179
0.373TrpTyr: 0.373 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.919TyrAla: 2.919 ± 0.377
1.242TyrCys: 1.242 ± 0.309
2.36TyrAsp: 2.36 ± 0.375
2.609TyrGlu: 2.609 ± 0.475
1.367TyrPhe: 1.367 ± 0.277
3.106TyrGly: 3.106 ± 0.438
0.497TyrHis: 0.497 ± 0.164
2.298TyrIle: 2.298 ± 0.446
1.926TyrLys: 1.926 ± 0.318
2.05TyrLeu: 2.05 ± 0.346
1.118TyrMet: 1.118 ± 0.279
1.491TyrAsn: 1.491 ± 0.322
1.491TyrPro: 1.491 ± 0.321
0.994TyrGln: 0.994 ± 0.26
2.112TyrArg: 2.112 ± 0.424
1.553TyrSer: 1.553 ± 0.324
3.292TyrThr: 3.292 ± 0.44
1.801TyrVal: 1.801 ± 0.304
0.559TyrTrp: 0.559 ± 0.172
1.367TyrTyr: 1.367 ± 0.328
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 89 proteins (16100 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski