Amino acid dipepetide frequency for Clostridium phage phiMMP01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.16AlaAla: 2.16 ± 0.67
0.617AlaCys: 0.617 ± 0.211
2.469AlaAsp: 2.469 ± 0.464
3.549AlaGlu: 3.549 ± 0.617
1.697AlaPhe: 1.697 ± 0.406
3.395AlaGly: 3.395 ± 0.68
0.309AlaHis: 0.309 ± 0.127
5.092AlaIle: 5.092 ± 0.536
4.629AlaLys: 4.629 ± 0.611
4.012AlaLeu: 4.012 ± 0.532
1.543AlaMet: 1.543 ± 0.349
2.777AlaAsn: 2.777 ± 0.47
1.08AlaPro: 1.08 ± 0.336
1.466AlaGln: 1.466 ± 0.402
2.237AlaArg: 2.237 ± 0.471
3.857AlaSer: 3.857 ± 0.499
3.626AlaThr: 3.626 ± 0.623
2.546AlaVal: 2.546 ± 0.452
0.463AlaTrp: 0.463 ± 0.172
1.466AlaTyr: 1.466 ± 0.424
0.0AlaXaa: 0.0 ± 0.0
Cys
0.154CysAla: 0.154 ± 0.105
0.309CysCys: 0.309 ± 0.153
1.003CysAsp: 1.003 ± 0.352
0.926CysGlu: 0.926 ± 0.22
0.617CysPhe: 0.617 ± 0.265
0.617CysGly: 0.617 ± 0.331
0.077CysHis: 0.077 ± 0.076
1.234CysIle: 1.234 ± 0.371
1.234CysLys: 1.234 ± 0.353
0.849CysLeu: 0.849 ± 0.228
0.077CysMet: 0.077 ± 0.076
0.849CysAsn: 0.849 ± 0.209
0.386CysPro: 0.386 ± 0.149
0.077CysGln: 0.077 ± 0.085
0.926CysArg: 0.926 ± 0.342
0.617CysSer: 0.617 ± 0.245
0.231CysThr: 0.231 ± 0.124
0.309CysVal: 0.309 ± 0.148
0.386CysTrp: 0.386 ± 0.144
0.309CysTyr: 0.309 ± 0.175
0.0CysXaa: 0.0 ± 0.0
Asp
2.546AspAla: 2.546 ± 0.515
0.771AspCys: 0.771 ± 0.242
3.549AspAsp: 3.549 ± 0.639
4.243AspGlu: 4.243 ± 0.614
3.317AspPhe: 3.317 ± 0.52
3.935AspGly: 3.935 ± 0.513
0.154AspHis: 0.154 ± 0.107
5.863AspIle: 5.863 ± 0.55
6.48AspLys: 6.48 ± 0.546
4.938AspLeu: 4.938 ± 0.528
1.312AspMet: 1.312 ± 0.296
4.32AspAsn: 4.32 ± 0.443
0.694AspPro: 0.694 ± 0.241
0.694AspGln: 0.694 ± 0.255
2.469AspArg: 2.469 ± 0.479
3.626AspSer: 3.626 ± 0.601
3.626AspThr: 3.626 ± 0.491
3.472AspVal: 3.472 ± 0.558
0.771AspTrp: 0.771 ± 0.203
3.395AspTyr: 3.395 ± 0.489
0.0AspXaa: 0.0 ± 0.0
Glu
4.397GluAla: 4.397 ± 0.577
0.694GluCys: 0.694 ± 0.206
4.938GluAsp: 4.938 ± 0.573
7.792GluGlu: 7.792 ± 0.729
3.857GluPhe: 3.857 ± 0.545
3.549GluGly: 3.549 ± 0.531
0.54GluHis: 0.54 ± 0.182
7.638GluIle: 7.638 ± 0.806
9.566GluLys: 9.566 ± 0.966
10.029GluLeu: 10.029 ± 1.046
2.7GluMet: 2.7 ± 0.539
6.326GluAsn: 6.326 ± 0.82
1.312GluPro: 1.312 ± 0.361
2.932GluGln: 2.932 ± 0.463
2.546GluArg: 2.546 ± 0.485
4.089GluSer: 4.089 ± 0.431
3.935GluThr: 3.935 ± 0.664
3.703GluVal: 3.703 ± 0.523
0.771GluTrp: 0.771 ± 0.28
4.32GluTyr: 4.32 ± 0.705
0.0GluXaa: 0.0 ± 0.0
Phe
1.852PheAla: 1.852 ± 0.457
0.617PheCys: 0.617 ± 0.223
3.24PheAsp: 3.24 ± 0.452
4.089PheGlu: 4.089 ± 0.541
1.543PhePhe: 1.543 ± 0.321
2.623PheGly: 2.623 ± 0.392
0.463PheHis: 0.463 ± 0.202
3.009PheIle: 3.009 ± 0.438
3.549PheLys: 3.549 ± 0.557
2.546PheLeu: 2.546 ± 0.414
1.234PheMet: 1.234 ± 0.302
2.854PheAsn: 2.854 ± 0.479
0.617PhePro: 0.617 ± 0.242
1.312PheGln: 1.312 ± 0.336
1.466PheArg: 1.466 ± 0.354
2.16PheSer: 2.16 ± 0.403
2.546PheThr: 2.546 ± 0.442
1.929PheVal: 1.929 ± 0.406
0.309PheTrp: 0.309 ± 0.119
1.312PheTyr: 1.312 ± 0.345
0.0PheXaa: 0.0 ± 0.0
Gly
2.546GlyAla: 2.546 ± 0.551
0.771GlyCys: 0.771 ± 0.267
2.546GlyAsp: 2.546 ± 0.5
4.86GlyGlu: 4.86 ± 0.532
2.16GlyPhe: 2.16 ± 0.459
2.546GlyGly: 2.546 ± 0.439
0.849GlyHis: 0.849 ± 0.33
4.783GlyIle: 4.783 ± 0.659
6.172GlyLys: 6.172 ± 0.72
3.472GlyLeu: 3.472 ± 0.532
1.157GlyMet: 1.157 ± 0.444
4.32GlyAsn: 4.32 ± 0.612
0.463GlyPro: 0.463 ± 0.188
1.543GlyGln: 1.543 ± 0.358
1.697GlyArg: 1.697 ± 0.41
3.24GlySer: 3.24 ± 0.467
2.16GlyThr: 2.16 ± 0.436
4.475GlyVal: 4.475 ± 0.543
1.003GlyTrp: 1.003 ± 0.278
2.854GlyTyr: 2.854 ± 0.466
0.0GlyXaa: 0.0 ± 0.0
His
0.386HisAla: 0.386 ± 0.184
0.386HisCys: 0.386 ± 0.162
0.463HisAsp: 0.463 ± 0.172
1.157HisGlu: 1.157 ± 0.254
0.694HisPhe: 0.694 ± 0.196
0.154HisGly: 0.154 ± 0.122
0.231HisHis: 0.231 ± 0.118
0.926HisIle: 0.926 ± 0.312
1.157HisLys: 1.157 ± 0.307
0.926HisLeu: 0.926 ± 0.285
0.386HisMet: 0.386 ± 0.18
0.463HisAsn: 0.463 ± 0.207
0.617HisPro: 0.617 ± 0.252
0.309HisGln: 0.309 ± 0.146
0.386HisArg: 0.386 ± 0.134
0.771HisSer: 0.771 ± 0.209
0.926HisThr: 0.926 ± 0.255
0.386HisVal: 0.386 ± 0.152
0.309HisTrp: 0.309 ± 0.135
0.617HisTyr: 0.617 ± 0.176
0.0HisXaa: 0.0 ± 0.0
Ile
5.323IleAla: 5.323 ± 0.745
1.08IleCys: 1.08 ± 0.376
6.326IleAsp: 6.326 ± 0.777
8.332IleGlu: 8.332 ± 0.843
2.854IlePhe: 2.854 ± 0.529
4.243IleGly: 4.243 ± 0.579
1.157IleHis: 1.157 ± 0.287
6.712IleIle: 6.712 ± 0.884
10.184IleLys: 10.184 ± 0.863
6.712IleLeu: 6.712 ± 0.851
1.003IleMet: 1.003 ± 0.317
7.098IleAsn: 7.098 ± 0.542
2.546IlePro: 2.546 ± 0.534
2.623IleGln: 2.623 ± 0.428
3.163IleArg: 3.163 ± 0.515
6.326IleSer: 6.326 ± 0.79
3.78IleThr: 3.78 ± 0.443
5.4IleVal: 5.4 ± 0.83
0.54IleTrp: 0.54 ± 0.172
3.472IleTyr: 3.472 ± 0.549
0.0IleXaa: 0.0 ± 0.0
Lys
4.243LysAla: 4.243 ± 0.535
1.234LysCys: 1.234 ± 0.33
6.326LysAsp: 6.326 ± 0.723
10.801LysGlu: 10.801 ± 1.159
4.012LysPhe: 4.012 ± 0.632
5.015LysGly: 5.015 ± 0.591
1.62LysHis: 1.62 ± 0.497
9.026LysIle: 9.026 ± 0.832
10.647LysLys: 10.647 ± 1.193
9.489LysLeu: 9.489 ± 0.807
3.086LysMet: 3.086 ± 0.388
7.792LysAsn: 7.792 ± 0.891
1.389LysPro: 1.389 ± 0.391
4.475LysGln: 4.475 ± 0.622
4.397LysArg: 4.397 ± 0.518
6.326LysSer: 6.326 ± 0.775
4.783LysThr: 4.783 ± 0.632
7.175LysVal: 7.175 ± 0.731
0.849LysTrp: 0.849 ± 0.235
5.786LysTyr: 5.786 ± 0.729
0.0LysXaa: 0.0 ± 0.0
Leu
3.395LeuAla: 3.395 ± 0.472
0.771LeuCys: 0.771 ± 0.232
6.172LeuAsp: 6.172 ± 0.567
8.023LeuGlu: 8.023 ± 0.743
2.16LeuPhe: 2.16 ± 0.416
4.32LeuGly: 4.32 ± 0.727
1.08LeuHis: 1.08 ± 0.223
7.561LeuIle: 7.561 ± 0.863
10.415LeuLys: 10.415 ± 0.926
5.94LeuLeu: 5.94 ± 0.65
1.697LeuMet: 1.697 ± 0.416
6.712LeuAsn: 6.712 ± 0.914
1.312LeuPro: 1.312 ± 0.341
2.546LeuGln: 2.546 ± 0.392
4.012LeuArg: 4.012 ± 0.604
5.092LeuSer: 5.092 ± 0.718
4.706LeuThr: 4.706 ± 0.568
4.706LeuVal: 4.706 ± 0.695
0.617LeuTrp: 0.617 ± 0.235
3.703LeuTyr: 3.703 ± 0.505
0.0LeuXaa: 0.0 ± 0.0
Met
2.16MetAla: 2.16 ± 0.507
0.0MetCys: 0.0 ± 0.0
1.543MetAsp: 1.543 ± 0.334
2.006MetGlu: 2.006 ± 0.435
0.154MetPhe: 0.154 ± 0.101
0.849MetGly: 0.849 ± 0.291
0.077MetHis: 0.077 ± 0.068
1.312MetIle: 1.312 ± 0.359
2.314MetLys: 2.314 ± 0.432
1.852MetLeu: 1.852 ± 0.368
0.231MetMet: 0.231 ± 0.138
2.006MetAsn: 2.006 ± 0.341
0.54MetPro: 0.54 ± 0.199
0.694MetGln: 0.694 ± 0.242
1.003MetArg: 1.003 ± 0.262
1.543MetSer: 1.543 ± 0.293
1.852MetThr: 1.852 ± 0.29
0.771MetVal: 0.771 ± 0.274
0.077MetTrp: 0.077 ± 0.071
0.849MetTyr: 0.849 ± 0.251
0.0MetXaa: 0.0 ± 0.0
Asn
3.78AsnAla: 3.78 ± 0.867
0.694AsnCys: 0.694 ± 0.231
3.857AsnAsp: 3.857 ± 0.673
5.246AsnGlu: 5.246 ± 0.654
2.7AsnPhe: 2.7 ± 0.496
3.703AsnGly: 3.703 ± 0.459
0.849AsnHis: 0.849 ± 0.257
6.558AsnIle: 6.558 ± 0.754
9.721AsnLys: 9.721 ± 0.973
6.018AsnLeu: 6.018 ± 0.544
1.466AsnMet: 1.466 ± 0.269
5.863AsnAsn: 5.863 ± 0.734
1.929AsnPro: 1.929 ± 0.429
1.852AsnGln: 1.852 ± 0.295
2.623AsnArg: 2.623 ± 0.451
4.86AsnSer: 4.86 ± 0.68
4.012AsnThr: 4.012 ± 0.489
4.089AsnVal: 4.089 ± 0.557
0.694AsnTrp: 0.694 ± 0.235
2.546AsnTyr: 2.546 ± 0.409
0.0AsnXaa: 0.0 ± 0.0
Pro
1.157ProAla: 1.157 ± 0.268
0.54ProCys: 0.54 ± 0.19
1.312ProAsp: 1.312 ± 0.356
1.543ProGlu: 1.543 ± 0.365
0.617ProPhe: 0.617 ± 0.248
0.771ProGly: 0.771 ± 0.237
0.231ProHis: 0.231 ± 0.138
1.774ProIle: 1.774 ± 0.532
2.392ProLys: 2.392 ± 0.443
1.389ProLeu: 1.389 ± 0.3
0.154ProMet: 0.154 ± 0.105
1.234ProAsn: 1.234 ± 0.273
0.463ProPro: 0.463 ± 0.293
0.463ProGln: 0.463 ± 0.172
0.231ProArg: 0.231 ± 0.144
1.389ProSer: 1.389 ± 0.354
1.389ProThr: 1.389 ± 0.391
1.697ProVal: 1.697 ± 0.368
0.309ProTrp: 0.309 ± 0.175
0.694ProTyr: 0.694 ± 0.187
0.0ProXaa: 0.0 ± 0.0
Gln
1.466GlnAla: 1.466 ± 0.336
0.0GlnCys: 0.0 ± 0.0
1.774GlnAsp: 1.774 ± 0.374
2.854GlnGlu: 2.854 ± 0.468
1.003GlnPhe: 1.003 ± 0.284
1.852GlnGly: 1.852 ± 0.37
0.386GlnHis: 0.386 ± 0.187
3.163GlnIle: 3.163 ± 0.532
2.314GlnLys: 2.314 ± 0.4
3.163GlnLeu: 3.163 ± 0.543
0.771GlnMet: 0.771 ± 0.222
2.469GlnAsn: 2.469 ± 0.516
0.54GlnPro: 0.54 ± 0.291
1.543GlnGln: 1.543 ± 0.386
1.003GlnArg: 1.003 ± 0.361
1.697GlnSer: 1.697 ± 0.33
2.006GlnThr: 2.006 ± 0.342
1.003GlnVal: 1.003 ± 0.298
0.077GlnTrp: 0.077 ± 0.071
1.003GlnTyr: 1.003 ± 0.25
0.0GlnXaa: 0.0 ± 0.0
Arg
2.006ArgAla: 2.006 ± 0.452
0.694ArgCys: 0.694 ± 0.236
2.392ArgAsp: 2.392 ± 0.467
3.163ArgGlu: 3.163 ± 0.459
1.62ArgPhe: 1.62 ± 0.308
2.237ArgGly: 2.237 ± 0.453
0.463ArgHis: 0.463 ± 0.2
3.549ArgIle: 3.549 ± 0.543
4.089ArgLys: 4.089 ± 0.656
3.086ArgLeu: 3.086 ± 0.422
1.003ArgMet: 1.003 ± 0.252
1.697ArgAsn: 1.697 ± 0.361
0.463ArgPro: 0.463 ± 0.188
1.003ArgGln: 1.003 ± 0.219
1.08ArgArg: 1.08 ± 0.281
1.697ArgSer: 1.697 ± 0.311
1.312ArgThr: 1.312 ± 0.317
2.546ArgVal: 2.546 ± 0.358
0.386ArgTrp: 0.386 ± 0.181
1.852ArgTyr: 1.852 ± 0.37
0.0ArgXaa: 0.0 ± 0.0
Ser
2.932SerAla: 2.932 ± 0.638
0.463SerCys: 0.463 ± 0.148
2.392SerAsp: 2.392 ± 0.386
4.783SerGlu: 4.783 ± 0.619
3.395SerPhe: 3.395 ± 0.419
3.472SerGly: 3.472 ± 0.531
0.54SerHis: 0.54 ± 0.187
6.403SerIle: 6.403 ± 0.788
7.792SerLys: 7.792 ± 0.802
5.169SerLeu: 5.169 ± 0.599
1.157SerMet: 1.157 ± 0.288
4.629SerAsn: 4.629 ± 0.626
0.849SerPro: 0.849 ± 0.265
1.697SerGln: 1.697 ± 0.292
1.852SerArg: 1.852 ± 0.332
4.32SerSer: 4.32 ± 0.862
3.78SerThr: 3.78 ± 0.484
3.24SerVal: 3.24 ± 0.416
0.54SerTrp: 0.54 ± 0.166
2.7SerTyr: 2.7 ± 0.362
0.0SerXaa: 0.0 ± 0.0
Thr
2.314ThrAla: 2.314 ± 0.502
0.309ThrCys: 0.309 ± 0.144
2.7ThrAsp: 2.7 ± 0.502
3.626ThrGlu: 3.626 ± 0.44
2.469ThrPhe: 2.469 ± 0.43
3.935ThrGly: 3.935 ± 0.511
1.157ThrHis: 1.157 ± 0.285
6.095ThrIle: 6.095 ± 0.66
5.4ThrLys: 5.4 ± 0.701
4.629ThrLeu: 4.629 ± 0.545
0.926ThrMet: 0.926 ± 0.226
3.163ThrAsn: 3.163 ± 0.467
2.237ThrPro: 2.237 ± 0.462
1.697ThrGln: 1.697 ± 0.38
1.312ThrArg: 1.312 ± 0.338
3.163ThrSer: 3.163 ± 0.404
3.78ThrThr: 3.78 ± 0.644
2.546ThrVal: 2.546 ± 0.48
0.463ThrTrp: 0.463 ± 0.168
2.237ThrTyr: 2.237 ± 0.409
0.0ThrXaa: 0.0 ± 0.0
Val
4.012ValAla: 4.012 ± 0.749
0.54ValCys: 0.54 ± 0.221
4.012ValAsp: 4.012 ± 0.539
4.938ValGlu: 4.938 ± 0.581
2.083ValPhe: 2.083 ± 0.346
3.626ValGly: 3.626 ± 0.477
1.08ValHis: 1.08 ± 0.285
3.395ValIle: 3.395 ± 0.568
4.706ValLys: 4.706 ± 0.561
5.015ValLeu: 5.015 ± 0.618
1.08ValMet: 1.08 ± 0.317
4.938ValAsn: 4.938 ± 0.5
1.312ValPro: 1.312 ± 0.315
1.389ValGln: 1.389 ± 0.349
2.546ValArg: 2.546 ± 0.433
4.012ValSer: 4.012 ± 0.658
2.392ValThr: 2.392 ± 0.445
3.24ValVal: 3.24 ± 0.587
0.309ValTrp: 0.309 ± 0.125
2.083ValTyr: 2.083 ± 0.363
0.0ValXaa: 0.0 ± 0.0
Trp
0.309TrpAla: 0.309 ± 0.145
0.154TrpCys: 0.154 ± 0.106
0.54TrpAsp: 0.54 ± 0.193
1.003TrpGlu: 1.003 ± 0.283
0.386TrpPhe: 0.386 ± 0.145
0.771TrpGly: 0.771 ± 0.214
0.077TrpHis: 0.077 ± 0.106
0.771TrpIle: 0.771 ± 0.245
0.926TrpLys: 0.926 ± 0.342
0.849TrpLeu: 0.849 ± 0.281
0.231TrpMet: 0.231 ± 0.132
0.617TrpAsn: 0.617 ± 0.237
0.154TrpPro: 0.154 ± 0.109
0.309TrpGln: 0.309 ± 0.137
0.077TrpArg: 0.077 ± 0.081
0.463TrpSer: 0.463 ± 0.217
0.463TrpThr: 0.463 ± 0.208
0.694TrpVal: 0.694 ± 0.246
0.0TrpTrp: 0.0 ± 0.0
0.386TrpTyr: 0.386 ± 0.261
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.852TyrAla: 1.852 ± 0.505
0.54TyrCys: 0.54 ± 0.213
2.623TyrAsp: 2.623 ± 0.411
2.7TyrGlu: 2.7 ± 0.528
2.006TyrPhe: 2.006 ± 0.381
2.006TyrGly: 2.006 ± 0.397
0.309TyrHis: 0.309 ± 0.142
4.012TyrIle: 4.012 ± 0.559
4.783TyrLys: 4.783 ± 0.581
4.86TyrLeu: 4.86 ± 0.559
0.54TyrMet: 0.54 ± 0.208
2.932TyrAsn: 2.932 ± 0.515
0.849TyrPro: 0.849 ± 0.27
1.466TyrGln: 1.466 ± 0.346
1.312TyrArg: 1.312 ± 0.327
2.854TyrSer: 2.854 ± 0.481
2.854TyrThr: 2.854 ± 0.469
2.777TyrVal: 2.777 ± 0.396
0.309TyrTrp: 0.309 ± 0.145
2.392TyrTyr: 2.392 ± 0.601
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 81 proteins (12963 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski