Amino acid dipepetide frequency for Mycobacterium phage Fishburne

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.5AlaAla: 17.5 ± 1.729
0.983AlaCys: 0.983 ± 0.254
8.586AlaAsp: 8.586 ± 0.944
10.946AlaGlu: 10.946 ± 1.434
3.146AlaPhe: 3.146 ± 0.501
8.717AlaGly: 8.717 ± 0.993
2.36AlaHis: 2.36 ± 0.445
5.833AlaIle: 5.833 ± 0.639
4.523AlaLys: 4.523 ± 0.655
9.963AlaLeu: 9.963 ± 0.765
2.36AlaMet: 2.36 ± 0.364
3.474AlaAsn: 3.474 ± 0.526
7.079AlaPro: 7.079 ± 0.904
4.457AlaGln: 4.457 ± 0.684
8.717AlaArg: 8.717 ± 1.086
6.227AlaSer: 6.227 ± 0.676
6.489AlaThr: 6.489 ± 0.641
6.882AlaVal: 6.882 ± 0.718
2.228AlaTrp: 2.228 ± 0.486
2.818AlaTyr: 2.818 ± 0.408
0.0AlaXaa: 0.0 ± 0.0
Cys
0.721CysAla: 0.721 ± 0.249
0.066CysCys: 0.066 ± 0.066
0.59CysAsp: 0.59 ± 0.209
0.852CysGlu: 0.852 ± 0.243
0.066CysPhe: 0.066 ± 0.06
1.245CysGly: 1.245 ± 0.323
0.393CysHis: 0.393 ± 0.154
0.328CysIle: 0.328 ± 0.198
0.262CysLys: 0.262 ± 0.126
0.59CysLeu: 0.59 ± 0.183
0.262CysMet: 0.262 ± 0.13
0.131CysAsn: 0.131 ± 0.104
0.918CysPro: 0.918 ± 0.374
0.393CysGln: 0.393 ± 0.172
0.852CysArg: 0.852 ± 0.203
0.393CysSer: 0.393 ± 0.177
0.787CysThr: 0.787 ± 0.22
0.197CysVal: 0.197 ± 0.106
0.131CysTrp: 0.131 ± 0.1
0.197CysTyr: 0.197 ± 0.103
0.0CysXaa: 0.0 ± 0.0
Asp
8.127AspAla: 8.127 ± 1.0
0.721AspCys: 0.721 ± 0.249
5.768AspAsp: 5.768 ± 0.728
5.44AspGlu: 5.44 ± 0.703
1.376AspPhe: 1.376 ± 0.333
6.489AspGly: 6.489 ± 0.838
1.18AspHis: 1.18 ± 0.278
2.163AspIle: 2.163 ± 0.33
1.639AspLys: 1.639 ± 0.267
5.44AspLeu: 5.44 ± 0.674
1.376AspMet: 1.376 ± 0.231
1.311AspAsn: 1.311 ± 0.259
4.457AspPro: 4.457 ± 0.621
1.442AspGln: 1.442 ± 0.327
5.309AspArg: 5.309 ± 0.468
2.818AspSer: 2.818 ± 0.585
3.277AspThr: 3.277 ± 0.477
3.67AspVal: 3.67 ± 0.433
1.77AspTrp: 1.77 ± 0.286
1.573AspTyr: 1.573 ± 0.241
0.0AspXaa: 0.0 ± 0.0
Glu
8.062GluAla: 8.062 ± 0.844
0.918GluCys: 0.918 ± 0.238
3.081GluAsp: 3.081 ± 0.513
2.687GluGlu: 2.687 ± 0.411
1.966GluPhe: 1.966 ± 0.304
4.064GluGly: 4.064 ± 0.601
1.77GluHis: 1.77 ± 0.395
3.67GluIle: 3.67 ± 0.536
1.704GluLys: 1.704 ± 0.369
6.751GluLeu: 6.751 ± 0.778
1.442GluMet: 1.442 ± 0.365
1.639GluAsn: 1.639 ± 0.287
3.736GluPro: 3.736 ± 0.563
2.491GluGln: 2.491 ± 0.381
4.129GluArg: 4.129 ± 0.544
2.818GluSer: 2.818 ± 0.494
3.015GluThr: 3.015 ± 0.502
4.457GluVal: 4.457 ± 0.555
1.442GluTrp: 1.442 ± 0.345
1.704GluTyr: 1.704 ± 0.263
0.0GluXaa: 0.0 ± 0.0
Phe
2.556PheAla: 2.556 ± 0.428
0.131PheCys: 0.131 ± 0.108
2.556PheAsp: 2.556 ± 0.377
2.097PheGlu: 2.097 ± 0.378
0.59PhePhe: 0.59 ± 0.165
3.802PheGly: 3.802 ± 0.596
0.59PheHis: 0.59 ± 0.167
1.508PheIle: 1.508 ± 0.328
1.049PheLys: 1.049 ± 0.25
1.639PheLeu: 1.639 ± 0.339
0.852PheMet: 0.852 ± 0.192
1.18PheAsn: 1.18 ± 0.359
1.835PhePro: 1.835 ± 0.461
1.049PheGln: 1.049 ± 0.255
1.639PheArg: 1.639 ± 0.309
1.245PheSer: 1.245 ± 0.346
1.508PheThr: 1.508 ± 0.358
1.901PheVal: 1.901 ± 0.289
0.328PheTrp: 0.328 ± 0.147
0.655PheTyr: 0.655 ± 0.186
0.0PheXaa: 0.0 ± 0.0
Gly
9.7GlyAla: 9.7 ± 1.009
0.262GlyCys: 0.262 ± 0.121
5.047GlyAsp: 5.047 ± 0.57
4.916GlyGlu: 4.916 ± 0.721
3.212GlyPhe: 3.212 ± 0.536
10.028GlyGly: 10.028 ± 1.139
1.442GlyHis: 1.442 ± 0.358
3.474GlyIle: 3.474 ± 0.418
3.146GlyLys: 3.146 ± 0.407
7.603GlyLeu: 7.603 ± 0.721
1.311GlyMet: 1.311 ± 0.301
2.556GlyAsn: 2.556 ± 0.357
4.391GlyPro: 4.391 ± 0.714
3.736GlyGln: 3.736 ± 0.743
5.899GlyArg: 5.899 ± 0.591
6.227GlySer: 6.227 ± 0.801
5.375GlyThr: 5.375 ± 0.664
7.275GlyVal: 7.275 ± 0.759
1.508GlyTrp: 1.508 ± 0.35
2.36GlyTyr: 2.36 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
1.704HisAla: 1.704 ± 0.341
0.131HisCys: 0.131 ± 0.091
1.77HisAsp: 1.77 ± 0.284
1.573HisGlu: 1.573 ± 0.424
0.262HisPhe: 0.262 ± 0.139
0.918HisGly: 0.918 ± 0.21
0.459HisHis: 0.459 ± 0.198
1.442HisIle: 1.442 ± 0.281
0.852HisLys: 0.852 ± 0.219
1.311HisLeu: 1.311 ± 0.294
0.393HisMet: 0.393 ± 0.147
0.721HisAsn: 0.721 ± 0.198
1.18HisPro: 1.18 ± 0.292
0.787HisGln: 0.787 ± 0.211
1.508HisArg: 1.508 ± 0.379
0.852HisSer: 0.852 ± 0.242
1.18HisThr: 1.18 ± 0.324
1.573HisVal: 1.573 ± 0.323
0.459HisTrp: 0.459 ± 0.202
0.59HisTyr: 0.59 ± 0.215
0.0HisXaa: 0.0 ± 0.0
Ile
6.161IleAla: 6.161 ± 0.604
0.328IleCys: 0.328 ± 0.165
3.343IleAsp: 3.343 ± 0.413
4.064IleGlu: 4.064 ± 0.564
0.524IlePhe: 0.524 ± 0.179
3.867IleGly: 3.867 ± 0.603
0.852IleHis: 0.852 ± 0.23
0.655IleIle: 0.655 ± 0.188
1.18IleLys: 1.18 ± 0.257
1.639IleLeu: 1.639 ± 0.312
0.393IleMet: 0.393 ± 0.162
1.114IleAsn: 1.114 ± 0.252
2.556IlePro: 2.556 ± 0.297
1.508IleGln: 1.508 ± 0.322
3.933IleArg: 3.933 ± 0.583
2.294IleSer: 2.294 ± 0.39
3.539IleThr: 3.539 ± 0.582
3.277IleVal: 3.277 ± 0.511
0.524IleTrp: 0.524 ± 0.192
0.983IleTyr: 0.983 ± 0.256
0.0IleXaa: 0.0 ± 0.0
Lys
4.195LysAla: 4.195 ± 0.525
0.197LysCys: 0.197 ± 0.104
1.18LysAsp: 1.18 ± 0.283
0.983LysGlu: 0.983 ± 0.255
1.376LysPhe: 1.376 ± 0.268
2.097LysGly: 2.097 ± 0.444
0.393LysHis: 0.393 ± 0.153
0.852LysIle: 0.852 ± 0.255
0.721LysLys: 0.721 ± 0.223
3.343LysLeu: 3.343 ± 0.393
0.852LysMet: 0.852 ± 0.264
0.787LysAsn: 0.787 ± 0.259
2.949LysPro: 2.949 ± 0.399
0.852LysGln: 0.852 ± 0.265
2.425LysArg: 2.425 ± 0.413
1.966LysSer: 1.966 ± 0.344
2.097LysThr: 2.097 ± 0.401
2.36LysVal: 2.36 ± 0.365
0.066LysTrp: 0.066 ± 0.061
0.655LysTyr: 0.655 ± 0.168
0.0LysXaa: 0.0 ± 0.0
Leu
10.487LeuAla: 10.487 ± 0.852
0.983LeuCys: 0.983 ± 0.28
6.096LeuAsp: 6.096 ± 0.605
3.867LeuGlu: 3.867 ± 0.477
1.508LeuPhe: 1.508 ± 0.304
8.062LeuGly: 8.062 ± 0.804
1.245LeuHis: 1.245 ± 0.234
4.457LeuIle: 4.457 ± 0.499
2.425LeuLys: 2.425 ± 0.356
6.751LeuLeu: 6.751 ± 0.65
1.311LeuMet: 1.311 ± 0.329
1.966LeuAsn: 1.966 ± 0.485
5.506LeuPro: 5.506 ± 0.566
2.36LeuGln: 2.36 ± 0.43
6.03LeuArg: 6.03 ± 0.617
4.195LeuSer: 4.195 ± 0.697
5.309LeuThr: 5.309 ± 0.607
4.719LeuVal: 4.719 ± 0.559
1.049LeuTrp: 1.049 ± 0.223
1.245LeuTyr: 1.245 ± 0.268
0.0LeuXaa: 0.0 ± 0.0
Met
3.736MetAla: 3.736 ± 0.58
0.197MetCys: 0.197 ± 0.102
0.524MetAsp: 0.524 ± 0.2
0.393MetGlu: 0.393 ± 0.162
0.655MetPhe: 0.655 ± 0.199
0.721MetGly: 0.721 ± 0.179
0.262MetHis: 0.262 ± 0.139
0.721MetIle: 0.721 ± 0.262
0.328MetLys: 0.328 ± 0.156
1.508MetLeu: 1.508 ± 0.324
0.262MetMet: 0.262 ± 0.146
0.787MetAsn: 0.787 ± 0.224
1.18MetPro: 1.18 ± 0.27
0.59MetGln: 0.59 ± 0.232
2.163MetArg: 2.163 ± 0.376
1.835MetSer: 1.835 ± 0.386
2.163MetThr: 2.163 ± 0.332
0.918MetVal: 0.918 ± 0.235
0.197MetTrp: 0.197 ± 0.113
0.328MetTyr: 0.328 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
3.736AsnAla: 3.736 ± 0.671
0.262AsnCys: 0.262 ± 0.144
1.311AsnAsp: 1.311 ± 0.35
1.508AsnGlu: 1.508 ± 0.348
0.459AsnPhe: 0.459 ± 0.142
3.736AsnGly: 3.736 ± 0.419
0.852AsnHis: 0.852 ± 0.197
0.983AsnIle: 0.983 ± 0.245
0.393AsnLys: 0.393 ± 0.156
1.77AsnLeu: 1.77 ± 0.312
0.524AsnMet: 0.524 ± 0.212
0.918AsnAsn: 0.918 ± 0.232
3.015AsnPro: 3.015 ± 0.359
0.459AsnGln: 0.459 ± 0.197
2.753AsnArg: 2.753 ± 0.401
2.294AsnSer: 2.294 ± 0.433
1.901AsnThr: 1.901 ± 0.407
1.704AsnVal: 1.704 ± 0.37
0.721AsnTrp: 0.721 ± 0.195
0.393AsnTyr: 0.393 ± 0.169
0.0AsnXaa: 0.0 ± 0.0
Pro
7.079ProAla: 7.079 ± 0.915
0.262ProCys: 0.262 ± 0.137
4.457ProAsp: 4.457 ± 0.507
3.802ProGlu: 3.802 ± 0.561
1.901ProPhe: 1.901 ± 0.338
6.358ProGly: 6.358 ± 0.679
1.049ProHis: 1.049 ± 0.276
1.901ProIle: 1.901 ± 0.336
2.032ProLys: 2.032 ± 0.334
4.916ProLeu: 4.916 ± 0.58
0.787ProMet: 0.787 ± 0.247
2.687ProAsn: 2.687 ± 0.624
3.146ProPro: 3.146 ± 0.613
2.687ProGln: 2.687 ± 0.433
4.785ProArg: 4.785 ± 0.636
2.949ProSer: 2.949 ± 0.425
3.605ProThr: 3.605 ± 0.484
4.981ProVal: 4.981 ± 0.646
1.311ProTrp: 1.311 ± 0.315
1.573ProTyr: 1.573 ± 0.335
0.0ProXaa: 0.0 ± 0.0
Gln
4.129GlnAla: 4.129 ± 0.796
0.262GlnCys: 0.262 ± 0.144
1.442GlnAsp: 1.442 ± 0.271
1.573GlnGlu: 1.573 ± 0.297
1.966GlnPhe: 1.966 ± 0.529
3.015GlnGly: 3.015 ± 0.681
0.655GlnHis: 0.655 ± 0.175
1.966GlnIle: 1.966 ± 0.385
1.376GlnLys: 1.376 ± 0.278
2.491GlnLeu: 2.491 ± 0.355
1.049GlnMet: 1.049 ± 0.221
0.852GlnAsn: 0.852 ± 0.26
2.294GlnPro: 2.294 ± 0.372
2.949GlnGln: 2.949 ± 0.479
2.556GlnArg: 2.556 ± 0.439
2.032GlnSer: 2.032 ± 0.356
2.097GlnThr: 2.097 ± 0.276
2.491GlnVal: 2.491 ± 0.442
1.049GlnTrp: 1.049 ± 0.286
0.459GlnTyr: 0.459 ± 0.139
0.0GlnXaa: 0.0 ± 0.0
Arg
8.39ArgAla: 8.39 ± 0.806
0.852ArgCys: 0.852 ± 0.26
4.916ArgAsp: 4.916 ± 0.491
4.326ArgGlu: 4.326 ± 0.506
2.36ArgPhe: 2.36 ± 0.353
5.178ArgGly: 5.178 ± 0.663
1.901ArgHis: 1.901 ± 0.376
3.212ArgIle: 3.212 ± 0.432
3.081ArgLys: 3.081 ± 0.465
5.506ArgLeu: 5.506 ± 0.509
2.032ArgMet: 2.032 ± 0.359
2.425ArgAsn: 2.425 ± 0.469
3.736ArgPro: 3.736 ± 0.557
2.753ArgGln: 2.753 ± 0.406
6.882ArgArg: 6.882 ± 1.145
3.408ArgSer: 3.408 ± 0.564
4.523ArgThr: 4.523 ± 0.537
5.112ArgVal: 5.112 ± 0.68
1.704ArgTrp: 1.704 ± 0.357
1.835ArgTyr: 1.835 ± 0.426
0.0ArgXaa: 0.0 ± 0.0
Ser
6.882SerAla: 6.882 ± 0.639
0.459SerCys: 0.459 ± 0.152
3.015SerAsp: 3.015 ± 0.466
2.36SerGlu: 2.36 ± 0.495
1.573SerPhe: 1.573 ± 0.319
6.751SerGly: 6.751 ± 1.074
0.852SerHis: 0.852 ± 0.235
2.032SerIle: 2.032 ± 0.431
1.376SerLys: 1.376 ± 0.307
3.867SerLeu: 3.867 ± 0.541
1.442SerMet: 1.442 ± 0.354
1.835SerAsn: 1.835 ± 0.434
2.884SerPro: 2.884 ± 0.431
1.966SerGln: 1.966 ± 0.361
3.605SerArg: 3.605 ± 0.548
2.491SerSer: 2.491 ± 0.482
3.736SerThr: 3.736 ± 0.586
4.129SerVal: 4.129 ± 0.468
0.787SerTrp: 0.787 ± 0.204
1.18SerTyr: 1.18 ± 0.237
0.0SerXaa: 0.0 ± 0.0
Thr
6.685ThrAla: 6.685 ± 0.589
0.393ThrCys: 0.393 ± 0.168
4.129ThrAsp: 4.129 ± 0.461
2.884ThrGlu: 2.884 ± 0.435
1.704ThrPhe: 1.704 ± 0.458
5.375ThrGly: 5.375 ± 0.551
0.983ThrHis: 0.983 ± 0.218
3.933ThrIle: 3.933 ± 0.641
1.901ThrLys: 1.901 ± 0.32
5.44ThrLeu: 5.44 ± 0.556
0.983ThrMet: 0.983 ± 0.23
1.573ThrAsn: 1.573 ± 0.354
5.047ThrPro: 5.047 ± 0.547
2.163ThrGln: 2.163 ± 0.389
3.736ThrArg: 3.736 ± 0.493
2.622ThrSer: 2.622 ± 0.383
4.129ThrThr: 4.129 ± 0.518
5.964ThrVal: 5.964 ± 0.672
1.311ThrTrp: 1.311 ± 0.392
1.639ThrTyr: 1.639 ± 0.322
0.0ThrXaa: 0.0 ± 0.0
Val
8.586ValAla: 8.586 ± 0.795
1.049ValCys: 1.049 ± 0.287
5.375ValAsp: 5.375 ± 0.637
5.178ValGlu: 5.178 ± 0.678
2.294ValPhe: 2.294 ± 0.417
5.833ValGly: 5.833 ± 0.677
1.311ValHis: 1.311 ± 0.254
2.097ValIle: 2.097 ± 0.34
1.376ValLys: 1.376 ± 0.3
5.112ValLeu: 5.112 ± 0.621
0.983ValMet: 0.983 ± 0.228
2.556ValAsn: 2.556 ± 0.398
4.26ValPro: 4.26 ± 0.575
1.901ValGln: 1.901 ± 0.376
4.26ValArg: 4.26 ± 0.576
4.195ValSer: 4.195 ± 0.641
5.178ValThr: 5.178 ± 0.754
6.882ValVal: 6.882 ± 0.868
1.114ValTrp: 1.114 ± 0.255
1.311ValTyr: 1.311 ± 0.293
0.0ValXaa: 0.0 ± 0.0
Trp
2.36TrpAla: 2.36 ± 0.431
0.59TrpCys: 0.59 ± 0.186
0.787TrpAsp: 0.787 ± 0.232
0.655TrpGlu: 0.655 ± 0.164
1.114TrpPhe: 1.114 ± 0.299
1.245TrpGly: 1.245 ± 0.267
0.328TrpHis: 0.328 ± 0.184
0.852TrpIle: 0.852 ± 0.239
0.459TrpLys: 0.459 ± 0.16
1.573TrpLeu: 1.573 ± 0.325
0.328TrpMet: 0.328 ± 0.145
0.524TrpAsn: 0.524 ± 0.169
1.114TrpPro: 1.114 ± 0.341
1.245TrpGln: 1.245 ± 0.277
1.442TrpArg: 1.442 ± 0.333
0.983TrpSer: 0.983 ± 0.231
1.049TrpThr: 1.049 ± 0.298
1.049TrpVal: 1.049 ± 0.231
0.524TrpTrp: 0.524 ± 0.176
0.393TrpTyr: 0.393 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.556TyrAla: 2.556 ± 0.373
0.328TyrCys: 0.328 ± 0.166
1.376TyrAsp: 1.376 ± 0.322
1.442TyrGlu: 1.442 ± 0.275
0.655TyrPhe: 0.655 ± 0.173
1.966TyrGly: 1.966 ± 0.368
0.787TyrHis: 0.787 ± 0.223
0.655TyrIle: 0.655 ± 0.241
0.524TyrLys: 0.524 ± 0.191
2.36TyrLeu: 2.36 ± 0.41
0.459TyrMet: 0.459 ± 0.186
0.655TyrAsn: 0.655 ± 0.158
0.983TyrPro: 0.983 ± 0.275
0.918TyrGln: 0.918 ± 0.225
1.508TyrArg: 1.508 ± 0.315
1.376TyrSer: 1.376 ± 0.28
1.639TyrThr: 1.639 ± 0.363
1.311TyrVal: 1.311 ± 0.255
0.393TyrTrp: 0.393 ± 0.139
0.524TyrTyr: 0.524 ± 0.173
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (15258 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski