Amino acid dipepetide frequency for Mycobacterium phage Paphu

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.834AlaAla: 12.834 ± 1.286
0.748AlaCys: 0.748 ± 0.211
6.23AlaAsp: 6.23 ± 0.605
5.483AlaGlu: 5.483 ± 0.623
3.427AlaPhe: 3.427 ± 0.414
8.411AlaGly: 8.411 ± 1.062
1.371AlaHis: 1.371 ± 0.344
4.61AlaIle: 4.61 ± 0.626
3.8AlaLys: 3.8 ± 0.447
9.158AlaLeu: 9.158 ± 0.928
1.994AlaMet: 1.994 ± 0.42
2.305AlaAsn: 2.305 ± 0.388
4.86AlaPro: 4.86 ± 0.702
3.427AlaGln: 3.427 ± 0.455
5.607AlaArg: 5.607 ± 0.554
4.797AlaSer: 4.797 ± 0.541
6.479AlaThr: 6.479 ± 0.697
8.847AlaVal: 8.847 ± 0.778
2.118AlaTrp: 2.118 ± 0.33
2.741AlaTyr: 2.741 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.685CysAla: 0.685 ± 0.216
0.0CysCys: 0.0 ± 0.0
0.436CysAsp: 0.436 ± 0.161
0.748CysGlu: 0.748 ± 0.211
0.249CysPhe: 0.249 ± 0.13
0.685CysGly: 0.685 ± 0.263
0.249CysHis: 0.249 ± 0.124
0.187CysIle: 0.187 ± 0.123
0.187CysLys: 0.187 ± 0.102
0.498CysLeu: 0.498 ± 0.21
0.125CysMet: 0.125 ± 0.088
0.187CysAsn: 0.187 ± 0.102
0.249CysPro: 0.249 ± 0.109
0.187CysGln: 0.187 ± 0.114
0.436CysArg: 0.436 ± 0.177
0.436CysSer: 0.436 ± 0.159
0.249CysThr: 0.249 ± 0.129
0.187CysVal: 0.187 ± 0.096
0.249CysTrp: 0.249 ± 0.12
0.249CysTyr: 0.249 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
6.853AspAla: 6.853 ± 0.755
0.561AspCys: 0.561 ± 0.211
4.236AspAsp: 4.236 ± 0.594
3.613AspGlu: 3.613 ± 0.563
2.305AspPhe: 2.305 ± 0.337
5.919AspGly: 5.919 ± 0.591
1.121AspHis: 1.121 ± 0.305
2.554AspIle: 2.554 ± 0.482
2.367AspLys: 2.367 ± 0.376
6.729AspLeu: 6.729 ± 0.708
1.246AspMet: 1.246 ± 0.226
1.744AspAsn: 1.744 ± 0.323
4.486AspPro: 4.486 ± 0.7
1.931AspGln: 1.931 ± 0.395
3.925AspArg: 3.925 ± 0.447
3.302AspSer: 3.302 ± 0.437
3.738AspThr: 3.738 ± 0.444
4.236AspVal: 4.236 ± 0.533
1.807AspTrp: 1.807 ± 0.301
2.243AspTyr: 2.243 ± 0.398
0.0AspXaa: 0.0 ± 0.0
Glu
5.919GluAla: 5.919 ± 0.695
0.249GluCys: 0.249 ± 0.139
4.922GluAsp: 4.922 ± 0.666
4.922GluGlu: 4.922 ± 0.545
1.807GluPhe: 1.807 ± 0.357
3.863GluGly: 3.863 ± 0.45
1.433GluHis: 1.433 ± 0.348
3.24GluIle: 3.24 ± 0.475
2.492GluLys: 2.492 ± 0.41
6.853GluLeu: 6.853 ± 0.623
1.495GluMet: 1.495 ± 0.349
1.371GluAsn: 1.371 ± 0.305
2.43GluPro: 2.43 ± 0.439
2.679GluGln: 2.679 ± 0.475
3.8GluArg: 3.8 ± 0.547
3.8GluSer: 3.8 ± 0.482
3.551GluThr: 3.551 ± 0.562
6.106GluVal: 6.106 ± 0.629
1.433GluTrp: 1.433 ± 0.347
2.243GluTyr: 2.243 ± 0.411
0.0GluXaa: 0.0 ± 0.0
Phe
2.367PheAla: 2.367 ± 0.33
0.312PheCys: 0.312 ± 0.166
2.554PheAsp: 2.554 ± 0.402
2.305PheGlu: 2.305 ± 0.344
0.561PhePhe: 0.561 ± 0.192
3.613PheGly: 3.613 ± 0.574
0.748PheHis: 0.748 ± 0.237
1.308PheIle: 1.308 ± 0.238
1.184PheLys: 1.184 ± 0.249
2.741PheLeu: 2.741 ± 0.597
0.685PheMet: 0.685 ± 0.191
1.246PheAsn: 1.246 ± 0.259
1.744PhePro: 1.744 ± 0.314
0.997PheGln: 0.997 ± 0.203
1.62PheArg: 1.62 ± 0.411
1.869PheSer: 1.869 ± 0.308
2.118PheThr: 2.118 ± 0.427
2.243PheVal: 2.243 ± 0.369
0.623PheTrp: 0.623 ± 0.195
0.872PheTyr: 0.872 ± 0.19
0.0PheXaa: 0.0 ± 0.0
Gly
7.102GlyAla: 7.102 ± 0.919
0.935GlyCys: 0.935 ± 0.259
6.106GlyAsp: 6.106 ± 0.571
4.735GlyGlu: 4.735 ± 0.563
3.177GlyPhe: 3.177 ± 0.602
10.155GlyGly: 10.155 ± 2.698
1.807GlyHis: 1.807 ± 0.347
5.109GlyIle: 5.109 ± 0.839
3.551GlyLys: 3.551 ± 0.505
7.538GlyLeu: 7.538 ± 0.728
2.181GlyMet: 2.181 ± 0.355
2.866GlyAsn: 2.866 ± 0.458
3.738GlyPro: 3.738 ± 0.539
2.866GlyGln: 2.866 ± 0.377
5.233GlyArg: 5.233 ± 0.569
6.106GlySer: 6.106 ± 0.967
5.856GlyThr: 5.856 ± 0.794
5.483GlyVal: 5.483 ± 0.496
2.679GlyTrp: 2.679 ± 0.377
2.741GlyTyr: 2.741 ± 0.383
0.0GlyXaa: 0.0 ± 0.0
His
1.62HisAla: 1.62 ± 0.365
0.187HisCys: 0.187 ± 0.116
1.184HisAsp: 1.184 ± 0.237
1.62HisGlu: 1.62 ± 0.315
0.685HisPhe: 0.685 ± 0.184
1.495HisGly: 1.495 ± 0.388
0.748HisHis: 0.748 ± 0.213
0.81HisIle: 0.81 ± 0.222
0.872HisLys: 0.872 ± 0.266
1.246HisLeu: 1.246 ± 0.255
0.125HisMet: 0.125 ± 0.125
0.249HisAsn: 0.249 ± 0.122
1.371HisPro: 1.371 ± 0.268
0.872HisGln: 0.872 ± 0.229
1.433HisArg: 1.433 ± 0.309
0.81HisSer: 0.81 ± 0.2
0.997HisThr: 0.997 ± 0.233
1.682HisVal: 1.682 ± 0.344
0.374HisTrp: 0.374 ± 0.137
0.623HisTyr: 0.623 ± 0.225
0.0HisXaa: 0.0 ± 0.0
Ile
6.355IleAla: 6.355 ± 0.721
0.312IleCys: 0.312 ± 0.145
3.489IleAsp: 3.489 ± 0.411
3.8IleGlu: 3.8 ± 0.503
0.935IlePhe: 0.935 ± 0.246
3.738IleGly: 3.738 ± 0.468
0.685IleHis: 0.685 ± 0.181
1.62IleIle: 1.62 ± 0.256
1.62IleLys: 1.62 ± 0.296
3.053IleLeu: 3.053 ± 0.364
0.748IleMet: 0.748 ± 0.188
1.931IleAsn: 1.931 ± 0.31
2.804IlePro: 2.804 ± 0.388
1.558IleGln: 1.558 ± 0.347
3.177IleArg: 3.177 ± 0.48
3.053IleSer: 3.053 ± 0.447
3.302IleThr: 3.302 ± 0.396
3.053IleVal: 3.053 ± 0.523
0.685IleTrp: 0.685 ± 0.174
1.495IleTyr: 1.495 ± 0.264
0.0IleXaa: 0.0 ± 0.0
Lys
3.364LysAla: 3.364 ± 0.497
0.125LysCys: 0.125 ± 0.09
2.305LysAsp: 2.305 ± 0.355
1.869LysGlu: 1.869 ± 0.355
1.433LysPhe: 1.433 ± 0.345
2.305LysGly: 2.305 ± 0.36
1.059LysHis: 1.059 ± 0.301
1.931LysIle: 1.931 ± 0.384
2.118LysLys: 2.118 ± 0.467
3.551LysLeu: 3.551 ± 0.533
0.81LysMet: 0.81 ± 0.201
1.558LysAsn: 1.558 ± 0.36
2.804LysPro: 2.804 ± 0.471
1.371LysGln: 1.371 ± 0.283
2.928LysArg: 2.928 ± 0.455
2.617LysSer: 2.617 ± 0.403
2.056LysThr: 2.056 ± 0.347
3.053LysVal: 3.053 ± 0.436
0.748LysTrp: 0.748 ± 0.196
1.059LysTyr: 1.059 ± 0.29
0.0LysXaa: 0.0 ± 0.0
Leu
9.657LeuAla: 9.657 ± 1.113
0.312LeuCys: 0.312 ± 0.144
5.794LeuAsp: 5.794 ± 0.533
5.358LeuGlu: 5.358 ± 0.688
2.181LeuPhe: 2.181 ± 0.446
7.538LeuGly: 7.538 ± 0.677
1.558LeuHis: 1.558 ± 0.345
4.299LeuIle: 4.299 ± 0.474
3.676LeuLys: 3.676 ± 0.41
5.732LeuLeu: 5.732 ± 0.595
1.682LeuMet: 1.682 ± 0.32
2.741LeuAsn: 2.741 ± 0.399
5.42LeuPro: 5.42 ± 0.668
2.617LeuGln: 2.617 ± 0.543
6.23LeuArg: 6.23 ± 0.628
5.919LeuSer: 5.919 ± 0.704
6.23LeuThr: 6.23 ± 0.774
5.669LeuVal: 5.669 ± 0.652
1.308LeuTrp: 1.308 ± 0.299
2.243LeuTyr: 2.243 ± 0.384
0.0LeuXaa: 0.0 ± 0.0
Met
2.243MetAla: 2.243 ± 0.301
0.0MetCys: 0.0 ± 0.0
1.121MetAsp: 1.121 ± 0.303
1.308MetGlu: 1.308 ± 0.311
0.498MetPhe: 0.498 ± 0.162
1.807MetGly: 1.807 ± 0.359
0.374MetHis: 0.374 ± 0.143
0.561MetIle: 0.561 ± 0.164
0.872MetLys: 0.872 ± 0.256
0.935MetLeu: 0.935 ± 0.269
0.187MetMet: 0.187 ± 0.104
0.872MetAsn: 0.872 ± 0.208
1.059MetPro: 1.059 ± 0.248
0.748MetGln: 0.748 ± 0.216
1.682MetArg: 1.682 ± 0.39
2.056MetSer: 2.056 ± 0.431
2.118MetThr: 2.118 ± 0.31
0.935MetVal: 0.935 ± 0.264
0.374MetTrp: 0.374 ± 0.147
0.374MetTyr: 0.374 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
3.177AsnAla: 3.177 ± 0.502
0.062AsnCys: 0.062 ± 0.069
1.682AsnAsp: 1.682 ± 0.419
2.056AsnGlu: 2.056 ± 0.391
0.935AsnPhe: 0.935 ± 0.251
3.427AsnGly: 3.427 ± 0.532
0.623AsnHis: 0.623 ± 0.215
1.433AsnIle: 1.433 ± 0.278
0.685AsnLys: 0.685 ± 0.172
2.181AsnLeu: 2.181 ± 0.332
0.81AsnMet: 0.81 ± 0.177
0.997AsnAsn: 0.997 ± 0.281
2.804AsnPro: 2.804 ± 0.354
0.935AsnGln: 0.935 ± 0.233
1.682AsnArg: 1.682 ± 0.379
1.558AsnSer: 1.558 ± 0.33
1.869AsnThr: 1.869 ± 0.386
2.679AsnVal: 2.679 ± 0.411
0.748AsnTrp: 0.748 ± 0.224
1.059AsnTyr: 1.059 ± 0.257
0.0AsnXaa: 0.0 ± 0.0
Pro
4.86ProAla: 4.86 ± 0.509
0.312ProCys: 0.312 ± 0.15
4.299ProAsp: 4.299 ± 0.463
4.112ProGlu: 4.112 ± 0.51
2.243ProPhe: 2.243 ± 0.403
5.109ProGly: 5.109 ± 0.59
0.685ProHis: 0.685 ± 0.204
2.243ProIle: 2.243 ± 0.379
2.056ProLys: 2.056 ± 0.265
4.486ProLeu: 4.486 ± 0.482
0.872ProMet: 0.872 ± 0.236
1.682ProAsn: 1.682 ± 0.381
2.679ProPro: 2.679 ± 0.44
1.682ProGln: 1.682 ± 0.338
2.99ProArg: 2.99 ± 0.475
4.299ProSer: 4.299 ± 0.513
4.112ProThr: 4.112 ± 0.555
4.112ProVal: 4.112 ± 0.48
0.623ProTrp: 0.623 ± 0.243
1.371ProTyr: 1.371 ± 0.313
0.0ProXaa: 0.0 ± 0.0
Gln
3.24GlnAla: 3.24 ± 0.709
0.187GlnCys: 0.187 ± 0.117
1.371GlnAsp: 1.371 ± 0.345
1.931GlnGlu: 1.931 ± 0.417
1.308GlnPhe: 1.308 ± 0.38
2.804GlnGly: 2.804 ± 0.404
0.685GlnHis: 0.685 ± 0.205
2.554GlnIle: 2.554 ± 0.547
1.246GlnLys: 1.246 ± 0.293
4.174GlnLeu: 4.174 ± 0.698
0.748GlnMet: 0.748 ± 0.188
0.561GlnAsn: 0.561 ± 0.142
1.62GlnPro: 1.62 ± 0.303
1.931GlnGln: 1.931 ± 0.401
2.118GlnArg: 2.118 ± 0.382
1.682GlnSer: 1.682 ± 0.303
1.744GlnThr: 1.744 ± 0.311
2.181GlnVal: 2.181 ± 0.335
0.748GlnTrp: 0.748 ± 0.209
0.498GlnTyr: 0.498 ± 0.178
0.0GlnXaa: 0.0 ± 0.0
Arg
5.233ArgAla: 5.233 ± 0.536
0.748ArgCys: 0.748 ± 0.25
3.177ArgAsp: 3.177 ± 0.514
4.797ArgGlu: 4.797 ± 0.577
1.807ArgPhe: 1.807 ± 0.361
5.483ArgGly: 5.483 ± 0.658
1.121ArgHis: 1.121 ± 0.277
3.177ArgIle: 3.177 ± 0.53
3.053ArgLys: 3.053 ± 0.51
6.168ArgLeu: 6.168 ± 0.754
1.433ArgMet: 1.433 ± 0.313
2.554ArgAsn: 2.554 ± 0.466
2.741ArgPro: 2.741 ± 0.404
1.807ArgGln: 1.807 ± 0.369
5.233ArgArg: 5.233 ± 0.684
4.05ArgSer: 4.05 ± 0.515
2.804ArgThr: 2.804 ± 0.482
5.109ArgVal: 5.109 ± 0.605
1.495ArgTrp: 1.495 ± 0.349
1.869ArgTyr: 1.869 ± 0.25
0.0ArgXaa: 0.0 ± 0.0
Ser
6.292SerAla: 6.292 ± 0.78
0.436SerCys: 0.436 ± 0.179
3.427SerAsp: 3.427 ± 0.446
3.863SerGlu: 3.863 ± 0.526
1.869SerPhe: 1.869 ± 0.407
7.227SerGly: 7.227 ± 0.849
1.371SerHis: 1.371 ± 0.269
2.741SerIle: 2.741 ± 0.388
2.056SerLys: 2.056 ± 0.371
5.046SerLeu: 5.046 ± 0.581
1.62SerMet: 1.62 ± 0.288
2.118SerAsn: 2.118 ± 0.38
3.427SerPro: 3.427 ± 0.475
2.056SerGln: 2.056 ± 0.351
3.115SerArg: 3.115 ± 0.506
3.925SerSer: 3.925 ± 0.875
3.613SerThr: 3.613 ± 0.56
3.676SerVal: 3.676 ± 0.462
1.308SerTrp: 1.308 ± 0.317
1.433SerTyr: 1.433 ± 0.306
0.0SerXaa: 0.0 ± 0.0
Thr
6.355ThrAla: 6.355 ± 0.837
0.249ThrCys: 0.249 ± 0.115
3.987ThrAsp: 3.987 ± 0.601
4.112ThrGlu: 4.112 ± 0.477
2.056ThrPhe: 2.056 ± 0.352
6.915ThrGly: 6.915 ± 0.699
0.748ThrHis: 0.748 ± 0.232
2.492ThrIle: 2.492 ± 0.475
2.492ThrLys: 2.492 ± 0.38
6.292ThrLeu: 6.292 ± 0.676
1.059ThrMet: 1.059 ± 0.244
1.744ThrAsn: 1.744 ± 0.33
4.174ThrPro: 4.174 ± 0.489
1.682ThrGln: 1.682 ± 0.342
3.364ThrArg: 3.364 ± 0.591
3.302ThrSer: 3.302 ± 0.596
4.361ThrThr: 4.361 ± 0.521
5.856ThrVal: 5.856 ± 0.642
1.371ThrTrp: 1.371 ± 0.296
1.931ThrTyr: 1.931 ± 0.35
0.0ThrXaa: 0.0 ± 0.0
Val
7.289ValAla: 7.289 ± 0.75
0.374ValCys: 0.374 ± 0.15
5.794ValAsp: 5.794 ± 0.589
4.797ValGlu: 4.797 ± 0.639
2.741ValPhe: 2.741 ± 0.437
5.109ValGly: 5.109 ± 0.739
1.308ValHis: 1.308 ± 0.248
3.925ValIle: 3.925 ± 0.516
3.177ValLys: 3.177 ± 0.494
5.794ValLeu: 5.794 ± 0.599
1.184ValMet: 1.184 ± 0.293
2.741ValAsn: 2.741 ± 0.362
4.05ValPro: 4.05 ± 0.537
2.118ValGln: 2.118 ± 0.339
5.171ValArg: 5.171 ± 0.737
4.361ValSer: 4.361 ± 0.543
5.669ValThr: 5.669 ± 0.609
5.296ValVal: 5.296 ± 0.607
1.121ValTrp: 1.121 ± 0.292
1.931ValTyr: 1.931 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
1.558TrpAla: 1.558 ± 0.288
0.187TrpCys: 0.187 ± 0.086
1.62TrpAsp: 1.62 ± 0.31
0.935TrpGlu: 0.935 ± 0.244
0.872TrpPhe: 0.872 ± 0.237
1.869TrpGly: 1.869 ± 0.351
0.685TrpHis: 0.685 ± 0.193
1.121TrpIle: 1.121 ± 0.271
0.374TrpLys: 0.374 ± 0.192
1.807TrpLeu: 1.807 ± 0.313
0.312TrpMet: 0.312 ± 0.158
0.498TrpAsn: 0.498 ± 0.176
0.997TrpPro: 0.997 ± 0.263
0.935TrpGln: 0.935 ± 0.218
1.558TrpArg: 1.558 ± 0.314
1.184TrpSer: 1.184 ± 0.236
1.558TrpThr: 1.558 ± 0.389
1.931TrpVal: 1.931 ± 0.347
0.561TrpTrp: 0.561 ± 0.202
0.312TrpTyr: 0.312 ± 0.134
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.305TyrAla: 2.305 ± 0.373
0.187TyrCys: 0.187 ± 0.144
1.184TyrAsp: 1.184 ± 0.269
2.243TyrGlu: 2.243 ± 0.342
0.623TyrPhe: 0.623 ± 0.183
2.492TyrGly: 2.492 ± 0.383
0.623TyrHis: 0.623 ± 0.183
1.495TyrIle: 1.495 ± 0.346
1.184TyrLys: 1.184 ± 0.234
2.181TyrLeu: 2.181 ± 0.414
0.748TyrMet: 0.748 ± 0.195
1.433TyrAsn: 1.433 ± 0.332
1.371TyrPro: 1.371 ± 0.276
0.935TyrGln: 0.935 ± 0.242
2.554TyrArg: 2.554 ± 0.466
1.495TyrSer: 1.495 ± 0.261
2.118TyrThr: 2.118 ± 0.341
1.62TyrVal: 1.62 ± 0.324
0.498TyrTrp: 0.498 ± 0.202
0.623TyrTyr: 0.623 ± 0.175
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (16052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski