Amino acid dipepetide frequency for Aeromonas phage phiARM81mr

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.771AlaAla: 15.771 ± 1.139
1.305AlaCys: 1.305 ± 0.386
7.124AlaAsp: 7.124 ± 0.736
9.789AlaGlu: 9.789 ± 0.827
2.774AlaPhe: 2.774 ± 0.35
9.136AlaGly: 9.136 ± 0.975
1.795AlaHis: 1.795 ± 0.313
6.037AlaIle: 6.037 ± 0.606
7.124AlaLys: 7.124 ± 0.834
9.136AlaLeu: 9.136 ± 0.645
4.351AlaMet: 4.351 ± 0.624
5.873AlaAsn: 5.873 ± 0.671
3.861AlaPro: 3.861 ± 0.407
7.342AlaGln: 7.342 ± 0.889
7.07AlaArg: 7.07 ± 0.745
6.308AlaSer: 6.308 ± 1.218
5.873AlaThr: 5.873 ± 0.67
5.275AlaVal: 5.275 ± 0.507
1.414AlaTrp: 1.414 ± 0.289
2.828AlaTyr: 2.828 ± 0.516
0.0AlaXaa: 0.0 ± 0.0
Cys
1.033CysAla: 1.033 ± 0.24
0.326CysCys: 0.326 ± 0.173
0.489CysAsp: 0.489 ± 0.199
0.435CysGlu: 0.435 ± 0.161
0.109CysPhe: 0.109 ± 0.092
0.979CysGly: 0.979 ± 0.286
0.272CysHis: 0.272 ± 0.167
0.544CysIle: 0.544 ± 0.229
0.326CysLys: 0.326 ± 0.182
0.761CysLeu: 0.761 ± 0.255
0.272CysMet: 0.272 ± 0.14
0.435CysAsn: 0.435 ± 0.161
0.979CysPro: 0.979 ± 0.34
0.544CysGln: 0.544 ± 0.216
1.251CysArg: 1.251 ± 0.337
0.544CysSer: 0.544 ± 0.21
0.435CysThr: 0.435 ± 0.184
0.435CysVal: 0.435 ± 0.199
0.163CysTrp: 0.163 ± 0.112
0.218CysTyr: 0.218 ± 0.118
0.0CysXaa: 0.0 ± 0.0
Asp
7.015AspAla: 7.015 ± 0.831
0.707AspCys: 0.707 ± 0.195
2.991AspAsp: 2.991 ± 0.369
4.623AspGlu: 4.623 ± 0.799
1.849AspPhe: 1.849 ± 0.276
4.677AspGly: 4.677 ± 0.8
1.305AspHis: 1.305 ± 0.312
2.828AspIle: 2.828 ± 0.319
3.372AspLys: 3.372 ± 0.385
4.568AspLeu: 4.568 ± 0.515
1.414AspMet: 1.414 ± 0.239
1.631AspAsn: 1.631 ± 0.317
3.481AspPro: 3.481 ± 0.489
2.882AspGln: 2.882 ± 0.326
3.861AspArg: 3.861 ± 0.449
2.719AspSer: 2.719 ± 0.371
2.719AspThr: 2.719 ± 0.433
2.665AspVal: 2.665 ± 0.312
1.414AspTrp: 1.414 ± 0.332
1.74AspTyr: 1.74 ± 0.309
0.0AspXaa: 0.0 ± 0.0
Glu
7.668GluAla: 7.668 ± 0.815
0.598GluCys: 0.598 ± 0.249
1.903GluAsp: 1.903 ± 0.277
3.698GluGlu: 3.698 ± 0.428
2.393GluPhe: 2.393 ± 0.31
4.133GluGly: 4.133 ± 0.532
1.468GluHis: 1.468 ± 0.323
2.882GluIle: 2.882 ± 0.483
2.991GluLys: 2.991 ± 0.455
8.756GluLeu: 8.756 ± 0.78
2.447GluMet: 2.447 ± 0.417
1.523GluAsn: 1.523 ± 0.416
2.719GluPro: 2.719 ± 0.424
5.765GluGln: 5.765 ± 0.924
5.71GluArg: 5.71 ± 0.584
3.426GluSer: 3.426 ± 0.514
2.61GluThr: 2.61 ± 0.434
5.112GluVal: 5.112 ± 0.457
1.305GluTrp: 1.305 ± 0.278
1.631GluTyr: 1.631 ± 0.373
0.0GluXaa: 0.0 ± 0.0
Phe
2.882PheAla: 2.882 ± 0.354
0.598PheCys: 0.598 ± 0.208
1.196PheAsp: 1.196 ± 0.226
1.414PheGlu: 1.414 ± 0.289
0.598PhePhe: 0.598 ± 0.256
2.23PheGly: 2.23 ± 0.293
0.816PheHis: 0.816 ± 0.249
1.142PheIle: 1.142 ± 0.217
1.523PheLys: 1.523 ± 0.305
1.958PheLeu: 1.958 ± 0.329
0.544PheMet: 0.544 ± 0.147
1.468PheAsn: 1.468 ± 0.239
1.088PhePro: 1.088 ± 0.278
0.87PheGln: 0.87 ± 0.202
1.849PheArg: 1.849 ± 0.364
1.795PheSer: 1.795 ± 0.299
1.142PheThr: 1.142 ± 0.213
2.61PheVal: 2.61 ± 0.424
0.435PheTrp: 0.435 ± 0.165
0.87PheTyr: 0.87 ± 0.178
0.0PheXaa: 0.0 ± 0.0
Gly
7.015GlyAla: 7.015 ± 0.975
0.707GlyCys: 0.707 ± 0.254
4.296GlyAsp: 4.296 ± 0.632
5.547GlyGlu: 5.547 ± 0.723
2.338GlyPhe: 2.338 ± 0.317
6.2GlyGly: 6.2 ± 0.674
1.849GlyHis: 1.849 ± 0.331
3.045GlyIle: 3.045 ± 0.473
4.786GlyLys: 4.786 ± 0.563
6.091GlyLeu: 6.091 ± 0.551
2.61GlyMet: 2.61 ± 0.35
2.502GlyAsn: 2.502 ± 0.368
2.338GlyPro: 2.338 ± 0.3
3.644GlyGln: 3.644 ± 0.399
4.949GlyArg: 4.949 ± 0.502
4.405GlySer: 4.405 ± 0.521
3.861GlyThr: 3.861 ± 0.509
4.786GlyVal: 4.786 ± 0.547
1.523GlyTrp: 1.523 ± 0.268
2.175GlyTyr: 2.175 ± 0.343
0.0GlyXaa: 0.0 ± 0.0
His
2.284HisAla: 2.284 ± 0.272
0.381HisCys: 0.381 ± 0.197
1.468HisAsp: 1.468 ± 0.315
0.87HisGlu: 0.87 ± 0.249
0.761HisPhe: 0.761 ± 0.191
2.393HisGly: 2.393 ± 0.396
0.598HisHis: 0.598 ± 0.175
1.142HisIle: 1.142 ± 0.383
0.489HisLys: 0.489 ± 0.212
1.903HisLeu: 1.903 ± 0.287
0.272HisMet: 0.272 ± 0.108
0.87HisAsn: 0.87 ± 0.207
0.816HisPro: 0.816 ± 0.337
0.598HisGln: 0.598 ± 0.198
1.305HisArg: 1.305 ± 0.281
0.925HisSer: 0.925 ± 0.253
1.196HisThr: 1.196 ± 0.285
1.142HisVal: 1.142 ± 0.265
0.489HisTrp: 0.489 ± 0.174
0.653HisTyr: 0.653 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
5.438IleAla: 5.438 ± 0.413
0.544IleCys: 0.544 ± 0.237
3.807IleAsp: 3.807 ± 0.369
3.1IleGlu: 3.1 ± 0.455
0.925IlePhe: 0.925 ± 0.227
3.589IleGly: 3.589 ± 0.36
0.816IleHis: 0.816 ± 0.265
1.958IleIle: 1.958 ± 0.353
2.556IleLys: 2.556 ± 0.362
2.556IleLeu: 2.556 ± 0.395
0.87IleMet: 0.87 ± 0.198
2.067IleAsn: 2.067 ± 0.294
1.468IlePro: 1.468 ± 0.374
1.903IleGln: 1.903 ± 0.355
2.774IleArg: 2.774 ± 0.329
2.937IleSer: 2.937 ± 0.547
3.263IleThr: 3.263 ± 0.536
2.067IleVal: 2.067 ± 0.388
0.272IleTrp: 0.272 ± 0.119
1.033IleTyr: 1.033 ± 0.332
0.0IleXaa: 0.0 ± 0.0
Lys
7.015LysAla: 7.015 ± 0.607
0.489LysCys: 0.489 ± 0.184
2.61LysAsp: 2.61 ± 0.386
2.828LysGlu: 2.828 ± 0.492
0.816LysPhe: 0.816 ± 0.248
4.079LysGly: 4.079 ± 0.533
1.196LysHis: 1.196 ± 0.262
2.284LysIle: 2.284 ± 0.418
1.577LysLys: 1.577 ± 0.406
5.058LysLeu: 5.058 ± 0.567
1.251LysMet: 1.251 ± 0.315
1.795LysAsn: 1.795 ± 0.376
2.937LysPro: 2.937 ± 0.427
2.991LysGln: 2.991 ± 0.476
3.317LysArg: 3.317 ± 0.406
2.774LysSer: 2.774 ± 0.413
3.263LysThr: 3.263 ± 0.538
3.045LysVal: 3.045 ± 0.519
0.544LysTrp: 0.544 ± 0.205
1.414LysTyr: 1.414 ± 0.257
0.0LysXaa: 0.0 ± 0.0
Leu
10.224LeuAla: 10.224 ± 0.779
0.707LeuCys: 0.707 ± 0.231
5.601LeuAsp: 5.601 ± 0.51
6.417LeuGlu: 6.417 ± 0.726
2.719LeuPhe: 2.719 ± 0.479
5.873LeuGly: 5.873 ± 0.552
1.36LeuHis: 1.36 ± 0.245
2.447LeuIle: 2.447 ± 0.413
4.188LeuLys: 4.188 ± 0.587
6.798LeuLeu: 6.798 ± 0.627
2.556LeuMet: 2.556 ± 0.492
3.154LeuAsn: 3.154 ± 0.48
4.133LeuPro: 4.133 ± 0.517
3.589LeuGln: 3.589 ± 0.493
6.037LeuArg: 6.037 ± 0.736
4.133LeuSer: 4.133 ± 0.647
5.384LeuThr: 5.384 ± 0.512
5.438LeuVal: 5.438 ± 0.533
0.707LeuTrp: 0.707 ± 0.171
1.686LeuTyr: 1.686 ± 0.31
0.0LeuXaa: 0.0 ± 0.0
Met
4.024MetAla: 4.024 ± 0.499
0.272MetCys: 0.272 ± 0.133
1.577MetAsp: 1.577 ± 0.28
1.468MetGlu: 1.468 ± 0.202
0.435MetPhe: 0.435 ± 0.129
2.067MetGly: 2.067 ± 0.379
0.272MetHis: 0.272 ± 0.119
1.196MetIle: 1.196 ± 0.237
1.196MetLys: 1.196 ± 0.209
2.175MetLeu: 2.175 ± 0.344
0.707MetMet: 0.707 ± 0.201
1.033MetAsn: 1.033 ± 0.26
0.87MetPro: 0.87 ± 0.186
1.414MetGln: 1.414 ± 0.274
2.012MetArg: 2.012 ± 0.275
2.393MetSer: 2.393 ± 0.367
2.719MetThr: 2.719 ± 0.383
1.577MetVal: 1.577 ± 0.282
0.381MetTrp: 0.381 ± 0.143
0.272MetTyr: 0.272 ± 0.122
0.0MetXaa: 0.0 ± 0.0
Asn
4.623AsnAla: 4.623 ± 0.577
0.163AsnCys: 0.163 ± 0.098
2.067AsnAsp: 2.067 ± 0.403
2.338AsnGlu: 2.338 ± 0.294
0.87AsnPhe: 0.87 ± 0.202
3.154AsnGly: 3.154 ± 0.477
0.816AsnHis: 0.816 ± 0.177
1.414AsnIle: 1.414 ± 0.213
2.012AsnLys: 2.012 ± 0.411
3.263AsnLeu: 3.263 ± 0.405
0.707AsnMet: 0.707 ± 0.196
1.251AsnAsn: 1.251 ± 0.223
2.284AsnPro: 2.284 ± 0.408
1.958AsnGln: 1.958 ± 0.356
2.175AsnArg: 2.175 ± 0.342
1.849AsnSer: 1.849 ± 0.376
2.175AsnThr: 2.175 ± 0.387
1.631AsnVal: 1.631 ± 0.317
0.544AsnTrp: 0.544 ± 0.202
0.87AsnTyr: 0.87 ± 0.219
0.0AsnXaa: 0.0 ± 0.0
Pro
5.003ProAla: 5.003 ± 0.375
0.544ProCys: 0.544 ± 0.224
3.861ProAsp: 3.861 ± 0.531
3.97ProGlu: 3.97 ± 0.57
1.305ProPhe: 1.305 ± 0.252
3.154ProGly: 3.154 ± 0.42
1.033ProHis: 1.033 ± 0.304
1.958ProIle: 1.958 ± 0.391
2.502ProLys: 2.502 ± 0.314
3.154ProLeu: 3.154 ± 0.538
0.489ProMet: 0.489 ± 0.194
1.142ProAsn: 1.142 ± 0.22
1.631ProPro: 1.631 ± 0.342
1.958ProGln: 1.958 ± 0.37
2.665ProArg: 2.665 ± 0.406
2.338ProSer: 2.338 ± 0.473
2.991ProThr: 2.991 ± 0.379
3.263ProVal: 3.263 ± 0.42
0.381ProTrp: 0.381 ± 0.151
0.925ProTyr: 0.925 ± 0.291
0.0ProXaa: 0.0 ± 0.0
Gln
8.212GlnAla: 8.212 ± 0.926
0.218GlnCys: 0.218 ± 0.132
2.991GlnAsp: 2.991 ± 0.353
3.698GlnGlu: 3.698 ± 0.706
1.849GlnPhe: 1.849 ± 0.265
2.937GlnGly: 2.937 ± 0.448
1.142GlnHis: 1.142 ± 0.251
2.338GlnIle: 2.338 ± 0.439
2.121GlnLys: 2.121 ± 0.34
5.112GlnLeu: 5.112 ± 0.701
1.142GlnMet: 1.142 ± 0.201
0.87GlnAsn: 0.87 ± 0.206
2.175GlnPro: 2.175 ± 0.394
4.351GlnGln: 4.351 ± 0.797
3.807GlnArg: 3.807 ± 0.558
3.589GlnSer: 3.589 ± 0.511
2.447GlnThr: 2.447 ± 0.381
3.1GlnVal: 3.1 ± 0.369
0.707GlnTrp: 0.707 ± 0.217
0.653GlnTyr: 0.653 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
7.777ArgAla: 7.777 ± 0.597
0.761ArgCys: 0.761 ± 0.235
4.623ArgAsp: 4.623 ± 0.56
4.242ArgGlu: 4.242 ± 0.56
2.23ArgPhe: 2.23 ± 0.375
3.97ArgGly: 3.97 ± 0.376
1.523ArgHis: 1.523 ± 0.338
2.882ArgIle: 2.882 ± 0.386
3.861ArgLys: 3.861 ± 0.406
5.819ArgLeu: 5.819 ± 0.637
1.795ArgMet: 1.795 ± 0.25
2.012ArgAsn: 2.012 ± 0.382
2.991ArgPro: 2.991 ± 0.53
3.426ArgGln: 3.426 ± 0.612
4.514ArgArg: 4.514 ± 0.641
3.97ArgSer: 3.97 ± 0.555
3.589ArgThr: 3.589 ± 0.38
3.807ArgVal: 3.807 ± 0.415
1.523ArgTrp: 1.523 ± 0.263
2.067ArgTyr: 2.067 ± 0.362
0.0ArgXaa: 0.0 ± 0.0
Ser
7.396SerAla: 7.396 ± 1.043
0.598SerCys: 0.598 ± 0.209
2.828SerAsp: 2.828 ± 0.33
3.698SerGlu: 3.698 ± 0.383
1.577SerPhe: 1.577 ± 0.345
5.33SerGly: 5.33 ± 0.719
1.033SerHis: 1.033 ± 0.239
2.61SerIle: 2.61 ± 0.635
3.045SerLys: 3.045 ± 0.311
4.405SerLeu: 4.405 ± 0.482
1.958SerMet: 1.958 ± 0.315
2.393SerAsn: 2.393 ± 0.3
2.61SerPro: 2.61 ± 0.437
3.045SerGln: 3.045 ± 0.541
3.1SerArg: 3.1 ± 0.358
3.154SerSer: 3.154 ± 0.534
2.991SerThr: 2.991 ± 0.404
3.263SerVal: 3.263 ± 0.48
0.925SerTrp: 0.925 ± 0.164
0.87SerTyr: 0.87 ± 0.207
0.0SerXaa: 0.0 ± 0.0
Thr
6.744ThrAla: 6.744 ± 0.732
0.218ThrCys: 0.218 ± 0.124
2.882ThrAsp: 2.882 ± 0.392
4.405ThrGlu: 4.405 ± 0.425
1.36ThrPhe: 1.36 ± 0.27
4.351ThrGly: 4.351 ± 0.531
1.36ThrHis: 1.36 ± 0.234
2.719ThrIle: 2.719 ± 0.353
3.1ThrLys: 3.1 ± 0.584
4.024ThrLeu: 4.024 ± 0.475
1.686ThrMet: 1.686 ± 0.304
1.849ThrAsn: 1.849 ± 0.271
3.426ThrPro: 3.426 ± 0.561
2.556ThrGln: 2.556 ± 0.4
2.882ThrArg: 2.882 ± 0.401
3.426ThrSer: 3.426 ± 0.501
3.045ThrThr: 3.045 ± 0.535
3.916ThrVal: 3.916 ± 0.773
0.544ThrTrp: 0.544 ± 0.21
1.577ThrTyr: 1.577 ± 0.279
0.0ThrXaa: 0.0 ± 0.0
Val
6.635ValAla: 6.635 ± 0.681
0.489ValCys: 0.489 ± 0.203
3.807ValAsp: 3.807 ± 0.553
4.242ValGlu: 4.242 ± 0.644
0.979ValPhe: 0.979 ± 0.254
3.861ValGly: 3.861 ± 0.549
0.653ValHis: 0.653 ± 0.162
3.317ValIle: 3.317 ± 0.423
2.61ValLys: 2.61 ± 0.345
4.188ValLeu: 4.188 ± 0.51
1.849ValMet: 1.849 ± 0.352
2.774ValAsn: 2.774 ± 0.365
2.665ValPro: 2.665 ± 0.393
2.067ValGln: 2.067 ± 0.313
3.861ValArg: 3.861 ± 0.617
4.079ValSer: 4.079 ± 0.462
4.894ValThr: 4.894 ± 0.732
5.058ValVal: 5.058 ± 0.653
1.033ValTrp: 1.033 ± 0.194
1.414ValTyr: 1.414 ± 0.233
0.0ValXaa: 0.0 ± 0.0
Trp
1.196TrpAla: 1.196 ± 0.257
0.653TrpCys: 0.653 ± 0.195
0.653TrpAsp: 0.653 ± 0.201
0.653TrpGlu: 0.653 ± 0.178
0.598TrpPhe: 0.598 ± 0.153
0.761TrpGly: 0.761 ± 0.213
0.707TrpHis: 0.707 ± 0.175
0.326TrpIle: 0.326 ± 0.119
0.816TrpLys: 0.816 ± 0.211
1.36TrpLeu: 1.36 ± 0.322
0.381TrpMet: 0.381 ± 0.136
0.761TrpAsn: 0.761 ± 0.212
0.435TrpPro: 0.435 ± 0.155
0.979TrpGln: 0.979 ± 0.243
1.577TrpArg: 1.577 ± 0.294
0.87TrpSer: 0.87 ± 0.239
0.435TrpThr: 0.435 ± 0.164
1.088TrpVal: 1.088 ± 0.268
0.163TrpTrp: 0.163 ± 0.142
0.435TrpTyr: 0.435 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.23TyrAla: 2.23 ± 0.326
0.272TyrCys: 0.272 ± 0.124
1.686TyrAsp: 1.686 ± 0.312
1.305TyrGlu: 1.305 ± 0.315
0.544TyrPhe: 0.544 ± 0.153
1.795TyrGly: 1.795 ± 0.32
0.544TyrHis: 0.544 ± 0.226
0.87TyrIle: 0.87 ± 0.215
1.088TyrLys: 1.088 ± 0.245
2.067TyrLeu: 2.067 ± 0.371
0.707TyrMet: 0.707 ± 0.206
0.816TyrAsn: 0.816 ± 0.272
1.36TyrPro: 1.36 ± 0.29
1.577TyrGln: 1.577 ± 0.302
2.665TyrArg: 2.665 ± 0.419
1.196TyrSer: 1.196 ± 0.323
1.033TyrThr: 1.033 ± 0.196
1.196TyrVal: 1.196 ± 0.281
0.381TyrTrp: 0.381 ± 0.194
0.544TyrTyr: 0.544 ± 0.179
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (18389 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski