Amino acid dipepetide frequency for Aeromonas phage 13AhydR10PP

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.484AlaAla: 11.484 ± 1.423
1.12AlaCys: 1.12 ± 0.263
5.392AlaAsp: 5.392 ± 0.651
6.372AlaGlu: 6.372 ± 0.761
2.871AlaPhe: 2.871 ± 0.472
8.123AlaGly: 8.123 ± 1.0
1.4AlaHis: 1.4 ± 0.327
6.092AlaIle: 6.092 ± 0.706
6.232AlaLys: 6.232 ± 0.747
10.083AlaLeu: 10.083 ± 0.871
4.061AlaMet: 4.061 ± 0.656
3.011AlaAsn: 3.011 ± 0.634
4.061AlaPro: 4.061 ± 0.482
3.571AlaGln: 3.571 ± 0.547
5.112AlaArg: 5.112 ± 0.645
6.022AlaSer: 6.022 ± 0.749
6.582AlaThr: 6.582 ± 0.797
5.952AlaVal: 5.952 ± 0.75
0.56AlaTrp: 0.56 ± 0.236
2.241AlaTyr: 2.241 ± 0.384
0.0AlaXaa: 0.0 ± 0.0
Cys
1.26CysAla: 1.26 ± 0.305
0.21CysCys: 0.21 ± 0.118
0.63CysAsp: 0.63 ± 0.228
0.98CysGlu: 0.98 ± 0.248
0.35CysPhe: 0.35 ± 0.149
1.47CysGly: 1.47 ± 0.347
0.35CysHis: 0.35 ± 0.165
0.98CysIle: 0.98 ± 0.316
0.63CysLys: 0.63 ± 0.174
1.611CysLeu: 1.611 ± 0.275
0.21CysMet: 0.21 ± 0.115
0.42CysAsn: 0.42 ± 0.142
1.05CysPro: 1.05 ± 0.307
0.42CysGln: 0.42 ± 0.182
1.05CysArg: 1.05 ± 0.301
0.7CysSer: 0.7 ± 0.208
0.7CysThr: 0.7 ± 0.2
0.7CysVal: 0.7 ± 0.218
0.42CysTrp: 0.42 ± 0.193
0.56CysTyr: 0.56 ± 0.194
0.0CysXaa: 0.0 ± 0.0
Asp
5.952AspAla: 5.952 ± 0.756
0.91AspCys: 0.91 ± 0.237
2.521AspAsp: 2.521 ± 0.401
2.941AspGlu: 2.941 ± 0.489
1.821AspPhe: 1.821 ± 0.387
5.392AspGly: 5.392 ± 0.667
1.19AspHis: 1.19 ± 0.282
2.871AspIle: 2.871 ± 0.442
2.521AspLys: 2.521 ± 0.522
5.322AspLeu: 5.322 ± 0.603
1.26AspMet: 1.26 ± 0.283
2.171AspAsn: 2.171 ± 0.397
2.451AspPro: 2.451 ± 0.523
2.101AspGln: 2.101 ± 0.403
3.571AspArg: 3.571 ± 0.517
2.031AspSer: 2.031 ± 0.325
2.521AspThr: 2.521 ± 0.478
4.622AspVal: 4.622 ± 0.693
1.12AspTrp: 1.12 ± 0.399
1.26AspTyr: 1.26 ± 0.3
0.0AspXaa: 0.0 ± 0.0
Glu
6.232GluAla: 6.232 ± 0.76
0.7GluCys: 0.7 ± 0.206
2.591GluAsp: 2.591 ± 0.5
3.921GluGlu: 3.921 ± 0.64
1.961GluPhe: 1.961 ± 0.362
4.622GluGly: 4.622 ± 0.587
1.33GluHis: 1.33 ± 0.29
3.011GluIle: 3.011 ± 0.375
3.711GluLys: 3.711 ± 0.535
6.092GluLeu: 6.092 ± 0.681
2.241GluMet: 2.241 ± 0.447
1.4GluAsn: 1.4 ± 0.328
2.031GluPro: 2.031 ± 0.351
2.311GluGln: 2.311 ± 0.495
4.411GluArg: 4.411 ± 0.672
3.851GluSer: 3.851 ± 0.429
3.151GluThr: 3.151 ± 0.474
5.252GluVal: 5.252 ± 0.583
1.4GluTrp: 1.4 ± 0.333
2.101GluTyr: 2.101 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
2.661PheAla: 2.661 ± 0.426
0.42PheCys: 0.42 ± 0.179
2.031PheAsp: 2.031 ± 0.323
1.751PheGlu: 1.751 ± 0.309
0.35PhePhe: 0.35 ± 0.157
3.641PheGly: 3.641 ± 0.715
1.12PheHis: 1.12 ± 0.255
2.311PheIle: 2.311 ± 0.361
1.891PheLys: 1.891 ± 0.415
2.591PheLeu: 2.591 ± 0.384
0.42PheMet: 0.42 ± 0.143
0.77PheAsn: 0.77 ± 0.251
1.33PhePro: 1.33 ± 0.263
1.19PheGln: 1.19 ± 0.311
1.961PheArg: 1.961 ± 0.306
1.33PheSer: 1.33 ± 0.299
2.171PheThr: 2.171 ± 0.344
1.541PheVal: 1.541 ± 0.328
0.63PheTrp: 0.63 ± 0.215
0.7PheTyr: 0.7 ± 0.323
0.0PheXaa: 0.0 ± 0.0
Gly
6.092GlyAla: 6.092 ± 0.807
1.19GlyCys: 1.19 ± 0.253
3.431GlyAsp: 3.431 ± 0.506
5.812GlyGlu: 5.812 ± 0.616
3.641GlyPhe: 3.641 ± 0.578
6.162GlyGly: 6.162 ± 0.953
1.891GlyHis: 1.891 ± 0.383
4.481GlyIle: 4.481 ± 0.426
5.392GlyLys: 5.392 ± 0.578
6.582GlyLeu: 6.582 ± 0.787
2.941GlyMet: 2.941 ± 0.404
2.941GlyAsn: 2.941 ± 0.48
2.871GlyPro: 2.871 ± 0.465
2.871GlyGln: 2.871 ± 0.473
3.991GlyArg: 3.991 ± 0.553
4.972GlySer: 4.972 ± 0.597
3.921GlyThr: 3.921 ± 0.699
5.462GlyVal: 5.462 ± 0.502
1.19GlyTrp: 1.19 ± 0.295
2.311GlyTyr: 2.311 ± 0.56
0.0GlyXaa: 0.0 ± 0.0
His
1.681HisAla: 1.681 ± 0.287
0.49HisCys: 0.49 ± 0.151
1.26HisAsp: 1.26 ± 0.321
1.26HisGlu: 1.26 ± 0.301
1.05HisPhe: 1.05 ± 0.27
1.47HisGly: 1.47 ± 0.347
0.98HisHis: 0.98 ± 0.269
1.12HisIle: 1.12 ± 0.274
0.91HisLys: 0.91 ± 0.264
1.961HisLeu: 1.961 ± 0.401
0.49HisMet: 0.49 ± 0.192
0.56HisAsn: 0.56 ± 0.215
1.12HisPro: 1.12 ± 0.227
1.05HisGln: 1.05 ± 0.278
1.961HisArg: 1.961 ± 0.427
1.05HisSer: 1.05 ± 0.296
0.91HisThr: 0.91 ± 0.26
1.26HisVal: 1.26 ± 0.34
0.63HisTrp: 0.63 ± 0.241
1.05HisTyr: 1.05 ± 0.263
0.0HisXaa: 0.0 ± 0.0
Ile
4.762IleAla: 4.762 ± 0.541
0.56IleCys: 0.56 ± 0.189
3.011IleAsp: 3.011 ± 0.433
4.131IleGlu: 4.131 ± 0.592
0.91IlePhe: 0.91 ± 0.232
2.801IleGly: 2.801 ± 0.464
1.12IleHis: 1.12 ± 0.284
2.381IleIle: 2.381 ± 0.558
3.711IleLys: 3.711 ± 0.487
3.711IleLeu: 3.711 ± 0.464
1.611IleMet: 1.611 ± 0.268
2.171IleAsn: 2.171 ± 0.322
1.891IlePro: 1.891 ± 0.389
1.611IleGln: 1.611 ± 0.456
3.641IleArg: 3.641 ± 0.509
3.011IleSer: 3.011 ± 0.466
4.131IleThr: 4.131 ± 0.448
2.871IleVal: 2.871 ± 0.422
0.98IleTrp: 0.98 ± 0.281
0.98IleTyr: 0.98 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
9.033LysAla: 9.033 ± 0.708
0.56LysCys: 0.56 ± 0.191
3.291LysAsp: 3.291 ± 0.472
3.221LysGlu: 3.221 ± 0.451
1.681LysPhe: 1.681 ± 0.372
6.022LysGly: 6.022 ± 0.598
1.05LysHis: 1.05 ± 0.235
2.241LysIle: 2.241 ± 0.465
2.381LysLys: 2.381 ± 0.394
4.061LysLeu: 4.061 ± 0.672
2.801LysMet: 2.801 ± 0.499
1.19LysAsn: 1.19 ± 0.29
2.871LysPro: 2.871 ± 0.452
2.731LysGln: 2.731 ± 0.606
3.711LysArg: 3.711 ± 0.481
2.451LysSer: 2.451 ± 0.418
3.571LysThr: 3.571 ± 0.353
4.271LysVal: 4.271 ± 0.489
1.05LysTrp: 1.05 ± 0.274
0.98LysTyr: 0.98 ± 0.278
0.0LysXaa: 0.0 ± 0.0
Leu
7.142LeuAla: 7.142 ± 0.791
1.05LeuCys: 1.05 ± 0.289
5.742LeuAsp: 5.742 ± 0.926
6.372LeuGlu: 6.372 ± 0.59
1.681LeuPhe: 1.681 ± 0.353
4.832LeuGly: 4.832 ± 0.573
1.891LeuHis: 1.891 ± 0.373
3.711LeuIle: 3.711 ± 0.46
4.692LeuLys: 4.692 ± 0.502
6.162LeuLeu: 6.162 ± 0.765
2.591LeuMet: 2.591 ± 0.458
3.361LeuAsn: 3.361 ± 0.473
4.271LeuPro: 4.271 ± 0.478
2.311LeuGln: 2.311 ± 0.362
5.672LeuArg: 5.672 ± 0.656
6.722LeuSer: 6.722 ± 0.697
7.562LeuThr: 7.562 ± 0.845
5.952LeuVal: 5.952 ± 0.612
1.19LeuTrp: 1.19 ± 0.358
1.961LeuTyr: 1.961 ± 0.293
0.0LeuXaa: 0.0 ± 0.0
Met
4.201MetAla: 4.201 ± 0.767
0.42MetCys: 0.42 ± 0.153
2.101MetAsp: 2.101 ± 0.364
1.751MetGlu: 1.751 ± 0.371
0.56MetPhe: 0.56 ± 0.195
2.171MetGly: 2.171 ± 0.4
0.49MetHis: 0.49 ± 0.204
0.84MetIle: 0.84 ± 0.211
3.081MetLys: 3.081 ± 0.523
2.031MetLeu: 2.031 ± 0.439
0.84MetMet: 0.84 ± 0.229
1.4MetAsn: 1.4 ± 0.373
1.05MetPro: 1.05 ± 0.249
1.26MetGln: 1.26 ± 0.304
1.12MetArg: 1.12 ± 0.349
2.661MetSer: 2.661 ± 0.402
2.801MetThr: 2.801 ± 0.442
2.311MetVal: 2.311 ± 0.387
0.42MetTrp: 0.42 ± 0.189
0.63MetTyr: 0.63 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
2.451AsnAla: 2.451 ± 0.57
0.42AsnCys: 0.42 ± 0.176
1.26AsnAsp: 1.26 ± 0.261
2.311AsnGlu: 2.311 ± 0.396
0.91AsnPhe: 0.91 ± 0.263
3.011AsnGly: 3.011 ± 0.56
1.05AsnHis: 1.05 ± 0.302
1.47AsnIle: 1.47 ± 0.313
3.081AsnLys: 3.081 ± 0.45
3.221AsnLeu: 3.221 ± 0.493
0.91AsnMet: 0.91 ± 0.286
1.611AsnAsn: 1.611 ± 0.453
2.101AsnPro: 2.101 ± 0.326
2.031AsnGln: 2.031 ± 0.32
1.681AsnArg: 1.681 ± 0.348
1.4AsnSer: 1.4 ± 0.308
2.381AsnThr: 2.381 ± 0.474
2.591AsnVal: 2.591 ± 0.464
0.63AsnTrp: 0.63 ± 0.192
0.77AsnTyr: 0.77 ± 0.235
0.0AsnXaa: 0.0 ± 0.0
Pro
4.411ProAla: 4.411 ± 0.577
0.63ProCys: 0.63 ± 0.205
3.291ProAsp: 3.291 ± 0.523
3.571ProGlu: 3.571 ± 0.553
1.751ProPhe: 1.751 ± 0.359
2.731ProGly: 2.731 ± 0.397
0.77ProHis: 0.77 ± 0.227
1.611ProIle: 1.611 ± 0.307
2.521ProLys: 2.521 ± 0.432
3.291ProLeu: 3.291 ± 0.511
0.42ProMet: 0.42 ± 0.157
1.26ProAsn: 1.26 ± 0.296
1.891ProPro: 1.891 ± 0.529
1.47ProGln: 1.47 ± 0.334
1.751ProArg: 1.751 ± 0.399
2.731ProSer: 2.731 ± 0.395
3.571ProThr: 3.571 ± 0.838
3.991ProVal: 3.991 ± 0.525
0.49ProTrp: 0.49 ± 0.189
1.26ProTyr: 1.26 ± 0.293
0.0ProXaa: 0.0 ± 0.0
Gln
5.042GlnAla: 5.042 ± 0.597
1.05GlnCys: 1.05 ± 0.277
1.681GlnAsp: 1.681 ± 0.338
2.171GlnGlu: 2.171 ± 0.425
0.7GlnPhe: 0.7 ± 0.252
2.801GlnGly: 2.801 ± 0.382
1.33GlnHis: 1.33 ± 0.328
2.031GlnIle: 2.031 ± 0.393
1.751GlnLys: 1.751 ± 0.54
3.011GlnLeu: 3.011 ± 0.417
1.33GlnMet: 1.33 ± 0.401
1.33GlnAsn: 1.33 ± 0.269
0.98GlnPro: 0.98 ± 0.251
1.05GlnGln: 1.05 ± 0.282
1.961GlnArg: 1.961 ± 0.414
2.521GlnSer: 2.521 ± 0.383
2.031GlnThr: 2.031 ± 0.343
2.731GlnVal: 2.731 ± 0.416
0.7GlnTrp: 0.7 ± 0.192
0.77GlnTyr: 0.77 ± 0.226
0.0GlnXaa: 0.0 ± 0.0
Arg
5.882ArgAla: 5.882 ± 0.582
0.91ArgCys: 0.91 ± 0.25
2.871ArgAsp: 2.871 ± 0.384
2.591ArgGlu: 2.591 ± 0.497
2.521ArgPhe: 2.521 ± 0.427
4.061ArgGly: 4.061 ± 0.482
1.4ArgHis: 1.4 ± 0.35
2.241ArgIle: 2.241 ± 0.392
3.431ArgLys: 3.431 ± 0.437
6.652ArgLeu: 6.652 ± 0.816
2.031ArgMet: 2.031 ± 0.269
3.011ArgAsn: 3.011 ± 0.464
3.011ArgPro: 3.011 ± 0.519
2.101ArgGln: 2.101 ± 0.452
3.641ArgArg: 3.641 ± 0.737
3.571ArgSer: 3.571 ± 0.679
3.151ArgThr: 3.151 ± 0.482
4.552ArgVal: 4.552 ± 0.594
1.19ArgTrp: 1.19 ± 0.249
1.47ArgTyr: 1.47 ± 0.275
0.0ArgXaa: 0.0 ± 0.0
Ser
5.672SerAla: 5.672 ± 0.693
1.19SerCys: 1.19 ± 0.31
3.431SerAsp: 3.431 ± 0.529
2.731SerGlu: 2.731 ± 0.388
1.681SerPhe: 1.681 ± 0.28
5.462SerGly: 5.462 ± 0.73
1.12SerHis: 1.12 ± 0.287
3.221SerIle: 3.221 ± 0.514
2.661SerLys: 2.661 ± 0.352
6.022SerLeu: 6.022 ± 0.544
2.661SerMet: 2.661 ± 0.414
1.681SerAsn: 1.681 ± 0.384
2.311SerPro: 2.311 ± 0.502
2.171SerGln: 2.171 ± 0.34
3.361SerArg: 3.361 ± 0.477
3.151SerSer: 3.151 ± 0.49
3.501SerThr: 3.501 ± 0.556
3.431SerVal: 3.431 ± 0.465
1.12SerTrp: 1.12 ± 0.255
1.681SerTyr: 1.681 ± 0.36
0.0SerXaa: 0.0 ± 0.0
Thr
6.302ThrAla: 6.302 ± 0.899
1.19ThrCys: 1.19 ± 0.303
3.081ThrAsp: 3.081 ± 0.425
3.501ThrGlu: 3.501 ± 0.531
2.311ThrPhe: 2.311 ± 0.383
5.532ThrGly: 5.532 ± 0.523
0.91ThrHis: 0.91 ± 0.234
3.361ThrIle: 3.361 ± 0.662
4.271ThrLys: 4.271 ± 0.703
5.532ThrLeu: 5.532 ± 0.663
2.031ThrMet: 2.031 ± 0.403
1.681ThrAsn: 1.681 ± 0.489
3.711ThrPro: 3.711 ± 0.675
2.031ThrGln: 2.031 ± 0.281
4.061ThrArg: 4.061 ± 0.669
2.871ThrSer: 2.871 ± 0.54
4.201ThrThr: 4.201 ± 0.68
4.271ThrVal: 4.271 ± 0.556
0.98ThrTrp: 0.98 ± 0.294
1.751ThrTyr: 1.751 ± 0.352
0.0ThrXaa: 0.0 ± 0.0
Val
6.302ValAla: 6.302 ± 0.739
1.05ValCys: 1.05 ± 0.301
4.341ValAsp: 4.341 ± 0.597
4.762ValGlu: 4.762 ± 0.571
2.171ValPhe: 2.171 ± 0.36
4.552ValGly: 4.552 ± 0.507
1.541ValHis: 1.541 ± 0.374
4.341ValIle: 4.341 ± 0.516
4.692ValLys: 4.692 ± 0.518
3.711ValLeu: 3.711 ± 0.503
2.451ValMet: 2.451 ± 0.414
3.291ValAsn: 3.291 ± 0.548
2.731ValPro: 2.731 ± 0.448
2.731ValGln: 2.731 ± 0.317
3.921ValArg: 3.921 ± 0.421
4.972ValSer: 4.972 ± 0.508
4.481ValThr: 4.481 ± 0.616
6.232ValVal: 6.232 ± 0.781
1.12ValTrp: 1.12 ± 0.244
1.821ValTyr: 1.821 ± 0.466
0.0ValXaa: 0.0 ± 0.0
Trp
1.751TrpAla: 1.751 ± 0.389
0.14TrpCys: 0.14 ± 0.093
1.26TrpAsp: 1.26 ± 0.339
0.84TrpGlu: 0.84 ± 0.301
1.05TrpPhe: 1.05 ± 0.235
1.05TrpGly: 1.05 ± 0.247
0.49TrpHis: 0.49 ± 0.162
0.63TrpIle: 0.63 ± 0.207
0.77TrpLys: 0.77 ± 0.208
1.19TrpLeu: 1.19 ± 0.304
0.35TrpMet: 0.35 ± 0.122
0.91TrpAsn: 0.91 ± 0.233
0.49TrpPro: 0.49 ± 0.163
0.84TrpGln: 0.84 ± 0.238
1.05TrpArg: 1.05 ± 0.271
0.77TrpSer: 0.77 ± 0.234
0.77TrpThr: 0.77 ± 0.249
1.19TrpVal: 1.19 ± 0.21
0.07TrpTrp: 0.07 ± 0.072
0.49TrpTyr: 0.49 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.031TyrAla: 2.031 ± 0.385
0.49TyrCys: 0.49 ± 0.179
1.33TyrAsp: 1.33 ± 0.269
1.33TyrGlu: 1.33 ± 0.278
0.84TyrPhe: 0.84 ± 0.205
2.241TyrGly: 2.241 ± 0.445
0.84TyrHis: 0.84 ± 0.208
1.05TyrIle: 1.05 ± 0.408
0.84TyrLys: 0.84 ± 0.246
2.031TyrLeu: 2.031 ± 0.366
0.42TyrMet: 0.42 ± 0.178
1.12TyrAsn: 1.12 ± 0.288
1.19TyrPro: 1.19 ± 0.247
1.19TyrGln: 1.19 ± 0.298
2.661TyrArg: 2.661 ± 0.383
1.47TyrSer: 1.47 ± 0.506
1.47TyrThr: 1.47 ± 0.332
2.031TyrVal: 2.031 ± 0.383
0.21TyrTrp: 0.21 ± 0.119
0.77TyrTyr: 0.77 ± 0.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (14282 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski