Amino acid dipepetide frequency for Aeromonas phage LAh3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.951AlaAla: 15.951 ± 2.208
1.123AlaCys: 1.123 ± 0.309
5.692AlaAsp: 5.692 ± 0.525
8.762AlaGlu: 8.762 ± 1.188
2.696AlaPhe: 2.696 ± 0.422
9.736AlaGly: 9.736 ± 0.768
1.648AlaHis: 1.648 ± 0.422
4.718AlaIle: 4.718 ± 0.603
5.841AlaLys: 5.841 ± 0.89
10.934AlaLeu: 10.934 ± 1.316
3.295AlaMet: 3.295 ± 0.464
3.894AlaAsn: 3.894 ± 0.642
3.37AlaPro: 3.37 ± 0.415
6.066AlaGln: 6.066 ± 1.054
5.916AlaArg: 5.916 ± 0.828
5.242AlaSer: 5.242 ± 0.478
7.04AlaThr: 7.04 ± 0.996
6.515AlaVal: 6.515 ± 0.905
1.048AlaTrp: 1.048 ± 0.299
3.67AlaTyr: 3.67 ± 0.446
0.0AlaXaa: 0.0 ± 0.0
Cys
1.048CysAla: 1.048 ± 0.324
0.449CysCys: 0.449 ± 0.198
0.449CysAsp: 0.449 ± 0.228
0.599CysGlu: 0.599 ± 0.203
0.524CysPhe: 0.524 ± 0.231
1.498CysGly: 1.498 ± 0.396
0.374CysHis: 0.374 ± 0.184
0.599CysIle: 0.599 ± 0.234
0.599CysLys: 0.599 ± 0.206
0.824CysLeu: 0.824 ± 0.252
0.599CysMet: 0.599 ± 0.198
0.225CysAsn: 0.225 ± 0.13
0.674CysPro: 0.674 ± 0.196
0.3CysGln: 0.3 ± 0.145
0.674CysArg: 0.674 ± 0.198
0.899CysSer: 0.899 ± 0.294
1.123CysThr: 1.123 ± 0.378
0.824CysVal: 0.824 ± 0.226
0.075CysTrp: 0.075 ± 0.076
0.449CysTyr: 0.449 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
5.841AspAla: 5.841 ± 0.55
0.899AspCys: 0.899 ± 0.266
4.418AspAsp: 4.418 ± 0.614
3.819AspGlu: 3.819 ± 0.522
2.921AspPhe: 2.921 ± 0.451
4.493AspGly: 4.493 ± 0.696
0.974AspHis: 0.974 ± 0.294
3.22AspIle: 3.22 ± 0.455
3.37AspLys: 3.37 ± 0.464
4.793AspLeu: 4.793 ± 0.689
2.022AspMet: 2.022 ± 0.456
1.573AspAsn: 1.573 ± 0.318
1.947AspPro: 1.947 ± 0.336
1.947AspGln: 1.947 ± 0.288
3.37AspArg: 3.37 ± 0.412
3.52AspSer: 3.52 ± 0.406
2.921AspThr: 2.921 ± 0.498
3.295AspVal: 3.295 ± 0.441
0.899AspTrp: 0.899 ± 0.296
2.247AspTyr: 2.247 ± 0.433
0.0AspXaa: 0.0 ± 0.0
Glu
7.264GluAla: 7.264 ± 0.853
0.749GluCys: 0.749 ± 0.225
3.445GluAsp: 3.445 ± 0.646
3.67GluGlu: 3.67 ± 0.521
1.947GluPhe: 1.947 ± 0.38
3.744GluGly: 3.744 ± 0.586
1.573GluHis: 1.573 ± 0.341
1.273GluIle: 1.273 ± 0.287
3.445GluLys: 3.445 ± 0.53
6.74GluLeu: 6.74 ± 0.704
1.048GluMet: 1.048 ± 0.252
1.348GluAsn: 1.348 ± 0.294
1.423GluPro: 1.423 ± 0.326
4.044GluGln: 4.044 ± 0.799
3.295GluArg: 3.295 ± 0.51
3.07GluSer: 3.07 ± 0.399
3.67GluThr: 3.67 ± 0.469
4.119GluVal: 4.119 ± 0.446
0.974GluTrp: 0.974 ± 0.228
1.722GluTyr: 1.722 ± 0.28
0.0GluXaa: 0.0 ± 0.0
Phe
2.696PheAla: 2.696 ± 0.532
0.374PheCys: 0.374 ± 0.178
2.921PheAsp: 2.921 ± 0.382
1.648PheGlu: 1.648 ± 0.312
0.899PhePhe: 0.899 ± 0.244
2.996PheGly: 2.996 ± 0.427
0.225PheHis: 0.225 ± 0.123
2.172PheIle: 2.172 ± 0.402
2.097PheLys: 2.097 ± 0.445
2.396PheLeu: 2.396 ± 0.445
1.498PheMet: 1.498 ± 0.228
1.573PheAsn: 1.573 ± 0.306
1.123PhePro: 1.123 ± 0.237
1.722PheGln: 1.722 ± 0.313
1.648PheArg: 1.648 ± 0.296
1.648PheSer: 1.648 ± 0.277
1.273PheThr: 1.273 ± 0.347
2.097PheVal: 2.097 ± 0.346
0.374PheTrp: 0.374 ± 0.169
1.198PheTyr: 1.198 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
7.639GlyAla: 7.639 ± 0.947
1.573GlyCys: 1.573 ± 0.362
5.242GlyAsp: 5.242 ± 0.681
3.295GlyGlu: 3.295 ± 0.365
3.22GlyPhe: 3.22 ± 0.407
7.115GlyGly: 7.115 ± 1.229
1.498GlyHis: 1.498 ± 0.328
3.744GlyIle: 3.744 ± 0.444
4.868GlyLys: 4.868 ± 0.5
6.216GlyLeu: 6.216 ± 0.761
1.498GlyMet: 1.498 ± 0.488
2.921GlyAsn: 2.921 ± 0.446
2.247GlyPro: 2.247 ± 0.377
2.921GlyGln: 2.921 ± 0.477
5.242GlyArg: 5.242 ± 0.605
4.718GlySer: 4.718 ± 0.699
5.092GlyThr: 5.092 ± 0.762
5.092GlyVal: 5.092 ± 0.569
1.648GlyTrp: 1.648 ± 0.299
3.07GlyTyr: 3.07 ± 0.523
0.0GlyXaa: 0.0 ± 0.0
His
1.198HisAla: 1.198 ± 0.395
0.3HisCys: 0.3 ± 0.167
1.198HisAsp: 1.198 ± 0.266
1.198HisGlu: 1.198 ± 0.346
0.674HisPhe: 0.674 ± 0.202
2.097HisGly: 2.097 ± 0.427
0.524HisHis: 0.524 ± 0.199
0.899HisIle: 0.899 ± 0.283
0.824HisLys: 0.824 ± 0.27
1.048HisLeu: 1.048 ± 0.241
0.824HisMet: 0.824 ± 0.311
0.974HisAsn: 0.974 ± 0.267
1.423HisPro: 1.423 ± 0.298
0.749HisGln: 0.749 ± 0.199
1.947HisArg: 1.947 ± 0.529
1.423HisSer: 1.423 ± 0.306
1.123HisThr: 1.123 ± 0.283
1.423HisVal: 1.423 ± 0.359
0.15HisTrp: 0.15 ± 0.111
1.123HisTyr: 1.123 ± 0.319
0.0HisXaa: 0.0 ± 0.0
Ile
5.317IleAla: 5.317 ± 0.665
0.374IleCys: 0.374 ± 0.146
2.621IleAsp: 2.621 ± 0.465
2.097IleGlu: 2.097 ± 0.34
0.974IlePhe: 0.974 ± 0.265
3.819IleGly: 3.819 ± 0.503
1.348IleHis: 1.348 ± 0.266
1.722IleIle: 1.722 ± 0.312
2.097IleLys: 2.097 ± 0.472
3.145IleLeu: 3.145 ± 0.552
1.423IleMet: 1.423 ± 0.268
1.797IleAsn: 1.797 ± 0.456
1.722IlePro: 1.722 ± 0.292
1.872IleGln: 1.872 ± 0.377
2.771IleArg: 2.771 ± 0.36
2.097IleSer: 2.097 ± 0.369
2.621IleThr: 2.621 ± 0.393
2.471IleVal: 2.471 ± 0.359
0.3IleTrp: 0.3 ± 0.147
1.348IleTyr: 1.348 ± 0.3
0.0IleXaa: 0.0 ± 0.0
Lys
8.163LysAla: 8.163 ± 0.838
0.3LysCys: 0.3 ± 0.121
3.445LysAsp: 3.445 ± 0.365
4.418LysGlu: 4.418 ± 0.574
1.273LysPhe: 1.273 ± 0.256
3.52LysGly: 3.52 ± 0.549
1.198LysHis: 1.198 ± 0.246
1.722LysIle: 1.722 ± 0.359
2.846LysLys: 2.846 ± 0.532
5.841LysLeu: 5.841 ± 0.772
1.947LysMet: 1.947 ± 0.493
1.947LysAsn: 1.947 ± 0.348
2.771LysPro: 2.771 ± 0.589
2.696LysGln: 2.696 ± 0.567
3.37LysArg: 3.37 ± 0.38
2.696LysSer: 2.696 ± 0.432
2.546LysThr: 2.546 ± 0.376
3.295LysVal: 3.295 ± 0.399
1.423LysTrp: 1.423 ± 0.396
1.872LysTyr: 1.872 ± 0.314
0.0LysXaa: 0.0 ± 0.0
Leu
8.912LeuAla: 8.912 ± 0.865
1.273LeuCys: 1.273 ± 0.367
5.841LeuAsp: 5.841 ± 0.52
4.269LeuGlu: 4.269 ± 0.662
2.996LeuPhe: 2.996 ± 0.482
5.991LeuGly: 5.991 ± 0.627
2.322LeuHis: 2.322 ± 0.397
3.37LeuIle: 3.37 ± 0.598
5.092LeuLys: 5.092 ± 0.496
6.815LeuLeu: 6.815 ± 0.774
3.145LeuMet: 3.145 ± 0.399
3.819LeuAsn: 3.819 ± 0.462
4.718LeuPro: 4.718 ± 0.539
3.894LeuGln: 3.894 ± 0.591
4.643LeuArg: 4.643 ± 0.716
5.991LeuSer: 5.991 ± 0.685
5.542LeuThr: 5.542 ± 0.802
5.766LeuVal: 5.766 ± 0.8
1.048LeuTrp: 1.048 ± 0.272
2.696LeuTyr: 2.696 ± 0.383
0.0LeuXaa: 0.0 ± 0.0
Met
4.493MetAla: 4.493 ± 0.493
0.449MetCys: 0.449 ± 0.162
2.097MetAsp: 2.097 ± 0.376
1.423MetGlu: 1.423 ± 0.274
0.824MetPhe: 0.824 ± 0.278
1.498MetGly: 1.498 ± 0.402
0.674MetHis: 0.674 ± 0.258
0.824MetIle: 0.824 ± 0.216
2.097MetLys: 2.097 ± 0.51
3.295MetLeu: 3.295 ± 0.525
0.899MetMet: 0.899 ± 0.225
1.797MetAsn: 1.797 ± 0.382
1.498MetPro: 1.498 ± 0.324
1.648MetGln: 1.648 ± 0.515
1.648MetArg: 1.648 ± 0.39
1.872MetSer: 1.872 ± 0.397
1.423MetThr: 1.423 ± 0.324
1.947MetVal: 1.947 ± 0.493
0.15MetTrp: 0.15 ± 0.1
0.524MetTyr: 0.524 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
4.568AsnAla: 4.568 ± 0.708
0.3AsnCys: 0.3 ± 0.164
1.947AsnAsp: 1.947 ± 0.486
1.648AsnGlu: 1.648 ± 0.324
0.974AsnPhe: 0.974 ± 0.297
4.493AsnGly: 4.493 ± 0.647
1.198AsnHis: 1.198 ± 0.353
1.273AsnIle: 1.273 ± 0.248
2.471AsnLys: 2.471 ± 0.398
2.696AsnLeu: 2.696 ± 0.455
1.348AsnMet: 1.348 ± 0.321
1.498AsnAsn: 1.498 ± 0.323
2.396AsnPro: 2.396 ± 0.376
1.498AsnGln: 1.498 ± 0.296
2.771AsnArg: 2.771 ± 0.454
1.947AsnSer: 1.947 ± 0.335
1.872AsnThr: 1.872 ± 0.308
2.471AsnVal: 2.471 ± 0.534
0.524AsnTrp: 0.524 ± 0.155
1.048AsnTyr: 1.048 ± 0.329
0.0AsnXaa: 0.0 ± 0.0
Pro
5.242ProAla: 5.242 ± 0.798
0.599ProCys: 0.599 ± 0.199
2.921ProAsp: 2.921 ± 0.359
3.894ProGlu: 3.894 ± 0.715
1.348ProPhe: 1.348 ± 0.356
3.37ProGly: 3.37 ± 0.506
0.524ProHis: 0.524 ± 0.237
1.797ProIle: 1.797 ± 0.305
2.322ProLys: 2.322 ± 0.35
2.921ProLeu: 2.921 ± 0.439
1.048ProMet: 1.048 ± 0.278
1.797ProAsn: 1.797 ± 0.404
1.573ProPro: 1.573 ± 0.487
1.123ProGln: 1.123 ± 0.233
1.573ProArg: 1.573 ± 0.296
2.247ProSer: 2.247 ± 0.442
1.273ProThr: 1.273 ± 0.228
2.846ProVal: 2.846 ± 0.455
0.374ProTrp: 0.374 ± 0.217
1.198ProTyr: 1.198 ± 0.415
0.0ProXaa: 0.0 ± 0.0
Gln
6.216GlnAla: 6.216 ± 0.832
0.674GlnCys: 0.674 ± 0.219
1.498GlnAsp: 1.498 ± 0.296
2.396GlnGlu: 2.396 ± 0.464
1.722GlnPhe: 1.722 ± 0.361
3.52GlnGly: 3.52 ± 0.507
0.824GlnHis: 0.824 ± 0.261
1.947GlnIle: 1.947 ± 0.373
2.621GlnLys: 2.621 ± 0.45
4.868GlnLeu: 4.868 ± 0.716
1.573GlnMet: 1.573 ± 0.286
1.573GlnAsn: 1.573 ± 0.341
1.573GlnPro: 1.573 ± 0.318
3.295GlnGln: 3.295 ± 0.534
2.696GlnArg: 2.696 ± 0.508
2.471GlnSer: 2.471 ± 0.582
2.696GlnThr: 2.696 ± 0.556
2.322GlnVal: 2.322 ± 0.329
0.449GlnTrp: 0.449 ± 0.183
1.872GlnTyr: 1.872 ± 0.479
0.0GlnXaa: 0.0 ± 0.0
Arg
6.291ArgAla: 6.291 ± 0.711
0.749ArgCys: 0.749 ± 0.26
3.07ArgAsp: 3.07 ± 0.462
3.67ArgGlu: 3.67 ± 0.456
1.797ArgPhe: 1.797 ± 0.413
3.744ArgGly: 3.744 ± 0.605
1.198ArgHis: 1.198 ± 0.266
2.921ArgIle: 2.921 ± 0.568
3.52ArgLys: 3.52 ± 0.535
5.841ArgLeu: 5.841 ± 0.65
2.322ArgMet: 2.322 ± 0.599
3.07ArgAsn: 3.07 ± 0.382
1.947ArgPro: 1.947 ± 0.357
2.996ArgGln: 2.996 ± 0.531
3.445ArgArg: 3.445 ± 0.533
2.471ArgSer: 2.471 ± 0.396
2.846ArgThr: 2.846 ± 0.496
2.921ArgVal: 2.921 ± 0.485
1.648ArgTrp: 1.648 ± 0.388
2.022ArgTyr: 2.022 ± 0.379
0.0ArgXaa: 0.0 ± 0.0
Ser
5.841SerAla: 5.841 ± 0.676
0.674SerCys: 0.674 ± 0.208
2.322SerAsp: 2.322 ± 0.309
2.396SerGlu: 2.396 ± 0.53
1.423SerPhe: 1.423 ± 0.203
4.718SerGly: 4.718 ± 0.621
0.599SerHis: 0.599 ± 0.258
2.546SerIle: 2.546 ± 0.335
3.595SerLys: 3.595 ± 0.459
4.568SerLeu: 4.568 ± 0.577
1.573SerMet: 1.573 ± 0.26
2.396SerAsn: 2.396 ± 0.489
2.247SerPro: 2.247 ± 0.39
2.996SerGln: 2.996 ± 0.464
3.52SerArg: 3.52 ± 0.507
3.22SerSer: 3.22 ± 0.457
3.969SerThr: 3.969 ± 0.7
3.67SerVal: 3.67 ± 0.449
0.974SerTrp: 0.974 ± 0.293
1.423SerTyr: 1.423 ± 0.321
0.0SerXaa: 0.0 ± 0.0
Thr
5.991ThrAla: 5.991 ± 0.848
0.449ThrCys: 0.449 ± 0.175
3.145ThrAsp: 3.145 ± 0.418
3.145ThrGlu: 3.145 ± 0.463
2.022ThrPhe: 2.022 ± 0.35
4.493ThrGly: 4.493 ± 0.595
0.899ThrHis: 0.899 ± 0.323
2.471ThrIle: 2.471 ± 0.419
4.119ThrLys: 4.119 ± 0.434
5.018ThrLeu: 5.018 ± 0.621
1.872ThrMet: 1.872 ± 0.383
1.648ThrAsn: 1.648 ± 0.423
2.921ThrPro: 2.921 ± 0.392
1.797ThrGln: 1.797 ± 0.436
3.22ThrArg: 3.22 ± 0.436
4.044ThrSer: 4.044 ± 0.542
4.269ThrThr: 4.269 ± 0.725
3.969ThrVal: 3.969 ± 0.533
0.524ThrTrp: 0.524 ± 0.188
2.172ThrTyr: 2.172 ± 0.401
0.0ThrXaa: 0.0 ± 0.0
Val
6.441ValAla: 6.441 ± 0.546
0.749ValCys: 0.749 ± 0.255
2.921ValAsp: 2.921 ± 0.541
3.894ValGlu: 3.894 ± 0.632
2.396ValPhe: 2.396 ± 0.438
4.643ValGly: 4.643 ± 0.692
2.247ValHis: 2.247 ± 0.449
2.696ValIle: 2.696 ± 0.419
3.295ValLys: 3.295 ± 0.525
5.167ValLeu: 5.167 ± 0.562
1.947ValMet: 1.947 ± 0.388
2.696ValAsn: 2.696 ± 0.377
2.696ValPro: 2.696 ± 0.553
2.921ValGln: 2.921 ± 0.531
3.819ValArg: 3.819 ± 0.533
2.921ValSer: 2.921 ± 0.297
4.418ValThr: 4.418 ± 0.609
5.092ValVal: 5.092 ± 0.615
0.674ValTrp: 0.674 ± 0.261
1.198ValTyr: 1.198 ± 0.322
0.0ValXaa: 0.0 ± 0.0
Trp
1.573TrpAla: 1.573 ± 0.241
0.3TrpCys: 0.3 ± 0.178
1.048TrpAsp: 1.048 ± 0.313
0.524TrpGlu: 0.524 ± 0.18
1.198TrpPhe: 1.198 ± 0.247
0.974TrpGly: 0.974 ± 0.271
0.225TrpHis: 0.225 ± 0.133
0.075TrpIle: 0.075 ± 0.073
0.374TrpLys: 0.374 ± 0.131
2.022TrpLeu: 2.022 ± 0.393
0.449TrpMet: 0.449 ± 0.169
0.674TrpAsn: 0.674 ± 0.263
0.449TrpPro: 0.449 ± 0.186
0.524TrpGln: 0.524 ± 0.17
0.674TrpArg: 0.674 ± 0.203
0.449TrpSer: 0.449 ± 0.173
0.749TrpThr: 0.749 ± 0.203
0.749TrpVal: 0.749 ± 0.31
0.599TrpTrp: 0.599 ± 0.266
0.524TrpTyr: 0.524 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.471TyrAla: 2.471 ± 0.458
0.449TyrCys: 0.449 ± 0.163
2.247TyrAsp: 2.247 ± 0.458
1.797TyrGlu: 1.797 ± 0.456
0.749TyrPhe: 0.749 ± 0.292
2.471TyrGly: 2.471 ± 0.425
1.048TyrHis: 1.048 ± 0.322
1.947TyrIle: 1.947 ± 0.465
1.797TyrLys: 1.797 ± 0.33
2.996TyrLeu: 2.996 ± 0.633
0.674TyrMet: 0.674 ± 0.25
1.722TyrAsn: 1.722 ± 0.317
1.273TyrPro: 1.273 ± 0.37
1.797TyrGln: 1.797 ± 0.311
2.247TyrArg: 2.247 ± 0.395
1.722TyrSer: 1.722 ± 0.305
1.722TyrThr: 1.722 ± 0.317
1.947TyrVal: 1.947 ± 0.402
0.3TyrTrp: 0.3 ± 0.131
0.524TyrTyr: 0.524 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (13354 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski