Amino acid dipepetide frequency for Yersinia phage vB_YenP_ISAO8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.462AlaAla: 12.462 ± 1.752
0.705AlaCys: 0.705 ± 0.339
5.643AlaAsp: 5.643 ± 0.654
7.289AlaGlu: 7.289 ± 1.029
2.743AlaPhe: 2.743 ± 0.48
6.662AlaGly: 6.662 ± 0.872
1.959AlaHis: 1.959 ± 0.444
3.213AlaIle: 3.213 ± 0.616
7.211AlaLys: 7.211 ± 0.834
7.367AlaLeu: 7.367 ± 0.97
3.135AlaMet: 3.135 ± 0.561
3.527AlaAsn: 3.527 ± 0.599
3.057AlaPro: 3.057 ± 0.547
5.094AlaGln: 5.094 ± 1.396
6.192AlaArg: 6.192 ± 0.641
4.624AlaSer: 4.624 ± 0.593
5.486AlaThr: 5.486 ± 0.563
7.211AlaVal: 7.211 ± 0.733
1.646AlaTrp: 1.646 ± 0.276
2.978AlaTyr: 2.978 ± 0.429
0.0AlaXaa: 0.0 ± 0.0
Cys
0.392CysAla: 0.392 ± 0.131
0.157CysCys: 0.157 ± 0.1
0.549CysAsp: 0.549 ± 0.214
0.627CysGlu: 0.627 ± 0.268
0.549CysPhe: 0.549 ± 0.176
0.47CysGly: 0.47 ± 0.188
0.392CysHis: 0.392 ± 0.206
0.157CysIle: 0.157 ± 0.09
0.078CysLys: 0.078 ± 0.09
0.392CysLeu: 0.392 ± 0.176
0.549CysMet: 0.549 ± 0.16
0.314CysAsn: 0.314 ± 0.133
0.314CysPro: 0.314 ± 0.142
0.235CysGln: 0.235 ± 0.14
0.627CysArg: 0.627 ± 0.193
0.549CysSer: 0.549 ± 0.182
0.627CysThr: 0.627 ± 0.263
1.097CysVal: 1.097 ± 0.304
0.157CysTrp: 0.157 ± 0.146
0.392CysTyr: 0.392 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
4.938AspAla: 4.938 ± 0.518
0.47AspCys: 0.47 ± 0.179
3.605AspAsp: 3.605 ± 0.72
4.389AspGlu: 4.389 ± 0.563
2.978AspPhe: 2.978 ± 0.506
5.565AspGly: 5.565 ± 0.806
0.862AspHis: 0.862 ± 0.299
3.135AspIle: 3.135 ± 0.461
4.467AspLys: 4.467 ± 0.486
4.938AspLeu: 4.938 ± 0.811
1.959AspMet: 1.959 ± 0.365
2.586AspAsn: 2.586 ± 0.418
2.116AspPro: 2.116 ± 0.455
1.724AspGln: 1.724 ± 0.424
3.527AspArg: 3.527 ± 0.485
3.997AspSer: 3.997 ± 0.66
2.743AspThr: 2.743 ± 0.39
4.703AspVal: 4.703 ± 0.678
0.941AspTrp: 0.941 ± 0.403
2.038AspTyr: 2.038 ± 0.548
0.0AspXaa: 0.0 ± 0.0
Glu
6.505GluAla: 6.505 ± 0.883
0.862GluCys: 0.862 ± 0.349
4.624GluAsp: 4.624 ± 0.65
4.546GluGlu: 4.546 ± 0.813
2.9GluPhe: 2.9 ± 0.422
4.311GluGly: 4.311 ± 0.537
1.489GluHis: 1.489 ± 0.394
2.822GluIle: 2.822 ± 0.425
2.978GluLys: 2.978 ± 0.382
4.467GluLeu: 4.467 ± 0.533
2.508GluMet: 2.508 ± 0.459
2.195GluAsn: 2.195 ± 0.512
1.803GluPro: 1.803 ± 0.33
4.624GluGln: 4.624 ± 0.741
3.292GluArg: 3.292 ± 0.497
2.743GluSer: 2.743 ± 0.371
3.997GluThr: 3.997 ± 0.412
3.997GluVal: 3.997 ± 0.5
1.803GluTrp: 1.803 ± 0.474
3.057GluTyr: 3.057 ± 0.408
0.0GluXaa: 0.0 ± 0.0
Phe
3.292PheAla: 3.292 ± 0.392
0.627PheCys: 0.627 ± 0.196
2.508PheAsp: 2.508 ± 0.342
1.568PheGlu: 1.568 ± 0.383
1.489PhePhe: 1.489 ± 0.384
2.9PheGly: 2.9 ± 0.406
0.705PheHis: 0.705 ± 0.243
1.881PheIle: 1.881 ± 0.409
1.724PheLys: 1.724 ± 0.383
2.665PheLeu: 2.665 ± 0.395
1.568PheMet: 1.568 ± 0.356
1.724PheAsn: 1.724 ± 0.372
1.097PhePro: 1.097 ± 0.343
1.489PheGln: 1.489 ± 0.301
2.351PheArg: 2.351 ± 0.369
2.273PheSer: 2.273 ± 0.482
1.881PheThr: 1.881 ± 0.464
2.351PheVal: 2.351 ± 0.362
0.549PheTrp: 0.549 ± 0.248
1.332PheTyr: 1.332 ± 0.365
0.0PheXaa: 0.0 ± 0.0
Gly
6.427GlyAla: 6.427 ± 1.09
0.705GlyCys: 0.705 ± 0.239
5.251GlyAsp: 5.251 ± 0.713
4.624GlyGlu: 4.624 ± 0.579
2.586GlyPhe: 2.586 ± 0.472
6.348GlyGly: 6.348 ± 0.906
0.862GlyHis: 0.862 ± 0.311
3.37GlyIle: 3.37 ± 0.489
5.486GlyLys: 5.486 ± 0.675
6.662GlyLeu: 6.662 ± 0.805
2.038GlyMet: 2.038 ± 0.328
3.919GlyAsn: 3.919 ± 0.643
1.959GlyPro: 1.959 ± 0.356
2.978GlyGln: 2.978 ± 0.563
4.546GlyArg: 4.546 ± 0.703
4.154GlySer: 4.154 ± 0.853
4.781GlyThr: 4.781 ± 0.616
4.624GlyVal: 4.624 ± 0.763
1.176GlyTrp: 1.176 ± 0.281
2.978GlyTyr: 2.978 ± 0.57
0.0GlyXaa: 0.0 ± 0.0
His
1.411HisAla: 1.411 ± 0.251
0.235HisCys: 0.235 ± 0.147
1.724HisAsp: 1.724 ± 0.31
0.941HisGlu: 0.941 ± 0.234
0.705HisPhe: 0.705 ± 0.226
2.038HisGly: 2.038 ± 0.456
0.549HisHis: 0.549 ± 0.294
1.097HisIle: 1.097 ± 0.347
1.254HisLys: 1.254 ± 0.249
1.332HisLeu: 1.332 ± 0.326
0.784HisMet: 0.784 ± 0.219
1.176HisAsn: 1.176 ± 0.327
1.019HisPro: 1.019 ± 0.287
0.784HisGln: 0.784 ± 0.301
1.176HisArg: 1.176 ± 0.386
0.784HisSer: 0.784 ± 0.206
0.47HisThr: 0.47 ± 0.244
1.097HisVal: 1.097 ± 0.272
0.235HisTrp: 0.235 ± 0.109
0.941HisTyr: 0.941 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
3.919IleAla: 3.919 ± 0.511
0.392IleCys: 0.392 ± 0.186
2.9IleAsp: 2.9 ± 0.407
2.822IleGlu: 2.822 ± 0.484
1.411IlePhe: 1.411 ± 0.321
3.527IleGly: 3.527 ± 0.44
1.176IleHis: 1.176 ± 0.317
2.351IleIle: 2.351 ± 0.395
2.743IleLys: 2.743 ± 0.46
3.37IleLeu: 3.37 ± 0.547
1.097IleMet: 1.097 ± 0.177
2.038IleAsn: 2.038 ± 0.3
2.273IlePro: 2.273 ± 0.396
1.803IleGln: 1.803 ± 0.47
3.057IleArg: 3.057 ± 0.432
3.37IleSer: 3.37 ± 0.437
3.057IleThr: 3.057 ± 0.647
2.116IleVal: 2.116 ± 0.295
0.392IleTrp: 0.392 ± 0.181
1.489IleTyr: 1.489 ± 0.336
0.0IleXaa: 0.0 ± 0.0
Lys
7.132LysAla: 7.132 ± 0.816
0.47LysCys: 0.47 ± 0.212
3.997LysAsp: 3.997 ± 0.429
5.565LysGlu: 5.565 ± 0.509
2.508LysPhe: 2.508 ± 0.426
3.527LysGly: 3.527 ± 0.484
1.803LysHis: 1.803 ± 0.357
2.508LysIle: 2.508 ± 0.526
3.449LysLys: 3.449 ± 0.595
5.957LysLeu: 5.957 ± 0.856
1.959LysMet: 1.959 ± 0.365
2.351LysAsn: 2.351 ± 0.442
2.822LysPro: 2.822 ± 0.493
2.43LysGln: 2.43 ± 0.369
3.057LysArg: 3.057 ± 0.502
2.116LysSer: 2.116 ± 0.453
3.684LysThr: 3.684 ± 0.545
3.84LysVal: 3.84 ± 0.545
1.489LysTrp: 1.489 ± 0.363
1.646LysTyr: 1.646 ± 0.354
0.0LysXaa: 0.0 ± 0.0
Leu
9.484LeuAla: 9.484 ± 1.086
0.549LeuCys: 0.549 ± 0.291
4.076LeuAsp: 4.076 ± 0.57
4.624LeuGlu: 4.624 ± 0.589
2.195LeuPhe: 2.195 ± 0.346
5.094LeuGly: 5.094 ± 0.653
1.568LeuHis: 1.568 ± 0.345
3.997LeuIle: 3.997 ± 0.629
4.703LeuLys: 4.703 ± 0.603
5.33LeuLeu: 5.33 ± 0.751
2.273LeuMet: 2.273 ± 0.468
4.232LeuAsn: 4.232 ± 0.715
3.057LeuPro: 3.057 ± 0.435
3.449LeuGln: 3.449 ± 0.478
4.389LeuArg: 4.389 ± 0.681
5.8LeuSer: 5.8 ± 0.693
6.192LeuThr: 6.192 ± 0.672
4.703LeuVal: 4.703 ± 0.63
0.941LeuTrp: 0.941 ± 0.269
2.43LeuTyr: 2.43 ± 0.338
0.0LeuXaa: 0.0 ± 0.0
Met
3.449MetAla: 3.449 ± 0.507
0.157MetCys: 0.157 ± 0.1
2.822MetAsp: 2.822 ± 0.54
2.038MetGlu: 2.038 ± 0.406
0.941MetPhe: 0.941 ± 0.27
2.038MetGly: 2.038 ± 0.592
0.627MetHis: 0.627 ± 0.26
1.097MetIle: 1.097 ± 0.257
1.568MetLys: 1.568 ± 0.326
2.743MetLeu: 2.743 ± 0.475
1.176MetMet: 1.176 ± 0.426
1.803MetAsn: 1.803 ± 0.368
1.568MetPro: 1.568 ± 0.294
1.724MetGln: 1.724 ± 0.524
2.038MetArg: 2.038 ± 0.316
2.038MetSer: 2.038 ± 0.432
2.116MetThr: 2.116 ± 0.455
1.881MetVal: 1.881 ± 0.443
0.627MetTrp: 0.627 ± 0.283
1.019MetTyr: 1.019 ± 0.288
0.0MetXaa: 0.0 ± 0.0
Asn
4.389AsnAla: 4.389 ± 0.46
0.314AsnCys: 0.314 ± 0.186
1.724AsnAsp: 1.724 ± 0.331
3.292AsnGlu: 3.292 ± 0.532
1.724AsnPhe: 1.724 ± 0.46
4.389AsnGly: 4.389 ± 0.556
1.019AsnHis: 1.019 ± 0.241
2.351AsnIle: 2.351 ± 0.329
2.978AsnLys: 2.978 ± 0.47
3.37AsnLeu: 3.37 ± 0.457
2.116AsnMet: 2.116 ± 0.459
1.881AsnAsn: 1.881 ± 0.488
2.351AsnPro: 2.351 ± 0.498
1.724AsnGln: 1.724 ± 0.546
2.9AsnArg: 2.9 ± 0.528
1.646AsnSer: 1.646 ± 0.366
1.803AsnThr: 1.803 ± 0.412
2.978AsnVal: 2.978 ± 0.496
0.705AsnTrp: 0.705 ± 0.364
1.568AsnTyr: 1.568 ± 0.305
0.0AsnXaa: 0.0 ± 0.0
Pro
4.232ProAla: 4.232 ± 0.686
0.235ProCys: 0.235 ± 0.132
3.213ProAsp: 3.213 ± 0.563
3.684ProGlu: 3.684 ± 0.501
1.646ProPhe: 1.646 ± 0.387
3.684ProGly: 3.684 ± 0.515
0.549ProHis: 0.549 ± 0.14
1.489ProIle: 1.489 ± 0.297
2.273ProLys: 2.273 ± 0.349
3.527ProLeu: 3.527 ± 0.56
0.862ProMet: 0.862 ± 0.224
1.724ProAsn: 1.724 ± 0.328
1.411ProPro: 1.411 ± 0.364
1.803ProGln: 1.803 ± 0.339
1.411ProArg: 1.411 ± 0.357
1.568ProSer: 1.568 ± 0.257
1.881ProThr: 1.881 ± 0.395
3.057ProVal: 3.057 ± 0.518
0.627ProTrp: 0.627 ± 0.195
1.254ProTyr: 1.254 ± 0.419
0.0ProXaa: 0.0 ± 0.0
Gln
4.703GlnAla: 4.703 ± 1.406
0.314GlnCys: 0.314 ± 0.14
2.195GlnAsp: 2.195 ± 0.297
2.273GlnGlu: 2.273 ± 0.481
1.724GlnPhe: 1.724 ± 0.415
3.135GlnGly: 3.135 ± 0.615
1.176GlnHis: 1.176 ± 0.292
1.803GlnIle: 1.803 ± 0.413
2.508GlnLys: 2.508 ± 0.443
3.84GlnLeu: 3.84 ± 0.558
2.116GlnMet: 2.116 ± 0.335
1.803GlnAsn: 1.803 ± 0.328
1.646GlnPro: 1.646 ± 0.4
2.508GlnGln: 2.508 ± 0.648
2.822GlnArg: 2.822 ± 0.627
2.743GlnSer: 2.743 ± 0.614
2.195GlnThr: 2.195 ± 0.429
2.9GlnVal: 2.9 ± 0.429
0.862GlnTrp: 0.862 ± 0.27
1.881GlnTyr: 1.881 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
5.016ArgAla: 5.016 ± 0.591
0.314ArgCys: 0.314 ± 0.156
3.37ArgAsp: 3.37 ± 0.508
3.84ArgGlu: 3.84 ± 0.525
2.273ArgPhe: 2.273 ± 0.456
3.84ArgGly: 3.84 ± 0.469
1.019ArgHis: 1.019 ± 0.228
2.586ArgIle: 2.586 ± 0.514
4.154ArgLys: 4.154 ± 0.638
4.859ArgLeu: 4.859 ± 0.524
1.803ArgMet: 1.803 ± 0.461
3.684ArgAsn: 3.684 ± 0.516
1.881ArgPro: 1.881 ± 0.309
2.038ArgGln: 2.038 ± 0.365
3.997ArgArg: 3.997 ± 0.628
2.508ArgSer: 2.508 ± 0.334
2.822ArgThr: 2.822 ± 0.519
3.605ArgVal: 3.605 ± 0.487
1.646ArgTrp: 1.646 ± 0.398
2.195ArgTyr: 2.195 ± 0.369
0.0ArgXaa: 0.0 ± 0.0
Ser
5.878SerAla: 5.878 ± 0.706
0.47SerCys: 0.47 ± 0.208
2.9SerAsp: 2.9 ± 0.493
2.351SerGlu: 2.351 ± 0.504
1.176SerPhe: 1.176 ± 0.296
4.311SerGly: 4.311 ± 0.586
1.097SerHis: 1.097 ± 0.301
2.273SerIle: 2.273 ± 0.378
3.37SerLys: 3.37 ± 0.517
4.859SerLeu: 4.859 ± 0.645
1.724SerMet: 1.724 ± 0.37
2.43SerAsn: 2.43 ± 0.534
3.37SerPro: 3.37 ± 0.623
2.508SerGln: 2.508 ± 0.443
2.586SerArg: 2.586 ± 0.611
1.959SerSer: 1.959 ± 0.451
3.449SerThr: 3.449 ± 0.446
3.605SerVal: 3.605 ± 0.428
0.941SerTrp: 0.941 ± 0.234
1.568SerTyr: 1.568 ± 0.374
0.0SerXaa: 0.0 ± 0.0
Thr
5.33ThrAla: 5.33 ± 0.674
0.235ThrCys: 0.235 ± 0.157
3.37ThrAsp: 3.37 ± 0.527
2.665ThrGlu: 2.665 ± 0.451
2.351ThrPhe: 2.351 ± 0.407
4.938ThrGly: 4.938 ± 0.821
0.784ThrHis: 0.784 ± 0.235
2.586ThrIle: 2.586 ± 0.47
3.919ThrLys: 3.919 ± 0.53
4.624ThrLeu: 4.624 ± 0.6
1.489ThrMet: 1.489 ± 0.405
2.351ThrAsn: 2.351 ± 0.541
3.527ThrPro: 3.527 ± 0.483
1.724ThrGln: 1.724 ± 0.271
2.351ThrArg: 2.351 ± 0.401
3.135ThrSer: 3.135 ± 0.519
3.684ThrThr: 3.684 ± 0.491
4.624ThrVal: 4.624 ± 0.913
0.941ThrTrp: 0.941 ± 0.26
2.038ThrTyr: 2.038 ± 0.529
0.0ThrXaa: 0.0 ± 0.0
Val
4.859ValAla: 4.859 ± 0.679
0.549ValCys: 0.549 ± 0.217
3.527ValAsp: 3.527 ± 0.433
4.546ValGlu: 4.546 ± 0.505
1.724ValPhe: 1.724 ± 0.317
4.389ValGly: 4.389 ± 0.659
1.097ValHis: 1.097 ± 0.26
4.232ValIle: 4.232 ± 0.743
4.311ValLys: 4.311 ± 0.626
4.154ValLeu: 4.154 ± 0.523
3.057ValMet: 3.057 ± 0.548
3.057ValAsn: 3.057 ± 0.846
2.822ValPro: 2.822 ± 0.501
4.311ValGln: 4.311 ± 0.498
4.546ValArg: 4.546 ± 0.591
3.762ValSer: 3.762 ± 0.598
3.449ValThr: 3.449 ± 0.605
4.311ValVal: 4.311 ± 0.655
0.627ValTrp: 0.627 ± 0.204
2.116ValTyr: 2.116 ± 0.338
0.0ValXaa: 0.0 ± 0.0
Trp
1.254TrpAla: 1.254 ± 0.291
0.157TrpCys: 0.157 ± 0.1
1.411TrpAsp: 1.411 ± 0.36
1.254TrpGlu: 1.254 ± 0.387
1.176TrpPhe: 1.176 ± 0.197
1.176TrpGly: 1.176 ± 0.363
0.627TrpHis: 0.627 ± 0.19
0.314TrpIle: 0.314 ± 0.138
0.941TrpLys: 0.941 ± 0.265
2.116TrpLeu: 2.116 ± 0.474
0.235TrpMet: 0.235 ± 0.124
0.627TrpAsn: 0.627 ± 0.298
0.627TrpPro: 0.627 ± 0.22
0.549TrpGln: 0.549 ± 0.185
0.47TrpArg: 0.47 ± 0.171
0.941TrpSer: 0.941 ± 0.293
1.019TrpThr: 1.019 ± 0.25
1.332TrpVal: 1.332 ± 0.301
0.549TrpTrp: 0.549 ± 0.231
0.627TrpTyr: 0.627 ± 0.239
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.135TyrAla: 3.135 ± 0.387
0.705TyrCys: 0.705 ± 0.235
2.273TyrAsp: 2.273 ± 0.492
2.586TyrGlu: 2.586 ± 0.459
1.176TyrPhe: 1.176 ± 0.362
3.135TyrGly: 3.135 ± 0.443
0.392TyrHis: 0.392 ± 0.157
2.038TyrIle: 2.038 ± 0.422
2.195TyrLys: 2.195 ± 0.371
2.508TyrLeu: 2.508 ± 0.392
0.941TyrMet: 0.941 ± 0.23
1.803TyrAsn: 1.803 ± 0.452
1.411TyrPro: 1.411 ± 0.367
1.646TyrGln: 1.646 ± 0.351
2.195TyrArg: 2.195 ± 0.467
2.116TyrSer: 2.116 ± 0.409
1.332TyrThr: 1.332 ± 0.284
1.411TyrVal: 1.411 ± 0.358
0.549TyrTrp: 0.549 ± 0.237
0.784TyrTyr: 0.784 ± 0.362
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (12760 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski