Amino acid dipepetide frequency for Yersinia phage vB_YenP_AP10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.257AlaAla: 10.257 ± 1.169
0.763AlaCys: 0.763 ± 0.253
6.358AlaAsp: 6.358 ± 0.749
5.001AlaGlu: 5.001 ± 0.481
3.645AlaPhe: 3.645 ± 0.644
7.46AlaGly: 7.46 ± 1.132
1.102AlaHis: 1.102 ± 0.238
4.747AlaIle: 4.747 ± 0.713
6.442AlaLys: 6.442 ± 0.724
7.799AlaLeu: 7.799 ± 0.988
3.306AlaMet: 3.306 ± 0.555
4.408AlaAsn: 4.408 ± 0.534
3.306AlaPro: 3.306 ± 0.548
4.238AlaGln: 4.238 ± 0.808
5.934AlaArg: 5.934 ± 0.662
5.425AlaSer: 5.425 ± 0.63
4.069AlaThr: 4.069 ± 0.58
5.934AlaVal: 5.934 ± 0.698
1.102AlaTrp: 1.102 ± 0.405
2.797AlaTyr: 2.797 ± 0.394
0.0AlaXaa: 0.0 ± 0.0
Cys
0.593CysAla: 0.593 ± 0.229
0.17CysCys: 0.17 ± 0.108
0.678CysAsp: 0.678 ± 0.301
0.678CysGlu: 0.678 ± 0.254
0.509CysPhe: 0.509 ± 0.235
0.593CysGly: 0.593 ± 0.292
0.339CysHis: 0.339 ± 0.177
0.339CysIle: 0.339 ± 0.172
0.339CysLys: 0.339 ± 0.155
0.763CysLeu: 0.763 ± 0.227
0.17CysMet: 0.17 ± 0.165
0.0CysAsn: 0.0 ± 0.0
0.339CysPro: 0.339 ± 0.163
0.424CysGln: 0.424 ± 0.201
0.593CysArg: 0.593 ± 0.361
0.339CysSer: 0.339 ± 0.181
0.509CysThr: 0.509 ± 0.209
0.763CysVal: 0.763 ± 0.304
0.085CysTrp: 0.085 ± 0.075
0.339CysTyr: 0.339 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
5.086AspAla: 5.086 ± 0.87
0.593AspCys: 0.593 ± 0.237
4.408AspAsp: 4.408 ± 0.722
3.56AspGlu: 3.56 ± 0.489
2.797AspPhe: 2.797 ± 0.426
6.358AspGly: 6.358 ± 0.898
1.272AspHis: 1.272 ± 0.306
3.73AspIle: 3.73 ± 0.386
4.154AspLys: 4.154 ± 0.783
4.493AspLeu: 4.493 ± 0.709
2.543AspMet: 2.543 ± 0.426
2.373AspAsn: 2.373 ± 0.325
2.373AspPro: 2.373 ± 0.582
2.119AspGln: 2.119 ± 0.382
3.136AspArg: 3.136 ± 0.496
4.069AspSer: 4.069 ± 0.547
3.815AspThr: 3.815 ± 0.55
4.238AspVal: 4.238 ± 0.475
0.678AspTrp: 0.678 ± 0.242
2.628AspTyr: 2.628 ± 0.46
0.0AspXaa: 0.0 ± 0.0
Glu
7.799GluAla: 7.799 ± 0.768
0.593GluCys: 0.593 ± 0.22
4.154GluAsp: 4.154 ± 0.619
5.934GluGlu: 5.934 ± 1.178
2.713GluPhe: 2.713 ± 0.355
4.662GluGly: 4.662 ± 0.761
2.034GluHis: 2.034 ± 0.604
2.458GluIle: 2.458 ± 0.33
3.645GluLys: 3.645 ± 0.545
6.866GluLeu: 6.866 ± 0.631
2.373GluMet: 2.373 ± 0.624
2.882GluAsn: 2.882 ± 0.303
2.034GluPro: 2.034 ± 0.49
3.052GluGln: 3.052 ± 0.509
3.984GluArg: 3.984 ± 0.674
5.001GluSer: 5.001 ± 0.8
2.458GluThr: 2.458 ± 0.456
3.645GluVal: 3.645 ± 0.551
0.848GluTrp: 0.848 ± 0.217
3.391GluTyr: 3.391 ± 0.509
0.0GluXaa: 0.0 ± 0.0
Phe
2.797PheAla: 2.797 ± 0.447
0.424PheCys: 0.424 ± 0.216
2.543PheAsp: 2.543 ± 0.452
1.78PheGlu: 1.78 ± 0.354
1.102PhePhe: 1.102 ± 0.215
2.967PheGly: 2.967 ± 0.501
0.678PheHis: 0.678 ± 0.309
1.611PheIle: 1.611 ± 0.493
2.628PheLys: 2.628 ± 0.487
3.645PheLeu: 3.645 ± 0.607
1.187PheMet: 1.187 ± 0.25
2.797PheAsn: 2.797 ± 0.445
1.611PhePro: 1.611 ± 0.439
1.017PheGln: 1.017 ± 0.31
2.204PheArg: 2.204 ± 0.429
1.95PheSer: 1.95 ± 0.339
2.967PheThr: 2.967 ± 0.459
2.373PheVal: 2.373 ± 0.472
0.339PheTrp: 0.339 ± 0.126
1.102PheTyr: 1.102 ± 0.28
0.0PheXaa: 0.0 ± 0.0
Gly
6.273GlyAla: 6.273 ± 0.902
0.678GlyCys: 0.678 ± 0.299
5.425GlyAsp: 5.425 ± 0.74
5.001GlyGlu: 5.001 ± 0.65
3.815GlyPhe: 3.815 ± 0.498
6.527GlyGly: 6.527 ± 1.038
1.102GlyHis: 1.102 ± 0.322
4.917GlyIle: 4.917 ± 0.698
5.595GlyLys: 5.595 ± 0.647
6.612GlyLeu: 6.612 ± 0.889
1.611GlyMet: 1.611 ± 0.351
3.306GlyAsn: 3.306 ± 0.884
1.272GlyPro: 1.272 ± 0.324
2.967GlyGln: 2.967 ± 0.48
4.238GlyArg: 4.238 ± 0.624
4.832GlySer: 4.832 ± 0.636
3.645GlyThr: 3.645 ± 0.715
5.595GlyVal: 5.595 ± 0.798
1.95GlyTrp: 1.95 ± 0.398
2.797GlyTyr: 2.797 ± 0.541
0.0GlyXaa: 0.0 ± 0.0
His
1.526HisAla: 1.526 ± 0.344
0.254HisCys: 0.254 ± 0.135
1.187HisAsp: 1.187 ± 0.388
1.441HisGlu: 1.441 ± 0.355
0.848HisPhe: 0.848 ± 0.25
1.356HisGly: 1.356 ± 0.318
0.424HisHis: 0.424 ± 0.213
1.441HisIle: 1.441 ± 0.308
1.695HisLys: 1.695 ± 0.388
2.034HisLeu: 2.034 ± 0.359
0.509HisMet: 0.509 ± 0.197
0.593HisAsn: 0.593 ± 0.206
0.254HisPro: 0.254 ± 0.144
0.678HisGln: 0.678 ± 0.291
0.424HisArg: 0.424 ± 0.198
1.102HisSer: 1.102 ± 0.324
0.509HisThr: 0.509 ± 0.186
1.356HisVal: 1.356 ± 0.319
0.254HisTrp: 0.254 ± 0.147
0.848HisTyr: 0.848 ± 0.265
0.0HisXaa: 0.0 ± 0.0
Ile
4.408IleAla: 4.408 ± 0.564
0.424IleCys: 0.424 ± 0.157
3.899IleAsp: 3.899 ± 0.571
3.136IleGlu: 3.136 ± 0.51
1.356IlePhe: 1.356 ± 0.365
3.73IleGly: 3.73 ± 0.65
1.526IleHis: 1.526 ± 0.344
2.713IleIle: 2.713 ± 0.563
3.73IleLys: 3.73 ± 0.428
4.238IleLeu: 4.238 ± 0.603
1.272IleMet: 1.272 ± 0.429
1.78IleAsn: 1.78 ± 0.531
3.306IlePro: 3.306 ± 0.432
1.272IleGln: 1.272 ± 0.377
3.306IleArg: 3.306 ± 0.519
2.543IleSer: 2.543 ± 0.37
2.628IleThr: 2.628 ± 0.479
2.713IleVal: 2.713 ± 0.505
0.509IleTrp: 0.509 ± 0.198
1.441IleTyr: 1.441 ± 0.276
0.0IleXaa: 0.0 ± 0.0
Lys
7.799LysAla: 7.799 ± 0.775
0.593LysCys: 0.593 ± 0.226
3.475LysAsp: 3.475 ± 0.626
4.662LysGlu: 4.662 ± 0.823
2.628LysPhe: 2.628 ± 0.427
5.679LysGly: 5.679 ± 0.853
1.187LysHis: 1.187 ± 0.274
1.865LysIle: 1.865 ± 0.304
4.408LysLys: 4.408 ± 0.865
5.425LysLeu: 5.425 ± 0.943
1.695LysMet: 1.695 ± 0.416
2.204LysAsn: 2.204 ± 0.354
3.052LysPro: 3.052 ± 0.591
2.543LysGln: 2.543 ± 0.644
3.73LysArg: 3.73 ± 0.567
3.645LysSer: 3.645 ± 0.594
2.967LysThr: 2.967 ± 0.494
5.679LysVal: 5.679 ± 0.761
0.678LysTrp: 0.678 ± 0.239
1.865LysTyr: 1.865 ± 0.333
0.0LysXaa: 0.0 ± 0.0
Leu
8.222LeuAla: 8.222 ± 1.131
0.424LeuCys: 0.424 ± 0.193
4.069LeuAsp: 4.069 ± 0.445
7.12LeuGlu: 7.12 ± 1.198
2.713LeuPhe: 2.713 ± 0.481
5.001LeuGly: 5.001 ± 0.79
1.187LeuHis: 1.187 ± 0.272
3.645LeuIle: 3.645 ± 0.435
5.934LeuLys: 5.934 ± 0.669
5.51LeuLeu: 5.51 ± 0.99
2.628LeuMet: 2.628 ± 0.551
4.408LeuAsn: 4.408 ± 0.569
3.221LeuPro: 3.221 ± 0.625
3.221LeuGln: 3.221 ± 0.513
4.832LeuArg: 4.832 ± 0.607
4.238LeuSer: 4.238 ± 0.572
5.171LeuThr: 5.171 ± 0.628
4.917LeuVal: 4.917 ± 0.54
1.526LeuTrp: 1.526 ± 0.38
2.289LeuTyr: 2.289 ± 0.44
0.0LeuXaa: 0.0 ± 0.0
Met
3.221MetAla: 3.221 ± 0.488
0.339MetCys: 0.339 ± 0.2
2.373MetAsp: 2.373 ± 0.565
1.95MetGlu: 1.95 ± 0.419
1.272MetPhe: 1.272 ± 0.322
1.865MetGly: 1.865 ± 0.345
0.848MetHis: 0.848 ± 0.353
1.272MetIle: 1.272 ± 0.285
0.932MetLys: 0.932 ± 0.285
3.221MetLeu: 3.221 ± 0.408
0.678MetMet: 0.678 ± 0.219
1.187MetAsn: 1.187 ± 0.274
1.102MetPro: 1.102 ± 0.244
1.526MetGln: 1.526 ± 0.405
1.017MetArg: 1.017 ± 0.244
1.95MetSer: 1.95 ± 0.467
1.865MetThr: 1.865 ± 0.333
1.526MetVal: 1.526 ± 0.335
0.17MetTrp: 0.17 ± 0.115
0.593MetTyr: 0.593 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
3.815AsnAla: 3.815 ± 0.494
0.17AsnCys: 0.17 ± 0.144
2.289AsnAsp: 2.289 ± 0.444
2.797AsnGlu: 2.797 ± 0.542
1.356AsnPhe: 1.356 ± 0.317
4.577AsnGly: 4.577 ± 0.641
0.763AsnHis: 0.763 ± 0.332
3.136AsnIle: 3.136 ± 0.857
2.628AsnLys: 2.628 ± 0.463
2.543AsnLeu: 2.543 ± 0.673
1.356AsnMet: 1.356 ± 0.327
1.695AsnAsn: 1.695 ± 0.427
2.628AsnPro: 2.628 ± 0.414
2.119AsnGln: 2.119 ± 0.518
2.034AsnArg: 2.034 ± 0.587
2.967AsnSer: 2.967 ± 0.847
2.373AsnThr: 2.373 ± 0.55
2.628AsnVal: 2.628 ± 0.421
0.593AsnTrp: 0.593 ± 0.139
1.441AsnTyr: 1.441 ± 0.269
0.0AsnXaa: 0.0 ± 0.0
Pro
3.645ProAla: 3.645 ± 0.618
0.254ProCys: 0.254 ± 0.148
3.052ProAsp: 3.052 ± 0.384
3.306ProGlu: 3.306 ± 0.465
1.356ProPhe: 1.356 ± 0.343
2.204ProGly: 2.204 ± 0.357
0.509ProHis: 0.509 ± 0.186
1.356ProIle: 1.356 ± 0.334
2.628ProLys: 2.628 ± 0.525
2.204ProLeu: 2.204 ± 0.372
1.187ProMet: 1.187 ± 0.247
2.119ProAsn: 2.119 ± 0.414
1.102ProPro: 1.102 ± 0.399
1.187ProGln: 1.187 ± 0.317
1.611ProArg: 1.611 ± 0.36
2.119ProSer: 2.119 ± 0.379
1.695ProThr: 1.695 ± 0.436
2.119ProVal: 2.119 ± 0.391
1.102ProTrp: 1.102 ± 0.277
1.695ProTyr: 1.695 ± 0.519
0.0ProXaa: 0.0 ± 0.0
Gln
4.747GlnAla: 4.747 ± 0.797
0.254GlnCys: 0.254 ± 0.188
2.373GlnAsp: 2.373 ± 0.384
3.306GlnGlu: 3.306 ± 0.708
2.204GlnPhe: 2.204 ± 0.415
3.391GlnGly: 3.391 ± 0.519
0.339GlnHis: 0.339 ± 0.161
1.526GlnIle: 1.526 ± 0.369
2.119GlnLys: 2.119 ± 0.55
3.899GlnLeu: 3.899 ± 0.559
0.763GlnMet: 0.763 ± 0.252
1.102GlnAsn: 1.102 ± 0.26
1.356GlnPro: 1.356 ± 0.268
2.543GlnGln: 2.543 ± 0.531
2.797GlnArg: 2.797 ± 0.64
2.458GlnSer: 2.458 ± 0.66
1.102GlnThr: 1.102 ± 0.284
2.628GlnVal: 2.628 ± 0.463
0.848GlnTrp: 0.848 ± 0.259
1.695GlnTyr: 1.695 ± 0.616
0.0GlnXaa: 0.0 ± 0.0
Arg
4.917ArgAla: 4.917 ± 0.603
0.678ArgCys: 0.678 ± 0.283
3.815ArgAsp: 3.815 ± 0.538
5.001ArgGlu: 5.001 ± 0.796
1.611ArgPhe: 1.611 ± 0.406
4.069ArgGly: 4.069 ± 0.495
1.187ArgHis: 1.187 ± 0.335
3.221ArgIle: 3.221 ± 0.447
3.815ArgLys: 3.815 ± 0.644
5.51ArgLeu: 5.51 ± 0.623
1.356ArgMet: 1.356 ± 0.275
2.289ArgAsn: 2.289 ± 0.517
1.78ArgPro: 1.78 ± 0.42
2.204ArgGln: 2.204 ± 0.342
2.458ArgArg: 2.458 ± 0.345
3.645ArgSer: 3.645 ± 0.493
2.119ArgThr: 2.119 ± 0.424
2.967ArgVal: 2.967 ± 0.541
0.763ArgTrp: 0.763 ± 0.254
1.695ArgTyr: 1.695 ± 0.301
0.0ArgXaa: 0.0 ± 0.0
Ser
5.001SerAla: 5.001 ± 0.601
0.593SerCys: 0.593 ± 0.24
4.662SerAsp: 4.662 ± 0.622
3.899SerGlu: 3.899 ± 0.492
2.373SerPhe: 2.373 ± 0.478
4.577SerGly: 4.577 ± 0.429
1.611SerHis: 1.611 ± 0.405
3.391SerIle: 3.391 ± 0.502
3.899SerLys: 3.899 ± 0.59
3.56SerLeu: 3.56 ± 0.473
1.441SerMet: 1.441 ± 0.338
2.373SerAsn: 2.373 ± 0.474
1.441SerPro: 1.441 ± 0.241
2.458SerGln: 2.458 ± 0.339
3.475SerArg: 3.475 ± 0.576
3.052SerSer: 3.052 ± 0.541
3.306SerThr: 3.306 ± 0.491
3.899SerVal: 3.899 ± 0.456
0.932SerTrp: 0.932 ± 0.259
2.797SerTyr: 2.797 ± 0.546
0.0SerXaa: 0.0 ± 0.0
Thr
4.662ThrAla: 4.662 ± 0.581
0.424ThrCys: 0.424 ± 0.209
3.645ThrAsp: 3.645 ± 0.531
3.645ThrGlu: 3.645 ± 0.391
1.441ThrPhe: 1.441 ± 0.298
5.171ThrGly: 5.171 ± 0.757
0.848ThrHis: 0.848 ± 0.202
3.052ThrIle: 3.052 ± 0.454
3.73ThrLys: 3.73 ± 0.462
4.832ThrLeu: 4.832 ± 0.646
1.526ThrMet: 1.526 ± 0.31
1.441ThrAsn: 1.441 ± 0.358
2.458ThrPro: 2.458 ± 0.41
2.458ThrGln: 2.458 ± 0.412
1.611ThrArg: 1.611 ± 0.313
2.967ThrSer: 2.967 ± 0.523
2.882ThrThr: 2.882 ± 0.603
3.56ThrVal: 3.56 ± 0.65
0.678ThrTrp: 0.678 ± 0.262
1.356ThrTyr: 1.356 ± 0.252
0.0ThrXaa: 0.0 ± 0.0
Val
5.425ValAla: 5.425 ± 0.582
0.339ValCys: 0.339 ± 0.158
2.713ValAsp: 2.713 ± 0.527
4.323ValGlu: 4.323 ± 0.855
2.543ValPhe: 2.543 ± 0.517
4.493ValGly: 4.493 ± 0.565
1.102ValHis: 1.102 ± 0.29
3.475ValIle: 3.475 ± 0.704
4.662ValLys: 4.662 ± 0.614
3.984ValLeu: 3.984 ± 0.441
1.526ValMet: 1.526 ± 0.336
3.815ValAsn: 3.815 ± 0.773
2.204ValPro: 2.204 ± 0.416
3.052ValGln: 3.052 ± 0.49
4.493ValArg: 4.493 ± 0.689
4.238ValSer: 4.238 ± 0.498
5.171ValThr: 5.171 ± 0.768
5.086ValVal: 5.086 ± 0.647
1.187ValTrp: 1.187 ± 0.399
2.034ValTyr: 2.034 ± 0.462
0.0ValXaa: 0.0 ± 0.0
Trp
0.848TrpAla: 0.848 ± 0.222
0.339TrpCys: 0.339 ± 0.192
0.763TrpAsp: 0.763 ± 0.255
1.187TrpGlu: 1.187 ± 0.258
0.678TrpPhe: 0.678 ± 0.289
0.678TrpGly: 0.678 ± 0.236
0.17TrpHis: 0.17 ± 0.121
0.678TrpIle: 0.678 ± 0.318
1.017TrpLys: 1.017 ± 0.271
1.102TrpLeu: 1.102 ± 0.3
0.339TrpMet: 0.339 ± 0.154
1.017TrpAsn: 1.017 ± 0.255
0.339TrpPro: 0.339 ± 0.123
0.763TrpGln: 0.763 ± 0.269
0.763TrpArg: 0.763 ± 0.206
1.017TrpSer: 1.017 ± 0.385
1.272TrpThr: 1.272 ± 0.294
1.695TrpVal: 1.695 ± 0.505
0.339TrpTrp: 0.339 ± 0.136
0.17TrpTyr: 0.17 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.052TyrAla: 3.052 ± 0.559
0.254TyrCys: 0.254 ± 0.146
2.543TyrAsp: 2.543 ± 0.501
2.797TyrGlu: 2.797 ± 0.443
0.763TyrPhe: 0.763 ± 0.213
2.797TyrGly: 2.797 ± 0.362
0.509TyrHis: 0.509 ± 0.234
1.356TyrIle: 1.356 ± 0.314
1.78TyrLys: 1.78 ± 0.302
2.034TyrLeu: 2.034 ± 0.349
1.356TyrMet: 1.356 ± 0.293
2.119TyrAsn: 2.119 ± 0.4
1.272TyrPro: 1.272 ± 0.358
1.611TyrGln: 1.611 ± 0.42
2.543TyrArg: 2.543 ± 0.604
1.187TyrSer: 1.187 ± 0.306
1.865TyrThr: 1.865 ± 0.457
2.543TyrVal: 2.543 ± 0.528
0.593TyrTrp: 0.593 ± 0.224
0.848TyrTyr: 0.848 ± 0.328
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (11798 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski