Amino acid dipepetide frequency for Escherichia phage vB_EcoM-Ro157c2YLVW

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.745AlaAla: 9.745 ± 0.822
0.717AlaCys: 0.717 ± 0.162
5.695AlaAsp: 5.695 ± 0.565
6.455AlaGlu: 6.455 ± 0.636
4.177AlaPhe: 4.177 ± 0.455
6.412AlaGly: 6.412 ± 0.641
1.392AlaHis: 1.392 ± 0.289
5.991AlaIle: 5.991 ± 0.527
5.273AlaLys: 5.273 ± 0.502
8.986AlaLeu: 8.986 ± 0.78
2.658AlaMet: 2.658 ± 0.264
4.134AlaAsn: 4.134 ± 0.404
3.291AlaPro: 3.291 ± 0.373
4.092AlaGln: 4.092 ± 0.409
5.611AlaArg: 5.611 ± 0.624
5.316AlaSer: 5.316 ± 0.447
5.231AlaThr: 5.231 ± 0.48
5.78AlaVal: 5.78 ± 0.569
1.012AlaTrp: 1.012 ± 0.261
2.616AlaTyr: 2.616 ± 0.302
0.0AlaXaa: 0.0 ± 0.0
Cys
0.591CysAla: 0.591 ± 0.135
0.253CysCys: 0.253 ± 0.124
0.717CysAsp: 0.717 ± 0.17
0.548CysGlu: 0.548 ± 0.14
0.422CysPhe: 0.422 ± 0.137
0.802CysGly: 0.802 ± 0.175
0.295CysHis: 0.295 ± 0.111
0.548CysIle: 0.548 ± 0.165
0.506CysLys: 0.506 ± 0.154
0.844CysLeu: 0.844 ± 0.202
0.127CysMet: 0.127 ± 0.079
0.253CysAsn: 0.253 ± 0.11
0.422CysPro: 0.422 ± 0.122
0.295CysGln: 0.295 ± 0.121
0.802CysArg: 0.802 ± 0.185
0.548CysSer: 0.548 ± 0.145
0.633CysThr: 0.633 ± 0.175
0.422CysVal: 0.422 ± 0.132
0.211CysTrp: 0.211 ± 0.105
0.717CysTyr: 0.717 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
6.201AspAla: 6.201 ± 0.627
0.38AspCys: 0.38 ± 0.144
3.797AspAsp: 3.797 ± 0.591
4.134AspGlu: 4.134 ± 0.536
2.362AspPhe: 2.362 ± 0.266
5.062AspGly: 5.062 ± 0.406
0.717AspHis: 0.717 ± 0.149
3.417AspIle: 3.417 ± 0.409
3.037AspLys: 3.037 ± 0.346
4.683AspLeu: 4.683 ± 0.502
2.573AspMet: 2.573 ± 0.354
2.616AspAsn: 2.616 ± 0.396
2.362AspPro: 2.362 ± 0.281
1.73AspGln: 1.73 ± 0.298
2.658AspArg: 2.658 ± 0.346
4.092AspSer: 4.092 ± 0.449
3.586AspThr: 3.586 ± 0.414
4.514AspVal: 4.514 ± 0.393
0.928AspTrp: 0.928 ± 0.198
2.109AspTyr: 2.109 ± 0.318
0.0AspXaa: 0.0 ± 0.0
Glu
5.991GluAla: 5.991 ± 0.471
0.464GluCys: 0.464 ± 0.144
3.628GluAsp: 3.628 ± 0.421
4.514GluGlu: 4.514 ± 0.368
2.869GluPhe: 2.869 ± 0.384
4.008GluGly: 4.008 ± 0.473
1.434GluHis: 1.434 ± 0.232
4.472GluIle: 4.472 ± 0.484
4.134GluLys: 4.134 ± 0.551
7.256GluLeu: 7.256 ± 0.512
2.531GluMet: 2.531 ± 0.331
2.742GluAsn: 2.742 ± 0.328
2.32GluPro: 2.32 ± 0.317
3.375GluGln: 3.375 ± 0.41
3.544GluArg: 3.544 ± 0.417
3.586GluSer: 3.586 ± 0.383
2.953GluThr: 2.953 ± 0.351
3.923GluVal: 3.923 ± 0.332
1.181GluTrp: 1.181 ± 0.226
1.898GluTyr: 1.898 ± 0.303
0.0GluXaa: 0.0 ± 0.0
Phe
2.573PheAla: 2.573 ± 0.33
0.464PheCys: 0.464 ± 0.164
2.953PheAsp: 2.953 ± 0.301
2.194PheGlu: 2.194 ± 0.265
1.687PhePhe: 1.687 ± 0.284
2.278PheGly: 2.278 ± 0.347
0.506PheHis: 0.506 ± 0.125
2.784PheIle: 2.784 ± 0.415
1.856PheLys: 1.856 ± 0.274
2.995PheLeu: 2.995 ± 0.345
0.717PheMet: 0.717 ± 0.143
2.489PheAsn: 2.489 ± 0.256
1.434PhePro: 1.434 ± 0.267
1.519PheGln: 1.519 ± 0.245
1.983PheArg: 1.983 ± 0.295
3.08PheSer: 3.08 ± 0.403
2.109PheThr: 2.109 ± 0.259
2.447PheVal: 2.447 ± 0.314
0.422PheTrp: 0.422 ± 0.135
1.097PheTyr: 1.097 ± 0.297
0.0PheXaa: 0.0 ± 0.0
Gly
4.641GlyAla: 4.641 ± 0.591
0.675GlyCys: 0.675 ± 0.191
5.02GlyAsp: 5.02 ± 0.456
4.261GlyGlu: 4.261 ± 0.354
2.784GlyPhe: 2.784 ± 0.258
5.02GlyGly: 5.02 ± 0.609
1.35GlyHis: 1.35 ± 0.223
4.345GlyIle: 4.345 ± 0.416
4.767GlyLys: 4.767 ± 0.501
5.062GlyLeu: 5.062 ± 0.408
2.32GlyMet: 2.32 ± 0.251
3.628GlyAsn: 3.628 ± 0.444
1.266GlyPro: 1.266 ± 0.201
2.405GlyGln: 2.405 ± 0.315
3.797GlyArg: 3.797 ± 0.437
3.881GlySer: 3.881 ± 0.337
4.894GlyThr: 4.894 ± 0.575
5.189GlyVal: 5.189 ± 0.455
1.139GlyTrp: 1.139 ± 0.255
2.152GlyTyr: 2.152 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
1.645HisAla: 1.645 ± 0.289
0.253HisCys: 0.253 ± 0.127
0.506HisAsp: 0.506 ± 0.143
0.886HisGlu: 0.886 ± 0.172
0.802HisPhe: 0.802 ± 0.185
1.35HisGly: 1.35 ± 0.194
0.38HisHis: 0.38 ± 0.127
0.886HisIle: 0.886 ± 0.198
0.802HisLys: 0.802 ± 0.196
1.308HisLeu: 1.308 ± 0.229
0.253HisMet: 0.253 ± 0.097
0.928HisAsn: 0.928 ± 0.198
0.886HisPro: 0.886 ± 0.166
0.886HisGln: 0.886 ± 0.218
1.181HisArg: 1.181 ± 0.242
1.35HisSer: 1.35 ± 0.236
1.181HisThr: 1.181 ± 0.218
1.223HisVal: 1.223 ± 0.197
0.127HisTrp: 0.127 ± 0.07
0.548HisTyr: 0.548 ± 0.147
0.0HisXaa: 0.0 ± 0.0
Ile
6.075IleAla: 6.075 ± 0.499
0.844IleCys: 0.844 ± 0.152
4.134IleAsp: 4.134 ± 0.452
4.345IleGlu: 4.345 ± 0.42
1.856IlePhe: 1.856 ± 0.245
4.345IleGly: 4.345 ± 0.575
0.886IleHis: 0.886 ± 0.185
2.531IleIle: 2.531 ± 0.33
3.459IleLys: 3.459 ± 0.368
4.134IleLeu: 4.134 ± 0.406
0.844IleMet: 0.844 ± 0.177
3.417IleAsn: 3.417 ± 0.413
2.911IlePro: 2.911 ± 0.398
2.025IleGln: 2.025 ± 0.313
3.291IleArg: 3.291 ± 0.297
5.273IleSer: 5.273 ± 0.466
4.43IleThr: 4.43 ± 0.472
2.531IleVal: 2.531 ± 0.288
0.295IleTrp: 0.295 ± 0.13
1.687IleTyr: 1.687 ± 0.287
0.0IleXaa: 0.0 ± 0.0
Lys
6.623LysAla: 6.623 ± 0.634
0.38LysCys: 0.38 ± 0.122
3.164LysAsp: 3.164 ± 0.302
4.092LysGlu: 4.092 ± 0.435
1.941LysPhe: 1.941 ± 0.263
3.881LysGly: 3.881 ± 0.461
1.35LysHis: 1.35 ± 0.267
3.122LysIle: 3.122 ± 0.412
4.219LysLys: 4.219 ± 0.583
4.978LysLeu: 4.978 ± 0.438
1.645LysMet: 1.645 ± 0.275
2.953LysAsn: 2.953 ± 0.335
2.236LysPro: 2.236 ± 0.241
2.32LysGln: 2.32 ± 0.326
2.911LysArg: 2.911 ± 0.356
3.923LysSer: 3.923 ± 0.41
3.797LysThr: 3.797 ± 0.349
3.375LysVal: 3.375 ± 0.447
0.759LysTrp: 0.759 ± 0.145
2.025LysTyr: 2.025 ± 0.326
0.0LysXaa: 0.0 ± 0.0
Leu
8.733LeuAla: 8.733 ± 0.67
1.012LeuCys: 1.012 ± 0.213
5.273LeuAsp: 5.273 ± 0.379
5.906LeuGlu: 5.906 ± 0.579
2.784LeuPhe: 2.784 ± 0.246
4.725LeuGly: 4.725 ± 0.524
1.603LeuHis: 1.603 ± 0.249
4.092LeuIle: 4.092 ± 0.477
6.244LeuLys: 6.244 ± 0.558
7.425LeuLeu: 7.425 ± 0.789
2.362LeuMet: 2.362 ± 0.312
4.219LeuAsn: 4.219 ± 0.36
4.345LeuPro: 4.345 ± 0.387
3.417LeuGln: 3.417 ± 0.389
4.894LeuArg: 4.894 ± 0.47
6.37LeuSer: 6.37 ± 0.426
5.358LeuThr: 5.358 ± 0.571
5.062LeuVal: 5.062 ± 0.414
0.717LeuTrp: 0.717 ± 0.207
2.025LeuTyr: 2.025 ± 0.3
0.0LeuXaa: 0.0 ± 0.0
Met
2.447MetAla: 2.447 ± 0.332
0.295MetCys: 0.295 ± 0.13
1.519MetAsp: 1.519 ± 0.316
1.266MetGlu: 1.266 ± 0.207
0.886MetPhe: 0.886 ± 0.157
1.477MetGly: 1.477 ± 0.22
0.169MetHis: 0.169 ± 0.077
1.73MetIle: 1.73 ± 0.27
1.856MetLys: 1.856 ± 0.207
2.32MetLeu: 2.32 ± 0.25
0.591MetMet: 0.591 ± 0.153
1.266MetAsn: 1.266 ± 0.206
1.012MetPro: 1.012 ± 0.167
1.434MetGln: 1.434 ± 0.204
1.645MetArg: 1.645 ± 0.239
2.025MetSer: 2.025 ± 0.28
1.73MetThr: 1.73 ± 0.259
1.223MetVal: 1.223 ± 0.188
0.337MetTrp: 0.337 ± 0.129
0.337MetTyr: 0.337 ± 0.101
0.0MetXaa: 0.0 ± 0.0
Asn
5.02AsnAla: 5.02 ± 0.486
0.506AsnCys: 0.506 ± 0.181
2.278AsnAsp: 2.278 ± 0.353
3.333AsnGlu: 3.333 ± 0.398
0.886AsnPhe: 0.886 ± 0.19
4.514AsnGly: 4.514 ± 0.378
0.802AsnHis: 0.802 ± 0.16
3.037AsnIle: 3.037 ± 0.314
3.122AsnLys: 3.122 ± 0.368
3.586AsnLeu: 3.586 ± 0.317
0.717AsnMet: 0.717 ± 0.157
2.025AsnAsn: 2.025 ± 0.272
1.856AsnPro: 1.856 ± 0.306
2.025AsnGln: 2.025 ± 0.299
2.194AsnArg: 2.194 ± 0.334
3.206AsnSer: 3.206 ± 0.328
2.869AsnThr: 2.869 ± 0.343
3.291AsnVal: 3.291 ± 0.368
0.38AsnTrp: 0.38 ± 0.118
1.477AsnTyr: 1.477 ± 0.263
0.0AsnXaa: 0.0 ± 0.0
Pro
3.755ProAla: 3.755 ± 0.319
0.464ProCys: 0.464 ± 0.171
2.995ProAsp: 2.995 ± 0.351
4.05ProGlu: 4.05 ± 0.359
1.856ProPhe: 1.856 ± 0.317
2.995ProGly: 2.995 ± 0.353
0.633ProHis: 0.633 ± 0.155
1.983ProIle: 1.983 ± 0.266
1.772ProLys: 1.772 ± 0.282
2.827ProLeu: 2.827 ± 0.297
0.464ProMet: 0.464 ± 0.13
1.561ProAsn: 1.561 ± 0.272
1.603ProPro: 1.603 ± 0.266
1.266ProGln: 1.266 ± 0.27
1.35ProArg: 1.35 ± 0.266
2.573ProSer: 2.573 ± 0.311
2.405ProThr: 2.405 ± 0.29
3.206ProVal: 3.206 ± 0.313
0.506ProTrp: 0.506 ± 0.174
1.603ProTyr: 1.603 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
3.291GlnAla: 3.291 ± 0.429
0.337GlnCys: 0.337 ± 0.098
1.814GlnAsp: 1.814 ± 0.261
3.164GlnGlu: 3.164 ± 0.367
1.308GlnPhe: 1.308 ± 0.282
1.898GlnGly: 1.898 ± 0.229
0.759GlnHis: 0.759 ± 0.176
2.405GlnIle: 2.405 ± 0.296
2.658GlnLys: 2.658 ± 0.376
3.966GlnLeu: 3.966 ± 0.447
1.392GlnMet: 1.392 ± 0.239
2.025GlnAsn: 2.025 ± 0.313
1.181GlnPro: 1.181 ± 0.19
2.236GlnGln: 2.236 ± 0.341
2.489GlnArg: 2.489 ± 0.328
2.152GlnSer: 2.152 ± 0.31
1.941GlnThr: 1.941 ± 0.293
2.405GlnVal: 2.405 ± 0.325
0.633GlnTrp: 0.633 ± 0.201
1.519GlnTyr: 1.519 ± 0.241
0.0GlnXaa: 0.0 ± 0.0
Arg
5.189ArgAla: 5.189 ± 0.401
0.464ArgCys: 0.464 ± 0.135
2.784ArgAsp: 2.784 ± 0.27
3.502ArgGlu: 3.502 ± 0.366
2.194ArgPhe: 2.194 ± 0.266
2.995ArgGly: 2.995 ± 0.352
1.012ArgHis: 1.012 ± 0.192
3.67ArgIle: 3.67 ± 0.363
4.303ArgLys: 4.303 ± 0.529
5.695ArgLeu: 5.695 ± 0.501
1.561ArgMet: 1.561 ± 0.212
2.531ArgAsn: 2.531 ± 0.322
2.067ArgPro: 2.067 ± 0.308
1.941ArgGln: 1.941 ± 0.273
3.839ArgArg: 3.839 ± 0.501
2.362ArgSer: 2.362 ± 0.325
2.911ArgThr: 2.911 ± 0.309
3.291ArgVal: 3.291 ± 0.393
0.844ArgTrp: 0.844 ± 0.252
1.73ArgTyr: 1.73 ± 0.241
0.0ArgXaa: 0.0 ± 0.0
Ser
6.033SerAla: 6.033 ± 0.611
0.675SerCys: 0.675 ± 0.142
4.683SerAsp: 4.683 ± 0.521
3.839SerGlu: 3.839 ± 0.375
2.447SerPhe: 2.447 ± 0.372
5.822SerGly: 5.822 ± 0.482
0.97SerHis: 0.97 ± 0.185
4.345SerIle: 4.345 ± 0.345
3.164SerLys: 3.164 ± 0.342
5.991SerLeu: 5.991 ± 0.527
1.561SerMet: 1.561 ± 0.276
2.573SerAsn: 2.573 ± 0.31
2.236SerPro: 2.236 ± 0.283
2.784SerGln: 2.784 ± 0.308
3.375SerArg: 3.375 ± 0.34
4.219SerSer: 4.219 ± 0.468
3.333SerThr: 3.333 ± 0.383
4.641SerVal: 4.641 ± 0.391
0.675SerTrp: 0.675 ± 0.172
2.152SerTyr: 2.152 ± 0.312
0.0SerXaa: 0.0 ± 0.0
Thr
6.201ThrAla: 6.201 ± 0.788
0.422ThrCys: 0.422 ± 0.128
4.008ThrAsp: 4.008 ± 0.475
3.628ThrGlu: 3.628 ± 0.368
2.489ThrPhe: 2.489 ± 0.364
4.809ThrGly: 4.809 ± 0.452
1.055ThrHis: 1.055 ± 0.157
3.797ThrIle: 3.797 ± 0.475
2.7ThrLys: 2.7 ± 0.361
5.442ThrLeu: 5.442 ± 0.455
1.055ThrMet: 1.055 ± 0.192
2.362ThrAsn: 2.362 ± 0.378
3.712ThrPro: 3.712 ± 0.337
1.983ThrGln: 1.983 ± 0.333
2.658ThrArg: 2.658 ± 0.319
4.05ThrSer: 4.05 ± 0.687
3.839ThrThr: 3.839 ± 0.567
3.712ThrVal: 3.712 ± 0.422
0.337ThrTrp: 0.337 ± 0.154
1.856ThrTyr: 1.856 ± 0.316
0.0ThrXaa: 0.0 ± 0.0
Val
5.653ValAla: 5.653 ± 0.547
0.717ValCys: 0.717 ± 0.217
4.05ValAsp: 4.05 ± 0.423
3.923ValGlu: 3.923 ± 0.377
2.067ValPhe: 2.067 ± 0.246
3.628ValGly: 3.628 ± 0.379
0.844ValHis: 0.844 ± 0.162
3.417ValIle: 3.417 ± 0.356
3.923ValLys: 3.923 ± 0.422
5.316ValLeu: 5.316 ± 0.407
1.223ValMet: 1.223 ± 0.218
3.459ValAsn: 3.459 ± 0.32
2.911ValPro: 2.911 ± 0.363
2.152ValGln: 2.152 ± 0.307
3.586ValArg: 3.586 ± 0.356
4.978ValSer: 4.978 ± 0.435
4.767ValThr: 4.767 ± 0.441
3.881ValVal: 3.881 ± 0.435
1.012ValTrp: 1.012 ± 0.225
1.814ValTyr: 1.814 ± 0.303
0.0ValXaa: 0.0 ± 0.0
Trp
1.35TrpAla: 1.35 ± 0.237
0.253TrpCys: 0.253 ± 0.092
0.506TrpAsp: 0.506 ± 0.107
0.717TrpGlu: 0.717 ± 0.157
0.675TrpPhe: 0.675 ± 0.18
0.633TrpGly: 0.633 ± 0.163
0.38TrpHis: 0.38 ± 0.14
0.886TrpIle: 0.886 ± 0.206
0.506TrpLys: 0.506 ± 0.142
1.35TrpLeu: 1.35 ± 0.353
0.169TrpMet: 0.169 ± 0.081
0.591TrpAsn: 0.591 ± 0.199
0.422TrpPro: 0.422 ± 0.122
0.38TrpGln: 0.38 ± 0.133
0.844TrpArg: 0.844 ± 0.233
0.759TrpSer: 0.759 ± 0.215
0.464TrpThr: 0.464 ± 0.117
0.97TrpVal: 0.97 ± 0.19
0.253TrpTrp: 0.253 ± 0.095
0.169TrpTyr: 0.169 ± 0.09
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.827TyrAla: 2.827 ± 0.291
0.38TyrCys: 0.38 ± 0.135
1.434TyrAsp: 1.434 ± 0.238
2.067TyrGlu: 2.067 ± 0.283
1.097TyrPhe: 1.097 ± 0.21
1.941TyrGly: 1.941 ± 0.309
0.844TyrHis: 0.844 ± 0.152
1.856TyrIle: 1.856 ± 0.276
1.392TyrLys: 1.392 ± 0.226
2.405TyrLeu: 2.405 ± 0.299
0.759TyrMet: 0.759 ± 0.17
1.35TyrAsn: 1.35 ± 0.19
1.392TyrPro: 1.392 ± 0.269
1.308TyrGln: 1.308 ± 0.212
2.236TyrArg: 2.236 ± 0.397
1.772TyrSer: 1.772 ± 0.247
1.687TyrThr: 1.687 ± 0.304
2.236TyrVal: 2.236 ± 0.323
0.506TyrTrp: 0.506 ± 0.134
0.802TyrTyr: 0.802 ± 0.191
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (23705 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski