Amino acid dipepetide frequency for Pantoea phage vB_PagS_Vid5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.183AlaAla: 11.183 ± 1.445
1.322AlaCys: 1.322 ± 0.26
4.626AlaAsp: 4.626 ± 0.699
4.423AlaGlu: 4.423 ± 0.474
3.304AlaPhe: 3.304 ± 0.468
6.608AlaGly: 6.608 ± 0.577
1.068AlaHis: 1.068 ± 0.2
5.338AlaIle: 5.338 ± 0.494
4.982AlaLys: 4.982 ± 0.739
7.879AlaLeu: 7.879 ± 0.746
2.643AlaMet: 2.643 ± 0.468
4.778AlaAsn: 4.778 ± 0.528
4.372AlaPro: 4.372 ± 0.928
4.473AlaGln: 4.473 ± 0.753
4.168AlaArg: 4.168 ± 0.421
5.388AlaSer: 5.388 ± 0.775
5.846AlaThr: 5.846 ± 0.779
6.608AlaVal: 6.608 ± 0.518
1.322AlaTrp: 1.322 ± 0.297
3.304AlaTyr: 3.304 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
1.068CysAla: 1.068 ± 0.235
0.153CysCys: 0.153 ± 0.066
1.271CysAsp: 1.271 ± 0.308
0.712CysGlu: 0.712 ± 0.205
0.559CysPhe: 0.559 ± 0.268
1.423CysGly: 1.423 ± 0.304
0.458CysHis: 0.458 ± 0.138
0.763CysIle: 0.763 ± 0.301
0.661CysLys: 0.661 ± 0.178
0.813CysLeu: 0.813 ± 0.239
0.254CysMet: 0.254 ± 0.12
1.118CysAsn: 1.118 ± 0.275
0.61CysPro: 0.61 ± 0.17
0.407CysGln: 0.407 ± 0.125
0.661CysArg: 0.661 ± 0.187
0.813CysSer: 0.813 ± 0.232
1.068CysThr: 1.068 ± 0.263
0.864CysVal: 0.864 ± 0.233
0.153CysTrp: 0.153 ± 0.076
0.712CysTyr: 0.712 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
5.846AspAla: 5.846 ± 0.521
0.864AspCys: 0.864 ± 0.21
2.237AspAsp: 2.237 ± 0.353
3.406AspGlu: 3.406 ± 0.381
2.44AspPhe: 2.44 ± 0.335
4.88AspGly: 4.88 ± 0.429
0.813AspHis: 0.813 ± 0.188
2.999AspIle: 2.999 ± 0.405
3.711AspLys: 3.711 ± 0.394
4.168AspLeu: 4.168 ± 0.369
1.728AspMet: 1.728 ± 0.3
1.881AspAsn: 1.881 ± 0.266
2.643AspPro: 2.643 ± 0.306
1.474AspGln: 1.474 ± 0.279
2.44AspArg: 2.44 ± 0.308
2.491AspSer: 2.491 ± 0.306
2.898AspThr: 2.898 ± 0.388
3.457AspVal: 3.457 ± 0.353
1.118AspTrp: 1.118 ± 0.188
2.288AspTyr: 2.288 ± 0.285
0.0AspXaa: 0.0 ± 0.0
Glu
5.388GluAla: 5.388 ± 0.533
0.813GluCys: 0.813 ± 0.212
2.948GluAsp: 2.948 ± 0.443
3.711GluGlu: 3.711 ± 0.457
2.593GluPhe: 2.593 ± 0.401
3.558GluGly: 3.558 ± 0.474
1.169GluHis: 1.169 ± 0.25
2.948GluIle: 2.948 ± 0.402
2.796GluLys: 2.796 ± 0.335
5.948GluLeu: 5.948 ± 0.508
2.186GluMet: 2.186 ± 0.308
3.05GluAsn: 3.05 ± 0.337
2.237GluPro: 2.237 ± 0.388
2.796GluGln: 2.796 ± 0.386
3.05GluArg: 3.05 ± 0.442
3.101GluSer: 3.101 ± 0.486
2.796GluThr: 2.796 ± 0.416
3.813GluVal: 3.813 ± 0.437
1.22GluTrp: 1.22 ± 0.207
1.983GluTyr: 1.983 ± 0.344
0.0GluXaa: 0.0 ± 0.0
Phe
3.457PheAla: 3.457 ± 0.404
0.61PheCys: 0.61 ± 0.189
2.694PheAsp: 2.694 ± 0.41
2.033PheGlu: 2.033 ± 0.354
1.169PhePhe: 1.169 ± 0.211
2.999PheGly: 2.999 ± 0.467
1.068PheHis: 1.068 ± 0.271
2.084PheIle: 2.084 ± 0.289
1.932PheLys: 1.932 ± 0.359
2.338PheLeu: 2.338 ± 0.435
0.763PheMet: 0.763 ± 0.166
2.033PheAsn: 2.033 ± 0.217
1.678PhePro: 1.678 ± 0.322
1.118PheGln: 1.118 ± 0.257
2.338PheArg: 2.338 ± 0.408
2.389PheSer: 2.389 ± 0.391
2.44PheThr: 2.44 ± 0.341
2.542PheVal: 2.542 ± 0.395
0.254PheTrp: 0.254 ± 0.099
1.068PheTyr: 1.068 ± 0.258
0.0PheXaa: 0.0 ± 0.0
Gly
5.846GlyAla: 5.846 ± 0.676
1.169GlyCys: 1.169 ± 0.241
3.457GlyAsp: 3.457 ± 0.452
4.728GlyGlu: 4.728 ± 0.583
2.796GlyPhe: 2.796 ± 0.395
4.473GlyGly: 4.473 ± 0.558
1.22GlyHis: 1.22 ± 0.289
3.558GlyIle: 3.558 ± 0.463
5.287GlyLys: 5.287 ± 0.541
5.033GlyLeu: 5.033 ± 0.446
1.83GlyMet: 1.83 ± 0.301
2.643GlyAsn: 2.643 ± 0.352
2.796GlyPro: 2.796 ± 0.343
3.66GlyGln: 3.66 ± 0.467
3.965GlyArg: 3.965 ± 0.376
5.49GlySer: 5.49 ± 0.553
4.423GlyThr: 4.423 ± 0.362
6.863GlyVal: 6.863 ± 0.438
1.068GlyTrp: 1.068 ± 0.239
2.847GlyTyr: 2.847 ± 0.369
0.0GlyXaa: 0.0 ± 0.0
His
1.271HisAla: 1.271 ± 0.219
0.407HisCys: 0.407 ± 0.195
0.712HisAsp: 0.712 ± 0.217
1.118HisGlu: 1.118 ± 0.238
0.966HisPhe: 0.966 ± 0.226
1.373HisGly: 1.373 ± 0.253
0.356HisHis: 0.356 ± 0.161
1.017HisIle: 1.017 ± 0.207
0.864HisLys: 0.864 ± 0.204
1.678HisLeu: 1.678 ± 0.303
0.61HisMet: 0.61 ± 0.209
1.169HisAsn: 1.169 ± 0.236
0.915HisPro: 0.915 ± 0.204
0.915HisGln: 0.915 ± 0.209
1.017HisArg: 1.017 ± 0.213
1.423HisSer: 1.423 ± 0.246
1.271HisThr: 1.271 ± 0.264
1.474HisVal: 1.474 ± 0.333
0.407HisTrp: 0.407 ± 0.154
0.458HisTyr: 0.458 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
4.321IleAla: 4.321 ± 0.393
0.915IleCys: 0.915 ± 0.191
3.914IleAsp: 3.914 ± 0.461
3.355IleGlu: 3.355 ± 0.357
1.728IlePhe: 1.728 ± 0.421
3.101IleGly: 3.101 ± 0.421
1.169IleHis: 1.169 ± 0.25
2.643IleIle: 2.643 ± 0.451
3.152IleLys: 3.152 ± 0.329
2.643IleLeu: 2.643 ± 0.358
1.068IleMet: 1.068 ± 0.216
2.847IleAsn: 2.847 ± 0.326
3.253IlePro: 3.253 ± 0.356
2.643IleGln: 2.643 ± 0.348
2.593IleArg: 2.593 ± 0.419
3.355IleSer: 3.355 ± 0.428
4.473IleThr: 4.473 ± 0.605
3.965IleVal: 3.965 ± 0.457
0.458IleTrp: 0.458 ± 0.152
1.423IleTyr: 1.423 ± 0.253
0.0IleXaa: 0.0 ± 0.0
Lys
5.033LysAla: 5.033 ± 0.908
0.712LysCys: 0.712 ± 0.209
2.643LysAsp: 2.643 ± 0.432
4.168LysGlu: 4.168 ± 0.505
1.83LysPhe: 1.83 ± 0.29
3.609LysGly: 3.609 ± 0.514
1.525LysHis: 1.525 ± 0.328
2.745LysIle: 2.745 ± 0.348
2.796LysLys: 2.796 ± 0.406
5.134LysLeu: 5.134 ± 0.475
1.983LysMet: 1.983 ± 0.351
2.389LysAsn: 2.389 ± 0.333
3.304LysPro: 3.304 ± 0.447
2.796LysGln: 2.796 ± 0.342
2.796LysArg: 2.796 ± 0.394
3.253LysSer: 3.253 ± 0.396
2.491LysThr: 2.491 ± 0.375
3.304LysVal: 3.304 ± 0.408
0.712LysTrp: 0.712 ± 0.179
1.779LysTyr: 1.779 ± 0.329
0.0LysXaa: 0.0 ± 0.0
Leu
7.168LeuAla: 7.168 ± 0.594
1.83LeuCys: 1.83 ± 0.33
4.88LeuAsp: 4.88 ± 0.486
4.575LeuGlu: 4.575 ± 0.544
2.847LeuPhe: 2.847 ± 0.469
4.982LeuGly: 4.982 ± 0.634
1.83LeuHis: 1.83 ± 0.329
3.863LeuIle: 3.863 ± 0.429
4.524LeuLys: 4.524 ± 0.625
6.303LeuLeu: 6.303 ± 0.675
2.389LeuMet: 2.389 ± 0.384
4.219LeuAsn: 4.219 ± 0.524
3.152LeuPro: 3.152 ± 0.36
3.355LeuGln: 3.355 ± 0.502
4.626LeuArg: 4.626 ± 0.435
4.829LeuSer: 4.829 ± 0.567
5.134LeuThr: 5.134 ± 0.548
6.1LeuVal: 6.1 ± 0.555
0.915LeuTrp: 0.915 ± 0.234
2.491LeuTyr: 2.491 ± 0.389
0.0LeuXaa: 0.0 ± 0.0
Met
2.338MetAla: 2.338 ± 0.412
0.254MetCys: 0.254 ± 0.103
1.169MetAsp: 1.169 ± 0.235
1.779MetGlu: 1.779 ± 0.262
1.068MetPhe: 1.068 ± 0.234
2.237MetGly: 2.237 ± 0.359
0.508MetHis: 0.508 ± 0.149
1.728MetIle: 1.728 ± 0.308
1.627MetLys: 1.627 ± 0.363
2.338MetLeu: 2.338 ± 0.338
0.559MetMet: 0.559 ± 0.173
0.763MetAsn: 0.763 ± 0.219
1.118MetPro: 1.118 ± 0.243
1.779MetGln: 1.779 ± 0.282
2.033MetArg: 2.033 ± 0.301
2.338MetSer: 2.338 ± 0.353
2.033MetThr: 2.033 ± 0.34
1.83MetVal: 1.83 ± 0.253
0.051MetTrp: 0.051 ± 0.06
0.966MetTyr: 0.966 ± 0.233
0.0MetXaa: 0.0 ± 0.0
Asn
4.372AsnAla: 4.372 ± 0.574
0.763AsnCys: 0.763 ± 0.24
3.05AsnAsp: 3.05 ± 0.354
2.745AsnGlu: 2.745 ± 0.293
1.576AsnPhe: 1.576 ± 0.373
5.795AsnGly: 5.795 ± 0.634
0.966AsnHis: 0.966 ± 0.213
2.542AsnIle: 2.542 ± 0.454
2.44AsnLys: 2.44 ± 0.374
3.762AsnLeu: 3.762 ± 0.303
1.068AsnMet: 1.068 ± 0.245
2.593AsnAsn: 2.593 ± 0.476
2.338AsnPro: 2.338 ± 0.415
1.322AsnGln: 1.322 ± 0.29
1.83AsnArg: 1.83 ± 0.298
2.135AsnSer: 2.135 ± 0.342
2.338AsnThr: 2.338 ± 0.344
3.152AsnVal: 3.152 ± 0.394
1.118AsnTrp: 1.118 ± 0.257
1.932AsnTyr: 1.932 ± 0.267
0.0AsnXaa: 0.0 ± 0.0
Pro
5.134ProAla: 5.134 ± 0.753
0.356ProCys: 0.356 ± 0.143
2.593ProAsp: 2.593 ± 0.362
2.847ProGlu: 2.847 ± 0.443
1.881ProPhe: 1.881 ± 0.341
3.558ProGly: 3.558 ± 0.521
0.966ProHis: 0.966 ± 0.217
1.932ProIle: 1.932 ± 0.359
1.983ProLys: 1.983 ± 0.392
3.203ProLeu: 3.203 ± 0.373
1.118ProMet: 1.118 ± 0.227
1.627ProAsn: 1.627 ± 0.334
2.338ProPro: 2.338 ± 0.44
1.983ProGln: 1.983 ± 0.511
1.779ProArg: 1.779 ± 0.317
2.593ProSer: 2.593 ± 0.333
2.44ProThr: 2.44 ± 0.276
4.473ProVal: 4.473 ± 0.769
0.712ProTrp: 0.712 ± 0.224
1.678ProTyr: 1.678 ± 0.284
0.0ProXaa: 0.0 ± 0.0
Gln
4.423GlnAla: 4.423 ± 0.868
0.61GlnCys: 0.61 ± 0.176
1.423GlnAsp: 1.423 ± 0.267
2.542GlnGlu: 2.542 ± 0.343
1.525GlnPhe: 1.525 ± 0.38
2.847GlnGly: 2.847 ± 0.376
0.813GlnHis: 0.813 ± 0.22
1.881GlnIle: 1.881 ± 0.304
2.288GlnLys: 2.288 ± 0.442
4.168GlnLeu: 4.168 ± 0.486
1.474GlnMet: 1.474 ± 0.287
2.135GlnAsn: 2.135 ± 0.507
2.186GlnPro: 2.186 ± 0.603
3.558GlnGln: 3.558 ± 1.417
2.338GlnArg: 2.338 ± 0.411
3.05GlnSer: 3.05 ± 0.411
2.186GlnThr: 2.186 ± 0.326
2.338GlnVal: 2.338 ± 0.438
0.61GlnTrp: 0.61 ± 0.168
1.169GlnTyr: 1.169 ± 0.216
0.0GlnXaa: 0.0 ± 0.0
Arg
4.626ArgAla: 4.626 ± 0.667
0.763ArgCys: 0.763 ± 0.231
2.796ArgAsp: 2.796 ± 0.32
2.847ArgGlu: 2.847 ± 0.484
2.135ArgPhe: 2.135 ± 0.367
2.999ArgGly: 2.999 ± 0.346
1.118ArgHis: 1.118 ± 0.296
3.05ArgIle: 3.05 ± 0.465
2.745ArgLys: 2.745 ± 0.382
4.88ArgLeu: 4.88 ± 0.49
1.525ArgMet: 1.525 ± 0.264
2.44ArgAsn: 2.44 ± 0.399
2.186ArgPro: 2.186 ± 0.371
1.83ArgGln: 1.83 ± 0.282
2.186ArgArg: 2.186 ± 0.344
2.999ArgSer: 2.999 ± 0.385
3.253ArgThr: 3.253 ± 0.343
3.762ArgVal: 3.762 ± 0.426
0.61ArgTrp: 0.61 ± 0.158
2.084ArgTyr: 2.084 ± 0.438
0.0ArgXaa: 0.0 ± 0.0
Ser
6.049SerAla: 6.049 ± 0.741
0.813SerCys: 0.813 ± 0.205
2.796SerAsp: 2.796 ± 0.415
3.05SerGlu: 3.05 ± 0.363
2.033SerPhe: 2.033 ± 0.378
5.795SerGly: 5.795 ± 0.682
0.712SerHis: 0.712 ± 0.173
3.813SerIle: 3.813 ± 0.463
3.203SerLys: 3.203 ± 0.357
4.677SerLeu: 4.677 ± 0.475
1.983SerMet: 1.983 ± 0.278
2.948SerAsn: 2.948 ± 0.412
2.288SerPro: 2.288 ± 0.357
1.881SerGln: 1.881 ± 0.446
3.203SerArg: 3.203 ± 0.377
4.016SerSer: 4.016 ± 0.556
3.152SerThr: 3.152 ± 0.396
5.185SerVal: 5.185 ± 0.518
1.525SerTrp: 1.525 ± 0.304
2.186SerTyr: 2.186 ± 0.272
0.0SerXaa: 0.0 ± 0.0
Thr
5.948ThrAla: 5.948 ± 0.678
0.407ThrCys: 0.407 ± 0.143
3.101ThrAsp: 3.101 ± 0.348
3.203ThrGlu: 3.203 ± 0.388
2.186ThrPhe: 2.186 ± 0.287
5.083ThrGly: 5.083 ± 0.511
1.068ThrHis: 1.068 ± 0.278
3.101ThrIle: 3.101 ± 0.376
2.44ThrLys: 2.44 ± 0.367
5.49ThrLeu: 5.49 ± 0.474
1.373ThrMet: 1.373 ± 0.297
3.101ThrAsn: 3.101 ± 0.354
3.152ThrPro: 3.152 ± 0.394
2.237ThrGln: 2.237 ± 0.312
2.643ThrArg: 2.643 ± 0.385
3.813ThrSer: 3.813 ± 0.424
3.965ThrThr: 3.965 ± 0.483
4.575ThrVal: 4.575 ± 0.562
0.763ThrTrp: 0.763 ± 0.206
1.881ThrTyr: 1.881 ± 0.311
0.0ThrXaa: 0.0 ± 0.0
Val
5.948ValAla: 5.948 ± 0.639
0.966ValCys: 0.966 ± 0.239
4.321ValAsp: 4.321 ± 0.59
4.27ValGlu: 4.27 ± 0.424
2.542ValPhe: 2.542 ± 0.325
4.931ValGly: 4.931 ± 0.37
1.118ValHis: 1.118 ± 0.226
3.914ValIle: 3.914 ± 0.388
4.728ValLys: 4.728 ± 0.404
5.744ValLeu: 5.744 ± 0.611
2.593ValMet: 2.593 ± 0.368
3.253ValAsn: 3.253 ± 0.412
2.643ValPro: 2.643 ± 0.351
3.66ValGln: 3.66 ± 0.624
4.016ValArg: 4.016 ± 0.463
5.083ValSer: 5.083 ± 0.571
4.626ValThr: 4.626 ± 0.411
6.151ValVal: 6.151 ± 0.461
1.322ValTrp: 1.322 ± 0.298
2.847ValTyr: 2.847 ± 0.362
0.0ValXaa: 0.0 ± 0.0
Trp
1.322TrpAla: 1.322 ± 0.253
0.305TrpCys: 0.305 ± 0.124
0.915TrpAsp: 0.915 ± 0.209
0.661TrpGlu: 0.661 ± 0.188
0.458TrpPhe: 0.458 ± 0.148
0.763TrpGly: 0.763 ± 0.209
0.61TrpHis: 0.61 ± 0.212
0.915TrpIle: 0.915 ± 0.257
1.118TrpLys: 1.118 ± 0.243
1.373TrpLeu: 1.373 ± 0.222
0.356TrpMet: 0.356 ± 0.138
0.508TrpAsn: 0.508 ± 0.134
0.661TrpPro: 0.661 ± 0.179
0.407TrpGln: 0.407 ± 0.131
0.661TrpArg: 0.661 ± 0.177
0.864TrpSer: 0.864 ± 0.201
0.763TrpThr: 0.763 ± 0.194
1.271TrpVal: 1.271 ± 0.263
0.407TrpTrp: 0.407 ± 0.121
0.915TrpTyr: 0.915 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.948TyrAla: 2.948 ± 0.328
0.508TyrCys: 0.508 ± 0.182
2.237TyrAsp: 2.237 ± 0.32
1.932TyrGlu: 1.932 ± 0.324
1.373TyrPhe: 1.373 ± 0.226
2.135TyrGly: 2.135 ± 0.403
0.864TyrHis: 0.864 ± 0.207
2.186TyrIle: 2.186 ± 0.295
1.728TyrLys: 1.728 ± 0.292
2.44TyrLeu: 2.44 ± 0.401
0.966TyrMet: 0.966 ± 0.214
2.338TyrAsn: 2.338 ± 0.338
1.322TyrPro: 1.322 ± 0.279
1.271TyrGln: 1.271 ± 0.216
2.491TyrArg: 2.491 ± 0.424
1.881TyrSer: 1.881 ± 0.398
1.932TyrThr: 1.932 ± 0.371
2.948TyrVal: 2.948 ± 0.499
0.508TyrTrp: 0.508 ± 0.18
1.627TyrTyr: 1.627 ± 0.321
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 99 proteins (19673 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski