Amino acid dipepetide frequency for Pseudomonas phage Littlefix

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.255AlaAla: 11.255 ± 1.482
0.435AlaCys: 0.435 ± 0.133
6.301AlaAsp: 6.301 ± 0.605
6.258AlaGlu: 6.258 ± 0.582
3.085AlaPhe: 3.085 ± 0.339
8.083AlaGly: 8.083 ± 0.921
1.695AlaHis: 1.695 ± 0.28
5.084AlaIle: 5.084 ± 0.578
6.301AlaLys: 6.301 ± 0.678
8.604AlaLeu: 8.604 ± 0.56
3.259AlaMet: 3.259 ± 0.405
3.824AlaAsn: 3.824 ± 0.43
3.042AlaPro: 3.042 ± 0.46
4.824AlaGln: 4.824 ± 0.727
4.432AlaArg: 4.432 ± 0.522
5.171AlaSer: 5.171 ± 0.744
5.519AlaThr: 5.519 ± 0.776
6.562AlaVal: 6.562 ± 0.563
1.173AlaTrp: 1.173 ± 0.216
3.433AlaTyr: 3.433 ± 0.409
0.0AlaXaa: 0.0 ± 0.0
Cys
0.739CysAla: 0.739 ± 0.204
0.304CysCys: 0.304 ± 0.117
0.652CysAsp: 0.652 ± 0.206
0.478CysGlu: 0.478 ± 0.175
0.13CysPhe: 0.13 ± 0.067
0.348CysGly: 0.348 ± 0.135
0.174CysHis: 0.174 ± 0.081
0.391CysIle: 0.391 ± 0.148
0.435CysLys: 0.435 ± 0.167
0.478CysLeu: 0.478 ± 0.166
0.174CysMet: 0.174 ± 0.112
0.739CysAsn: 0.739 ± 0.255
0.217CysPro: 0.217 ± 0.1
0.261CysGln: 0.261 ± 0.131
0.478CysArg: 0.478 ± 0.151
0.348CysSer: 0.348 ± 0.135
0.826CysThr: 0.826 ± 0.25
0.739CysVal: 0.739 ± 0.234
0.043CysTrp: 0.043 ± 0.051
0.391CysTyr: 0.391 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
6.127AspAla: 6.127 ± 0.939
0.565AspCys: 0.565 ± 0.159
3.216AspAsp: 3.216 ± 0.429
4.215AspGlu: 4.215 ± 0.527
1.999AspPhe: 1.999 ± 0.345
4.867AspGly: 4.867 ± 0.57
1.173AspHis: 1.173 ± 0.245
2.868AspIle: 2.868 ± 0.324
3.781AspLys: 3.781 ± 0.432
5.345AspLeu: 5.345 ± 0.404
1.651AspMet: 1.651 ± 0.262
2.868AspAsn: 2.868 ± 0.291
3.433AspPro: 3.433 ± 0.345
2.651AspGln: 2.651 ± 0.382
3.303AspArg: 3.303 ± 0.326
2.998AspSer: 2.998 ± 0.394
3.694AspThr: 3.694 ± 0.374
4.215AspVal: 4.215 ± 0.469
0.652AspTrp: 0.652 ± 0.203
1.912AspTyr: 1.912 ± 0.308
0.0AspXaa: 0.0 ± 0.0
Glu
7.083GluAla: 7.083 ± 0.714
0.478GluCys: 0.478 ± 0.177
3.911GluAsp: 3.911 ± 0.381
4.389GluGlu: 4.389 ± 0.441
1.869GluPhe: 1.869 ± 0.316
3.303GluGly: 3.303 ± 0.382
1.043GluHis: 1.043 ± 0.202
2.781GluIle: 2.781 ± 0.352
3.085GluLys: 3.085 ± 0.411
6.692GluLeu: 6.692 ± 0.548
3.085GluMet: 3.085 ± 0.489
2.303GluAsn: 2.303 ± 0.318
2.216GluPro: 2.216 ± 0.349
3.607GluGln: 3.607 ± 0.398
3.042GluArg: 3.042 ± 0.399
3.39GluSer: 3.39 ± 0.334
3.476GluThr: 3.476 ± 0.382
4.563GluVal: 4.563 ± 0.462
1.086GluTrp: 1.086 ± 0.282
2.347GluTyr: 2.347 ± 0.368
0.0GluXaa: 0.0 ± 0.0
Phe
2.912PheAla: 2.912 ± 0.332
0.348PheCys: 0.348 ± 0.123
2.303PheAsp: 2.303 ± 0.3
1.869PheGlu: 1.869 ± 0.298
1.043PhePhe: 1.043 ± 0.203
2.738PheGly: 2.738 ± 0.406
0.739PheHis: 0.739 ± 0.152
1.695PheIle: 1.695 ± 0.259
2.825PheLys: 2.825 ± 0.346
1.869PheLeu: 1.869 ± 0.287
1.347PheMet: 1.347 ± 0.344
1.695PheAsn: 1.695 ± 0.265
1.217PhePro: 1.217 ± 0.272
1.564PheGln: 1.564 ± 0.221
1.695PheArg: 1.695 ± 0.219
2.129PheSer: 2.129 ± 0.29
1.999PheThr: 1.999 ± 0.288
2.781PheVal: 2.781 ± 0.331
0.608PheTrp: 0.608 ± 0.164
0.739PheTyr: 0.739 ± 0.194
0.0PheXaa: 0.0 ± 0.0
Gly
6.345GlyAla: 6.345 ± 0.669
0.608GlyCys: 0.608 ± 0.174
3.824GlyAsp: 3.824 ± 0.345
4.041GlyGlu: 4.041 ± 0.38
3.52GlyPhe: 3.52 ± 0.402
3.911GlyGly: 3.911 ± 0.493
1.086GlyHis: 1.086 ± 0.256
3.824GlyIle: 3.824 ± 0.434
4.563GlyLys: 4.563 ± 0.469
6.04GlyLeu: 6.04 ± 0.504
1.825GlyMet: 1.825 ± 0.298
3.607GlyAsn: 3.607 ± 0.403
2.173GlyPro: 2.173 ± 0.336
3.129GlyGln: 3.129 ± 0.355
2.26GlyArg: 2.26 ± 0.323
4.041GlySer: 4.041 ± 0.572
4.997GlyThr: 4.997 ± 0.453
4.954GlyVal: 4.954 ± 0.387
0.826GlyTrp: 0.826 ± 0.205
2.738GlyTyr: 2.738 ± 0.317
0.0GlyXaa: 0.0 ± 0.0
His
1.651HisAla: 1.651 ± 0.308
0.13HisCys: 0.13 ± 0.074
1.304HisAsp: 1.304 ± 0.21
1.434HisGlu: 1.434 ± 0.29
0.652HisPhe: 0.652 ± 0.166
0.956HisGly: 0.956 ± 0.192
0.435HisHis: 0.435 ± 0.147
1.13HisIle: 1.13 ± 0.228
1.26HisLys: 1.26 ± 0.239
1.347HisLeu: 1.347 ± 0.295
0.608HisMet: 0.608 ± 0.135
0.695HisAsn: 0.695 ± 0.178
0.826HisPro: 0.826 ± 0.228
0.956HisGln: 0.956 ± 0.165
1.173HisArg: 1.173 ± 0.249
1.26HisSer: 1.26 ± 0.189
1.043HisThr: 1.043 ± 0.201
1.477HisVal: 1.477 ± 0.285
0.304HisTrp: 0.304 ± 0.121
0.739HisTyr: 0.739 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
4.824IleAla: 4.824 ± 0.458
0.348IleCys: 0.348 ± 0.112
4.041IleAsp: 4.041 ± 0.445
3.694IleGlu: 3.694 ± 0.425
1.173IlePhe: 1.173 ± 0.266
3.52IleGly: 3.52 ± 0.427
1.217IleHis: 1.217 ± 0.265
2.564IleIle: 2.564 ± 0.566
3.65IleLys: 3.65 ± 0.399
3.52IleLeu: 3.52 ± 0.425
1.782IleMet: 1.782 ± 0.246
2.607IleAsn: 2.607 ± 0.375
2.694IlePro: 2.694 ± 0.333
2.347IleGln: 2.347 ± 0.35
1.956IleArg: 1.956 ± 0.244
2.781IleSer: 2.781 ± 0.285
3.52IleThr: 3.52 ± 0.445
3.303IleVal: 3.303 ± 0.345
0.174IleTrp: 0.174 ± 0.08
1.608IleTyr: 1.608 ± 0.238
0.0IleXaa: 0.0 ± 0.0
Lys
6.475LysAla: 6.475 ± 0.733
0.739LysCys: 0.739 ± 0.203
4.215LysAsp: 4.215 ± 0.543
4.302LysGlu: 4.302 ± 0.586
2.042LysPhe: 2.042 ± 0.283
3.781LysGly: 3.781 ± 0.392
1.26LysHis: 1.26 ± 0.224
2.998LysIle: 2.998 ± 0.411
4.476LysLys: 4.476 ± 0.609
6.779LysLeu: 6.779 ± 0.676
1.912LysMet: 1.912 ± 0.253
2.52LysAsn: 2.52 ± 0.351
4.085LysPro: 4.085 ± 0.596
2.868LysGln: 2.868 ± 0.369
2.694LysArg: 2.694 ± 0.356
3.042LysSer: 3.042 ± 0.449
2.998LysThr: 2.998 ± 0.352
3.911LysVal: 3.911 ± 0.498
0.739LysTrp: 0.739 ± 0.179
1.738LysTyr: 1.738 ± 0.326
0.0LysXaa: 0.0 ± 0.0
Leu
8.474LeuAla: 8.474 ± 0.562
0.478LeuCys: 0.478 ± 0.158
5.649LeuAsp: 5.649 ± 0.46
4.302LeuGlu: 4.302 ± 0.429
3.303LeuPhe: 3.303 ± 0.406
5.432LeuGly: 5.432 ± 0.397
1.869LeuHis: 1.869 ± 0.29
4.041LeuIle: 4.041 ± 0.473
5.649LeuLys: 5.649 ± 0.622
6.605LeuLeu: 6.605 ± 0.708
2.477LeuMet: 2.477 ± 0.294
3.954LeuAsn: 3.954 ± 0.436
4.476LeuPro: 4.476 ± 0.477
4.215LeuGln: 4.215 ± 0.434
3.607LeuArg: 3.607 ± 0.407
5.302LeuSer: 5.302 ± 0.401
5.562LeuThr: 5.562 ± 0.579
5.649LeuVal: 5.649 ± 0.475
0.652LeuTrp: 0.652 ± 0.157
2.39LeuTyr: 2.39 ± 0.293
0.0LeuXaa: 0.0 ± 0.0
Met
3.216MetAla: 3.216 ± 0.372
0.217MetCys: 0.217 ± 0.11
1.13MetAsp: 1.13 ± 0.245
1.869MetGlu: 1.869 ± 0.236
0.999MetPhe: 0.999 ± 0.221
1.651MetGly: 1.651 ± 0.275
0.826MetHis: 0.826 ± 0.178
2.26MetIle: 2.26 ± 0.287
1.391MetLys: 1.391 ± 0.219
2.607MetLeu: 2.607 ± 0.376
0.956MetMet: 0.956 ± 0.21
1.434MetAsn: 1.434 ± 0.224
1.347MetPro: 1.347 ± 0.232
1.521MetGln: 1.521 ± 0.297
1.564MetArg: 1.564 ± 0.268
2.694MetSer: 2.694 ± 0.394
1.738MetThr: 1.738 ± 0.257
2.086MetVal: 2.086 ± 0.341
0.304MetTrp: 0.304 ± 0.136
1.043MetTyr: 1.043 ± 0.219
0.0MetXaa: 0.0 ± 0.0
Asn
3.781AsnAla: 3.781 ± 0.451
0.304AsnCys: 0.304 ± 0.14
2.129AsnAsp: 2.129 ± 0.289
3.259AsnGlu: 3.259 ± 0.377
1.608AsnPhe: 1.608 ± 0.222
3.65AsnGly: 3.65 ± 0.591
0.739AsnHis: 0.739 ± 0.191
2.042AsnIle: 2.042 ± 0.294
2.607AsnLys: 2.607 ± 0.312
4.215AsnLeu: 4.215 ± 0.401
1.564AsnMet: 1.564 ± 0.312
2.347AsnAsn: 2.347 ± 0.305
2.564AsnPro: 2.564 ± 0.381
2.216AsnGln: 2.216 ± 0.314
2.52AsnArg: 2.52 ± 0.282
2.694AsnSer: 2.694 ± 0.375
2.825AsnThr: 2.825 ± 0.539
2.868AsnVal: 2.868 ± 0.47
0.739AsnTrp: 0.739 ± 0.198
1.347AsnTyr: 1.347 ± 0.197
0.0AsnXaa: 0.0 ± 0.0
Pro
3.65ProAla: 3.65 ± 0.382
0.565ProCys: 0.565 ± 0.185
3.39ProAsp: 3.39 ± 0.364
3.476ProGlu: 3.476 ± 0.43
1.608ProPhe: 1.608 ± 0.281
2.912ProGly: 2.912 ± 0.405
0.565ProHis: 0.565 ± 0.15
1.782ProIle: 1.782 ± 0.301
2.651ProLys: 2.651 ± 0.409
3.346ProLeu: 3.346 ± 0.346
1.13ProMet: 1.13 ± 0.259
2.216ProAsn: 2.216 ± 0.281
1.738ProPro: 1.738 ± 0.411
1.869ProGln: 1.869 ± 0.345
1.13ProArg: 1.13 ± 0.244
2.52ProSer: 2.52 ± 0.362
2.912ProThr: 2.912 ± 0.353
3.998ProVal: 3.998 ± 0.482
0.435ProTrp: 0.435 ± 0.165
1.086ProTyr: 1.086 ± 0.205
0.0ProXaa: 0.0 ± 0.0
Gln
5.171GlnAla: 5.171 ± 0.722
0.261GlnCys: 0.261 ± 0.13
1.738GlnAsp: 1.738 ± 0.283
2.998GlnGlu: 2.998 ± 0.334
1.608GlnPhe: 1.608 ± 0.314
3.129GlnGly: 3.129 ± 0.346
0.782GlnHis: 0.782 ± 0.158
2.607GlnIle: 2.607 ± 0.339
2.347GlnLys: 2.347 ± 0.426
4.65GlnLeu: 4.65 ± 0.37
1.26GlnMet: 1.26 ± 0.27
1.391GlnAsn: 1.391 ± 0.218
1.782GlnPro: 1.782 ± 0.31
2.477GlnGln: 2.477 ± 0.337
2.129GlnArg: 2.129 ± 0.381
2.52GlnSer: 2.52 ± 0.297
2.738GlnThr: 2.738 ± 0.36
3.346GlnVal: 3.346 ± 0.371
0.521GlnTrp: 0.521 ± 0.177
1.825GlnTyr: 1.825 ± 0.244
0.0GlnXaa: 0.0 ± 0.0
Arg
4.302ArgAla: 4.302 ± 0.438
0.348ArgCys: 0.348 ± 0.142
2.651ArgAsp: 2.651 ± 0.296
2.651ArgGlu: 2.651 ± 0.504
2.042ArgPhe: 2.042 ± 0.335
2.347ArgGly: 2.347 ± 0.303
0.826ArgHis: 0.826 ± 0.192
3.129ArgIle: 3.129 ± 0.419
3.085ArgLys: 3.085 ± 0.418
3.868ArgLeu: 3.868 ± 0.513
1.217ArgMet: 1.217 ± 0.226
2.868ArgAsn: 2.868 ± 0.406
0.956ArgPro: 0.956 ± 0.24
2.173ArgGln: 2.173 ± 0.376
2.042ArgArg: 2.042 ± 0.335
2.477ArgSer: 2.477 ± 0.359
3.042ArgThr: 3.042 ± 0.301
3.39ArgVal: 3.39 ± 0.421
0.348ArgTrp: 0.348 ± 0.138
1.738ArgTyr: 1.738 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
5.258SerAla: 5.258 ± 0.738
0.565SerCys: 0.565 ± 0.205
3.824SerAsp: 3.824 ± 0.403
3.346SerGlu: 3.346 ± 0.336
1.608SerPhe: 1.608 ± 0.237
4.302SerGly: 4.302 ± 0.453
1.086SerHis: 1.086 ± 0.207
2.912SerIle: 2.912 ± 0.279
3.781SerLys: 3.781 ± 0.296
5.084SerLeu: 5.084 ± 0.509
1.477SerMet: 1.477 ± 0.26
2.347SerAsn: 2.347 ± 0.369
2.26SerPro: 2.26 ± 0.291
1.869SerGln: 1.869 ± 0.235
2.52SerArg: 2.52 ± 0.321
3.259SerSer: 3.259 ± 0.387
3.998SerThr: 3.998 ± 0.377
3.998SerVal: 3.998 ± 0.479
0.956SerTrp: 0.956 ± 0.199
2.216SerTyr: 2.216 ± 0.29
0.0SerXaa: 0.0 ± 0.0
Thr
7.257ThrAla: 7.257 ± 0.957
0.304ThrCys: 0.304 ± 0.117
4.041ThrAsp: 4.041 ± 0.362
3.954ThrGlu: 3.954 ± 0.475
2.086ThrPhe: 2.086 ± 0.285
5.041ThrGly: 5.041 ± 0.55
1.347ThrHis: 1.347 ± 0.226
3.346ThrIle: 3.346 ± 0.46
4.041ThrLys: 4.041 ± 0.481
3.954ThrLeu: 3.954 ± 0.447
1.521ThrMet: 1.521 ± 0.291
3.085ThrAsn: 3.085 ± 0.422
3.172ThrPro: 3.172 ± 0.339
2.129ThrGln: 2.129 ± 0.284
2.434ThrArg: 2.434 ± 0.327
3.476ThrSer: 3.476 ± 0.313
4.085ThrThr: 4.085 ± 0.692
4.954ThrVal: 4.954 ± 0.472
0.695ThrTrp: 0.695 ± 0.197
2.129ThrTyr: 2.129 ± 0.422
0.0ThrXaa: 0.0 ± 0.0
Val
6.127ValAla: 6.127 ± 0.566
0.695ValCys: 0.695 ± 0.213
4.78ValAsp: 4.78 ± 0.398
4.737ValGlu: 4.737 ± 0.47
2.086ValPhe: 2.086 ± 0.28
4.085ValGly: 4.085 ± 0.478
1.434ValHis: 1.434 ± 0.211
3.607ValIle: 3.607 ± 0.458
4.737ValLys: 4.737 ± 0.445
5.171ValLeu: 5.171 ± 0.468
2.042ValMet: 2.042 ± 0.331
3.52ValAsn: 3.52 ± 0.454
3.085ValPro: 3.085 ± 0.36
3.607ValGln: 3.607 ± 0.462
3.607ValArg: 3.607 ± 0.471
4.302ValSer: 4.302 ± 0.397
5.128ValThr: 5.128 ± 0.575
4.954ValVal: 4.954 ± 0.607
1.13ValTrp: 1.13 ± 0.255
2.477ValTyr: 2.477 ± 0.338
0.0ValXaa: 0.0 ± 0.0
Trp
0.956TrpAla: 0.956 ± 0.167
0.304TrpCys: 0.304 ± 0.109
0.652TrpAsp: 0.652 ± 0.201
0.869TrpGlu: 0.869 ± 0.217
0.565TrpPhe: 0.565 ± 0.183
1.173TrpGly: 1.173 ± 0.243
0.174TrpHis: 0.174 ± 0.099
0.521TrpIle: 0.521 ± 0.14
0.826TrpLys: 0.826 ± 0.239
0.826TrpLeu: 0.826 ± 0.166
0.391TrpMet: 0.391 ± 0.146
0.435TrpAsn: 0.435 ± 0.165
0.521TrpPro: 0.521 ± 0.145
0.304TrpGln: 0.304 ± 0.123
0.695TrpArg: 0.695 ± 0.238
0.391TrpSer: 0.391 ± 0.107
0.695TrpThr: 0.695 ± 0.195
0.913TrpVal: 0.913 ± 0.226
0.174TrpTrp: 0.174 ± 0.092
0.652TrpTyr: 0.652 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.738TyrAla: 2.738 ± 0.311
0.348TyrCys: 0.348 ± 0.139
1.782TyrAsp: 1.782 ± 0.238
1.608TyrGlu: 1.608 ± 0.268
1.086TyrPhe: 1.086 ± 0.199
3.085TyrGly: 3.085 ± 0.421
0.913TyrHis: 0.913 ± 0.24
1.738TyrIle: 1.738 ± 0.288
2.086TyrLys: 2.086 ± 0.336
3.085TyrLeu: 3.085 ± 0.352
1.13TyrMet: 1.13 ± 0.244
1.564TyrAsn: 1.564 ± 0.218
1.173TyrPro: 1.173 ± 0.25
0.695TyrGln: 0.695 ± 0.167
2.216TyrArg: 2.216 ± 0.342
1.869TyrSer: 1.869 ± 0.284
2.26TyrThr: 2.26 ± 0.332
2.651TyrVal: 2.651 ± 0.375
0.478TyrTrp: 0.478 ± 0.173
1.043TyrTyr: 1.043 ± 0.288
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (23013 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski