Amino acid dipepetide frequency for Rheinheimera phage vB_RspM_Barba9A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.938AlaAla: 4.938 ± 0.654
0.913AlaCys: 0.913 ± 0.202
4.316AlaAsp: 4.316 ± 0.511
4.482AlaGlu: 4.482 ± 0.496
2.531AlaPhe: 2.531 ± 0.262
3.901AlaGly: 3.901 ± 0.459
0.871AlaHis: 0.871 ± 0.218
4.025AlaIle: 4.025 ± 0.396
4.316AlaLys: 4.316 ± 0.431
5.934AlaLeu: 5.934 ± 0.725
1.452AlaMet: 1.452 ± 0.225
3.112AlaAsn: 3.112 ± 0.372
1.701AlaPro: 1.701 ± 0.296
2.365AlaGln: 2.365 ± 0.294
1.909AlaArg: 1.909 ± 0.328
3.859AlaSer: 3.859 ± 0.455
3.901AlaThr: 3.901 ± 0.449
3.942AlaVal: 3.942 ± 0.462
0.498AlaTrp: 0.498 ± 0.134
2.78AlaTyr: 2.78 ± 0.358
0.0AlaXaa: 0.0 ± 0.0
Cys
0.581CysAla: 0.581 ± 0.163
0.373CysCys: 0.373 ± 0.116
1.079CysAsp: 1.079 ± 0.213
0.871CysGlu: 0.871 ± 0.217
0.913CysPhe: 0.913 ± 0.213
1.12CysGly: 1.12 ± 0.215
0.166CysHis: 0.166 ± 0.113
0.871CysIle: 0.871 ± 0.226
1.162CysLys: 1.162 ± 0.206
0.788CysLeu: 0.788 ± 0.198
0.332CysMet: 0.332 ± 0.133
0.996CysAsn: 0.996 ± 0.244
0.498CysPro: 0.498 ± 0.133
0.373CysGln: 0.373 ± 0.119
0.747CysArg: 0.747 ± 0.177
0.83CysSer: 0.83 ± 0.199
0.871CysThr: 0.871 ± 0.179
1.286CysVal: 1.286 ± 0.199
0.166CysTrp: 0.166 ± 0.076
0.581CysTyr: 0.581 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
2.739AspAla: 2.739 ± 0.365
0.954AspCys: 0.954 ± 0.18
4.15AspAsp: 4.15 ± 0.482
3.486AspGlu: 3.486 ± 0.414
2.614AspPhe: 2.614 ± 0.317
5.394AspGly: 5.394 ± 0.465
0.954AspHis: 0.954 ± 0.193
5.062AspIle: 5.062 ± 0.421
5.104AspLys: 5.104 ± 0.511
4.647AspLeu: 4.647 ± 0.493
1.411AspMet: 1.411 ± 0.26
3.693AspAsn: 3.693 ± 0.371
1.577AspPro: 1.577 ± 0.298
1.328AspGln: 1.328 ± 0.242
2.365AspArg: 2.365 ± 0.381
3.818AspSer: 3.818 ± 0.459
4.44AspThr: 4.44 ± 0.422
4.689AspVal: 4.689 ± 0.503
1.452AspTrp: 1.452 ± 0.255
3.237AspTyr: 3.237 ± 0.38
0.0AspXaa: 0.0 ± 0.0
Glu
4.067GluAla: 4.067 ± 0.484
1.079GluCys: 1.079 ± 0.236
3.652GluAsp: 3.652 ± 0.468
3.527GluGlu: 3.527 ± 0.425
2.78GluPhe: 2.78 ± 0.306
3.361GluGly: 3.361 ± 0.436
1.494GluHis: 1.494 ± 0.226
4.482GluIle: 4.482 ± 0.405
4.399GluLys: 4.399 ± 0.504
6.971GluLeu: 6.971 ± 0.514
1.826GluMet: 1.826 ± 0.242
3.444GluAsn: 3.444 ± 0.36
1.411GluPro: 1.411 ± 0.235
2.905GluGln: 2.905 ± 0.426
2.988GluArg: 2.988 ± 0.361
3.527GluSer: 3.527 ± 0.415
2.822GluThr: 2.822 ± 0.296
4.772GluVal: 4.772 ± 0.414
1.203GluTrp: 1.203 ± 0.233
3.652GluTyr: 3.652 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
2.407PheAla: 2.407 ± 0.289
0.788PheCys: 0.788 ± 0.165
3.237PheAsp: 3.237 ± 0.341
2.988PheGlu: 2.988 ± 0.332
1.784PhePhe: 1.784 ± 0.288
2.324PheGly: 2.324 ± 0.319
0.622PheHis: 0.622 ± 0.159
3.112PheIle: 3.112 ± 0.309
2.531PheLys: 2.531 ± 0.302
3.154PheLeu: 3.154 ± 0.348
1.079PheMet: 1.079 ± 0.189
3.154PheAsn: 3.154 ± 0.408
1.286PhePro: 1.286 ± 0.223
1.411PheGln: 1.411 ± 0.239
1.577PheArg: 1.577 ± 0.258
3.486PheSer: 3.486 ± 0.421
3.527PheThr: 3.527 ± 0.35
2.656PheVal: 2.656 ± 0.317
0.498PheTrp: 0.498 ± 0.158
1.162PheTyr: 1.162 ± 0.253
0.0PheXaa: 0.0 ± 0.0
Gly
3.818GlyAla: 3.818 ± 0.513
0.913GlyCys: 0.913 ± 0.194
4.316GlyAsp: 4.316 ± 0.535
3.901GlyGlu: 3.901 ± 0.38
3.071GlyPhe: 3.071 ± 0.34
4.647GlyGly: 4.647 ± 0.514
0.747GlyHis: 0.747 ± 0.184
3.195GlyIle: 3.195 ± 0.296
4.606GlyLys: 4.606 ± 0.443
4.896GlyLeu: 4.896 ± 0.51
1.95GlyMet: 1.95 ± 0.302
4.647GlyAsn: 4.647 ± 0.715
0.249GlyPro: 0.249 ± 0.093
2.033GlyGln: 2.033 ± 0.277
2.324GlyArg: 2.324 ± 0.321
5.768GlySer: 5.768 ± 0.571
4.482GlyThr: 4.482 ± 0.683
5.643GlyVal: 5.643 ± 0.486
1.411GlyTrp: 1.411 ± 0.276
3.569GlyTyr: 3.569 ± 0.75
0.0GlyXaa: 0.0 ± 0.0
His
0.664HisAla: 0.664 ± 0.152
0.207HisCys: 0.207 ± 0.085
0.456HisAsp: 0.456 ± 0.138
0.954HisGlu: 0.954 ± 0.232
0.498HisPhe: 0.498 ± 0.155
0.788HisGly: 0.788 ± 0.191
0.29HisHis: 0.29 ± 0.13
1.245HisIle: 1.245 ± 0.268
1.162HisLys: 1.162 ± 0.211
1.909HisLeu: 1.909 ± 0.293
0.373HisMet: 0.373 ± 0.111
0.871HisAsn: 0.871 ± 0.188
0.871HisPro: 0.871 ± 0.222
0.539HisGln: 0.539 ± 0.142
0.747HisArg: 0.747 ± 0.171
1.286HisSer: 1.286 ± 0.203
0.996HisThr: 0.996 ± 0.192
0.996HisVal: 0.996 ± 0.177
0.166HisTrp: 0.166 ± 0.082
0.788HisTyr: 0.788 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
4.025IleAla: 4.025 ± 0.399
0.871IleCys: 0.871 ± 0.233
4.399IleAsp: 4.399 ± 0.496
3.444IleGlu: 3.444 ± 0.39
1.95IlePhe: 1.95 ± 0.31
3.901IleGly: 3.901 ± 0.397
1.079IleHis: 1.079 ± 0.192
2.946IleIle: 2.946 ± 0.357
5.353IleLys: 5.353 ± 0.481
3.901IleLeu: 3.901 ± 0.368
1.369IleMet: 1.369 ± 0.244
3.735IleAsn: 3.735 ± 0.403
2.365IlePro: 2.365 ± 0.364
2.531IleGln: 2.531 ± 0.295
2.697IleArg: 2.697 ± 0.305
4.938IleSer: 4.938 ± 0.553
4.979IleThr: 4.979 ± 0.523
3.652IleVal: 3.652 ± 0.364
0.871IleTrp: 0.871 ± 0.16
2.739IleTyr: 2.739 ± 0.275
0.0IleXaa: 0.0 ± 0.0
Lys
4.357LysAla: 4.357 ± 0.559
0.913LysCys: 0.913 ± 0.207
3.901LysAsp: 3.901 ± 0.4
5.021LysGlu: 5.021 ± 0.491
2.863LysPhe: 2.863 ± 0.325
3.361LysGly: 3.361 ± 0.349
1.826LysHis: 1.826 ± 0.32
4.606LysIle: 4.606 ± 0.425
3.984LysLys: 3.984 ± 0.444
5.851LysLeu: 5.851 ± 0.549
2.033LysMet: 2.033 ± 0.305
2.946LysAsn: 2.946 ± 0.419
2.822LysPro: 2.822 ± 0.323
3.901LysGln: 3.901 ± 0.407
3.112LysArg: 3.112 ± 0.41
3.693LysSer: 3.693 ± 0.396
3.942LysThr: 3.942 ± 0.461
5.311LysVal: 5.311 ± 0.425
1.12LysTrp: 1.12 ± 0.205
2.822LysTyr: 2.822 ± 0.341
0.0LysXaa: 0.0 ± 0.0
Leu
5.892LeuAla: 5.892 ± 0.652
1.12LeuCys: 1.12 ± 0.192
4.772LeuAsp: 4.772 ± 0.426
5.809LeuGlu: 5.809 ± 0.389
4.108LeuPhe: 4.108 ± 0.512
4.938LeuGly: 4.938 ± 0.499
1.411LeuHis: 1.411 ± 0.234
4.647LeuIle: 4.647 ± 0.394
5.228LeuLys: 5.228 ± 0.503
6.598LeuLeu: 6.598 ± 0.54
1.743LeuMet: 1.743 ± 0.272
4.896LeuAsn: 4.896 ± 0.463
3.361LeuPro: 3.361 ± 0.432
3.527LeuGln: 3.527 ± 0.364
3.735LeuArg: 3.735 ± 0.367
6.432LeuSer: 6.432 ± 0.576
5.685LeuThr: 5.685 ± 0.551
5.643LeuVal: 5.643 ± 0.524
0.705LeuTrp: 0.705 ± 0.164
3.32LeuTyr: 3.32 ± 0.358
0.0LeuXaa: 0.0 ± 0.0
Met
1.784MetAla: 1.784 ± 0.305
0.498MetCys: 0.498 ± 0.134
0.954MetAsp: 0.954 ± 0.184
1.743MetGlu: 1.743 ± 0.303
1.369MetPhe: 1.369 ± 0.231
0.954MetGly: 0.954 ± 0.232
0.041MetHis: 0.041 ± 0.041
1.411MetIle: 1.411 ± 0.259
1.909MetLys: 1.909 ± 0.336
2.365MetLeu: 2.365 ± 0.32
0.788MetMet: 0.788 ± 0.162
1.203MetAsn: 1.203 ± 0.204
0.664MetPro: 0.664 ± 0.149
1.286MetGln: 1.286 ± 0.21
0.705MetArg: 0.705 ± 0.155
2.573MetSer: 2.573 ± 0.293
1.452MetThr: 1.452 ± 0.273
1.66MetVal: 1.66 ± 0.3
0.29MetTrp: 0.29 ± 0.122
0.954MetTyr: 0.954 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
3.154AsnAla: 3.154 ± 0.447
0.664AsnCys: 0.664 ± 0.17
2.78AsnAsp: 2.78 ± 0.295
2.739AsnGlu: 2.739 ± 0.344
2.158AsnPhe: 2.158 ± 0.292
4.813AsnGly: 4.813 ± 0.532
0.664AsnHis: 0.664 ± 0.201
3.527AsnIle: 3.527 ± 0.41
4.482AsnLys: 4.482 ± 0.48
4.813AsnLeu: 4.813 ± 0.455
1.452AsnMet: 1.452 ± 0.229
3.486AsnAsn: 3.486 ± 0.391
2.324AsnPro: 2.324 ± 0.278
2.656AsnGln: 2.656 ± 0.303
2.573AsnArg: 2.573 ± 0.405
3.112AsnSer: 3.112 ± 0.346
4.855AsnThr: 4.855 ± 1.2
3.984AsnVal: 3.984 ± 0.388
0.83AsnTrp: 0.83 ± 0.229
2.448AsnTyr: 2.448 ± 0.333
0.0AsnXaa: 0.0 ± 0.0
Pro
2.116ProAla: 2.116 ± 0.288
0.29ProCys: 0.29 ± 0.085
2.905ProAsp: 2.905 ± 0.373
2.531ProGlu: 2.531 ± 0.357
1.494ProPhe: 1.494 ± 0.234
0.041ProGly: 0.041 ± 0.039
0.456ProHis: 0.456 ± 0.137
2.324ProIle: 2.324 ± 0.295
1.743ProLys: 1.743 ± 0.313
2.407ProLeu: 2.407 ± 0.283
0.415ProMet: 0.415 ± 0.134
2.075ProAsn: 2.075 ± 0.306
0.996ProPro: 0.996 ± 0.224
1.037ProGln: 1.037 ± 0.241
1.079ProArg: 1.079 ± 0.253
2.324ProSer: 2.324 ± 0.35
3.071ProThr: 3.071 ± 0.425
2.822ProVal: 2.822 ± 0.36
0.498ProTrp: 0.498 ± 0.133
1.701ProTyr: 1.701 ± 0.22
0.0ProXaa: 0.0 ± 0.0
Gln
2.697GlnAla: 2.697 ± 0.448
0.415GlnCys: 0.415 ± 0.114
2.033GlnAsp: 2.033 ± 0.283
3.32GlnGlu: 3.32 ± 0.467
1.577GlnPhe: 1.577 ± 0.239
1.826GlnGly: 1.826 ± 0.296
0.498GlnHis: 0.498 ± 0.145
2.407GlnIle: 2.407 ± 0.322
2.282GlnLys: 2.282 ± 0.277
3.569GlnLeu: 3.569 ± 0.383
1.203GlnMet: 1.203 ± 0.315
1.992GlnAsn: 1.992 ± 0.364
1.66GlnPro: 1.66 ± 0.24
1.909GlnGln: 1.909 ± 0.367
1.909GlnArg: 1.909 ± 0.267
2.448GlnSer: 2.448 ± 0.352
1.95GlnThr: 1.95 ± 0.262
2.241GlnVal: 2.241 ± 0.284
0.456GlnTrp: 0.456 ± 0.141
2.241GlnTyr: 2.241 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
2.448ArgAla: 2.448 ± 0.423
0.664ArgCys: 0.664 ± 0.162
1.992ArgAsp: 1.992 ± 0.276
2.158ArgGlu: 2.158 ± 0.328
1.784ArgPhe: 1.784 ± 0.276
2.49ArgGly: 2.49 ± 0.324
0.747ArgHis: 0.747 ± 0.179
2.739ArgIle: 2.739 ± 0.372
2.905ArgLys: 2.905 ± 0.406
3.776ArgLeu: 3.776 ± 0.368
1.286ArgMet: 1.286 ± 0.246
2.531ArgAsn: 2.531 ± 0.527
1.203ArgPro: 1.203 ± 0.179
1.701ArgGln: 1.701 ± 0.244
1.701ArgArg: 1.701 ± 0.236
2.49ArgSer: 2.49 ± 0.317
2.365ArgThr: 2.365 ± 0.287
2.946ArgVal: 2.946 ± 0.44
0.373ArgTrp: 0.373 ± 0.099
2.033ArgTyr: 2.033 ± 0.271
0.0ArgXaa: 0.0 ± 0.0
Ser
3.984SerAla: 3.984 ± 0.419
1.162SerCys: 1.162 ± 0.243
5.311SerAsp: 5.311 ± 0.452
3.984SerGlu: 3.984 ± 0.434
2.946SerPhe: 2.946 ± 0.34
5.851SerGly: 5.851 ± 0.611
0.913SerHis: 0.913 ± 0.185
3.61SerIle: 3.61 ± 0.441
4.938SerLys: 4.938 ± 0.441
5.519SerLeu: 5.519 ± 0.682
1.784SerMet: 1.784 ± 0.25
3.486SerAsn: 3.486 ± 0.399
2.448SerPro: 2.448 ± 0.382
2.241SerGln: 2.241 ± 0.297
2.573SerArg: 2.573 ± 0.351
3.901SerSer: 3.901 ± 0.54
3.901SerThr: 3.901 ± 0.424
5.062SerVal: 5.062 ± 0.511
0.913SerTrp: 0.913 ± 0.178
3.029SerTyr: 3.029 ± 0.406
0.0SerXaa: 0.0 ± 0.0
Thr
4.73ThrAla: 4.73 ± 0.659
0.664ThrCys: 0.664 ± 0.184
4.191ThrAsp: 4.191 ± 0.348
4.025ThrGlu: 4.025 ± 0.41
2.697ThrPhe: 2.697 ± 0.309
6.722ThrGly: 6.722 ± 1.156
0.83ThrHis: 0.83 ± 0.201
3.984ThrIle: 3.984 ± 0.463
4.15ThrLys: 4.15 ± 0.466
5.145ThrLeu: 5.145 ± 0.627
1.079ThrMet: 1.079 ± 0.167
3.652ThrAsn: 3.652 ± 0.395
2.905ThrPro: 2.905 ± 0.409
2.116ThrGln: 2.116 ± 0.309
1.992ThrArg: 1.992 ± 0.292
4.191ThrSer: 4.191 ± 0.48
5.021ThrThr: 5.021 ± 0.548
4.772ThrVal: 4.772 ± 0.408
0.664ThrTrp: 0.664 ± 0.172
2.282ThrTyr: 2.282 ± 0.341
0.0ThrXaa: 0.0 ± 0.0
Val
4.316ValAla: 4.316 ± 0.596
0.996ValCys: 0.996 ± 0.179
4.73ValAsp: 4.73 ± 0.409
5.726ValGlu: 5.726 ± 0.498
3.237ValPhe: 3.237 ± 0.298
4.855ValGly: 4.855 ± 0.396
1.079ValHis: 1.079 ± 0.243
3.984ValIle: 3.984 ± 0.301
4.482ValLys: 4.482 ± 0.463
5.104ValLeu: 5.104 ± 0.469
1.867ValMet: 1.867 ± 0.246
3.901ValAsn: 3.901 ± 0.457
2.365ValPro: 2.365 ± 0.293
2.241ValGln: 2.241 ± 0.326
3.154ValArg: 3.154 ± 0.347
5.228ValSer: 5.228 ± 0.569
4.108ValThr: 4.108 ± 0.48
6.307ValVal: 6.307 ± 0.563
1.411ValTrp: 1.411 ± 0.224
2.863ValTyr: 2.863 ± 0.283
0.0ValXaa: 0.0 ± 0.0
Trp
0.581TrpAla: 0.581 ± 0.148
0.207TrpCys: 0.207 ± 0.093
0.498TrpAsp: 0.498 ± 0.143
0.83TrpGlu: 0.83 ± 0.168
0.664TrpPhe: 0.664 ± 0.183
0.83TrpGly: 0.83 ± 0.172
0.166TrpHis: 0.166 ± 0.086
0.871TrpIle: 0.871 ± 0.179
1.162TrpLys: 1.162 ± 0.214
1.743TrpLeu: 1.743 ± 0.257
0.332TrpMet: 0.332 ± 0.127
0.788TrpAsn: 0.788 ± 0.376
0.373TrpPro: 0.373 ± 0.126
0.913TrpGln: 0.913 ± 0.162
0.747TrpArg: 0.747 ± 0.201
0.747TrpSer: 0.747 ± 0.193
0.747TrpThr: 0.747 ± 0.174
0.954TrpVal: 0.954 ± 0.178
0.373TrpTrp: 0.373 ± 0.114
0.539TrpTyr: 0.539 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.448TyrAla: 2.448 ± 0.301
0.913TyrCys: 0.913 ± 0.201
3.444TyrAsp: 3.444 ± 0.388
2.822TyrGlu: 2.822 ± 0.349
1.743TyrPhe: 1.743 ± 0.259
4.067TyrGly: 4.067 ± 0.81
0.954TyrHis: 0.954 ± 0.19
2.697TyrIle: 2.697 ± 0.378
2.49TyrLys: 2.49 ± 0.331
4.482TyrLeu: 4.482 ± 0.421
0.788TyrMet: 0.788 ± 0.192
2.905TyrAsn: 2.905 ± 0.422
1.286TyrPro: 1.286 ± 0.253
1.618TyrGln: 1.618 ± 0.277
1.701TyrArg: 1.701 ± 0.257
2.905TyrSer: 2.905 ± 0.261
2.697TyrThr: 2.697 ± 0.379
2.614TyrVal: 2.614 ± 0.303
0.166TyrTrp: 0.166 ± 0.085
2.158TyrTyr: 2.158 ± 0.255
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 138 proteins (24100 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski