Amino acid dipepetide frequency for Sinorhizobium phage phiLM21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.647AlaAla: 15.647 ± 1.366
1.446AlaCys: 1.446 ± 0.265
6.311AlaAsp: 6.311 ± 0.636
8.086AlaGlu: 8.086 ± 1.029
3.682AlaPhe: 3.682 ± 0.554
7.495AlaGly: 7.495 ± 0.597
2.301AlaHis: 2.301 ± 0.423
6.377AlaIle: 6.377 ± 0.597
6.706AlaLys: 6.706 ± 0.661
8.349AlaLeu: 8.349 ± 0.839
4.01AlaMet: 4.01 ± 0.517
4.931AlaAsn: 4.931 ± 0.672
4.142AlaPro: 4.142 ± 0.444
4.668AlaGln: 4.668 ± 0.579
7.429AlaArg: 7.429 ± 0.916
6.245AlaSer: 6.245 ± 0.927
6.771AlaThr: 6.771 ± 0.721
6.574AlaVal: 6.574 ± 0.516
2.367AlaTrp: 2.367 ± 0.323
2.695AlaTyr: 2.695 ± 0.489
0.0AlaXaa: 0.0 ± 0.0
Cys
0.92CysAla: 0.92 ± 0.248
0.197CysCys: 0.197 ± 0.117
0.723CysAsp: 0.723 ± 0.201
0.657CysGlu: 0.657 ± 0.182
0.394CysPhe: 0.394 ± 0.197
0.986CysGly: 0.986 ± 0.251
0.197CysHis: 0.197 ± 0.132
0.592CysIle: 0.592 ± 0.201
0.329CysLys: 0.329 ± 0.125
0.592CysLeu: 0.592 ± 0.201
0.263CysMet: 0.263 ± 0.134
0.329CysAsn: 0.329 ± 0.142
0.526CysPro: 0.526 ± 0.177
0.394CysGln: 0.394 ± 0.145
0.657CysArg: 0.657 ± 0.181
0.592CysSer: 0.592 ± 0.19
0.394CysThr: 0.394 ± 0.194
0.723CysVal: 0.723 ± 0.233
0.131CysTrp: 0.131 ± 0.078
0.263CysTyr: 0.263 ± 0.125
0.0CysXaa: 0.0 ± 0.0
Asp
7.166AspAla: 7.166 ± 0.808
0.329AspCys: 0.329 ± 0.146
4.207AspAsp: 4.207 ± 0.579
4.076AspGlu: 4.076 ± 0.522
2.63AspPhe: 2.63 ± 0.43
4.733AspGly: 4.733 ± 0.589
1.315AspHis: 1.315 ± 0.3
3.747AspIle: 3.747 ± 0.524
2.301AspLys: 2.301 ± 0.47
4.799AspLeu: 4.799 ± 0.431
1.446AspMet: 1.446 ± 0.318
1.446AspAsn: 1.446 ± 0.31
2.761AspPro: 2.761 ± 0.423
1.644AspGln: 1.644 ± 0.28
4.207AspArg: 4.207 ± 0.731
2.695AspSer: 2.695 ± 0.489
2.761AspThr: 2.761 ± 0.449
4.142AspVal: 4.142 ± 0.474
2.235AspTrp: 2.235 ± 0.37
1.841AspTyr: 1.841 ± 0.386
0.0AspXaa: 0.0 ± 0.0
Glu
8.218GluAla: 8.218 ± 0.81
0.789GluCys: 0.789 ± 0.275
3.156GluAsp: 3.156 ± 0.556
4.01GluGlu: 4.01 ± 0.66
2.235GluPhe: 2.235 ± 0.346
3.747GluGly: 3.747 ± 0.475
1.512GluHis: 1.512 ± 0.376
4.076GluIle: 4.076 ± 0.609
4.47GluLys: 4.47 ± 0.557
4.931GluLeu: 4.931 ± 0.711
2.235GluMet: 2.235 ± 0.356
1.709GluAsn: 1.709 ± 0.346
2.893GluPro: 2.893 ± 0.487
3.55GluGln: 3.55 ± 0.58
6.114GluArg: 6.114 ± 0.923
1.972GluSer: 1.972 ± 0.346
3.419GluThr: 3.419 ± 0.553
3.682GluVal: 3.682 ± 0.471
1.249GluTrp: 1.249 ± 0.303
1.183GluTyr: 1.183 ± 0.326
0.0GluXaa: 0.0 ± 0.0
Phe
3.287PheAla: 3.287 ± 0.403
0.394PheCys: 0.394 ± 0.151
2.564PheAsp: 2.564 ± 0.361
1.972PheGlu: 1.972 ± 0.362
1.052PhePhe: 1.052 ± 0.256
2.893PheGly: 2.893 ± 0.423
0.657PheHis: 0.657 ± 0.211
1.315PheIle: 1.315 ± 0.302
1.775PheLys: 1.775 ± 0.377
1.907PheLeu: 1.907 ± 0.342
0.329PheMet: 0.329 ± 0.143
1.578PheAsn: 1.578 ± 0.298
1.709PhePro: 1.709 ± 0.328
1.644PheGln: 1.644 ± 0.309
2.301PheArg: 2.301 ± 0.399
1.972PheSer: 1.972 ± 0.351
2.695PheThr: 2.695 ± 0.346
2.63PheVal: 2.63 ± 0.394
0.526PheTrp: 0.526 ± 0.155
0.723PheTyr: 0.723 ± 0.205
0.0PheXaa: 0.0 ± 0.0
Gly
8.875GlyAla: 8.875 ± 0.789
0.723GlyCys: 0.723 ± 0.221
4.602GlyAsp: 4.602 ± 0.561
4.273GlyGlu: 4.273 ± 0.551
3.55GlyPhe: 3.55 ± 0.57
7.297GlyGly: 7.297 ± 1.06
0.986GlyHis: 0.986 ± 0.27
3.616GlyIle: 3.616 ± 0.447
4.668GlyLys: 4.668 ± 0.537
6.311GlyLeu: 6.311 ± 0.671
1.841GlyMet: 1.841 ± 0.355
2.893GlyAsn: 2.893 ± 0.421
2.958GlyPro: 2.958 ± 0.365
2.827GlyGln: 2.827 ± 0.476
4.799GlyArg: 4.799 ± 0.68
4.931GlySer: 4.931 ± 0.729
5.654GlyThr: 5.654 ± 0.965
5.259GlyVal: 5.259 ± 0.684
1.315GlyTrp: 1.315 ± 0.338
3.484GlyTyr: 3.484 ± 0.456
0.0GlyXaa: 0.0 ± 0.0
His
1.644HisAla: 1.644 ± 0.358
0.131HisCys: 0.131 ± 0.092
1.381HisAsp: 1.381 ± 0.34
1.446HisGlu: 1.446 ± 0.288
0.92HisPhe: 0.92 ± 0.289
1.183HisGly: 1.183 ± 0.296
0.46HisHis: 0.46 ± 0.224
0.657HisIle: 0.657 ± 0.218
0.789HisLys: 0.789 ± 0.219
1.381HisLeu: 1.381 ± 0.336
0.394HisMet: 0.394 ± 0.174
0.263HisAsn: 0.263 ± 0.114
1.249HisPro: 1.249 ± 0.249
1.249HisGln: 1.249 ± 0.309
1.972HisArg: 1.972 ± 0.415
0.46HisSer: 0.46 ± 0.174
0.592HisThr: 0.592 ± 0.238
1.315HisVal: 1.315 ± 0.314
0.394HisTrp: 0.394 ± 0.175
0.46HisTyr: 0.46 ± 0.16
0.0HisXaa: 0.0 ± 0.0
Ile
6.048IleAla: 6.048 ± 0.679
0.394IleCys: 0.394 ± 0.157
4.01IleAsp: 4.01 ± 0.509
4.076IleGlu: 4.076 ± 0.539
1.381IlePhe: 1.381 ± 0.328
4.996IleGly: 4.996 ± 0.591
0.723IleHis: 0.723 ± 0.206
2.761IleIle: 2.761 ± 0.417
2.367IleLys: 2.367 ± 0.415
3.221IleLeu: 3.221 ± 0.508
0.855IleMet: 0.855 ± 0.219
2.564IleAsn: 2.564 ± 0.436
2.301IlePro: 2.301 ± 0.368
1.578IleGln: 1.578 ± 0.364
3.221IleArg: 3.221 ± 0.461
2.695IleSer: 2.695 ± 0.53
3.682IleThr: 3.682 ± 0.44
3.682IleVal: 3.682 ± 0.469
0.46IleTrp: 0.46 ± 0.173
0.723IleTyr: 0.723 ± 0.197
0.0IleXaa: 0.0 ± 0.0
Lys
6.969LysAla: 6.969 ± 0.768
0.329LysCys: 0.329 ± 0.135
2.038LysAsp: 2.038 ± 0.583
3.156LysGlu: 3.156 ± 0.422
1.644LysPhe: 1.644 ± 0.283
4.207LysGly: 4.207 ± 0.658
1.249LysHis: 1.249 ± 0.304
3.353LysIle: 3.353 ± 0.447
1.907LysLys: 1.907 ± 0.386
3.813LysLeu: 3.813 ± 0.465
1.052LysMet: 1.052 ± 0.333
1.315LysAsn: 1.315 ± 0.321
2.235LysPro: 2.235 ± 0.345
1.578LysGln: 1.578 ± 0.358
3.55LysArg: 3.55 ± 0.492
2.761LysSer: 2.761 ± 0.37
2.827LysThr: 2.827 ± 0.399
3.156LysVal: 3.156 ± 0.464
0.592LysTrp: 0.592 ± 0.225
1.183LysTyr: 1.183 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
8.481LeuAla: 8.481 ± 0.699
1.052LeuCys: 1.052 ± 0.264
5.128LeuAsp: 5.128 ± 0.559
4.142LeuGlu: 4.142 ± 0.461
2.104LeuPhe: 2.104 ± 0.429
5.325LeuGly: 5.325 ± 0.849
1.315LeuHis: 1.315 ± 0.301
3.813LeuIle: 3.813 ± 0.423
3.221LeuLys: 3.221 ± 0.562
4.865LeuLeu: 4.865 ± 0.513
1.315LeuMet: 1.315 ± 0.296
2.63LeuAsn: 2.63 ± 0.478
3.287LeuPro: 3.287 ± 0.434
2.235LeuGln: 2.235 ± 0.318
5.522LeuArg: 5.522 ± 0.683
5.588LeuSer: 5.588 ± 0.612
4.733LeuThr: 4.733 ± 0.512
5.72LeuVal: 5.72 ± 0.631
1.118LeuTrp: 1.118 ± 0.279
1.972LeuTyr: 1.972 ± 0.429
0.0LeuXaa: 0.0 ± 0.0
Met
2.63MetAla: 2.63 ± 0.356
0.197MetCys: 0.197 ± 0.119
1.578MetAsp: 1.578 ± 0.271
1.183MetGlu: 1.183 ± 0.292
1.052MetPhe: 1.052 ± 0.282
1.709MetGly: 1.709 ± 0.334
0.329MetHis: 0.329 ± 0.191
0.986MetIle: 0.986 ± 0.211
1.052MetLys: 1.052 ± 0.31
1.446MetLeu: 1.446 ± 0.32
0.263MetMet: 0.263 ± 0.135
0.789MetAsn: 0.789 ± 0.208
1.249MetPro: 1.249 ± 0.296
1.118MetGln: 1.118 ± 0.264
1.709MetArg: 1.709 ± 0.478
2.498MetSer: 2.498 ± 0.374
1.644MetThr: 1.644 ± 0.383
0.92MetVal: 0.92 ± 0.264
0.46MetTrp: 0.46 ± 0.192
0.46MetTyr: 0.46 ± 0.176
0.0MetXaa: 0.0 ± 0.0
Asn
4.207AsnAla: 4.207 ± 0.486
0.263AsnCys: 0.263 ± 0.125
2.038AsnAsp: 2.038 ± 0.371
2.235AsnGlu: 2.235 ± 0.328
0.657AsnPhe: 0.657 ± 0.212
4.01AsnGly: 4.01 ± 0.658
0.526AsnHis: 0.526 ± 0.181
2.038AsnIle: 2.038 ± 0.463
1.841AsnLys: 1.841 ± 0.361
2.63AsnLeu: 2.63 ± 0.371
0.657AsnMet: 0.657 ± 0.255
1.052AsnAsn: 1.052 ± 0.249
1.775AsnPro: 1.775 ± 0.372
1.118AsnGln: 1.118 ± 0.274
2.564AsnArg: 2.564 ± 0.429
2.038AsnSer: 2.038 ± 0.382
2.301AsnThr: 2.301 ± 0.453
2.169AsnVal: 2.169 ± 0.421
0.263AsnTrp: 0.263 ± 0.122
0.855AsnTyr: 0.855 ± 0.23
0.0AsnXaa: 0.0 ± 0.0
Pro
5.522ProAla: 5.522 ± 0.612
0.329ProCys: 0.329 ± 0.147
3.419ProAsp: 3.419 ± 0.532
2.695ProGlu: 2.695 ± 0.441
1.709ProPhe: 1.709 ± 0.301
2.761ProGly: 2.761 ± 0.393
0.92ProHis: 0.92 ± 0.259
2.235ProIle: 2.235 ± 0.438
2.301ProLys: 2.301 ± 0.384
3.024ProLeu: 3.024 ± 0.384
1.118ProMet: 1.118 ± 0.26
1.578ProAsn: 1.578 ± 0.333
2.104ProPro: 2.104 ± 0.439
0.789ProGln: 0.789 ± 0.252
2.104ProArg: 2.104 ± 0.398
3.616ProSer: 3.616 ± 0.471
2.498ProThr: 2.498 ± 0.461
4.602ProVal: 4.602 ± 0.522
0.657ProTrp: 0.657 ± 0.267
0.657ProTyr: 0.657 ± 0.204
0.0ProXaa: 0.0 ± 0.0
Gln
4.405GlnAla: 4.405 ± 0.652
0.197GlnCys: 0.197 ± 0.102
1.907GlnAsp: 1.907 ± 0.352
1.907GlnGlu: 1.907 ± 0.385
1.512GlnPhe: 1.512 ± 0.376
3.221GlnGly: 3.221 ± 0.446
0.394GlnHis: 0.394 ± 0.171
1.315GlnIle: 1.315 ± 0.312
1.578GlnLys: 1.578 ± 0.298
2.893GlnLeu: 2.893 ± 0.409
0.92GlnMet: 0.92 ± 0.255
1.315GlnAsn: 1.315 ± 0.316
1.578GlnPro: 1.578 ± 0.378
2.038GlnGln: 2.038 ± 0.383
2.893GlnArg: 2.893 ± 0.483
1.907GlnSer: 1.907 ± 0.438
1.907GlnThr: 1.907 ± 0.374
2.827GlnVal: 2.827 ± 0.449
0.986GlnTrp: 0.986 ± 0.267
1.118GlnTyr: 1.118 ± 0.295
0.0GlnXaa: 0.0 ± 0.0
Arg
6.574ArgAla: 6.574 ± 0.836
0.723ArgCys: 0.723 ± 0.202
4.536ArgAsp: 4.536 ± 0.545
5.72ArgGlu: 5.72 ± 0.901
2.432ArgPhe: 2.432 ± 0.361
4.931ArgGly: 4.931 ± 0.606
1.249ArgHis: 1.249 ± 0.315
3.156ArgIle: 3.156 ± 0.463
3.55ArgLys: 3.55 ± 0.626
6.706ArgLeu: 6.706 ± 0.64
1.907ArgMet: 1.907 ± 0.374
2.498ArgAsn: 2.498 ± 0.428
2.63ArgPro: 2.63 ± 0.618
2.235ArgGln: 2.235 ± 0.415
5.983ArgArg: 5.983 ± 0.759
3.945ArgSer: 3.945 ± 0.538
3.879ArgThr: 3.879 ± 0.442
4.602ArgVal: 4.602 ± 0.503
0.789ArgTrp: 0.789 ± 0.216
2.235ArgTyr: 2.235 ± 0.384
0.0ArgXaa: 0.0 ± 0.0
Ser
6.048SerAla: 6.048 ± 0.718
0.657SerCys: 0.657 ± 0.216
3.156SerAsp: 3.156 ± 0.439
3.616SerGlu: 3.616 ± 0.513
2.367SerPhe: 2.367 ± 0.356
6.706SerGly: 6.706 ± 1.045
1.512SerHis: 1.512 ± 0.396
3.419SerIle: 3.419 ± 0.492
2.695SerLys: 2.695 ± 0.486
4.668SerLeu: 4.668 ± 0.534
1.446SerMet: 1.446 ± 0.336
1.512SerAsn: 1.512 ± 0.31
2.432SerPro: 2.432 ± 0.436
2.169SerGln: 2.169 ± 0.384
3.221SerArg: 3.221 ± 0.482
4.142SerSer: 4.142 ± 0.558
3.484SerThr: 3.484 ± 0.569
4.142SerVal: 4.142 ± 0.565
0.723SerTrp: 0.723 ± 0.185
1.512SerTyr: 1.512 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
7.626ThrAla: 7.626 ± 0.924
0.329ThrCys: 0.329 ± 0.137
3.55ThrAsp: 3.55 ± 0.434
3.616ThrGlu: 3.616 ± 0.451
1.841ThrPhe: 1.841 ± 0.308
5.588ThrGly: 5.588 ± 0.6
0.789ThrHis: 0.789 ± 0.291
3.156ThrIle: 3.156 ± 0.687
2.958ThrLys: 2.958 ± 0.469
4.01ThrLeu: 4.01 ± 0.602
1.315ThrMet: 1.315 ± 0.249
2.169ThrAsn: 2.169 ± 0.424
2.761ThrPro: 2.761 ± 0.616
1.841ThrGln: 1.841 ± 0.387
3.484ThrArg: 3.484 ± 0.455
3.616ThrSer: 3.616 ± 0.477
4.47ThrThr: 4.47 ± 0.752
4.47ThrVal: 4.47 ± 0.799
1.052ThrTrp: 1.052 ± 0.295
1.315ThrTyr: 1.315 ± 0.25
0.0ThrXaa: 0.0 ± 0.0
Val
7.56ValAla: 7.56 ± 0.677
0.526ValCys: 0.526 ± 0.181
3.747ValAsp: 3.747 ± 0.453
5.259ValGlu: 5.259 ± 0.58
1.381ValPhe: 1.381 ± 0.26
5.522ValGly: 5.522 ± 0.743
1.118ValHis: 1.118 ± 0.265
3.353ValIle: 3.353 ± 0.51
2.958ValLys: 2.958 ± 0.415
4.799ValLeu: 4.799 ± 0.668
1.249ValMet: 1.249 ± 0.33
3.221ValAsn: 3.221 ± 0.392
4.339ValPro: 4.339 ± 0.581
2.235ValGln: 2.235 ± 0.363
4.339ValArg: 4.339 ± 0.463
4.668ValSer: 4.668 ± 0.623
4.207ValThr: 4.207 ± 0.672
4.405ValVal: 4.405 ± 0.623
1.052ValTrp: 1.052 ± 0.288
2.038ValTyr: 2.038 ± 0.352
0.0ValXaa: 0.0 ± 0.0
Trp
1.972TrpAla: 1.972 ± 0.373
0.394TrpCys: 0.394 ± 0.149
0.986TrpAsp: 0.986 ± 0.245
1.052TrpGlu: 1.052 ± 0.293
0.592TrpPhe: 0.592 ± 0.209
1.118TrpGly: 1.118 ± 0.23
0.329TrpHis: 0.329 ± 0.13
0.789TrpIle: 0.789 ± 0.196
0.986TrpLys: 0.986 ± 0.253
1.446TrpLeu: 1.446 ± 0.347
0.131TrpMet: 0.131 ± 0.085
0.46TrpAsn: 0.46 ± 0.19
0.657TrpPro: 0.657 ± 0.234
0.789TrpGln: 0.789 ± 0.218
1.709TrpArg: 1.709 ± 0.453
1.315TrpSer: 1.315 ± 0.286
0.92TrpThr: 0.92 ± 0.289
1.183TrpVal: 1.183 ± 0.289
0.263TrpTrp: 0.263 ± 0.13
0.263TrpTyr: 0.263 ± 0.126
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.432TyrAla: 2.432 ± 0.396
0.46TyrCys: 0.46 ± 0.168
1.446TyrAsp: 1.446 ± 0.278
1.972TyrGlu: 1.972 ± 0.358
0.789TyrPhe: 0.789 ± 0.227
2.432TyrGly: 2.432 ± 0.384
0.46TyrHis: 0.46 ± 0.139
0.986TyrIle: 0.986 ± 0.219
0.46TyrLys: 0.46 ± 0.172
1.775TyrLeu: 1.775 ± 0.317
0.46TyrMet: 0.46 ± 0.18
0.986TyrAsn: 0.986 ± 0.282
1.118TyrPro: 1.118 ± 0.283
1.118TyrGln: 1.118 ± 0.313
2.498TyrArg: 2.498 ± 0.421
1.972TyrSer: 1.972 ± 0.411
1.118TyrThr: 1.118 ± 0.247
1.775TyrVal: 1.775 ± 0.259
0.723TyrTrp: 0.723 ± 0.242
0.789TyrTyr: 0.789 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (15212 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski