Amino acid dipepetide frequency for Pseudomonas phage MR4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.014AlaAla: 9.014 ± 1.565
0.71AlaCys: 0.71 ± 0.286
5.394AlaAsp: 5.394 ± 0.52
6.175AlaGlu: 6.175 ± 0.794
3.052AlaPhe: 3.052 ± 0.579
6.743AlaGly: 6.743 ± 0.91
1.349AlaHis: 1.349 ± 0.291
5.323AlaIle: 5.323 ± 0.676
5.891AlaLys: 5.891 ± 0.702
5.252AlaLeu: 5.252 ± 0.608
3.265AlaMet: 3.265 ± 0.435
4.188AlaAsn: 4.188 ± 0.495
2.413AlaPro: 2.413 ± 0.365
4.543AlaGln: 4.543 ± 0.701
3.549AlaArg: 3.549 ± 0.577
4.897AlaSer: 4.897 ± 0.527
5.394AlaThr: 5.394 ± 0.654
6.53AlaVal: 6.53 ± 0.733
1.207AlaTrp: 1.207 ± 0.261
3.265AlaTyr: 3.265 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
0.355CysAla: 0.355 ± 0.169
0.426CysCys: 0.426 ± 0.299
0.71CysAsp: 0.71 ± 0.249
0.497CysGlu: 0.497 ± 0.164
0.568CysPhe: 0.568 ± 0.247
0.781CysGly: 0.781 ± 0.296
0.355CysHis: 0.355 ± 0.173
0.639CysIle: 0.639 ± 0.198
0.852CysLys: 0.852 ± 0.224
0.994CysLeu: 0.994 ± 0.252
0.426CysMet: 0.426 ± 0.201
0.426CysAsn: 0.426 ± 0.208
0.355CysPro: 0.355 ± 0.157
0.497CysGln: 0.497 ± 0.191
0.568CysArg: 0.568 ± 0.173
0.852CysSer: 0.852 ± 0.273
0.71CysThr: 0.71 ± 0.257
0.568CysVal: 0.568 ± 0.225
0.142CysTrp: 0.142 ± 0.104
0.497CysTyr: 0.497 ± 0.226
0.0CysXaa: 0.0 ± 0.0
Asp
5.607AspAla: 5.607 ± 0.51
0.71AspCys: 0.71 ± 0.234
4.117AspAsp: 4.117 ± 0.537
3.478AspGlu: 3.478 ± 0.478
2.839AspPhe: 2.839 ± 0.425
4.755AspGly: 4.755 ± 0.627
0.568AspHis: 0.568 ± 0.196
2.768AspIle: 2.768 ± 0.422
3.62AspLys: 3.62 ± 0.482
5.323AspLeu: 5.323 ± 0.518
1.632AspMet: 1.632 ± 0.364
2.91AspAsn: 2.91 ± 0.524
2.413AspPro: 2.413 ± 0.345
1.703AspGln: 1.703 ± 0.374
2.342AspArg: 2.342 ± 0.551
3.407AspSer: 3.407 ± 0.517
3.691AspThr: 3.691 ± 0.5
4.046AspVal: 4.046 ± 0.485
1.207AspTrp: 1.207 ± 0.346
3.052AspTyr: 3.052 ± 0.392
0.0AspXaa: 0.0 ± 0.0
Glu
6.601GluAla: 6.601 ± 1.09
0.568GluCys: 0.568 ± 0.203
3.336GluAsp: 3.336 ± 0.574
4.401GluGlu: 4.401 ± 0.657
2.2GluPhe: 2.2 ± 0.478
4.188GluGly: 4.188 ± 0.534
1.136GluHis: 1.136 ± 0.324
2.129GluIle: 2.129 ± 0.473
3.833GluLys: 3.833 ± 0.565
7.24GluLeu: 7.24 ± 0.867
1.916GluMet: 1.916 ± 0.292
1.491GluAsn: 1.491 ± 0.231
1.987GluPro: 1.987 ± 0.438
4.472GluGln: 4.472 ± 0.555
2.91GluArg: 2.91 ± 0.484
3.478GluSer: 3.478 ± 0.447
2.839GluThr: 2.839 ± 0.534
4.826GluVal: 4.826 ± 0.561
1.42GluTrp: 1.42 ± 0.414
2.342GluTyr: 2.342 ± 0.362
0.0GluXaa: 0.0 ± 0.0
Phe
2.2PheAla: 2.2 ± 0.422
0.71PheCys: 0.71 ± 0.217
2.271PheAsp: 2.271 ± 0.399
2.342PheGlu: 2.342 ± 0.42
1.774PhePhe: 1.774 ± 0.353
2.981PheGly: 2.981 ± 0.506
0.497PheHis: 0.497 ± 0.186
2.2PheIle: 2.2 ± 0.303
2.697PheLys: 2.697 ± 0.338
2.768PheLeu: 2.768 ± 0.413
1.207PheMet: 1.207 ± 0.336
2.697PheAsn: 2.697 ± 0.441
0.923PhePro: 0.923 ± 0.296
1.065PheGln: 1.065 ± 0.249
1.774PheArg: 1.774 ± 0.313
1.703PheSer: 1.703 ± 0.365
2.129PheThr: 2.129 ± 0.418
2.342PheVal: 2.342 ± 0.423
0.213PheTrp: 0.213 ± 0.118
1.278PheTyr: 1.278 ± 0.467
0.0PheXaa: 0.0 ± 0.0
Gly
5.11GlyAla: 5.11 ± 0.745
0.852GlyCys: 0.852 ± 0.25
5.323GlyAsp: 5.323 ± 0.665
4.401GlyGlu: 4.401 ± 0.56
2.839GlyPhe: 2.839 ± 0.317
4.968GlyGly: 4.968 ± 0.725
1.207GlyHis: 1.207 ± 0.369
4.188GlyIle: 4.188 ± 0.819
6.246GlyLys: 6.246 ± 0.886
6.53GlyLeu: 6.53 ± 0.753
2.2GlyMet: 2.2 ± 0.426
2.91GlyAsn: 2.91 ± 0.608
0.071GlyPro: 0.071 ± 0.068
2.555GlyGln: 2.555 ± 0.398
3.62GlyArg: 3.62 ± 0.511
4.259GlySer: 4.259 ± 0.64
4.259GlyThr: 4.259 ± 0.508
4.897GlyVal: 4.897 ± 0.661
1.278GlyTrp: 1.278 ± 0.253
3.336GlyTyr: 3.336 ± 0.566
0.0GlyXaa: 0.0 ± 0.0
His
0.781HisAla: 0.781 ± 0.219
0.355HisCys: 0.355 ± 0.163
1.774HisAsp: 1.774 ± 0.383
1.065HisGlu: 1.065 ± 0.26
0.355HisPhe: 0.355 ± 0.162
1.491HisGly: 1.491 ± 0.254
0.639HisHis: 0.639 ± 0.283
0.852HisIle: 0.852 ± 0.299
1.632HisLys: 1.632 ± 0.4
1.845HisLeu: 1.845 ± 0.384
0.852HisMet: 0.852 ± 0.277
0.568HisAsn: 0.568 ± 0.238
0.426HisPro: 0.426 ± 0.196
0.852HisGln: 0.852 ± 0.234
1.065HisArg: 1.065 ± 0.296
0.781HisSer: 0.781 ± 0.236
1.491HisThr: 1.491 ± 0.345
1.207HisVal: 1.207 ± 0.359
0.355HisTrp: 0.355 ± 0.157
0.923HisTyr: 0.923 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
4.826IleAla: 4.826 ± 0.581
0.355IleCys: 0.355 ± 0.171
2.839IleAsp: 2.839 ± 0.449
2.484IleGlu: 2.484 ± 0.369
1.632IlePhe: 1.632 ± 0.379
3.407IleGly: 3.407 ± 0.531
1.987IleHis: 1.987 ± 0.376
1.987IleIle: 1.987 ± 0.421
3.62IleLys: 3.62 ± 0.55
3.62IleLeu: 3.62 ± 0.455
1.278IleMet: 1.278 ± 0.261
2.484IleAsn: 2.484 ± 0.387
2.2IlePro: 2.2 ± 0.369
1.774IleGln: 1.774 ± 0.379
3.265IleArg: 3.265 ± 0.518
3.549IleSer: 3.549 ± 0.536
3.904IleThr: 3.904 ± 0.732
2.768IleVal: 2.768 ± 0.542
0.426IleTrp: 0.426 ± 0.17
1.42IleTyr: 1.42 ± 0.321
0.0IleXaa: 0.0 ± 0.0
Lys
6.53LysAla: 6.53 ± 0.915
0.781LysCys: 0.781 ± 0.235
4.188LysAsp: 4.188 ± 0.56
3.975LysGlu: 3.975 ± 0.528
2.342LysPhe: 2.342 ± 0.368
4.472LysGly: 4.472 ± 0.632
0.852LysHis: 0.852 ± 0.241
2.129LysIle: 2.129 ± 0.419
3.478LysLys: 3.478 ± 0.609
6.743LysLeu: 6.743 ± 0.807
2.484LysMet: 2.484 ± 0.467
2.413LysAsn: 2.413 ± 0.443
3.407LysPro: 3.407 ± 0.57
3.336LysGln: 3.336 ± 0.416
3.407LysArg: 3.407 ± 0.528
3.762LysSer: 3.762 ± 0.524
2.697LysThr: 2.697 ± 0.346
5.181LysVal: 5.181 ± 0.668
1.278LysTrp: 1.278 ± 0.281
2.484LysTyr: 2.484 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
9.298LeuAla: 9.298 ± 0.797
0.568LeuCys: 0.568 ± 0.213
5.11LeuAsp: 5.11 ± 0.502
6.246LeuGlu: 6.246 ± 0.842
2.555LeuPhe: 2.555 ± 0.409
5.749LeuGly: 5.749 ± 0.558
1.491LeuHis: 1.491 ± 0.37
3.904LeuIle: 3.904 ± 0.493
5.11LeuLys: 5.11 ± 0.455
5.962LeuLeu: 5.962 ± 0.886
3.123LeuMet: 3.123 ± 0.433
3.123LeuAsn: 3.123 ± 0.422
3.691LeuPro: 3.691 ± 0.418
3.691LeuGln: 3.691 ± 0.557
5.039LeuArg: 5.039 ± 0.643
6.814LeuSer: 6.814 ± 0.804
5.607LeuThr: 5.607 ± 0.642
6.104LeuVal: 6.104 ± 0.706
0.923LeuTrp: 0.923 ± 0.262
2.626LeuTyr: 2.626 ± 0.441
0.0LeuXaa: 0.0 ± 0.0
Met
2.839MetAla: 2.839 ± 0.295
0.284MetCys: 0.284 ± 0.155
1.562MetAsp: 1.562 ± 0.275
2.2MetGlu: 2.2 ± 0.575
1.136MetPhe: 1.136 ± 0.3
2.058MetGly: 2.058 ± 0.322
0.852MetHis: 0.852 ± 0.308
1.207MetIle: 1.207 ± 0.264
2.342MetLys: 2.342 ± 0.325
2.555MetLeu: 2.555 ± 0.366
0.994MetMet: 0.994 ± 0.237
1.349MetAsn: 1.349 ± 0.265
1.349MetPro: 1.349 ± 0.281
1.845MetGln: 1.845 ± 0.381
1.703MetArg: 1.703 ± 0.36
2.484MetSer: 2.484 ± 0.429
1.987MetThr: 1.987 ± 0.416
1.774MetVal: 1.774 ± 0.362
0.639MetTrp: 0.639 ± 0.239
1.207MetTyr: 1.207 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
2.484AsnAla: 2.484 ± 0.37
0.213AsnCys: 0.213 ± 0.152
1.916AsnAsp: 1.916 ± 0.358
2.2AsnGlu: 2.2 ± 0.354
1.491AsnPhe: 1.491 ± 0.364
4.046AsnGly: 4.046 ± 0.856
0.497AsnHis: 0.497 ± 0.223
2.697AsnIle: 2.697 ± 0.412
3.549AsnLys: 3.549 ± 0.569
4.046AsnLeu: 4.046 ± 0.621
1.703AsnMet: 1.703 ± 0.308
2.2AsnAsn: 2.2 ± 0.432
2.129AsnPro: 2.129 ± 0.397
1.774AsnGln: 1.774 ± 0.301
2.91AsnArg: 2.91 ± 0.447
2.342AsnSer: 2.342 ± 0.433
2.484AsnThr: 2.484 ± 0.409
2.839AsnVal: 2.839 ± 0.591
0.781AsnTrp: 0.781 ± 0.22
1.845AsnTyr: 1.845 ± 0.283
0.0AsnXaa: 0.0 ± 0.0
Pro
3.904ProAla: 3.904 ± 0.463
0.497ProCys: 0.497 ± 0.206
2.413ProAsp: 2.413 ± 0.363
3.478ProGlu: 3.478 ± 0.55
0.923ProPhe: 0.923 ± 0.271
0.213ProGly: 0.213 ± 0.107
0.994ProHis: 0.994 ± 0.251
2.129ProIle: 2.129 ± 0.702
1.632ProLys: 1.632 ± 0.313
2.484ProLeu: 2.484 ± 0.392
0.923ProMet: 0.923 ± 0.261
1.632ProAsn: 1.632 ± 0.351
1.562ProPro: 1.562 ± 0.302
1.987ProGln: 1.987 ± 0.474
0.781ProArg: 0.781 ± 0.242
2.271ProSer: 2.271 ± 0.404
2.768ProThr: 2.768 ± 0.389
3.194ProVal: 3.194 ± 0.521
0.497ProTrp: 0.497 ± 0.195
1.774ProTyr: 1.774 ± 0.45
0.0ProXaa: 0.0 ± 0.0
Gln
5.323GlnAla: 5.323 ± 0.678
0.284GlnCys: 0.284 ± 0.148
1.987GlnAsp: 1.987 ± 0.393
3.194GlnGlu: 3.194 ± 0.412
1.562GlnPhe: 1.562 ± 0.296
2.484GlnGly: 2.484 ± 0.433
0.71GlnHis: 0.71 ± 0.243
1.774GlnIle: 1.774 ± 0.359
2.413GlnLys: 2.413 ± 0.458
5.891GlnLeu: 5.891 ± 0.654
1.562GlnMet: 1.562 ± 0.368
1.562GlnAsn: 1.562 ± 0.399
1.562GlnPro: 1.562 ± 0.322
3.549GlnGln: 3.549 ± 0.586
1.916GlnArg: 1.916 ± 0.381
3.194GlnSer: 3.194 ± 0.665
1.845GlnThr: 1.845 ± 0.353
3.265GlnVal: 3.265 ± 0.465
0.639GlnTrp: 0.639 ± 0.235
1.207GlnTyr: 1.207 ± 0.264
0.0GlnXaa: 0.0 ± 0.0
Arg
3.975ArgAla: 3.975 ± 0.73
0.497ArgCys: 0.497 ± 0.202
2.697ArgAsp: 2.697 ± 0.486
3.336ArgGlu: 3.336 ± 0.586
1.916ArgPhe: 1.916 ± 0.284
4.259ArgGly: 4.259 ± 0.476
1.065ArgHis: 1.065 ± 0.267
2.91ArgIle: 2.91 ± 0.37
3.194ArgLys: 3.194 ± 0.454
4.33ArgLeu: 4.33 ± 0.482
1.703ArgMet: 1.703 ± 0.305
1.987ArgAsn: 1.987 ± 0.334
1.774ArgPro: 1.774 ± 0.31
1.987ArgGln: 1.987 ± 0.452
2.129ArgArg: 2.129 ± 0.417
3.052ArgSer: 3.052 ± 0.444
1.845ArgThr: 1.845 ± 0.307
2.839ArgVal: 2.839 ± 0.505
0.852ArgTrp: 0.852 ± 0.26
2.2ArgTyr: 2.2 ± 0.336
0.0ArgXaa: 0.0 ± 0.0
Ser
4.046SerAla: 4.046 ± 0.68
0.852SerCys: 0.852 ± 0.255
3.691SerAsp: 3.691 ± 0.509
3.904SerGlu: 3.904 ± 0.476
1.987SerPhe: 1.987 ± 0.334
5.394SerGly: 5.394 ± 0.491
0.994SerHis: 0.994 ± 0.242
3.336SerIle: 3.336 ± 0.485
3.833SerLys: 3.833 ± 0.531
6.175SerLeu: 6.175 ± 0.766
2.129SerMet: 2.129 ± 0.365
2.626SerAsn: 2.626 ± 0.538
2.626SerPro: 2.626 ± 0.545
2.555SerGln: 2.555 ± 0.322
3.549SerArg: 3.549 ± 0.539
4.33SerSer: 4.33 ± 0.817
3.052SerThr: 3.052 ± 0.444
3.833SerVal: 3.833 ± 0.553
1.065SerTrp: 1.065 ± 0.318
2.271SerTyr: 2.271 ± 0.352
0.0SerXaa: 0.0 ± 0.0
Thr
5.252ThrAla: 5.252 ± 0.571
0.568ThrCys: 0.568 ± 0.187
3.975ThrAsp: 3.975 ± 0.507
3.336ThrGlu: 3.336 ± 0.601
1.987ThrPhe: 1.987 ± 0.29
4.259ThrGly: 4.259 ± 0.538
1.42ThrHis: 1.42 ± 0.294
2.839ThrIle: 2.839 ± 0.415
4.188ThrLys: 4.188 ± 0.591
4.826ThrLeu: 4.826 ± 0.609
1.774ThrMet: 1.774 ± 0.335
2.2ThrAsn: 2.2 ± 0.344
2.626ThrPro: 2.626 ± 0.473
2.91ThrGln: 2.91 ± 0.475
1.987ThrArg: 1.987 ± 0.337
3.478ThrSer: 3.478 ± 0.449
4.33ThrThr: 4.33 ± 0.522
3.478ThrVal: 3.478 ± 0.39
0.426ThrTrp: 0.426 ± 0.157
2.626ThrTyr: 2.626 ± 0.517
0.0ThrXaa: 0.0 ± 0.0
Val
5.678ValAla: 5.678 ± 0.63
0.923ValCys: 0.923 ± 0.273
3.691ValAsp: 3.691 ± 0.468
4.188ValGlu: 4.188 ± 0.52
2.413ValPhe: 2.413 ± 0.502
5.039ValGly: 5.039 ± 0.578
1.562ValHis: 1.562 ± 0.359
3.762ValIle: 3.762 ± 0.521
4.755ValLys: 4.755 ± 0.569
5.891ValLeu: 5.891 ± 0.631
1.916ValMet: 1.916 ± 0.403
4.543ValAsn: 4.543 ± 0.572
2.484ValPro: 2.484 ± 0.395
2.555ValGln: 2.555 ± 0.461
2.981ValArg: 2.981 ± 0.598
4.259ValSer: 4.259 ± 0.512
4.685ValThr: 4.685 ± 0.756
4.685ValVal: 4.685 ± 0.576
0.781ValTrp: 0.781 ± 0.322
1.845ValTyr: 1.845 ± 0.466
0.0ValXaa: 0.0 ± 0.0
Trp
1.065TrpAla: 1.065 ± 0.311
0.426TrpCys: 0.426 ± 0.169
0.923TrpAsp: 0.923 ± 0.308
0.639TrpGlu: 0.639 ± 0.182
0.639TrpPhe: 0.639 ± 0.2
0.852TrpGly: 0.852 ± 0.266
0.497TrpHis: 0.497 ± 0.156
0.568TrpIle: 0.568 ± 0.195
0.923TrpLys: 0.923 ± 0.235
1.774TrpLeu: 1.774 ± 0.317
0.426TrpMet: 0.426 ± 0.189
0.71TrpAsn: 0.71 ± 0.206
0.497TrpPro: 0.497 ± 0.228
0.71TrpGln: 0.71 ± 0.201
0.923TrpArg: 0.923 ± 0.249
1.136TrpSer: 1.136 ± 0.289
0.568TrpThr: 0.568 ± 0.186
0.923TrpVal: 0.923 ± 0.272
0.071TrpTrp: 0.071 ± 0.069
0.284TrpTyr: 0.284 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.336TyrAla: 3.336 ± 0.409
0.639TyrCys: 0.639 ± 0.23
2.271TyrAsp: 2.271 ± 0.414
1.774TyrGlu: 1.774 ± 0.315
1.562TyrPhe: 1.562 ± 0.303
2.91TyrGly: 2.91 ± 0.475
0.639TyrHis: 0.639 ± 0.255
2.484TyrIle: 2.484 ± 0.498
2.129TyrLys: 2.129 ± 0.438
2.484TyrLeu: 2.484 ± 0.391
0.781TyrMet: 0.781 ± 0.285
2.129TyrAsn: 2.129 ± 0.403
1.562TyrPro: 1.562 ± 0.426
1.562TyrGln: 1.562 ± 0.274
2.129TyrArg: 2.129 ± 0.435
2.2TyrSer: 2.2 ± 0.43
2.2TyrThr: 2.2 ± 0.335
3.336TyrVal: 3.336 ± 0.639
0.355TyrTrp: 0.355 ± 0.17
1.065TyrTyr: 1.065 ± 0.319
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (14090 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski