Amino acid dipepetide frequency for Escherichia phage phiEcoM-GJ1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.502AlaAla: 8.502 ± 1.016
1.023AlaCys: 1.023 ± 0.279
4.091AlaAsp: 4.091 ± 0.434
5.306AlaGlu: 5.306 ± 0.586
2.877AlaPhe: 2.877 ± 0.409
5.945AlaGly: 5.945 ± 0.796
1.342AlaHis: 1.342 ± 0.377
5.625AlaIle: 5.625 ± 0.787
5.689AlaLys: 5.689 ± 0.629
6.2AlaLeu: 6.2 ± 0.713
2.621AlaMet: 2.621 ± 0.45
4.283AlaAsn: 4.283 ± 0.447
2.301AlaPro: 2.301 ± 0.437
2.94AlaGln: 2.94 ± 0.54
4.411AlaArg: 4.411 ± 0.857
5.114AlaSer: 5.114 ± 0.599
5.114AlaThr: 5.114 ± 0.562
5.433AlaVal: 5.433 ± 0.551
0.895AlaTrp: 0.895 ± 0.171
2.94AlaTyr: 2.94 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
0.511CysAla: 0.511 ± 0.181
0.064CysCys: 0.064 ± 0.054
0.639CysAsp: 0.639 ± 0.304
0.511CysGlu: 0.511 ± 0.186
0.511CysPhe: 0.511 ± 0.153
0.831CysGly: 0.831 ± 0.19
0.384CysHis: 0.384 ± 0.174
0.639CysIle: 0.639 ± 0.209
0.703CysLys: 0.703 ± 0.246
0.639CysLeu: 0.639 ± 0.213
0.064CysMet: 0.064 ± 0.063
0.703CysAsn: 0.703 ± 0.241
0.447CysPro: 0.447 ± 0.198
0.384CysGln: 0.384 ± 0.165
0.703CysArg: 0.703 ± 0.226
0.767CysSer: 0.767 ± 0.214
1.023CysThr: 1.023 ± 0.26
0.895CysVal: 0.895 ± 0.275
0.192CysTrp: 0.192 ± 0.115
0.32CysTyr: 0.32 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
4.986AspAla: 4.986 ± 0.654
1.023AspCys: 1.023 ± 0.278
2.813AspAsp: 2.813 ± 0.483
3.771AspGlu: 3.771 ± 0.609
2.813AspPhe: 2.813 ± 0.391
5.369AspGly: 5.369 ± 0.522
1.087AspHis: 1.087 ± 0.237
3.388AspIle: 3.388 ± 0.599
3.58AspLys: 3.58 ± 0.612
4.73AspLeu: 4.73 ± 0.541
1.342AspMet: 1.342 ± 0.297
2.493AspAsn: 2.493 ± 0.401
2.749AspPro: 2.749 ± 0.472
0.703AspGln: 0.703 ± 0.164
2.301AspArg: 2.301 ± 0.425
4.73AspSer: 4.73 ± 0.611
2.877AspThr: 2.877 ± 0.332
4.538AspVal: 4.538 ± 0.494
1.215AspTrp: 1.215 ± 0.288
2.301AspTyr: 2.301 ± 0.411
0.0AspXaa: 0.0 ± 0.0
Glu
6.648GluAla: 6.648 ± 0.79
0.511GluCys: 0.511 ± 0.216
3.26GluAsp: 3.26 ± 0.552
4.666GluGlu: 4.666 ± 0.482
2.749GluPhe: 2.749 ± 0.312
3.324GluGly: 3.324 ± 0.4
1.47GluHis: 1.47 ± 0.252
3.963GluIle: 3.963 ± 0.534
3.068GluLys: 3.068 ± 0.45
6.2GluLeu: 6.2 ± 0.699
1.342GluMet: 1.342 ± 0.287
1.854GluAsn: 1.854 ± 0.361
1.534GluPro: 1.534 ± 0.309
2.685GluGln: 2.685 ± 0.404
3.58GluArg: 3.58 ± 0.6
3.707GluSer: 3.707 ± 0.483
3.58GluThr: 3.58 ± 0.463
4.666GluVal: 4.666 ± 0.674
0.767GluTrp: 0.767 ± 0.184
2.749GluTyr: 2.749 ± 0.436
0.0GluXaa: 0.0 ± 0.0
Phe
3.132PheAla: 3.132 ± 0.433
0.384PheCys: 0.384 ± 0.152
3.324PheAsp: 3.324 ± 0.399
2.749PheGlu: 2.749 ± 0.348
1.278PhePhe: 1.278 ± 0.349
3.132PheGly: 3.132 ± 0.42
0.767PheHis: 0.767 ± 0.193
2.301PheIle: 2.301 ± 0.353
3.132PheLys: 3.132 ± 0.476
2.365PheLeu: 2.365 ± 0.408
1.151PheMet: 1.151 ± 0.244
2.237PheAsn: 2.237 ± 0.436
1.406PhePro: 1.406 ± 0.266
1.087PheGln: 1.087 ± 0.28
2.109PheArg: 2.109 ± 0.381
2.685PheSer: 2.685 ± 0.392
2.621PheThr: 2.621 ± 0.39
2.749PheVal: 2.749 ± 0.399
0.511PheTrp: 0.511 ± 0.174
1.215PheTyr: 1.215 ± 0.31
0.0PheXaa: 0.0 ± 0.0
Gly
5.433GlyAla: 5.433 ± 0.731
0.767GlyCys: 0.767 ± 0.236
4.602GlyAsp: 4.602 ± 0.669
4.411GlyGlu: 4.411 ± 0.415
3.196GlyPhe: 3.196 ± 0.424
5.689GlyGly: 5.689 ± 0.823
1.406GlyHis: 1.406 ± 0.326
5.178GlyIle: 5.178 ± 0.505
4.475GlyLys: 4.475 ± 0.546
4.794GlyLeu: 4.794 ± 0.575
1.726GlyMet: 1.726 ± 0.502
3.771GlyAsn: 3.771 ± 0.491
1.726GlyPro: 1.726 ± 0.318
2.237GlyGln: 2.237 ± 0.296
3.452GlyArg: 3.452 ± 0.421
5.753GlySer: 5.753 ± 0.833
5.242GlyThr: 5.242 ± 0.857
5.242GlyVal: 5.242 ± 0.541
1.342GlyTrp: 1.342 ± 0.298
3.26GlyTyr: 3.26 ± 0.508
0.0GlyXaa: 0.0 ± 0.0
His
1.278HisAla: 1.278 ± 0.244
0.192HisCys: 0.192 ± 0.105
1.342HisAsp: 1.342 ± 0.313
1.406HisGlu: 1.406 ± 0.259
0.767HisPhe: 0.767 ± 0.194
1.598HisGly: 1.598 ± 0.322
0.639HisHis: 0.639 ± 0.207
1.151HisIle: 1.151 ± 0.32
1.151HisLys: 1.151 ± 0.36
0.895HisLeu: 0.895 ± 0.229
0.831HisMet: 0.831 ± 0.261
1.278HisAsn: 1.278 ± 0.317
0.767HisPro: 0.767 ± 0.209
0.959HisGln: 0.959 ± 0.236
1.023HisArg: 1.023 ± 0.256
1.215HisSer: 1.215 ± 0.361
1.342HisThr: 1.342 ± 0.573
1.151HisVal: 1.151 ± 0.245
0.256HisTrp: 0.256 ± 0.132
1.023HisTyr: 1.023 ± 0.282
0.0HisXaa: 0.0 ± 0.0
Ile
4.283IleAla: 4.283 ± 0.559
0.575IleCys: 0.575 ± 0.197
4.027IleAsp: 4.027 ± 0.608
4.347IleGlu: 4.347 ± 0.602
2.173IlePhe: 2.173 ± 0.43
4.283IleGly: 4.283 ± 0.658
1.087IleHis: 1.087 ± 0.285
3.196IleIle: 3.196 ± 0.457
5.178IleLys: 5.178 ± 0.721
3.963IleLeu: 3.963 ± 0.581
2.046IleMet: 2.046 ± 0.342
3.004IleAsn: 3.004 ± 0.577
2.301IlePro: 2.301 ± 0.463
2.94IleGln: 2.94 ± 0.413
2.94IleArg: 2.94 ± 0.426
4.027IleSer: 4.027 ± 0.484
4.538IleThr: 4.538 ± 0.462
3.899IleVal: 3.899 ± 0.571
0.703IleTrp: 0.703 ± 0.179
2.046IleTyr: 2.046 ± 0.391
0.0IleXaa: 0.0 ± 0.0
Lys
5.05LysAla: 5.05 ± 0.613
1.023LysCys: 1.023 ± 0.286
3.644LysAsp: 3.644 ± 0.483
3.644LysGlu: 3.644 ± 0.565
2.621LysPhe: 2.621 ± 0.434
3.388LysGly: 3.388 ± 0.444
1.854LysHis: 1.854 ± 0.384
3.068LysIle: 3.068 ± 0.438
2.877LysLys: 2.877 ± 0.453
6.073LysLeu: 6.073 ± 0.731
2.046LysMet: 2.046 ± 0.359
2.493LysAsn: 2.493 ± 0.473
2.813LysPro: 2.813 ± 0.514
2.685LysGln: 2.685 ± 0.366
2.493LysArg: 2.493 ± 0.36
4.219LysSer: 4.219 ± 0.429
3.26LysThr: 3.26 ± 0.574
4.986LysVal: 4.986 ± 0.732
0.959LysTrp: 0.959 ± 0.24
2.109LysTyr: 2.109 ± 0.352
0.0LysXaa: 0.0 ± 0.0
Leu
6.584LeuAla: 6.584 ± 0.53
0.831LeuCys: 0.831 ± 0.264
4.283LeuAsp: 4.283 ± 0.617
5.753LeuGlu: 5.753 ± 0.736
2.94LeuPhe: 2.94 ± 0.401
5.242LeuGly: 5.242 ± 0.676
1.534LeuHis: 1.534 ± 0.314
3.707LeuIle: 3.707 ± 0.451
4.73LeuLys: 4.73 ± 0.674
5.242LeuLeu: 5.242 ± 0.603
2.429LeuMet: 2.429 ± 0.318
4.666LeuAsn: 4.666 ± 0.586
3.452LeuPro: 3.452 ± 0.473
3.388LeuGln: 3.388 ± 0.503
3.835LeuArg: 3.835 ± 0.447
5.306LeuSer: 5.306 ± 0.619
4.73LeuThr: 4.73 ± 0.455
5.114LeuVal: 5.114 ± 0.603
0.767LeuTrp: 0.767 ± 0.182
2.301LeuTyr: 2.301 ± 0.329
0.0LeuXaa: 0.0 ± 0.0
Met
3.324MetAla: 3.324 ± 0.515
0.128MetCys: 0.128 ± 0.101
1.215MetAsp: 1.215 ± 0.225
1.534MetGlu: 1.534 ± 0.316
1.215MetPhe: 1.215 ± 0.244
1.662MetGly: 1.662 ± 0.383
0.639MetHis: 0.639 ± 0.195
1.151MetIle: 1.151 ± 0.215
2.237MetLys: 2.237 ± 0.413
2.046MetLeu: 2.046 ± 0.371
0.767MetMet: 0.767 ± 0.219
1.534MetAsn: 1.534 ± 0.461
1.406MetPro: 1.406 ± 0.346
1.726MetGln: 1.726 ± 0.356
1.215MetArg: 1.215 ± 0.309
2.173MetSer: 2.173 ± 0.352
1.47MetThr: 1.47 ± 0.32
1.534MetVal: 1.534 ± 0.319
0.447MetTrp: 0.447 ± 0.179
1.087MetTyr: 1.087 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
4.219AsnAla: 4.219 ± 0.509
0.703AsnCys: 0.703 ± 0.214
2.877AsnAsp: 2.877 ± 0.463
2.557AsnGlu: 2.557 ± 0.399
1.918AsnPhe: 1.918 ± 0.417
4.091AsnGly: 4.091 ± 0.632
0.959AsnHis: 0.959 ± 0.28
3.068AsnIle: 3.068 ± 0.461
2.813AsnLys: 2.813 ± 0.385
3.707AsnLeu: 3.707 ± 0.398
1.215AsnMet: 1.215 ± 0.292
2.685AsnAsn: 2.685 ± 0.439
3.004AsnPro: 3.004 ± 0.378
1.918AsnGln: 1.918 ± 0.349
2.429AsnArg: 2.429 ± 0.391
3.388AsnSer: 3.388 ± 0.514
3.196AsnThr: 3.196 ± 0.456
3.068AsnVal: 3.068 ± 0.489
0.447AsnTrp: 0.447 ± 0.162
1.598AsnTyr: 1.598 ± 0.326
0.0AsnXaa: 0.0 ± 0.0
Pro
3.068ProAla: 3.068 ± 0.406
0.32ProCys: 0.32 ± 0.131
2.173ProAsp: 2.173 ± 0.383
3.388ProGlu: 3.388 ± 0.382
1.215ProPhe: 1.215 ± 0.23
2.877ProGly: 2.877 ± 0.494
0.831ProHis: 0.831 ± 0.25
1.726ProIle: 1.726 ± 0.399
1.982ProLys: 1.982 ± 0.377
2.621ProLeu: 2.621 ± 0.446
0.831ProMet: 0.831 ± 0.185
2.365ProAsn: 2.365 ± 0.366
0.831ProPro: 0.831 ± 0.208
1.215ProGln: 1.215 ± 0.322
1.79ProArg: 1.79 ± 0.378
3.388ProSer: 3.388 ± 0.409
2.237ProThr: 2.237 ± 0.328
2.749ProVal: 2.749 ± 0.37
0.575ProTrp: 0.575 ± 0.162
1.406ProTyr: 1.406 ± 0.313
0.0ProXaa: 0.0 ± 0.0
Gln
3.196GlnAla: 3.196 ± 0.394
0.384GlnCys: 0.384 ± 0.193
1.662GlnAsp: 1.662 ± 0.233
2.237GlnGlu: 2.237 ± 0.358
1.598GlnPhe: 1.598 ± 0.313
1.726GlnGly: 1.726 ± 0.315
0.639GlnHis: 0.639 ± 0.225
3.26GlnIle: 3.26 ± 0.45
1.726GlnLys: 1.726 ± 0.312
3.516GlnLeu: 3.516 ± 0.449
1.215GlnMet: 1.215 ± 0.262
1.534GlnAsn: 1.534 ± 0.275
1.598GlnPro: 1.598 ± 0.296
2.365GlnGln: 2.365 ± 0.646
2.429GlnArg: 2.429 ± 0.395
2.109GlnSer: 2.109 ± 0.329
1.79GlnThr: 1.79 ± 0.377
2.94GlnVal: 2.94 ± 0.419
0.703GlnTrp: 0.703 ± 0.201
1.726GlnTyr: 1.726 ± 0.258
0.0GlnXaa: 0.0 ± 0.0
Arg
3.26ArgAla: 3.26 ± 0.397
0.639ArgCys: 0.639 ± 0.236
2.813ArgAsp: 2.813 ± 0.357
2.877ArgGlu: 2.877 ± 0.367
2.173ArgPhe: 2.173 ± 0.428
3.004ArgGly: 3.004 ± 0.558
0.511ArgHis: 0.511 ± 0.196
3.58ArgIle: 3.58 ± 0.443
3.324ArgLys: 3.324 ± 0.384
4.027ArgLeu: 4.027 ± 0.477
1.278ArgMet: 1.278 ± 0.35
2.493ArgAsn: 2.493 ± 0.479
1.982ArgPro: 1.982 ± 0.388
1.726ArgGln: 1.726 ± 0.327
1.918ArgArg: 1.918 ± 0.422
3.132ArgSer: 3.132 ± 0.439
2.365ArgThr: 2.365 ± 0.358
3.771ArgVal: 3.771 ± 0.507
0.831ArgTrp: 0.831 ± 0.214
2.237ArgTyr: 2.237 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
5.369SerAla: 5.369 ± 0.716
0.384SerCys: 0.384 ± 0.144
5.178SerAsp: 5.178 ± 0.422
3.58SerGlu: 3.58 ± 0.488
2.877SerPhe: 2.877 ± 0.416
6.392SerGly: 6.392 ± 0.852
1.726SerHis: 1.726 ± 0.348
5.242SerIle: 5.242 ± 0.762
3.707SerLys: 3.707 ± 0.563
5.753SerLeu: 5.753 ± 0.542
1.918SerMet: 1.918 ± 0.317
2.749SerAsn: 2.749 ± 0.518
2.493SerPro: 2.493 ± 0.453
2.749SerGln: 2.749 ± 0.502
3.004SerArg: 3.004 ± 0.383
4.73SerSer: 4.73 ± 0.7
3.899SerThr: 3.899 ± 0.649
3.963SerVal: 3.963 ± 0.478
0.575SerTrp: 0.575 ± 0.206
2.493SerTyr: 2.493 ± 0.379
0.0SerXaa: 0.0 ± 0.0
Thr
4.858ThrAla: 4.858 ± 0.65
0.447ThrCys: 0.447 ± 0.14
2.94ThrAsp: 2.94 ± 0.369
2.877ThrGlu: 2.877 ± 0.362
2.877ThrPhe: 2.877 ± 0.494
6.009ThrGly: 6.009 ± 0.681
1.151ThrHis: 1.151 ± 0.384
4.283ThrIle: 4.283 ± 0.644
3.132ThrLys: 3.132 ± 0.511
4.922ThrLeu: 4.922 ± 0.61
1.598ThrMet: 1.598 ± 0.235
2.557ThrAsn: 2.557 ± 0.519
2.109ThrPro: 2.109 ± 0.319
2.429ThrGln: 2.429 ± 0.318
2.685ThrArg: 2.685 ± 0.387
4.283ThrSer: 4.283 ± 0.707
3.644ThrThr: 3.644 ± 0.53
5.306ThrVal: 5.306 ± 0.628
0.959ThrTrp: 0.959 ± 0.247
1.918ThrTyr: 1.918 ± 0.362
0.0ThrXaa: 0.0 ± 0.0
Val
5.945ValAla: 5.945 ± 0.494
1.151ValCys: 1.151 ± 0.236
4.666ValAsp: 4.666 ± 0.527
3.58ValGlu: 3.58 ± 0.509
2.749ValPhe: 2.749 ± 0.45
5.178ValGly: 5.178 ± 0.456
1.278ValHis: 1.278 ± 0.322
4.027ValIle: 4.027 ± 0.57
4.602ValLys: 4.602 ± 0.636
5.05ValLeu: 5.05 ± 0.537
2.109ValMet: 2.109 ± 0.298
3.899ValAsn: 3.899 ± 0.512
2.877ValPro: 2.877 ± 0.48
2.046ValGln: 2.046 ± 0.381
3.388ValArg: 3.388 ± 0.479
4.475ValSer: 4.475 ± 0.604
4.794ValThr: 4.794 ± 0.607
4.091ValVal: 4.091 ± 0.633
1.278ValTrp: 1.278 ± 0.305
1.918ValTyr: 1.918 ± 0.305
0.0ValXaa: 0.0 ± 0.0
Trp
0.895TrpAla: 0.895 ± 0.181
0.192TrpCys: 0.192 ± 0.108
0.511TrpAsp: 0.511 ± 0.241
1.534TrpGlu: 1.534 ± 0.348
0.447TrpPhe: 0.447 ± 0.139
0.575TrpGly: 0.575 ± 0.156
0.192TrpHis: 0.192 ± 0.111
0.575TrpIle: 0.575 ± 0.223
0.895TrpLys: 0.895 ± 0.263
0.959TrpLeu: 0.959 ± 0.233
0.32TrpMet: 0.32 ± 0.122
0.831TrpAsn: 0.831 ± 0.186
0.256TrpPro: 0.256 ± 0.107
0.767TrpGln: 0.767 ± 0.189
0.511TrpArg: 0.511 ± 0.186
1.215TrpSer: 1.215 ± 0.238
1.023TrpThr: 1.023 ± 0.247
1.215TrpVal: 1.215 ± 0.301
0.128TrpTrp: 0.128 ± 0.089
0.767TrpTyr: 0.767 ± 0.235
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.109TyrAla: 2.109 ± 0.302
0.192TyrCys: 0.192 ± 0.095
2.749TyrAsp: 2.749 ± 0.39
1.598TyrGlu: 1.598 ± 0.321
1.342TyrPhe: 1.342 ± 0.314
3.388TyrGly: 3.388 ± 0.436
0.767TyrHis: 0.767 ± 0.247
2.429TyrIle: 2.429 ± 0.468
2.301TyrLys: 2.301 ± 0.489
3.196TyrLeu: 3.196 ± 0.549
1.598TyrMet: 1.598 ± 0.387
2.429TyrAsn: 2.429 ± 0.305
1.534TyrPro: 1.534 ± 0.256
1.406TyrGln: 1.406 ± 0.244
1.726TyrArg: 1.726 ± 0.318
2.365TyrSer: 2.365 ± 0.494
2.237TyrThr: 2.237 ± 0.326
1.79TyrVal: 1.79 ± 0.346
0.256TyrTrp: 0.256 ± 0.119
0.959TyrTyr: 0.959 ± 0.214
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (15645 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski