Amino acid dipepetide frequency for Cellulophaga phage phi40:1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.414AlaAla: 4.414 ± 0.824
0.501AlaCys: 0.501 ± 0.151
3.731AlaAsp: 3.731 ± 0.411
3.822AlaGlu: 3.822 ± 0.448
2.73AlaPhe: 2.73 ± 0.345
3.367AlaGly: 3.367 ± 0.434
0.865AlaHis: 0.865 ± 0.297
4.323AlaIle: 4.323 ± 0.409
5.824AlaLys: 5.824 ± 0.857
4.505AlaLeu: 4.505 ± 0.355
1.092AlaMet: 1.092 ± 0.234
3.231AlaAsn: 3.231 ± 0.371
1.866AlaPro: 1.866 ± 0.219
1.638AlaGln: 1.638 ± 0.318
2.548AlaArg: 2.548 ± 0.41
3.959AlaSer: 3.959 ± 0.532
3.549AlaThr: 3.549 ± 0.368
2.548AlaVal: 2.548 ± 0.374
0.546AlaTrp: 0.546 ± 0.142
3.14AlaTyr: 3.14 ± 0.406
0.0AlaXaa: 0.0 ± 0.0
Cys
0.319CysAla: 0.319 ± 0.13
0.228CysCys: 0.228 ± 0.109
0.455CysAsp: 0.455 ± 0.185
0.546CysGlu: 0.546 ± 0.155
0.546CysPhe: 0.546 ± 0.164
0.683CysGly: 0.683 ± 0.187
0.364CysHis: 0.364 ± 0.135
0.956CysIle: 0.956 ± 0.274
0.501CysLys: 0.501 ± 0.174
0.728CysLeu: 0.728 ± 0.205
0.228CysMet: 0.228 ± 0.1
0.455CysAsn: 0.455 ± 0.157
0.228CysPro: 0.228 ± 0.099
0.364CysGln: 0.364 ± 0.126
0.319CysArg: 0.319 ± 0.128
0.819CysSer: 0.819 ± 0.259
0.455CysThr: 0.455 ± 0.13
0.774CysVal: 0.774 ± 0.168
0.091CysTrp: 0.091 ± 0.061
0.319CysTyr: 0.319 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
3.868AspAla: 3.868 ± 0.407
0.501AspCys: 0.501 ± 0.184
4.55AspAsp: 4.55 ± 0.461
5.005AspGlu: 5.005 ± 0.598
3.094AspPhe: 3.094 ± 0.349
5.506AspGly: 5.506 ± 0.643
0.774AspHis: 0.774 ± 0.2
6.006AspIle: 6.006 ± 0.639
5.415AspLys: 5.415 ± 0.481
5.506AspLeu: 5.506 ± 0.595
1.547AspMet: 1.547 ± 0.278
3.549AspAsn: 3.549 ± 0.41
2.184AspPro: 2.184 ± 0.37
1.502AspGln: 1.502 ± 0.244
2.912AspArg: 2.912 ± 0.371
4.641AspSer: 4.641 ± 0.576
3.276AspThr: 3.276 ± 0.344
4.414AspVal: 4.414 ± 0.47
1.183AspTrp: 1.183 ± 0.204
3.777AspTyr: 3.777 ± 0.393
0.0AspXaa: 0.0 ± 0.0
Glu
5.551GluAla: 5.551 ± 0.544
0.683GluCys: 0.683 ± 0.183
6.052GluAsp: 6.052 ± 0.619
6.416GluGlu: 6.416 ± 0.604
2.412GluPhe: 2.412 ± 0.296
5.688GluGly: 5.688 ± 0.421
0.956GluHis: 0.956 ± 0.204
4.55GluIle: 4.55 ± 0.438
5.415GluLys: 5.415 ± 0.466
7.053GluLeu: 7.053 ± 0.635
1.411GluMet: 1.411 ± 0.234
3.549GluAsn: 3.549 ± 0.355
1.775GluPro: 1.775 ± 0.504
2.002GluGln: 2.002 ± 0.249
3.231GluArg: 3.231 ± 0.342
5.278GluSer: 5.278 ± 0.439
3.276GluThr: 3.276 ± 0.304
6.507GluVal: 6.507 ± 0.56
0.683GluTrp: 0.683 ± 0.172
3.276GluTyr: 3.276 ± 0.421
0.0GluXaa: 0.0 ± 0.0
Phe
2.23PheAla: 2.23 ± 0.335
0.364PheCys: 0.364 ± 0.147
2.776PheAsp: 2.776 ± 0.32
2.73PheGlu: 2.73 ± 0.279
1.411PhePhe: 1.411 ± 0.297
2.73PheGly: 2.73 ± 0.347
0.592PheHis: 0.592 ± 0.159
2.275PheIle: 2.275 ± 0.305
4.459PheLys: 4.459 ± 0.425
3.504PheLeu: 3.504 ± 0.465
1.274PheMet: 1.274 ± 0.25
2.275PheAsn: 2.275 ± 0.425
1.001PhePro: 1.001 ± 0.233
1.047PheGln: 1.047 ± 0.195
1.547PheArg: 1.547 ± 0.277
3.64PheSer: 3.64 ± 0.41
2.184PheThr: 2.184 ± 0.378
2.139PheVal: 2.139 ± 0.382
0.455PheTrp: 0.455 ± 0.163
1.684PheTyr: 1.684 ± 0.312
0.0PheXaa: 0.0 ± 0.0
Gly
3.913GlyAla: 3.913 ± 0.524
0.455GlyCys: 0.455 ± 0.172
5.551GlyAsp: 5.551 ± 0.676
5.278GlyGlu: 5.278 ± 0.45
2.958GlyPhe: 2.958 ± 0.384
4.823GlyGly: 4.823 ± 0.439
0.683GlyHis: 0.683 ± 0.193
4.96GlyIle: 4.96 ± 0.415
5.688GlyLys: 5.688 ± 0.573
6.097GlyLeu: 6.097 ± 0.565
1.593GlyMet: 1.593 ± 0.235
3.276GlyAsn: 3.276 ± 0.434
0.592GlyPro: 0.592 ± 0.152
1.866GlyGln: 1.866 ± 0.243
3.913GlyArg: 3.913 ± 0.373
6.325GlySer: 6.325 ± 0.762
3.731GlyThr: 3.731 ± 0.564
5.278GlyVal: 5.278 ± 0.541
0.956GlyTrp: 0.956 ± 0.225
3.231GlyTyr: 3.231 ± 0.484
0.0GlyXaa: 0.0 ± 0.0
His
0.501HisAla: 0.501 ± 0.161
0.273HisCys: 0.273 ± 0.107
1.001HisAsp: 1.001 ± 0.231
0.91HisGlu: 0.91 ± 0.193
0.546HisPhe: 0.546 ± 0.156
1.092HisGly: 1.092 ± 0.254
0.501HisHis: 0.501 ± 0.157
0.865HisIle: 0.865 ± 0.203
1.138HisLys: 1.138 ± 0.263
1.32HisLeu: 1.32 ± 0.246
0.455HisMet: 0.455 ± 0.159
0.819HisAsn: 0.819 ± 0.201
0.501HisPro: 0.501 ± 0.203
0.319HisGln: 0.319 ± 0.128
0.592HisArg: 0.592 ± 0.165
0.819HisSer: 0.819 ± 0.194
0.728HisThr: 0.728 ± 0.208
0.592HisVal: 0.592 ± 0.15
0.228HisTrp: 0.228 ± 0.093
0.546HisTyr: 0.546 ± 0.156
0.0HisXaa: 0.0 ± 0.0
Ile
3.731IleAla: 3.731 ± 0.405
0.728IleCys: 0.728 ± 0.171
4.505IleAsp: 4.505 ± 0.418
6.643IleGlu: 6.643 ± 0.552
2.139IlePhe: 2.139 ± 0.371
4.186IleGly: 4.186 ± 0.48
0.774IleHis: 0.774 ± 0.188
3.777IleIle: 3.777 ± 0.432
6.325IleLys: 6.325 ± 0.592
5.551IleLeu: 5.551 ± 0.452
1.593IleMet: 1.593 ± 0.284
3.868IleAsn: 3.868 ± 0.43
2.275IlePro: 2.275 ± 0.371
2.139IleGln: 2.139 ± 0.412
3.003IleArg: 3.003 ± 0.383
6.143IleSer: 6.143 ± 0.516
4.186IleThr: 4.186 ± 0.419
3.367IleVal: 3.367 ± 0.415
0.546IleTrp: 0.546 ± 0.17
2.912IleTyr: 2.912 ± 0.354
0.0IleXaa: 0.0 ± 0.0
Lys
5.642LysAla: 5.642 ± 0.811
0.41LysCys: 0.41 ± 0.148
6.37LysAsp: 6.37 ± 0.626
7.235LysGlu: 7.235 ± 0.647
2.73LysPhe: 2.73 ± 0.383
7.69LysGly: 7.69 ± 0.714
0.319LysHis: 0.319 ± 0.134
5.597LysIle: 5.597 ± 0.512
7.508LysLys: 7.508 ± 1.031
6.325LysLeu: 6.325 ± 0.645
2.685LysMet: 2.685 ± 0.314
4.186LysAsn: 4.186 ± 0.436
2.639LysPro: 2.639 ± 0.314
2.275LysGln: 2.275 ± 0.338
3.094LysArg: 3.094 ± 0.45
5.278LysSer: 5.278 ± 0.454
4.596LysThr: 4.596 ± 0.514
5.688LysVal: 5.688 ± 0.566
0.91LysTrp: 0.91 ± 0.229
3.367LysTyr: 3.367 ± 0.318
0.0LysXaa: 0.0 ± 0.0
Leu
3.959LeuAla: 3.959 ± 0.672
0.728LeuCys: 0.728 ± 0.203
6.598LeuAsp: 6.598 ± 0.631
6.734LeuGlu: 6.734 ± 0.513
3.003LeuPhe: 3.003 ± 0.422
5.642LeuGly: 5.642 ± 0.471
1.411LeuHis: 1.411 ± 0.271
5.278LeuIle: 5.278 ± 0.579
6.962LeuLys: 6.962 ± 0.729
5.733LeuLeu: 5.733 ± 0.587
1.638LeuMet: 1.638 ± 0.281
5.051LeuAsn: 5.051 ± 0.488
2.594LeuPro: 2.594 ± 0.333
2.275LeuGln: 2.275 ± 0.29
4.232LeuArg: 4.232 ± 0.431
6.507LeuSer: 6.507 ± 0.45
4.55LeuThr: 4.55 ± 0.447
4.277LeuVal: 4.277 ± 0.461
0.319LeuTrp: 0.319 ± 0.142
3.276LeuTyr: 3.276 ± 0.386
0.0LeuXaa: 0.0 ± 0.0
Met
2.594MetAla: 2.594 ± 0.368
0.41MetCys: 0.41 ± 0.146
1.138MetAsp: 1.138 ± 0.246
1.593MetGlu: 1.593 ± 0.229
0.819MetPhe: 0.819 ± 0.216
1.638MetGly: 1.638 ± 0.279
0.364MetHis: 0.364 ± 0.142
1.411MetIle: 1.411 ± 0.245
1.638MetLys: 1.638 ± 0.301
1.411MetLeu: 1.411 ± 0.317
0.501MetMet: 0.501 ± 0.139
1.32MetAsn: 1.32 ± 0.23
0.728MetPro: 0.728 ± 0.149
0.501MetGln: 0.501 ± 0.148
1.001MetArg: 1.001 ± 0.225
2.002MetSer: 2.002 ± 0.273
1.229MetThr: 1.229 ± 0.252
1.32MetVal: 1.32 ± 0.231
0.228MetTrp: 0.228 ± 0.109
1.32MetTyr: 1.32 ± 0.271
0.0MetXaa: 0.0 ± 0.0
Asn
2.548AsnAla: 2.548 ± 0.304
0.637AsnCys: 0.637 ± 0.191
2.639AsnAsp: 2.639 ± 0.377
2.958AsnGlu: 2.958 ± 0.404
2.73AsnPhe: 2.73 ± 0.334
4.186AsnGly: 4.186 ± 0.52
0.956AsnHis: 0.956 ± 0.198
3.822AsnIle: 3.822 ± 0.582
4.141AsnLys: 4.141 ± 0.443
4.732AsnLeu: 4.732 ± 0.459
1.32AsnMet: 1.32 ± 0.27
3.094AsnAsn: 3.094 ± 0.348
2.139AsnPro: 2.139 ± 0.304
1.502AsnGln: 1.502 ± 0.272
2.457AsnArg: 2.457 ± 0.326
3.64AsnSer: 3.64 ± 0.53
3.64AsnThr: 3.64 ± 0.466
2.685AsnVal: 2.685 ± 0.349
0.865AsnTrp: 0.865 ± 0.191
2.23AsnTyr: 2.23 ± 0.343
0.0AsnXaa: 0.0 ± 0.0
Pro
1.092ProAla: 1.092 ± 0.19
0.41ProCys: 0.41 ± 0.178
2.093ProAsp: 2.093 ± 0.374
1.911ProGlu: 1.911 ± 0.364
1.638ProPhe: 1.638 ± 0.238
0.364ProGly: 0.364 ± 0.136
0.592ProHis: 0.592 ± 0.177
1.866ProIle: 1.866 ± 0.294
2.685ProLys: 2.685 ± 0.368
2.594ProLeu: 2.594 ± 0.326
0.637ProMet: 0.637 ± 0.186
1.32ProAsn: 1.32 ± 0.311
0.819ProPro: 0.819 ± 0.246
0.819ProGln: 0.819 ± 0.171
1.092ProArg: 1.092 ± 0.237
2.412ProSer: 2.412 ± 0.425
2.867ProThr: 2.867 ± 0.556
2.366ProVal: 2.366 ± 0.416
0.364ProTrp: 0.364 ± 0.118
1.138ProTyr: 1.138 ± 0.275
0.0ProXaa: 0.0 ± 0.0
Gln
1.729GlnAla: 1.729 ± 0.393
0.501GlnCys: 0.501 ± 0.177
2.093GlnAsp: 2.093 ± 0.274
2.23GlnGlu: 2.23 ± 0.295
1.183GlnPhe: 1.183 ± 0.229
1.866GlnGly: 1.866 ± 0.334
0.273GlnHis: 0.273 ± 0.126
2.139GlnIle: 2.139 ± 0.286
1.866GlnLys: 1.866 ± 0.428
1.82GlnLeu: 1.82 ± 0.256
0.956GlnMet: 0.956 ± 0.137
1.138GlnAsn: 1.138 ± 0.196
0.546GlnPro: 0.546 ± 0.132
0.728GlnGln: 0.728 ± 0.268
0.91GlnArg: 0.91 ± 0.221
1.957GlnSer: 1.957 ± 0.357
1.638GlnThr: 1.638 ± 0.243
1.547GlnVal: 1.547 ± 0.278
0.137GlnTrp: 0.137 ± 0.102
1.365GlnTyr: 1.365 ± 0.229
0.0GlnXaa: 0.0 ± 0.0
Arg
2.639ArgAla: 2.639 ± 0.382
0.546ArgCys: 0.546 ± 0.145
2.594ArgAsp: 2.594 ± 0.252
3.094ArgGlu: 3.094 ± 0.381
1.502ArgPhe: 1.502 ± 0.261
3.549ArgGly: 3.549 ± 0.423
0.865ArgHis: 0.865 ± 0.24
3.003ArgIle: 3.003 ± 0.357
2.821ArgLys: 2.821 ± 0.394
3.822ArgLeu: 3.822 ± 0.372
1.001ArgMet: 1.001 ± 0.2
2.366ArgAsn: 2.366 ± 0.308
1.593ArgPro: 1.593 ± 0.262
1.638ArgGln: 1.638 ± 0.304
2.412ArgArg: 2.412 ± 0.313
3.185ArgSer: 3.185 ± 0.376
2.412ArgThr: 2.412 ± 0.322
3.504ArgVal: 3.504 ± 0.365
0.364ArgTrp: 0.364 ± 0.118
1.82ArgTyr: 1.82 ± 0.276
0.0ArgXaa: 0.0 ± 0.0
Ser
3.959SerAla: 3.959 ± 0.428
0.455SerCys: 0.455 ± 0.165
4.823SerAsp: 4.823 ± 0.463
5.506SerGlu: 5.506 ± 0.538
3.185SerPhe: 3.185 ± 0.454
6.143SerGly: 6.143 ± 0.745
1.092SerHis: 1.092 ± 0.207
5.005SerIle: 5.005 ± 0.458
7.098SerLys: 7.098 ± 0.684
5.506SerLeu: 5.506 ± 0.556
1.911SerMet: 1.911 ± 0.314
4.778SerAsn: 4.778 ± 0.644
2.412SerPro: 2.412 ± 0.345
1.729SerGln: 1.729 ± 0.268
3.003SerArg: 3.003 ± 0.33
6.461SerSer: 6.461 ± 0.742
4.232SerThr: 4.232 ± 0.771
5.187SerVal: 5.187 ± 0.602
0.683SerTrp: 0.683 ± 0.153
3.322SerTyr: 3.322 ± 0.44
0.0SerXaa: 0.0 ± 0.0
Thr
3.595ThrAla: 3.595 ± 0.45
0.41ThrCys: 0.41 ± 0.127
3.595ThrAsp: 3.595 ± 0.387
4.414ThrGlu: 4.414 ± 0.427
2.366ThrPhe: 2.366 ± 0.323
4.505ThrGly: 4.505 ± 0.57
0.637ThrHis: 0.637 ± 0.184
4.687ThrIle: 4.687 ± 0.569
4.459ThrLys: 4.459 ± 0.393
4.414ThrLeu: 4.414 ± 0.406
0.774ThrMet: 0.774 ± 0.168
2.685ThrAsn: 2.685 ± 0.344
2.093ThrPro: 2.093 ± 0.326
1.638ThrGln: 1.638 ± 0.291
2.685ThrArg: 2.685 ± 0.295
3.731ThrSer: 3.731 ± 0.526
3.777ThrThr: 3.777 ± 0.586
3.913ThrVal: 3.913 ± 0.506
0.455ThrTrp: 0.455 ± 0.153
1.957ThrTyr: 1.957 ± 0.377
0.0ThrXaa: 0.0 ± 0.0
Val
3.549ValAla: 3.549 ± 0.469
0.455ValCys: 0.455 ± 0.147
5.142ValAsp: 5.142 ± 0.434
4.823ValGlu: 4.823 ± 0.663
3.14ValPhe: 3.14 ± 0.448
3.959ValGly: 3.959 ± 0.501
0.819ValHis: 0.819 ± 0.189
3.231ValIle: 3.231 ± 0.427
5.597ValLys: 5.597 ± 0.619
5.824ValLeu: 5.824 ± 0.573
1.047ValMet: 1.047 ± 0.21
3.094ValAsn: 3.094 ± 0.361
1.729ValPro: 1.729 ± 0.304
1.547ValGln: 1.547 ± 0.261
2.867ValArg: 2.867 ± 0.363
4.869ValSer: 4.869 ± 0.501
3.458ValThr: 3.458 ± 0.51
4.232ValVal: 4.232 ± 0.501
0.956ValTrp: 0.956 ± 0.241
3.14ValTyr: 3.14 ± 0.431
0.0ValXaa: 0.0 ± 0.0
Trp
0.637TrpAla: 0.637 ± 0.189
0.228TrpCys: 0.228 ± 0.117
0.546TrpAsp: 0.546 ± 0.189
1.138TrpGlu: 1.138 ± 0.235
0.728TrpPhe: 0.728 ± 0.186
0.683TrpGly: 0.683 ± 0.2
0.182TrpHis: 0.182 ± 0.104
0.774TrpIle: 0.774 ± 0.194
0.683TrpLys: 0.683 ± 0.155
0.683TrpLeu: 0.683 ± 0.197
0.501TrpMet: 0.501 ± 0.144
0.364TrpAsn: 0.364 ± 0.136
0.137TrpPro: 0.137 ± 0.071
0.091TrpGln: 0.091 ± 0.066
0.364TrpArg: 0.364 ± 0.123
0.819TrpSer: 0.819 ± 0.198
0.546TrpThr: 0.546 ± 0.162
0.683TrpVal: 0.683 ± 0.178
0.137TrpTrp: 0.137 ± 0.091
0.637TrpTyr: 0.637 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.775TyrAla: 1.775 ± 0.273
0.455TyrCys: 0.455 ± 0.156
2.958TyrAsp: 2.958 ± 0.335
2.776TyrGlu: 2.776 ± 0.332
1.456TyrPhe: 1.456 ± 0.229
2.867TyrGly: 2.867 ± 0.387
0.728TyrHis: 0.728 ± 0.195
3.868TyrIle: 3.868 ± 0.453
4.323TyrLys: 4.323 ± 0.472
3.64TyrLeu: 3.64 ± 0.396
0.956TyrMet: 0.956 ± 0.234
2.503TyrAsn: 2.503 ± 0.418
1.229TyrPro: 1.229 ± 0.272
1.001TyrGln: 1.001 ± 0.23
2.548TyrArg: 2.548 ± 0.35
4.004TyrSer: 4.004 ± 0.472
2.457TyrThr: 2.457 ± 0.365
2.457TyrVal: 2.457 ± 0.361
0.455TyrTrp: 0.455 ± 0.123
2.321TyrTyr: 2.321 ± 0.308
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 101 proteins (21979 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski