Amino acid dipepetide frequency for Roseobacter phage CRP-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.572AlaAla: 10.572 ± 1.634
0.581AlaCys: 0.581 ± 0.188
5.867AlaAsp: 5.867 ± 0.635
6.274AlaGlu: 6.274 ± 0.799
3.137AlaPhe: 3.137 ± 0.346
6.39AlaGly: 6.39 ± 0.67
0.755AlaHis: 0.755 ± 0.209
5.46AlaIle: 5.46 ± 0.617
6.854AlaLys: 6.854 ± 0.926
5.635AlaLeu: 5.635 ± 0.561
3.602AlaMet: 3.602 ± 0.569
4.473AlaAsn: 4.473 ± 0.665
3.485AlaPro: 3.485 ± 0.47
4.124AlaGln: 4.124 ± 0.845
3.427AlaArg: 3.427 ± 0.576
5.344AlaSer: 5.344 ± 0.672
7.377AlaThr: 7.377 ± 1.405
5.46AlaVal: 5.46 ± 0.491
1.568AlaTrp: 1.568 ± 0.338
2.44AlaTyr: 2.44 ± 0.383
0.0AlaXaa: 0.0 ± 0.0
Cys
0.465CysAla: 0.465 ± 0.18
0.0CysCys: 0.0 ± 0.0
0.581CysAsp: 0.581 ± 0.202
0.523CysGlu: 0.523 ± 0.205
0.29CysPhe: 0.29 ± 0.114
0.581CysGly: 0.581 ± 0.203
0.116CysHis: 0.116 ± 0.082
0.232CysIle: 0.232 ± 0.098
0.581CysLys: 0.581 ± 0.224
0.465CysLeu: 0.465 ± 0.182
0.407CysMet: 0.407 ± 0.187
0.407CysAsn: 0.407 ± 0.182
0.407CysPro: 0.407 ± 0.17
0.349CysGln: 0.349 ± 0.141
0.232CysArg: 0.232 ± 0.119
0.755CysSer: 0.755 ± 0.259
0.523CysThr: 0.523 ± 0.215
0.465CysVal: 0.465 ± 0.16
0.29CysTrp: 0.29 ± 0.116
0.407CysTyr: 0.407 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
5.228AspAla: 5.228 ± 0.686
0.581AspCys: 0.581 ± 0.186
4.066AspAsp: 4.066 ± 0.606
4.705AspGlu: 4.705 ± 0.698
2.44AspPhe: 2.44 ± 0.353
5.867AspGly: 5.867 ± 0.938
0.523AspHis: 0.523 ± 0.172
5.228AspIle: 5.228 ± 0.65
3.602AspLys: 3.602 ± 0.637
5.925AspLeu: 5.925 ± 0.594
2.033AspMet: 2.033 ± 0.362
3.427AspAsn: 3.427 ± 0.508
3.137AspPro: 3.137 ± 0.374
1.801AspGln: 1.801 ± 0.346
2.207AspArg: 2.207 ± 0.298
3.95AspSer: 3.95 ± 0.516
4.124AspThr: 4.124 ± 0.52
4.996AspVal: 4.996 ± 0.513
0.523AspTrp: 0.523 ± 0.175
2.788AspTyr: 2.788 ± 0.371
0.0AspXaa: 0.0 ± 0.0
Glu
7.493GluAla: 7.493 ± 0.994
0.581GluCys: 0.581 ± 0.179
3.776GluAsp: 3.776 ± 0.555
5.228GluGlu: 5.228 ± 0.767
2.498GluPhe: 2.498 ± 0.331
3.485GluGly: 3.485 ± 0.497
1.51GluHis: 1.51 ± 0.397
3.485GluIle: 3.485 ± 0.395
3.253GluLys: 3.253 ± 0.726
4.938GluLeu: 4.938 ± 0.659
1.743GluMet: 1.743 ± 0.371
2.556GluAsn: 2.556 ± 0.464
2.091GluPro: 2.091 ± 0.5
3.427GluGln: 3.427 ± 0.546
3.485GluArg: 3.485 ± 0.64
2.44GluSer: 2.44 ± 0.466
4.24GluThr: 4.24 ± 0.561
5.17GluVal: 5.17 ± 0.475
1.046GluTrp: 1.046 ± 0.23
2.846GluTyr: 2.846 ± 0.476
0.0GluXaa: 0.0 ± 0.0
Phe
3.021PheAla: 3.021 ± 0.466
0.581PheCys: 0.581 ± 0.169
2.846PheAsp: 2.846 ± 0.417
1.685PheGlu: 1.685 ± 0.344
1.046PhePhe: 1.046 ± 0.211
2.963PheGly: 2.963 ± 0.42
0.523PheHis: 0.523 ± 0.182
1.452PheIle: 1.452 ± 0.286
2.556PheLys: 2.556 ± 0.403
1.859PheLeu: 1.859 ± 0.269
0.871PheMet: 0.871 ± 0.201
2.614PheAsn: 2.614 ± 0.46
1.394PhePro: 1.394 ± 0.258
0.871PheGln: 0.871 ± 0.186
1.336PheArg: 1.336 ± 0.259
2.091PheSer: 2.091 ± 0.342
3.079PheThr: 3.079 ± 0.59
2.033PheVal: 2.033 ± 0.361
0.232PheTrp: 0.232 ± 0.095
1.568PheTyr: 1.568 ± 0.376
0.0PheXaa: 0.0 ± 0.0
Gly
6.448GlyAla: 6.448 ± 1.037
0.407GlyCys: 0.407 ± 0.15
5.17GlyAsp: 5.17 ± 0.581
3.95GlyGlu: 3.95 ± 0.45
3.253GlyPhe: 3.253 ± 0.392
6.564GlyGly: 6.564 ± 0.934
0.929GlyHis: 0.929 ± 0.261
3.079GlyIle: 3.079 ± 0.408
3.892GlyLys: 3.892 ± 0.406
4.415GlyLeu: 4.415 ± 0.553
2.265GlyMet: 2.265 ± 0.512
3.95GlyAsn: 3.95 ± 0.685
1.801GlyPro: 1.801 ± 0.367
2.73GlyGln: 2.73 ± 0.442
2.614GlyArg: 2.614 ± 0.458
5.112GlySer: 5.112 ± 1.17
6.39GlyThr: 6.39 ± 1.346
5.286GlyVal: 5.286 ± 0.612
0.697GlyTrp: 0.697 ± 0.203
3.776GlyTyr: 3.776 ± 0.527
0.0GlyXaa: 0.0 ± 0.0
His
0.813HisAla: 0.813 ± 0.209
0.29HisCys: 0.29 ± 0.131
0.871HisAsp: 0.871 ± 0.33
1.046HisGlu: 1.046 ± 0.287
0.929HisPhe: 0.929 ± 0.229
0.988HisGly: 0.988 ± 0.273
0.116HisHis: 0.116 ± 0.081
0.697HisIle: 0.697 ± 0.216
1.394HisLys: 1.394 ± 0.335
1.452HisLeu: 1.452 ± 0.347
0.813HisMet: 0.813 ± 0.264
1.104HisAsn: 1.104 ± 0.281
0.755HisPro: 0.755 ± 0.232
0.813HisGln: 0.813 ± 0.252
0.813HisArg: 0.813 ± 0.217
0.755HisSer: 0.755 ± 0.288
0.755HisThr: 0.755 ± 0.164
0.988HisVal: 0.988 ± 0.222
0.465HisTrp: 0.465 ± 0.173
0.407HisTyr: 0.407 ± 0.136
0.0HisXaa: 0.0 ± 0.0
Ile
4.473IleAla: 4.473 ± 0.675
0.407IleCys: 0.407 ± 0.192
3.834IleAsp: 3.834 ± 0.642
3.95IleGlu: 3.95 ± 0.482
1.394IlePhe: 1.394 ± 0.317
3.253IleGly: 3.253 ± 0.585
1.278IleHis: 1.278 ± 0.245
2.44IleIle: 2.44 ± 0.4
3.021IleLys: 3.021 ± 0.462
2.265IleLeu: 2.265 ± 0.393
1.568IleMet: 1.568 ± 0.242
2.614IleAsn: 2.614 ± 0.466
2.324IlePro: 2.324 ± 0.379
2.672IleGln: 2.672 ± 0.418
2.788IleArg: 2.788 ± 0.422
3.718IleSer: 3.718 ± 0.576
4.066IleThr: 4.066 ± 0.713
3.137IleVal: 3.137 ± 0.515
0.581IleTrp: 0.581 ± 0.174
1.278IleTyr: 1.278 ± 0.245
0.0IleXaa: 0.0 ± 0.0
Lys
4.589LysAla: 4.589 ± 0.572
0.465LysCys: 0.465 ± 0.273
3.834LysAsp: 3.834 ± 0.596
4.473LysGlu: 4.473 ± 0.86
1.743LysPhe: 1.743 ± 0.255
4.299LysGly: 4.299 ± 0.559
1.394LysHis: 1.394 ± 0.35
2.614LysIle: 2.614 ± 0.411
4.879LysLys: 4.879 ± 1.048
4.24LysLeu: 4.24 ± 0.542
2.382LysMet: 2.382 ± 0.425
3.137LysAsn: 3.137 ± 0.655
2.614LysPro: 2.614 ± 0.332
3.311LysGln: 3.311 ± 0.547
3.427LysArg: 3.427 ± 0.584
3.602LysSer: 3.602 ± 0.564
3.66LysThr: 3.66 ± 0.472
3.892LysVal: 3.892 ± 0.669
1.22LysTrp: 1.22 ± 0.246
2.091LysTyr: 2.091 ± 0.334
0.0LysXaa: 0.0 ± 0.0
Leu
6.332LeuAla: 6.332 ± 0.528
0.465LeuCys: 0.465 ± 0.171
4.996LeuAsp: 4.996 ± 0.636
5.344LeuGlu: 5.344 ± 0.733
2.033LeuPhe: 2.033 ± 0.362
4.996LeuGly: 4.996 ± 0.588
1.394LeuHis: 1.394 ± 0.355
2.498LeuIle: 2.498 ± 0.402
4.705LeuLys: 4.705 ± 0.635
4.008LeuLeu: 4.008 ± 0.551
2.207LeuMet: 2.207 ± 0.405
2.846LeuAsn: 2.846 ± 0.351
2.788LeuPro: 2.788 ± 0.438
2.614LeuGln: 2.614 ± 0.434
3.66LeuArg: 3.66 ± 0.497
5.402LeuSer: 5.402 ± 0.555
4.879LeuThr: 4.879 ± 0.692
3.66LeuVal: 3.66 ± 0.459
1.162LeuTrp: 1.162 ± 0.315
2.382LeuTyr: 2.382 ± 0.495
0.0LeuXaa: 0.0 ± 0.0
Met
3.137MetAla: 3.137 ± 0.564
0.232MetCys: 0.232 ± 0.125
2.498MetAsp: 2.498 ± 0.405
1.626MetGlu: 1.626 ± 0.281
0.755MetPhe: 0.755 ± 0.189
1.452MetGly: 1.452 ± 0.395
0.523MetHis: 0.523 ± 0.21
1.162MetIle: 1.162 ± 0.278
1.743MetLys: 1.743 ± 0.349
1.859MetLeu: 1.859 ± 0.28
0.929MetMet: 0.929 ± 0.26
1.626MetAsn: 1.626 ± 0.282
1.046MetPro: 1.046 ± 0.261
2.091MetGln: 2.091 ± 0.465
1.626MetArg: 1.626 ± 0.3
3.137MetSer: 3.137 ± 0.421
2.324MetThr: 2.324 ± 0.338
1.104MetVal: 1.104 ± 0.295
0.29MetTrp: 0.29 ± 0.122
1.336MetTyr: 1.336 ± 0.263
0.0MetXaa: 0.0 ± 0.0
Asn
4.996AsnAla: 4.996 ± 0.576
0.523AsnCys: 0.523 ± 0.178
3.311AsnAsp: 3.311 ± 0.491
3.137AsnGlu: 3.137 ± 0.451
1.046AsnPhe: 1.046 ± 0.208
4.24AsnGly: 4.24 ± 0.796
1.104AsnHis: 1.104 ± 0.279
3.079AsnIle: 3.079 ± 0.408
2.963AsnLys: 2.963 ± 0.466
4.182AsnLeu: 4.182 ± 0.458
1.568AsnMet: 1.568 ± 0.261
2.73AsnAsn: 2.73 ± 0.406
2.498AsnPro: 2.498 ± 0.436
1.568AsnGln: 1.568 ± 0.379
1.626AsnArg: 1.626 ± 0.371
2.614AsnSer: 2.614 ± 0.441
3.079AsnThr: 3.079 ± 0.458
3.195AsnVal: 3.195 ± 0.532
0.465AsnTrp: 0.465 ± 0.204
1.859AsnTyr: 1.859 ± 0.338
0.0AsnXaa: 0.0 ± 0.0
Pro
3.253ProAla: 3.253 ± 0.42
0.116ProCys: 0.116 ± 0.113
3.311ProAsp: 3.311 ± 0.636
2.904ProGlu: 2.904 ± 0.45
0.988ProPhe: 0.988 ± 0.336
1.801ProGly: 1.801 ± 0.394
0.465ProHis: 0.465 ± 0.172
2.091ProIle: 2.091 ± 0.295
2.614ProLys: 2.614 ± 0.544
1.917ProLeu: 1.917 ± 0.315
0.755ProMet: 0.755 ± 0.207
2.265ProAsn: 2.265 ± 0.419
1.046ProPro: 1.046 ± 0.262
1.22ProGln: 1.22 ± 0.277
1.452ProArg: 1.452 ± 0.305
3.137ProSer: 3.137 ± 0.482
2.556ProThr: 2.556 ± 0.594
4.008ProVal: 4.008 ± 0.529
0.465ProTrp: 0.465 ± 0.161
1.568ProTyr: 1.568 ± 0.403
0.0ProXaa: 0.0 ± 0.0
Gln
4.299GlnAla: 4.299 ± 1.08
0.058GlnCys: 0.058 ± 0.059
2.73GlnAsp: 2.73 ± 0.411
3.021GlnGlu: 3.021 ± 0.493
1.626GlnPhe: 1.626 ± 0.433
2.498GlnGly: 2.498 ± 0.323
0.929GlnHis: 0.929 ± 0.25
1.975GlnIle: 1.975 ± 0.343
1.859GlnLys: 1.859 ± 0.322
3.137GlnLeu: 3.137 ± 0.359
1.22GlnMet: 1.22 ± 0.282
2.033GlnAsn: 2.033 ± 0.477
1.51GlnPro: 1.51 ± 0.294
3.079GlnGln: 3.079 ± 0.966
2.265GlnArg: 2.265 ± 0.429
2.672GlnSer: 2.672 ± 0.401
2.963GlnThr: 2.963 ± 0.799
2.265GlnVal: 2.265 ± 0.371
0.929GlnTrp: 0.929 ± 0.267
2.033GlnTyr: 2.033 ± 0.387
0.0GlnXaa: 0.0 ± 0.0
Arg
3.253ArgAla: 3.253 ± 0.565
0.407ArgCys: 0.407 ± 0.162
2.672ArgAsp: 2.672 ± 0.35
2.963ArgGlu: 2.963 ± 0.423
0.988ArgPhe: 0.988 ± 0.264
2.904ArgGly: 2.904 ± 0.407
1.046ArgHis: 1.046 ± 0.275
2.149ArgIle: 2.149 ± 0.356
3.892ArgLys: 3.892 ± 0.73
3.776ArgLeu: 3.776 ± 0.481
1.336ArgMet: 1.336 ± 0.236
1.336ArgAsn: 1.336 ± 0.392
1.626ArgPro: 1.626 ± 0.21
2.498ArgGln: 2.498 ± 0.415
2.382ArgArg: 2.382 ± 0.365
2.556ArgSer: 2.556 ± 0.436
2.324ArgThr: 2.324 ± 0.365
3.021ArgVal: 3.021 ± 0.415
0.639ArgTrp: 0.639 ± 0.245
2.207ArgTyr: 2.207 ± 0.384
0.0ArgXaa: 0.0 ± 0.0
Ser
5.751SerAla: 5.751 ± 0.773
0.29SerCys: 0.29 ± 0.133
3.021SerAsp: 3.021 ± 0.55
2.904SerGlu: 2.904 ± 0.476
3.079SerPhe: 3.079 ± 0.469
6.448SerGly: 6.448 ± 0.755
0.813SerHis: 0.813 ± 0.204
3.369SerIle: 3.369 ± 0.34
3.718SerLys: 3.718 ± 0.526
4.124SerLeu: 4.124 ± 0.616
1.917SerMet: 1.917 ± 0.524
3.021SerAsn: 3.021 ± 0.372
2.149SerPro: 2.149 ± 0.436
2.614SerGln: 2.614 ± 0.434
1.975SerArg: 1.975 ± 0.285
4.415SerSer: 4.415 ± 1.014
6.506SerThr: 6.506 ± 1.295
4.415SerVal: 4.415 ± 0.472
0.581SerTrp: 0.581 ± 0.18
2.498SerTyr: 2.498 ± 0.49
0.0SerXaa: 0.0 ± 0.0
Thr
8.016ThrAla: 8.016 ± 1.343
0.755ThrCys: 0.755 ± 0.259
3.718ThrAsp: 3.718 ± 0.496
3.718ThrGlu: 3.718 ± 0.447
3.485ThrPhe: 3.485 ± 0.484
6.564ThrGly: 6.564 ± 1.214
0.871ThrHis: 0.871 ± 0.211
4.008ThrIle: 4.008 ± 0.472
3.892ThrLys: 3.892 ± 0.586
5.054ThrLeu: 5.054 ± 0.661
1.801ThrMet: 1.801 ± 0.307
4.124ThrAsn: 4.124 ± 0.826
3.079ThrPro: 3.079 ± 0.457
3.369ThrGln: 3.369 ± 0.57
2.788ThrArg: 2.788 ± 0.394
4.415ThrSer: 4.415 ± 0.943
5.46ThrThr: 5.46 ± 1.096
5.054ThrVal: 5.054 ± 0.859
0.581ThrTrp: 0.581 ± 0.223
3.137ThrTyr: 3.137 ± 0.454
0.0ThrXaa: 0.0 ± 0.0
Val
6.448ValAla: 6.448 ± 0.517
0.697ValCys: 0.697 ± 0.223
5.228ValAsp: 5.228 ± 0.628
4.473ValGlu: 4.473 ± 0.618
2.44ValPhe: 2.44 ± 0.409
4.008ValGly: 4.008 ± 0.49
1.046ValHis: 1.046 ± 0.279
3.195ValIle: 3.195 ± 0.55
4.182ValLys: 4.182 ± 0.637
5.344ValLeu: 5.344 ± 0.618
1.917ValMet: 1.917 ± 0.25
3.195ValAsn: 3.195 ± 0.41
2.382ValPro: 2.382 ± 0.399
1.685ValGln: 1.685 ± 0.453
2.963ValArg: 2.963 ± 0.449
4.182ValSer: 4.182 ± 0.395
6.216ValThr: 6.216 ± 0.821
4.821ValVal: 4.821 ± 0.597
0.581ValTrp: 0.581 ± 0.173
2.091ValTyr: 2.091 ± 0.347
0.0ValXaa: 0.0 ± 0.0
Trp
1.22TrpAla: 1.22 ± 0.272
0.349TrpCys: 0.349 ± 0.139
1.685TrpAsp: 1.685 ± 0.353
0.581TrpGlu: 0.581 ± 0.208
0.523TrpPhe: 0.523 ± 0.197
0.349TrpGly: 0.349 ± 0.126
0.349TrpHis: 0.349 ± 0.137
0.407TrpIle: 0.407 ± 0.177
0.581TrpLys: 0.581 ± 0.185
1.046TrpLeu: 1.046 ± 0.265
0.465TrpMet: 0.465 ± 0.175
0.639TrpAsn: 0.639 ± 0.154
0.581TrpPro: 0.581 ± 0.211
0.465TrpGln: 0.465 ± 0.148
0.639TrpArg: 0.639 ± 0.172
1.046TrpSer: 1.046 ± 0.246
0.639TrpThr: 0.639 ± 0.203
0.871TrpVal: 0.871 ± 0.223
0.232TrpTrp: 0.232 ± 0.108
0.581TrpTyr: 0.581 ± 0.219
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.021TyrAla: 3.021 ± 0.448
0.29TyrCys: 0.29 ± 0.114
3.137TyrAsp: 3.137 ± 0.676
2.788TyrGlu: 2.788 ± 0.404
0.929TyrPhe: 0.929 ± 0.201
3.137TyrGly: 3.137 ± 0.446
0.581TyrHis: 0.581 ± 0.187
2.44TyrIle: 2.44 ± 0.398
1.685TyrLys: 1.685 ± 0.38
2.73TyrLeu: 2.73 ± 0.386
0.639TyrMet: 0.639 ± 0.224
1.685TyrAsn: 1.685 ± 0.379
1.22TyrPro: 1.22 ± 0.299
1.801TyrGln: 1.801 ± 0.335
2.265TyrArg: 2.265 ± 0.406
2.265TyrSer: 2.265 ± 0.464
2.73TyrThr: 2.73 ± 0.373
3.253TyrVal: 3.253 ± 0.529
0.697TyrTrp: 0.697 ± 0.237
1.336TyrTyr: 1.336 ± 0.248
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (17216 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski