Amino acid dipepetide frequency for Roseobacter phage CRP-7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.045AlaAla: 11.045 ± 2.129
0.272AlaCys: 0.272 ± 0.116
4.244AlaAsp: 4.244 ± 0.495
5.223AlaGlu: 5.223 ± 0.659
2.992AlaPhe: 2.992 ± 0.455
8.433AlaGly: 8.433 ± 0.915
1.251AlaHis: 1.251 ± 0.259
5.06AlaIle: 5.06 ± 0.517
5.55AlaLys: 5.55 ± 0.683
7.454AlaLeu: 7.454 ± 0.967
2.285AlaMet: 2.285 ± 0.371
5.386AlaAsn: 5.386 ± 0.541
2.775AlaPro: 2.775 ± 0.476
4.026AlaGln: 4.026 ± 0.396
3.7AlaArg: 3.7 ± 0.464
6.039AlaSer: 6.039 ± 1.416
5.223AlaThr: 5.223 ± 0.798
5.604AlaVal: 5.604 ± 0.708
0.871AlaTrp: 0.871 ± 0.208
2.72AlaTyr: 2.72 ± 0.385
0.0AlaXaa: 0.0 ± 0.0
Cys
0.762CysAla: 0.762 ± 0.243
0.054CysCys: 0.054 ± 0.046
0.381CysAsp: 0.381 ± 0.171
0.598CysGlu: 0.598 ± 0.218
0.326CysPhe: 0.326 ± 0.139
0.326CysGly: 0.326 ± 0.159
0.272CysHis: 0.272 ± 0.111
0.272CysIle: 0.272 ± 0.103
0.544CysLys: 0.544 ± 0.189
0.272CysLeu: 0.272 ± 0.137
0.163CysMet: 0.163 ± 0.089
0.272CysAsn: 0.272 ± 0.108
0.272CysPro: 0.272 ± 0.116
0.054CysGln: 0.054 ± 0.057
0.163CysArg: 0.163 ± 0.097
0.49CysSer: 0.49 ± 0.144
0.435CysThr: 0.435 ± 0.157
0.218CysVal: 0.218 ± 0.098
0.163CysTrp: 0.163 ± 0.104
0.109CysTyr: 0.109 ± 0.066
0.0CysXaa: 0.0 ± 0.0
Asp
5.114AspAla: 5.114 ± 0.595
0.707AspCys: 0.707 ± 0.19
3.591AspAsp: 3.591 ± 0.416
3.536AspGlu: 3.536 ± 0.554
2.176AspPhe: 2.176 ± 0.323
4.679AspGly: 4.679 ± 0.829
0.762AspHis: 0.762 ± 0.214
4.353AspIle: 4.353 ± 0.514
4.081AspLys: 4.081 ± 0.47
5.06AspLeu: 5.06 ± 0.49
1.904AspMet: 1.904 ± 0.352
4.461AspAsn: 4.461 ± 0.429
2.775AspPro: 2.775 ± 0.491
1.578AspGln: 1.578 ± 0.273
2.394AspArg: 2.394 ± 0.42
3.591AspSer: 3.591 ± 0.44
4.244AspThr: 4.244 ± 0.508
3.482AspVal: 3.482 ± 0.466
0.707AspTrp: 0.707 ± 0.227
2.503AspTyr: 2.503 ± 0.379
0.0AspXaa: 0.0 ± 0.0
Glu
6.42GluAla: 6.42 ± 0.619
0.435GluCys: 0.435 ± 0.149
4.407GluAsp: 4.407 ± 0.53
7.073GluGlu: 7.073 ± 1.068
2.503GluPhe: 2.503 ± 0.331
4.244GluGly: 4.244 ± 0.515
1.088GluHis: 1.088 ± 0.286
3.972GluIle: 3.972 ± 0.622
4.135GluLys: 4.135 ± 0.68
6.094GluLeu: 6.094 ± 0.666
2.339GluMet: 2.339 ± 0.43
2.394GluAsn: 2.394 ± 0.33
1.85GluPro: 1.85 ± 0.362
2.448GluGln: 2.448 ± 0.433
2.067GluArg: 2.067 ± 0.354
3.482GluSer: 3.482 ± 0.507
4.135GluThr: 4.135 ± 0.926
3.808GluVal: 3.808 ± 0.455
1.251GluTrp: 1.251 ± 0.273
2.775GluTyr: 2.775 ± 0.428
0.0GluXaa: 0.0 ± 0.0
Phe
2.394PheAla: 2.394 ± 0.398
0.381PheCys: 0.381 ± 0.16
2.285PheAsp: 2.285 ± 0.343
2.013PheGlu: 2.013 ± 0.319
0.979PhePhe: 0.979 ± 0.209
2.503PheGly: 2.503 ± 0.441
0.435PheHis: 0.435 ± 0.157
1.795PheIle: 1.795 ± 0.31
2.176PheLys: 2.176 ± 0.346
2.122PheLeu: 2.122 ± 0.355
0.925PheMet: 0.925 ± 0.256
2.285PheAsn: 2.285 ± 0.273
0.816PhePro: 0.816 ± 0.256
0.762PheGln: 0.762 ± 0.201
0.871PheArg: 0.871 ± 0.241
2.394PheSer: 2.394 ± 0.347
2.067PheThr: 2.067 ± 0.305
1.578PheVal: 1.578 ± 0.247
0.435PheTrp: 0.435 ± 0.165
1.088PheTyr: 1.088 ± 0.238
0.0PheXaa: 0.0 ± 0.0
Gly
6.529GlyAla: 6.529 ± 0.903
0.272GlyCys: 0.272 ± 0.144
4.026GlyAsp: 4.026 ± 0.405
4.081GlyGlu: 4.081 ± 0.37
2.503GlyPhe: 2.503 ± 0.318
5.822GlyGly: 5.822 ± 0.838
0.871GlyHis: 0.871 ± 0.181
4.407GlyIle: 4.407 ± 0.609
4.679GlyLys: 4.679 ± 0.648
5.495GlyLeu: 5.495 ± 0.675
1.306GlyMet: 1.306 ± 0.277
5.005GlyAsn: 5.005 ± 1.0
1.415GlyPro: 1.415 ± 0.367
2.666GlyGln: 2.666 ± 0.317
4.026GlyArg: 4.026 ± 0.583
5.985GlySer: 5.985 ± 0.622
6.855GlyThr: 6.855 ± 0.882
5.223GlyVal: 5.223 ± 0.529
0.762GlyTrp: 0.762 ± 0.226
2.339GlyTyr: 2.339 ± 0.555
0.0GlyXaa: 0.0 ± 0.0
His
0.598HisAla: 0.598 ± 0.228
0.109HisCys: 0.109 ± 0.075
1.197HisAsp: 1.197 ± 0.229
0.707HisGlu: 0.707 ± 0.207
0.435HisPhe: 0.435 ± 0.143
1.034HisGly: 1.034 ± 0.304
0.163HisHis: 0.163 ± 0.093
1.197HisIle: 1.197 ± 0.238
0.816HisLys: 0.816 ± 0.215
1.741HisLeu: 1.741 ± 0.357
0.49HisMet: 0.49 ± 0.167
0.707HisAsn: 0.707 ± 0.169
0.49HisPro: 0.49 ± 0.218
0.326HisGln: 0.326 ± 0.155
0.544HisArg: 0.544 ± 0.194
1.469HisSer: 1.469 ± 0.3
0.925HisThr: 0.925 ± 0.202
0.653HisVal: 0.653 ± 0.19
0.326HisTrp: 0.326 ± 0.132
0.653HisTyr: 0.653 ± 0.225
0.0HisXaa: 0.0 ± 0.0
Ile
4.788IleAla: 4.788 ± 0.44
0.326IleCys: 0.326 ± 0.148
4.135IleAsp: 4.135 ± 0.65
3.917IleGlu: 3.917 ± 0.566
1.197IlePhe: 1.197 ± 0.216
4.189IleGly: 4.189 ± 0.456
1.034IleHis: 1.034 ± 0.209
2.992IleIle: 2.992 ± 0.456
4.189IleLys: 4.189 ± 0.58
3.754IleLeu: 3.754 ± 0.432
1.469IleMet: 1.469 ± 0.264
4.026IleAsn: 4.026 ± 0.458
2.938IlePro: 2.938 ± 0.57
2.339IleGln: 2.339 ± 0.407
2.557IleArg: 2.557 ± 0.402
4.516IleSer: 4.516 ± 0.468
4.461IleThr: 4.461 ± 0.519
3.428IleVal: 3.428 ± 0.409
0.816IleTrp: 0.816 ± 0.198
2.231IleTyr: 2.231 ± 0.433
0.0IleXaa: 0.0 ± 0.0
Lys
6.094LysAla: 6.094 ± 0.638
0.381LysCys: 0.381 ± 0.188
4.897LysAsp: 4.897 ± 0.562
5.93LysGlu: 5.93 ± 0.847
1.197LysPhe: 1.197 ± 0.29
3.7LysGly: 3.7 ± 0.558
0.979LysHis: 0.979 ± 0.266
3.754LysIle: 3.754 ± 0.426
4.353LysLys: 4.353 ± 0.726
5.822LysLeu: 5.822 ± 0.371
1.741LysMet: 1.741 ± 0.314
2.992LysAsn: 2.992 ± 0.477
1.632LysPro: 1.632 ± 0.31
2.666LysGln: 2.666 ± 0.45
3.319LysArg: 3.319 ± 0.586
2.775LysSer: 2.775 ± 0.471
4.353LysThr: 4.353 ± 0.466
4.244LysVal: 4.244 ± 0.502
0.979LysTrp: 0.979 ± 0.28
1.85LysTyr: 1.85 ± 0.322
0.0LysXaa: 0.0 ± 0.0
Leu
6.583LeuAla: 6.583 ± 0.643
0.272LeuCys: 0.272 ± 0.145
4.081LeuAsp: 4.081 ± 0.453
4.57LeuGlu: 4.57 ± 0.536
2.394LeuPhe: 2.394 ± 0.432
6.039LeuGly: 6.039 ± 0.787
1.469LeuHis: 1.469 ± 0.31
3.7LeuIle: 3.7 ± 0.382
5.223LeuLys: 5.223 ± 0.51
4.516LeuLeu: 4.516 ± 0.456
2.067LeuMet: 2.067 ± 0.342
5.441LeuAsn: 5.441 ± 0.599
2.231LeuPro: 2.231 ± 0.378
3.21LeuGln: 3.21 ± 0.41
3.536LeuArg: 3.536 ± 0.357
6.42LeuSer: 6.42 ± 0.674
4.625LeuThr: 4.625 ± 0.463
4.407LeuVal: 4.407 ± 0.58
0.598LeuTrp: 0.598 ± 0.186
2.775LeuTyr: 2.775 ± 0.408
0.0LeuXaa: 0.0 ± 0.0
Met
2.448MetAla: 2.448 ± 0.44
0.163MetCys: 0.163 ± 0.121
1.578MetAsp: 1.578 ± 0.326
2.231MetGlu: 2.231 ± 0.496
0.816MetPhe: 0.816 ± 0.146
1.36MetGly: 1.36 ± 0.253
0.326MetHis: 0.326 ± 0.166
1.251MetIle: 1.251 ± 0.245
1.36MetLys: 1.36 ± 0.294
2.394MetLeu: 2.394 ± 0.401
0.653MetMet: 0.653 ± 0.165
1.578MetAsn: 1.578 ± 0.28
0.979MetPro: 0.979 ± 0.278
1.088MetGln: 1.088 ± 0.236
1.36MetArg: 1.36 ± 0.21
2.557MetSer: 2.557 ± 0.278
2.231MetThr: 2.231 ± 0.333
1.251MetVal: 1.251 ± 0.261
0.218MetTrp: 0.218 ± 0.113
1.143MetTyr: 1.143 ± 0.294
0.0MetXaa: 0.0 ± 0.0
Asn
5.169AsnAla: 5.169 ± 1.06
0.326AsnCys: 0.326 ± 0.14
3.047AsnAsp: 3.047 ± 0.367
3.645AsnGlu: 3.645 ± 0.462
1.904AsnPhe: 1.904 ± 0.314
3.754AsnGly: 3.754 ± 0.754
0.598AsnHis: 0.598 ± 0.178
4.57AsnIle: 4.57 ± 0.599
4.298AsnLys: 4.298 ± 0.593
4.733AsnLeu: 4.733 ± 0.551
1.36AsnMet: 1.36 ± 0.354
5.55AsnAsn: 5.55 ± 1.287
2.72AsnPro: 2.72 ± 0.386
1.469AsnGln: 1.469 ± 0.271
2.448AsnArg: 2.448 ± 0.348
4.842AsnSer: 4.842 ± 0.688
6.094AsnThr: 6.094 ± 1.847
3.482AsnVal: 3.482 ± 0.504
1.197AsnTrp: 1.197 ± 0.27
2.666AsnTyr: 2.666 ± 0.422
0.0AsnXaa: 0.0 ± 0.0
Pro
3.428ProAla: 3.428 ± 0.488
0.381ProCys: 0.381 ± 0.162
2.557ProAsp: 2.557 ± 0.485
3.319ProGlu: 3.319 ± 0.471
1.251ProPhe: 1.251 ± 0.216
2.231ProGly: 2.231 ± 0.36
0.381ProHis: 0.381 ± 0.156
1.904ProIle: 1.904 ± 0.451
2.176ProLys: 2.176 ± 0.364
2.067ProLeu: 2.067 ± 0.322
1.143ProMet: 1.143 ± 0.272
2.176ProAsn: 2.176 ± 0.426
1.088ProPro: 1.088 ± 0.333
1.578ProGln: 1.578 ± 0.331
1.143ProArg: 1.143 ± 0.477
3.047ProSer: 3.047 ± 0.376
2.72ProThr: 2.72 ± 0.526
1.795ProVal: 1.795 ± 0.5
0.326ProTrp: 0.326 ± 0.12
1.469ProTyr: 1.469 ± 0.298
0.0ProXaa: 0.0 ± 0.0
Gln
3.808GlnAla: 3.808 ± 0.401
0.109GlnCys: 0.109 ± 0.085
2.176GlnAsp: 2.176 ± 0.488
2.339GlnGlu: 2.339 ± 0.434
0.653GlnPhe: 0.653 ± 0.169
3.101GlnGly: 3.101 ± 0.458
0.871GlnHis: 0.871 ± 0.225
2.612GlnIle: 2.612 ± 0.308
2.176GlnLys: 2.176 ± 0.365
2.775GlnLeu: 2.775 ± 0.321
1.034GlnMet: 1.034 ± 0.255
1.523GlnAsn: 1.523 ± 0.313
1.632GlnPro: 1.632 ± 0.368
1.959GlnGln: 1.959 ± 0.406
1.904GlnArg: 1.904 ± 0.4
2.503GlnSer: 2.503 ± 0.31
2.775GlnThr: 2.775 ± 0.529
2.231GlnVal: 2.231 ± 0.318
0.49GlnTrp: 0.49 ± 0.152
1.034GlnTyr: 1.034 ± 0.241
0.0GlnXaa: 0.0 ± 0.0
Arg
3.591ArgAla: 3.591 ± 0.545
0.218ArgCys: 0.218 ± 0.107
3.264ArgAsp: 3.264 ± 0.446
2.775ArgGlu: 2.775 ± 0.472
1.197ArgPhe: 1.197 ± 0.313
2.448ArgGly: 2.448 ± 0.418
0.49ArgHis: 0.49 ± 0.161
2.013ArgIle: 2.013 ± 0.246
2.938ArgLys: 2.938 ± 0.568
3.428ArgLeu: 3.428 ± 0.379
1.197ArgMet: 1.197 ± 0.252
2.938ArgAsn: 2.938 ± 0.438
1.523ArgPro: 1.523 ± 0.322
1.795ArgGln: 1.795 ± 0.375
1.959ArgArg: 1.959 ± 0.387
3.156ArgSer: 3.156 ± 0.377
2.013ArgThr: 2.013 ± 0.408
2.339ArgVal: 2.339 ± 0.295
0.653ArgTrp: 0.653 ± 0.177
1.687ArgTyr: 1.687 ± 0.302
0.0ArgXaa: 0.0 ± 0.0
Ser
6.746SerAla: 6.746 ± 0.947
0.598SerCys: 0.598 ± 0.2
4.298SerAsp: 4.298 ± 0.557
4.244SerGlu: 4.244 ± 0.647
2.557SerPhe: 2.557 ± 0.341
5.822SerGly: 5.822 ± 0.701
1.197SerHis: 1.197 ± 0.257
4.625SerIle: 4.625 ± 0.656
4.353SerLys: 4.353 ± 0.524
4.461SerLeu: 4.461 ± 0.408
2.013SerMet: 2.013 ± 0.304
4.353SerAsn: 4.353 ± 0.83
2.884SerPro: 2.884 ± 0.295
2.938SerGln: 2.938 ± 0.365
2.612SerArg: 2.612 ± 0.369
6.148SerSer: 6.148 ± 0.922
5.495SerThr: 5.495 ± 0.883
4.57SerVal: 4.57 ± 0.587
0.925SerTrp: 0.925 ± 0.23
2.775SerTyr: 2.775 ± 0.392
0.0SerXaa: 0.0 ± 0.0
Thr
6.91ThrAla: 6.91 ± 1.267
0.272ThrCys: 0.272 ± 0.13
4.026ThrAsp: 4.026 ± 0.453
4.461ThrGlu: 4.461 ± 0.454
2.231ThrPhe: 2.231 ± 0.329
7.018ThrGly: 7.018 ± 1.253
0.762ThrHis: 0.762 ± 0.22
4.461ThrIle: 4.461 ± 0.596
4.081ThrLys: 4.081 ± 0.524
4.407ThrLeu: 4.407 ± 0.474
1.85ThrMet: 1.85 ± 0.27
4.788ThrAsn: 4.788 ± 0.856
3.972ThrPro: 3.972 ± 0.499
2.666ThrGln: 2.666 ± 0.384
2.503ThrArg: 2.503 ± 0.412
5.06ThrSer: 5.06 ± 1.178
5.822ThrThr: 5.822 ± 1.083
3.808ThrVal: 3.808 ± 0.437
0.653ThrTrp: 0.653 ± 0.17
2.448ThrTyr: 2.448 ± 0.426
0.0ThrXaa: 0.0 ± 0.0
Val
4.625ValAla: 4.625 ± 0.579
0.435ValCys: 0.435 ± 0.154
4.135ValAsp: 4.135 ± 0.422
3.863ValGlu: 3.863 ± 0.45
1.632ValPhe: 1.632 ± 0.263
4.244ValGly: 4.244 ± 0.764
0.816ValHis: 0.816 ± 0.285
3.428ValIle: 3.428 ± 0.51
3.645ValLys: 3.645 ± 0.432
3.917ValLeu: 3.917 ± 0.523
1.795ValMet: 1.795 ± 0.36
3.536ValAsn: 3.536 ± 0.477
2.72ValPro: 2.72 ± 0.737
1.523ValGln: 1.523 ± 0.246
2.339ValArg: 2.339 ± 0.351
5.386ValSer: 5.386 ± 0.604
4.189ValThr: 4.189 ± 0.659
3.373ValVal: 3.373 ± 0.401
0.598ValTrp: 0.598 ± 0.197
2.013ValTyr: 2.013 ± 0.373
0.0ValXaa: 0.0 ± 0.0
Trp
1.088TrpAla: 1.088 ± 0.225
0.218TrpCys: 0.218 ± 0.102
0.707TrpAsp: 0.707 ± 0.157
0.925TrpGlu: 0.925 ± 0.278
0.435TrpPhe: 0.435 ± 0.14
0.871TrpGly: 0.871 ± 0.25
0.054TrpHis: 0.054 ± 0.058
0.598TrpIle: 0.598 ± 0.195
0.871TrpLys: 0.871 ± 0.23
0.653TrpLeu: 0.653 ± 0.212
0.326TrpMet: 0.326 ± 0.109
0.653TrpAsn: 0.653 ± 0.155
0.326TrpPro: 0.326 ± 0.122
0.653TrpGln: 0.653 ± 0.187
0.598TrpArg: 0.598 ± 0.18
1.197TrpSer: 1.197 ± 0.289
0.544TrpThr: 0.544 ± 0.18
1.034TrpVal: 1.034 ± 0.235
0.163TrpTrp: 0.163 ± 0.095
0.49TrpTyr: 0.49 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.013TyrAla: 2.013 ± 0.321
0.218TyrCys: 0.218 ± 0.106
2.72TyrAsp: 2.72 ± 0.462
1.469TyrGlu: 1.469 ± 0.343
0.925TyrPhe: 0.925 ± 0.226
2.448TyrGly: 2.448 ± 0.369
0.762TyrHis: 0.762 ± 0.168
2.339TyrIle: 2.339 ± 0.315
2.013TyrLys: 2.013 ± 0.383
2.992TyrLeu: 2.992 ± 0.426
0.871TyrMet: 0.871 ± 0.224
3.808TyrAsn: 3.808 ± 0.598
1.143TyrPro: 1.143 ± 0.26
1.959TyrGln: 1.959 ± 0.305
1.578TyrArg: 1.578 ± 0.319
2.557TyrSer: 2.557 ± 0.387
2.992TyrThr: 2.992 ± 0.368
1.687TyrVal: 1.687 ± 0.366
0.272TyrTrp: 0.272 ± 0.113
1.251TyrTyr: 1.251 ± 0.269
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (18381 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski