Amino acid dipepetide frequency for Synechococcus phage S-CBM2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.33AlaAla: 5.33 ± 0.605
0.446AlaCys: 0.446 ± 0.119
4.086AlaAsp: 4.086 ± 0.264
3.659AlaGlu: 3.659 ± 0.318
2.229AlaPhe: 2.229 ± 0.225
5.869AlaGly: 5.869 ± 0.509
0.929AlaHis: 0.929 ± 0.165
3.603AlaIle: 3.603 ± 0.29
3.38AlaLys: 3.38 ± 0.642
4.049AlaLeu: 4.049 ± 0.311
1.17AlaMet: 1.17 ± 0.188
4.012AlaAsn: 4.012 ± 0.351
2.674AlaPro: 2.674 ± 0.254
2.507AlaGln: 2.507 ± 0.225
2.804AlaArg: 2.804 ± 0.291
5.275AlaSer: 5.275 ± 0.451
5.757AlaThr: 5.757 ± 0.419
4.272AlaVal: 4.272 ± 0.32
0.687AlaTrp: 0.687 ± 0.123
2.173AlaTyr: 2.173 ± 0.18
0.0AlaXaa: 0.0 ± 0.0
Cys
0.743CysAla: 0.743 ± 0.153
0.13CysCys: 0.13 ± 0.056
0.65CysAsp: 0.65 ± 0.106
0.631CysGlu: 0.631 ± 0.128
0.334CysPhe: 0.334 ± 0.103
0.669CysGly: 0.669 ± 0.125
0.204CysHis: 0.204 ± 0.061
0.594CysIle: 0.594 ± 0.113
0.576CysLys: 0.576 ± 0.139
0.613CysLeu: 0.613 ± 0.127
0.26CysMet: 0.26 ± 0.075
0.371CysAsn: 0.371 ± 0.092
0.483CysPro: 0.483 ± 0.121
0.353CysGln: 0.353 ± 0.085
0.52CysArg: 0.52 ± 0.112
0.52CysSer: 0.52 ± 0.109
0.501CysThr: 0.501 ± 0.097
0.594CysVal: 0.594 ± 0.114
0.13CysTrp: 0.13 ± 0.054
0.427CysTyr: 0.427 ± 0.1
0.0CysXaa: 0.0 ± 0.0
Asp
4.272AspAla: 4.272 ± 0.287
0.706AspCys: 0.706 ± 0.146
4.439AspAsp: 4.439 ± 0.373
3.975AspGlu: 3.975 ± 0.32
3.027AspPhe: 3.027 ± 0.265
5.535AspGly: 5.535 ± 0.367
1.114AspHis: 1.114 ± 0.169
3.9AspIle: 3.9 ± 0.298
3.38AspLys: 3.38 ± 0.27
5.145AspLeu: 5.145 ± 0.389
1.151AspMet: 1.151 ± 0.215
3.269AspAsn: 3.269 ± 0.279
2.749AspPro: 2.749 ± 0.226
2.08AspGln: 2.08 ± 0.226
2.804AspArg: 2.804 ± 0.275
4.309AspSer: 4.309 ± 0.358
3.473AspThr: 3.473 ± 0.225
4.03AspVal: 4.03 ± 0.282
0.984AspTrp: 0.984 ± 0.15
3.343AspTyr: 3.343 ± 0.261
0.0AspXaa: 0.0 ± 0.0
Glu
3.102GluAla: 3.102 ± 0.3
0.594GluCys: 0.594 ± 0.112
3.733GluAsp: 3.733 ± 0.33
4.179GluGlu: 4.179 ± 0.57
3.139GluPhe: 3.139 ± 0.246
4.142GluGly: 4.142 ± 0.284
0.91GluHis: 0.91 ± 0.145
4.662GluIle: 4.662 ± 0.3
4.383GluLys: 4.383 ± 0.474
5.256GluLeu: 5.256 ± 0.358
1.504GluMet: 1.504 ± 0.233
3.139GluAsn: 3.139 ± 0.3
1.523GluPro: 1.523 ± 0.236
2.377GluGln: 2.377 ± 0.235
2.712GluArg: 2.712 ± 0.273
3.677GluSer: 3.677 ± 0.284
3.064GluThr: 3.064 ± 0.256
4.365GluVal: 4.365 ± 0.299
1.17GluTrp: 1.17 ± 0.181
2.47GluTyr: 2.47 ± 0.2
0.0GluXaa: 0.0 ± 0.0
Phe
2.247PheAla: 2.247 ± 0.192
0.371PheCys: 0.371 ± 0.068
3.77PheAsp: 3.77 ± 0.345
2.47PheGlu: 2.47 ± 0.229
1.727PhePhe: 1.727 ± 0.162
3.306PheGly: 3.306 ± 0.311
0.724PheHis: 0.724 ± 0.146
2.73PheIle: 2.73 ± 0.233
2.712PheLys: 2.712 ± 0.246
3.306PheLeu: 3.306 ± 0.312
0.613PheMet: 0.613 ± 0.106
2.73PheAsn: 2.73 ± 0.289
1.523PhePro: 1.523 ± 0.166
1.263PheGln: 1.263 ± 0.175
1.857PheArg: 1.857 ± 0.185
3.12PheSer: 3.12 ± 0.261
3.269PheThr: 3.269 ± 0.323
2.804PheVal: 2.804 ± 0.264
0.446PheTrp: 0.446 ± 0.093
1.802PheTyr: 1.802 ± 0.231
0.0PheXaa: 0.0 ± 0.0
Gly
5.646GlyAla: 5.646 ± 0.454
0.761GlyCys: 0.761 ± 0.139
4.829GlyAsp: 4.829 ± 0.339
4.365GlyGlu: 4.365 ± 0.306
2.823GlyPhe: 2.823 ± 0.23
7.726GlyGly: 7.726 ± 0.782
0.947GlyHis: 0.947 ± 0.145
4.476GlyIle: 4.476 ± 0.341
3.826GlyLys: 3.826 ± 0.476
4.365GlyLeu: 4.365 ± 0.307
1.393GlyMet: 1.393 ± 0.201
5.367GlyAsn: 5.367 ± 0.646
1.987GlyPro: 1.987 ± 0.301
3.176GlyGln: 3.176 ± 0.352
3.324GlyArg: 3.324 ± 0.284
6.222GlySer: 6.222 ± 0.526
7.912GlyThr: 7.912 ± 0.713
5.275GlyVal: 5.275 ± 0.314
1.133GlyTrp: 1.133 ± 0.17
3.064GlyTyr: 3.064 ± 0.387
0.0GlyXaa: 0.0 ± 0.0
His
0.854HisAla: 0.854 ± 0.136
0.241HisCys: 0.241 ± 0.067
0.724HisAsp: 0.724 ± 0.136
0.799HisGlu: 0.799 ± 0.136
0.706HisPhe: 0.706 ± 0.131
1.096HisGly: 1.096 ± 0.163
0.334HisHis: 0.334 ± 0.092
0.836HisIle: 0.836 ± 0.121
0.799HisLys: 0.799 ± 0.163
1.096HisLeu: 1.096 ± 0.14
0.353HisMet: 0.353 ± 0.096
0.743HisAsn: 0.743 ± 0.126
1.021HisPro: 1.021 ± 0.164
0.483HisGln: 0.483 ± 0.099
0.836HisArg: 0.836 ± 0.132
0.669HisSer: 0.669 ± 0.106
1.096HisThr: 1.096 ± 0.202
0.891HisVal: 0.891 ± 0.134
0.241HisTrp: 0.241 ± 0.067
0.743HisTyr: 0.743 ± 0.131
0.0HisXaa: 0.0 ± 0.0
Ile
4.94IleAla: 4.94 ± 0.328
0.557IleCys: 0.557 ± 0.148
4.773IleAsp: 4.773 ± 0.331
4.792IleGlu: 4.792 ± 0.347
2.136IlePhe: 2.136 ± 0.187
4.327IleGly: 4.327 ± 0.295
0.873IleHis: 0.873 ± 0.139
3.845IleIle: 3.845 ± 0.447
3.603IleLys: 3.603 ± 0.25
5.126IleLeu: 5.126 ± 0.396
1.263IleMet: 1.263 ± 0.16
4.086IleAsn: 4.086 ± 0.379
2.99IlePro: 2.99 ± 0.236
2.303IleGln: 2.303 ± 0.198
3.176IleArg: 3.176 ± 0.313
4.847IleSer: 4.847 ± 0.438
5.219IleThr: 5.219 ± 0.637
3.807IleVal: 3.807 ± 0.278
0.594IleTrp: 0.594 ± 0.128
2.247IleTyr: 2.247 ± 0.213
0.0IleXaa: 0.0 ± 0.0
Lys
3.027LysAla: 3.027 ± 0.554
0.427LysCys: 0.427 ± 0.101
2.99LysAsp: 2.99 ± 0.304
4.142LysGlu: 4.142 ± 0.571
2.934LysPhe: 2.934 ± 0.269
3.102LysGly: 3.102 ± 0.388
0.799LysHis: 0.799 ± 0.174
4.439LysIle: 4.439 ± 0.307
4.773LysLys: 4.773 ± 0.832
4.699LysLeu: 4.699 ± 0.472
1.783LysMet: 1.783 ± 0.272
3.343LysAsn: 3.343 ± 0.357
1.727LysPro: 1.727 ± 0.282
2.024LysGln: 2.024 ± 0.235
2.359LysArg: 2.359 ± 0.33
3.845LysSer: 3.845 ± 0.417
3.399LysThr: 3.399 ± 0.298
4.309LysVal: 4.309 ± 0.309
0.65LysTrp: 0.65 ± 0.147
2.47LysTyr: 2.47 ± 0.238
0.0LysXaa: 0.0 ± 0.0
Leu
4.55LeuAla: 4.55 ± 0.404
0.761LeuCys: 0.761 ± 0.178
5.089LeuAsp: 5.089 ± 0.377
4.903LeuGlu: 4.903 ± 0.367
3.102LeuPhe: 3.102 ± 0.223
4.755LeuGly: 4.755 ± 0.381
1.096LeuHis: 1.096 ± 0.196
4.309LeuIle: 4.309 ± 0.336
5.07LeuLys: 5.07 ± 0.554
4.959LeuLeu: 4.959 ± 0.346
1.282LeuMet: 1.282 ± 0.197
4.922LeuAsn: 4.922 ± 0.35
3.176LeuPro: 3.176 ± 0.286
2.749LeuGln: 2.749 ± 0.273
3.083LeuArg: 3.083 ± 0.254
5.423LeuSer: 5.423 ± 0.27
5.795LeuThr: 5.795 ± 0.468
4.792LeuVal: 4.792 ± 0.308
0.631LeuTrp: 0.631 ± 0.088
3.102LeuTyr: 3.102 ± 0.269
0.019LeuXaa: 0.019 ± 0.02
Met
1.282MetAla: 1.282 ± 0.207
0.26MetCys: 0.26 ± 0.085
1.189MetAsp: 1.189 ± 0.202
0.966MetGlu: 0.966 ± 0.167
0.669MetPhe: 0.669 ± 0.119
1.096MetGly: 1.096 ± 0.162
0.39MetHis: 0.39 ± 0.109
1.56MetIle: 1.56 ± 0.227
1.356MetLys: 1.356 ± 0.234
1.616MetLeu: 1.616 ± 0.274
0.483MetMet: 0.483 ± 0.127
1.244MetAsn: 1.244 ± 0.19
1.04MetPro: 1.04 ± 0.197
1.003MetGln: 1.003 ± 0.159
0.687MetArg: 0.687 ± 0.106
1.56MetSer: 1.56 ± 0.234
1.151MetThr: 1.151 ± 0.171
0.966MetVal: 0.966 ± 0.143
0.241MetTrp: 0.241 ± 0.071
0.891MetTyr: 0.891 ± 0.138
0.0MetXaa: 0.0 ± 0.0
Asn
3.584AsnAla: 3.584 ± 0.344
0.576AsnCys: 0.576 ± 0.102
3.009AsnAsp: 3.009 ± 0.251
2.934AsnGlu: 2.934 ± 0.287
2.73AsnPhe: 2.73 ± 0.228
5.052AsnGly: 5.052 ± 0.534
1.077AsnHis: 1.077 ± 0.183
4.402AsnIle: 4.402 ± 0.387
2.73AsnLys: 2.73 ± 0.253
4.829AsnLeu: 4.829 ± 0.315
0.91AsnMet: 0.91 ± 0.14
3.25AsnAsn: 3.25 ± 0.374
2.879AsnPro: 2.879 ± 0.299
2.136AsnGln: 2.136 ± 0.224
2.674AsnArg: 2.674 ± 0.223
4.439AsnSer: 4.439 ± 0.42
4.903AsnThr: 4.903 ± 0.533
4.643AsnVal: 4.643 ± 0.315
0.799AsnTrp: 0.799 ± 0.149
2.879AsnTyr: 2.879 ± 0.203
0.0AsnXaa: 0.0 ± 0.0
Pro
2.34ProAla: 2.34 ± 0.219
0.409ProCys: 0.409 ± 0.094
2.749ProAsp: 2.749 ± 0.23
2.712ProGlu: 2.712 ± 0.317
1.579ProPhe: 1.579 ± 0.156
2.99ProGly: 2.99 ± 0.332
0.52ProHis: 0.52 ± 0.105
2.154ProIle: 2.154 ± 0.216
2.006ProLys: 2.006 ± 0.315
2.229ProLeu: 2.229 ± 0.237
0.631ProMet: 0.631 ± 0.143
2.117ProAsn: 2.117 ± 0.234
1.932ProPro: 1.932 ± 0.298
1.616ProGln: 1.616 ± 0.214
1.356ProArg: 1.356 ± 0.131
3.102ProSer: 3.102 ± 0.266
3.492ProThr: 3.492 ± 0.228
2.73ProVal: 2.73 ± 0.281
0.557ProTrp: 0.557 ± 0.125
1.746ProTyr: 1.746 ± 0.188
0.019ProXaa: 0.019 ± 0.02
Gln
2.08GlnAla: 2.08 ± 0.181
0.371GlnCys: 0.371 ± 0.091
1.727GlnAsp: 1.727 ± 0.181
2.433GlnGlu: 2.433 ± 0.255
2.024GlnPhe: 2.024 ± 0.212
2.637GlnGly: 2.637 ± 0.344
0.594GlnHis: 0.594 ± 0.116
2.674GlnIle: 2.674 ± 0.194
1.932GlnLys: 1.932 ± 0.25
3.194GlnLeu: 3.194 ± 0.227
0.91GlnMet: 0.91 ± 0.165
2.563GlnAsn: 2.563 ± 0.288
1.356GlnPro: 1.356 ± 0.167
1.449GlnGln: 1.449 ± 0.211
1.709GlnArg: 1.709 ± 0.175
2.656GlnSer: 2.656 ± 0.22
2.544GlnThr: 2.544 ± 0.331
2.322GlnVal: 2.322 ± 0.241
0.576GlnTrp: 0.576 ± 0.122
1.653GlnTyr: 1.653 ± 0.189
0.0GlnXaa: 0.0 ± 0.0
Arg
1.69ArgAla: 1.69 ± 0.261
0.279ArgCys: 0.279 ± 0.069
2.73ArgAsp: 2.73 ± 0.252
2.544ArgGlu: 2.544 ± 0.305
2.024ArgPhe: 2.024 ± 0.208
3.12ArgGly: 3.12 ± 0.261
0.52ArgHis: 0.52 ± 0.102
3.417ArgIle: 3.417 ± 0.228
2.786ArgLys: 2.786 ± 0.358
3.306ArgLeu: 3.306 ± 0.219
1.151ArgMet: 1.151 ± 0.169
2.414ArgAsn: 2.414 ± 0.205
1.634ArgPro: 1.634 ± 0.177
1.579ArgGln: 1.579 ± 0.173
2.229ArgArg: 2.229 ± 0.246
2.712ArgSer: 2.712 ± 0.219
2.062ArgThr: 2.062 ± 0.225
3.12ArgVal: 3.12 ± 0.31
0.687ArgTrp: 0.687 ± 0.151
2.34ArgTyr: 2.34 ± 0.235
0.0ArgXaa: 0.0 ± 0.0
Ser
5.776SerAla: 5.776 ± 0.457
0.52SerCys: 0.52 ± 0.133
4.179SerAsp: 4.179 ± 0.318
3.696SerGlu: 3.696 ± 0.281
3.38SerPhe: 3.38 ± 0.293
6.928SerGly: 6.928 ± 0.502
0.966SerHis: 0.966 ± 0.147
4.903SerIle: 4.903 ± 0.36
3.696SerLys: 3.696 ± 0.31
5.46SerLeu: 5.46 ± 0.298
1.3SerMet: 1.3 ± 0.223
4.587SerAsn: 4.587 ± 0.323
2.544SerPro: 2.544 ± 0.225
2.452SerGln: 2.452 ± 0.225
2.21SerArg: 2.21 ± 0.169
6.24SerSer: 6.24 ± 0.626
5.386SerThr: 5.386 ± 0.503
4.643SerVal: 4.643 ± 0.439
0.706SerTrp: 0.706 ± 0.097
3.064SerTyr: 3.064 ± 0.249
0.0SerXaa: 0.0 ± 0.0
Thr
5.59ThrAla: 5.59 ± 0.459
0.52ThrCys: 0.52 ± 0.13
4.012ThrAsp: 4.012 ± 0.323
3.677ThrGlu: 3.677 ± 0.272
3.027ThrPhe: 3.027 ± 0.294
7.262ThrGly: 7.262 ± 0.613
0.966ThrHis: 0.966 ± 0.179
5.367ThrIle: 5.367 ± 0.459
3.473ThrLys: 3.473 ± 0.274
5.72ThrLeu: 5.72 ± 0.74
1.189ThrMet: 1.189 ± 0.179
4.55ThrAsn: 4.55 ± 0.567
3.139ThrPro: 3.139 ± 0.245
2.916ThrGln: 2.916 ± 0.244
2.507ThrArg: 2.507 ± 0.246
6.017ThrSer: 6.017 ± 0.573
6.259ThrThr: 6.259 ± 0.803
5.535ThrVal: 5.535 ± 0.529
0.873ThrTrp: 0.873 ± 0.17
3.046ThrTyr: 3.046 ± 0.273
0.0ThrXaa: 0.0 ± 0.0
Val
4.773ValAla: 4.773 ± 0.364
0.65ValCys: 0.65 ± 0.127
4.922ValAsp: 4.922 ± 0.317
4.105ValGlu: 4.105 ± 0.295
3.194ValPhe: 3.194 ± 0.26
4.847ValGly: 4.847 ± 0.386
0.52ValHis: 0.52 ± 0.101
4.142ValIle: 4.142 ± 0.321
3.956ValLys: 3.956 ± 0.347
4.29ValLeu: 4.29 ± 0.266
1.207ValMet: 1.207 ± 0.174
4.067ValAsn: 4.067 ± 0.377
2.712ValPro: 2.712 ± 0.252
2.637ValGln: 2.637 ± 0.206
2.916ValArg: 2.916 ± 0.224
4.365ValSer: 4.365 ± 0.362
6.073ValThr: 6.073 ± 0.603
5.089ValVal: 5.089 ± 0.443
0.761ValTrp: 0.761 ± 0.106
2.377ValTyr: 2.377 ± 0.261
0.0ValXaa: 0.0 ± 0.0
Trp
0.65TrpAla: 0.65 ± 0.131
0.223TrpCys: 0.223 ± 0.067
0.891TrpAsp: 0.891 ± 0.119
0.799TrpGlu: 0.799 ± 0.144
0.409TrpPhe: 0.409 ± 0.085
1.077TrpGly: 1.077 ± 0.235
0.26TrpHis: 0.26 ± 0.076
0.78TrpIle: 0.78 ± 0.124
0.817TrpLys: 0.817 ± 0.143
0.891TrpLeu: 0.891 ± 0.159
0.316TrpMet: 0.316 ± 0.074
0.836TrpAsn: 0.836 ± 0.14
0.204TrpPro: 0.204 ± 0.069
0.631TrpGln: 0.631 ± 0.119
0.52TrpArg: 0.52 ± 0.116
0.836TrpSer: 0.836 ± 0.152
0.854TrpThr: 0.854 ± 0.154
0.706TrpVal: 0.706 ± 0.132
0.223TrpTrp: 0.223 ± 0.056
0.576TrpTyr: 0.576 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.433TyrAla: 2.433 ± 0.223
0.501TyrCys: 0.501 ± 0.101
3.194TyrAsp: 3.194 ± 0.262
2.322TyrGlu: 2.322 ± 0.201
1.597TyrPhe: 1.597 ± 0.198
3.083TyrGly: 3.083 ± 0.309
0.817TyrHis: 0.817 ± 0.158
2.712TyrIle: 2.712 ± 0.184
1.987TyrLys: 1.987 ± 0.234
3.417TyrLeu: 3.417 ± 0.292
0.761TyrMet: 0.761 ± 0.127
2.823TyrAsn: 2.823 ± 0.318
1.69TyrPro: 1.69 ± 0.158
1.672TyrGln: 1.672 ± 0.173
2.062TyrArg: 2.062 ± 0.232
2.786TyrSer: 2.786 ± 0.216
3.436TyrThr: 3.436 ± 0.324
2.712TyrVal: 2.712 ± 0.275
0.409TyrTrp: 0.409 ± 0.099
2.062TyrTyr: 2.062 ± 0.283
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.019XaaSer: 0.019 ± 0.02
0.019XaaThr: 0.019 ± 0.02
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 197 proteins (53844 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski