Amino acid dipepetide frequency for Cyanophage P-RSM6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.267AlaAla: 5.267 ± 0.527
0.372AlaCys: 0.372 ± 0.099
4.233AlaAsp: 4.233 ± 0.354
4.185AlaGlu: 4.185 ± 0.317
2.424AlaPhe: 2.424 ± 0.207
5.396AlaGly: 5.396 ± 0.465
1.083AlaHis: 1.083 ± 0.155
3.975AlaIle: 3.975 ± 0.319
3.635AlaLys: 3.635 ± 0.352
4.492AlaLeu: 4.492 ± 0.214
1.309AlaMet: 1.309 ± 0.15
3.829AlaAsn: 3.829 ± 0.307
2.537AlaPro: 2.537 ± 0.206
2.44AlaGln: 2.44 ± 0.17
2.666AlaArg: 2.666 ± 0.248
4.928AlaSer: 4.928 ± 0.365
5.8AlaThr: 5.8 ± 0.597
3.991AlaVal: 3.991 ± 0.266
0.695AlaTrp: 0.695 ± 0.099
2.521AlaTyr: 2.521 ± 0.254
0.0AlaXaa: 0.0 ± 0.0
Cys
0.533CysAla: 0.533 ± 0.101
0.081CysCys: 0.081 ± 0.037
0.872CysAsp: 0.872 ± 0.114
0.727CysGlu: 0.727 ± 0.143
0.469CysPhe: 0.469 ± 0.112
0.42CysGly: 0.42 ± 0.092
0.21CysHis: 0.21 ± 0.066
0.646CysIle: 0.646 ± 0.133
0.549CysLys: 0.549 ± 0.105
0.614CysLeu: 0.614 ± 0.113
0.081CysMet: 0.081 ± 0.042
0.614CysAsn: 0.614 ± 0.118
0.436CysPro: 0.436 ± 0.112
0.259CysGln: 0.259 ± 0.085
0.372CysArg: 0.372 ± 0.096
0.792CysSer: 0.792 ± 0.126
0.776CysThr: 0.776 ± 0.124
0.598CysVal: 0.598 ± 0.13
0.145CysTrp: 0.145 ± 0.062
0.469CysTyr: 0.469 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
4.993AspAla: 4.993 ± 0.365
0.646AspCys: 0.646 ± 0.134
4.653AspAsp: 4.653 ± 0.352
4.282AspGlu: 4.282 ± 0.349
3.021AspPhe: 3.021 ± 0.216
5.671AspGly: 5.671 ± 0.428
1.115AspHis: 1.115 ± 0.154
4.783AspIle: 4.783 ± 0.243
3.506AspLys: 3.506 ± 0.387
5.073AspLeu: 5.073 ± 0.326
1.583AspMet: 1.583 ± 0.146
3.555AspAsn: 3.555 ± 0.313
3.619AspPro: 3.619 ± 0.34
2.165AspGln: 2.165 ± 0.157
2.795AspArg: 2.795 ± 0.202
4.023AspSer: 4.023 ± 0.266
4.54AspThr: 4.54 ± 0.361
4.379AspVal: 4.379 ± 0.336
1.05AspTrp: 1.05 ± 0.135
3.215AspTyr: 3.215 ± 0.273
0.0AspXaa: 0.0 ± 0.0
Glu
3.458GluAla: 3.458 ± 0.275
0.598GluCys: 0.598 ± 0.105
3.926GluAsp: 3.926 ± 0.27
4.96GluGlu: 4.96 ± 0.434
3.183GluPhe: 3.183 ± 0.24
4.508GluGly: 4.508 ± 0.316
1.341GluHis: 1.341 ± 0.217
4.411GluIle: 4.411 ± 0.256
4.217GluLys: 4.217 ± 0.597
4.96GluLeu: 4.96 ± 0.31
1.325GluMet: 1.325 ± 0.193
3.151GluAsn: 3.151 ± 0.237
2.23GluPro: 2.23 ± 0.224
2.391GluGln: 2.391 ± 0.248
2.504GluArg: 2.504 ± 0.21
3.409GluSer: 3.409 ± 0.264
3.49GluThr: 3.49 ± 0.343
4.589GluVal: 4.589 ± 0.289
0.856GluTrp: 0.856 ± 0.124
2.65GluTyr: 2.65 ± 0.228
0.0GluXaa: 0.0 ± 0.0
Phe
2.068PheAla: 2.068 ± 0.157
0.355PheCys: 0.355 ± 0.072
3.829PheAsp: 3.829 ± 0.278
2.65PheGlu: 2.65 ± 0.216
1.583PhePhe: 1.583 ± 0.14
3.134PheGly: 3.134 ± 0.276
0.727PheHis: 0.727 ± 0.131
2.537PheIle: 2.537 ± 0.214
2.036PheLys: 2.036 ± 0.154
3.118PheLeu: 3.118 ± 0.26
0.921PheMet: 0.921 ± 0.159
2.569PheAsn: 2.569 ± 0.237
1.874PhePro: 1.874 ± 0.199
1.583PheGln: 1.583 ± 0.154
1.648PheArg: 1.648 ± 0.145
3.102PheSer: 3.102 ± 0.221
3.765PheThr: 3.765 ± 0.348
2.375PheVal: 2.375 ± 0.3
0.436PheTrp: 0.436 ± 0.102
1.68PheTyr: 1.68 ± 0.191
0.0PheXaa: 0.0 ± 0.0
Gly
5.445GlyAla: 5.445 ± 0.652
0.566GlyCys: 0.566 ± 0.11
5.267GlyAsp: 5.267 ± 0.334
4.411GlyGlu: 4.411 ± 0.277
3.086GlyPhe: 3.086 ± 0.231
7.998GlyGly: 7.998 ± 0.782
1.309GlyHis: 1.309 ± 0.162
3.732GlyIle: 3.732 ± 0.229
4.427GlyLys: 4.427 ± 0.452
4.556GlyLeu: 4.556 ± 0.332
1.583GlyMet: 1.583 ± 0.201
4.863GlyAsn: 4.863 ± 0.478
1.923GlyPro: 1.923 ± 0.229
2.811GlyGln: 2.811 ± 0.232
2.731GlyArg: 2.731 ± 0.24
6.188GlySer: 6.188 ± 0.461
6.867GlyThr: 6.867 ± 0.601
4.653GlyVal: 4.653 ± 0.329
1.212GlyTrp: 1.212 ± 0.132
3.377GlyTyr: 3.377 ± 0.303
0.0GlyXaa: 0.0 ± 0.0
His
0.792HisAla: 0.792 ± 0.115
0.21HisCys: 0.21 ± 0.069
1.293HisAsp: 1.293 ± 0.139
0.986HisGlu: 0.986 ± 0.168
0.646HisPhe: 0.646 ± 0.136
1.357HisGly: 1.357 ± 0.136
0.566HisHis: 0.566 ± 0.118
1.179HisIle: 1.179 ± 0.177
1.26HisLys: 1.26 ± 0.205
1.196HisLeu: 1.196 ± 0.165
0.355HisMet: 0.355 ± 0.102
0.953HisAsn: 0.953 ± 0.129
1.179HisPro: 1.179 ± 0.16
0.662HisGln: 0.662 ± 0.093
0.969HisArg: 0.969 ± 0.193
1.131HisSer: 1.131 ± 0.121
1.632HisThr: 1.632 ± 0.281
1.018HisVal: 1.018 ± 0.162
0.323HisTrp: 0.323 ± 0.092
0.889HisTyr: 0.889 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
4.169IleAla: 4.169 ± 0.306
0.533IleCys: 0.533 ± 0.091
4.993IleAsp: 4.993 ± 0.314
4.152IleGlu: 4.152 ± 0.292
2.246IlePhe: 2.246 ± 0.175
4.249IleGly: 4.249 ± 0.341
0.856IleHis: 0.856 ± 0.147
3.7IleIle: 3.7 ± 0.358
4.443IleLys: 4.443 ± 0.259
4.362IleLeu: 4.362 ± 0.268
0.84IleMet: 0.84 ± 0.15
4.233IleAsn: 4.233 ± 0.325
2.779IlePro: 2.779 ± 0.251
2.31IleGln: 2.31 ± 0.21
2.537IleArg: 2.537 ± 0.187
4.362IleSer: 4.362 ± 0.355
5.122IleThr: 5.122 ± 0.486
4.282IleVal: 4.282 ± 0.277
0.824IleTrp: 0.824 ± 0.133
2.165IleTyr: 2.165 ± 0.21
0.0IleXaa: 0.0 ± 0.0
Lys
4.055LysAla: 4.055 ± 0.394
0.614LysCys: 0.614 ± 0.105
3.748LysAsp: 3.748 ± 0.31
4.459LysGlu: 4.459 ± 0.522
2.504LysPhe: 2.504 ± 0.257
3.894LysGly: 3.894 ± 0.486
1.147LysHis: 1.147 ± 0.169
4.411LysIle: 4.411 ± 0.321
4.766LysLys: 4.766 ± 0.648
3.942LysLeu: 3.942 ± 0.307
1.6LysMet: 1.6 ± 0.233
3.215LysAsn: 3.215 ± 0.38
2.133LysPro: 2.133 ± 0.218
2.44LysGln: 2.44 ± 0.241
2.31LysArg: 2.31 ± 0.316
3.425LysSer: 3.425 ± 0.276
3.684LysThr: 3.684 ± 0.371
4.136LysVal: 4.136 ± 0.316
0.759LysTrp: 0.759 ± 0.127
3.167LysTyr: 3.167 ± 0.251
0.0LysXaa: 0.0 ± 0.0
Leu
4.282LeuAla: 4.282 ± 0.273
0.905LeuCys: 0.905 ± 0.121
5.59LeuAsp: 5.59 ± 0.366
4.976LeuGlu: 4.976 ± 0.348
2.763LeuPhe: 2.763 ± 0.262
4.136LeuGly: 4.136 ± 0.359
1.503LeuHis: 1.503 ± 0.216
3.942LeuIle: 3.942 ± 0.225
5.154LeuLys: 5.154 ± 0.384
4.734LeuLeu: 4.734 ± 0.355
1.26LeuMet: 1.26 ± 0.178
4.734LeuAsn: 4.734 ± 0.277
2.924LeuPro: 2.924 ± 0.238
2.407LeuGln: 2.407 ± 0.193
3.021LeuArg: 3.021 ± 0.218
4.896LeuSer: 4.896 ± 0.198
5.493LeuThr: 5.493 ± 0.432
3.829LeuVal: 3.829 ± 0.279
0.695LeuTrp: 0.695 ± 0.123
3.07LeuTyr: 3.07 ± 0.274
0.0LeuXaa: 0.0 ± 0.0
Met
1.406MetAla: 1.406 ± 0.177
0.129MetCys: 0.129 ± 0.054
1.131MetAsp: 1.131 ± 0.152
1.244MetGlu: 1.244 ± 0.192
0.856MetPhe: 0.856 ± 0.146
1.099MetGly: 1.099 ± 0.18
0.42MetHis: 0.42 ± 0.097
1.083MetIle: 1.083 ± 0.132
1.874MetLys: 1.874 ± 0.296
1.373MetLeu: 1.373 ± 0.176
0.517MetMet: 0.517 ± 0.112
1.293MetAsn: 1.293 ± 0.161
1.018MetPro: 1.018 ± 0.165
0.969MetGln: 0.969 ± 0.137
0.953MetArg: 0.953 ± 0.129
1.761MetSer: 1.761 ± 0.214
1.697MetThr: 1.697 ± 0.171
1.05MetVal: 1.05 ± 0.135
0.291MetTrp: 0.291 ± 0.066
0.808MetTyr: 0.808 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
4.072AsnAla: 4.072 ± 0.379
0.856AsnCys: 0.856 ± 0.124
3.652AsnAsp: 3.652 ± 0.213
3.264AsnGlu: 3.264 ± 0.23
2.714AsnPhe: 2.714 ± 0.22
4.427AsnGly: 4.427 ± 0.329
1.325AsnHis: 1.325 ± 0.171
4.007AsnIle: 4.007 ± 0.373
3.086AsnLys: 3.086 ± 0.294
4.572AsnLeu: 4.572 ± 0.328
1.083AsnMet: 1.083 ± 0.172
3.619AsnAsn: 3.619 ± 0.315
3.183AsnPro: 3.183 ± 0.259
2.795AsnGln: 2.795 ± 0.194
2.65AsnArg: 2.65 ± 0.23
3.538AsnSer: 3.538 ± 0.347
3.926AsnThr: 3.926 ± 0.437
4.152AsnVal: 4.152 ± 0.352
0.759AsnTrp: 0.759 ± 0.119
2.488AsnTyr: 2.488 ± 0.192
0.0AsnXaa: 0.0 ± 0.0
Pro
2.779ProAla: 2.779 ± 0.262
0.501ProCys: 0.501 ± 0.136
2.698ProAsp: 2.698 ± 0.213
2.828ProGlu: 2.828 ± 0.274
1.648ProPhe: 1.648 ± 0.211
2.989ProGly: 2.989 ± 0.305
0.84ProHis: 0.84 ± 0.133
2.537ProIle: 2.537 ± 0.226
2.375ProLys: 2.375 ± 0.257
2.957ProLeu: 2.957 ± 0.279
0.792ProMet: 0.792 ± 0.116
2.488ProAsn: 2.488 ± 0.24
2.165ProPro: 2.165 ± 0.264
1.325ProGln: 1.325 ± 0.13
1.81ProArg: 1.81 ± 0.256
3.054ProSer: 3.054 ± 0.253
3.038ProThr: 3.038 ± 0.256
2.521ProVal: 2.521 ± 0.233
0.646ProTrp: 0.646 ± 0.108
1.583ProTyr: 1.583 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
2.246GlnAla: 2.246 ± 0.204
0.291GlnCys: 0.291 ± 0.083
2.246GlnAsp: 2.246 ± 0.209
2.149GlnGlu: 2.149 ± 0.246
1.697GlnPhe: 1.697 ± 0.168
2.763GlnGly: 2.763 ± 0.21
0.727GlnHis: 0.727 ± 0.1
2.569GlnIle: 2.569 ± 0.241
2.02GlnLys: 2.02 ± 0.195
3.134GlnLeu: 3.134 ± 0.215
0.889GlnMet: 0.889 ± 0.141
1.987GlnAsn: 1.987 ± 0.208
1.099GlnPro: 1.099 ± 0.133
1.664GlnGln: 1.664 ± 0.194
1.147GlnArg: 1.147 ± 0.146
2.779GlnSer: 2.779 ± 0.208
2.214GlnThr: 2.214 ± 0.154
2.634GlnVal: 2.634 ± 0.238
0.711GlnTrp: 0.711 ± 0.094
2.068GlnTyr: 2.068 ± 0.197
0.0GlnXaa: 0.0 ± 0.0
Arg
2.31ArgAla: 2.31 ± 0.174
0.485ArgCys: 0.485 ± 0.103
2.601ArgAsp: 2.601 ± 0.207
2.407ArgGlu: 2.407 ± 0.249
1.777ArgPhe: 1.777 ± 0.174
2.844ArgGly: 2.844 ± 0.204
0.711ArgHis: 0.711 ± 0.121
3.021ArgIle: 3.021 ± 0.232
2.731ArgLys: 2.731 ± 0.326
2.989ArgLeu: 2.989 ± 0.226
1.066ArgMet: 1.066 ± 0.162
2.359ArgAsn: 2.359 ± 0.229
1.309ArgPro: 1.309 ± 0.162
1.47ArgGln: 1.47 ± 0.165
2.149ArgArg: 2.149 ± 0.28
2.488ArgSer: 2.488 ± 0.201
2.246ArgThr: 2.246 ± 0.202
2.682ArgVal: 2.682 ± 0.202
0.549ArgTrp: 0.549 ± 0.1
1.939ArgTyr: 1.939 ± 0.199
0.0ArgXaa: 0.0 ± 0.0
Ser
5.073SerAla: 5.073 ± 0.395
0.452SerCys: 0.452 ± 0.077
4.185SerAsp: 4.185 ± 0.284
3.765SerGlu: 3.765 ± 0.269
3.296SerPhe: 3.296 ± 0.221
7.255SerGly: 7.255 ± 0.617
1.131SerHis: 1.131 ± 0.15
4.055SerIle: 4.055 ± 0.308
3.425SerLys: 3.425 ± 0.318
4.508SerLeu: 4.508 ± 0.261
1.632SerMet: 1.632 ± 0.194
4.411SerAsn: 4.411 ± 0.335
2.569SerPro: 2.569 ± 0.256
2.133SerGln: 2.133 ± 0.25
2.488SerArg: 2.488 ± 0.201
4.96SerSer: 4.96 ± 0.47
5.122SerThr: 5.122 ± 0.409
4.007SerVal: 4.007 ± 0.25
0.856SerTrp: 0.856 ± 0.14
2.714SerTyr: 2.714 ± 0.219
0.0SerXaa: 0.0 ± 0.0
Thr
5.316ThrAla: 5.316 ± 0.561
0.776ThrCys: 0.776 ± 0.158
4.298ThrAsp: 4.298 ± 0.459
3.716ThrGlu: 3.716 ± 0.227
3.635ThrPhe: 3.635 ± 0.349
6.834ThrGly: 6.834 ± 0.706
1.179ThrHis: 1.179 ± 0.155
5.413ThrIle: 5.413 ± 0.544
3.458ThrLys: 3.458 ± 0.306
5.768ThrLeu: 5.768 ± 0.511
1.373ThrMet: 1.373 ± 0.197
4.508ThrAsn: 4.508 ± 0.524
3.312ThrPro: 3.312 ± 0.288
2.65ThrGln: 2.65 ± 0.241
2.375ThrArg: 2.375 ± 0.257
5.235ThrSer: 5.235 ± 0.388
5.784ThrThr: 5.784 ± 0.689
5.251ThrVal: 5.251 ± 0.561
0.905ThrTrp: 0.905 ± 0.163
2.908ThrTyr: 2.908 ± 0.305
0.0ThrXaa: 0.0 ± 0.0
Val
4.427ValAla: 4.427 ± 0.323
0.566ValCys: 0.566 ± 0.126
5.057ValAsp: 5.057 ± 0.452
3.813ValGlu: 3.813 ± 0.252
2.343ValPhe: 2.343 ± 0.155
4.621ValGly: 4.621 ± 0.397
1.131ValHis: 1.131 ± 0.141
3.845ValIle: 3.845 ± 0.349
3.781ValLys: 3.781 ± 0.225
4.136ValLeu: 4.136 ± 0.275
1.406ValMet: 1.406 ± 0.175
3.862ValAsn: 3.862 ± 0.329
2.747ValPro: 2.747 ± 0.295
2.133ValGln: 2.133 ± 0.215
2.488ValArg: 2.488 ± 0.235
4.702ValSer: 4.702 ± 0.351
5.8ValThr: 5.8 ± 0.657
3.975ValVal: 3.975 ± 0.32
0.679ValTrp: 0.679 ± 0.115
2.391ValTyr: 2.391 ± 0.236
0.0ValXaa: 0.0 ± 0.0
Trp
0.695TrpAla: 0.695 ± 0.104
0.129TrpCys: 0.129 ± 0.044
0.969TrpAsp: 0.969 ± 0.164
0.808TrpGlu: 0.808 ± 0.144
0.501TrpPhe: 0.501 ± 0.099
0.695TrpGly: 0.695 ± 0.155
0.275TrpHis: 0.275 ± 0.082
0.759TrpIle: 0.759 ± 0.113
1.018TrpLys: 1.018 ± 0.156
0.921TrpLeu: 0.921 ± 0.146
0.436TrpMet: 0.436 ± 0.105
1.099TrpAsn: 1.099 ± 0.124
0.42TrpPro: 0.42 ± 0.097
0.598TrpGln: 0.598 ± 0.093
0.485TrpArg: 0.485 ± 0.112
0.646TrpSer: 0.646 ± 0.107
0.759TrpThr: 0.759 ± 0.132
0.953TrpVal: 0.953 ± 0.164
0.226TrpTrp: 0.226 ± 0.07
0.695TrpTyr: 0.695 ± 0.133
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.391TyrAla: 2.391 ± 0.169
0.598TyrCys: 0.598 ± 0.109
3.425TyrAsp: 3.425 ± 0.262
2.375TyrGlu: 2.375 ± 0.219
1.6TyrPhe: 1.6 ± 0.17
2.714TyrGly: 2.714 ± 0.218
1.002TyrHis: 1.002 ± 0.202
2.553TyrIle: 2.553 ± 0.248
2.65TyrLys: 2.65 ± 0.214
2.908TyrLeu: 2.908 ± 0.269
0.905TyrMet: 0.905 ± 0.137
2.828TyrAsn: 2.828 ± 0.18
2.117TyrPro: 2.117 ± 0.26
1.826TyrGln: 1.826 ± 0.184
2.052TyrArg: 2.052 ± 0.228
2.617TyrSer: 2.617 ± 0.257
2.941TyrThr: 2.941 ± 0.269
2.828TyrVal: 2.828 ± 0.206
0.469TyrTrp: 0.469 ± 0.084
1.858TyrTyr: 1.858 ± 0.159
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 221 proteins (61893 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski