Amino acid dipepetide frequency for Cyanophage P-RSM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.074AlaAla: 6.074 ± 0.495
0.408AlaCys: 0.408 ± 0.101
4.032AlaAsp: 4.032 ± 0.271
3.907AlaGlu: 3.907 ± 0.367
2.753AlaPhe: 2.753 ± 0.272
6.607AlaGly: 6.607 ± 0.439
1.012AlaHis: 1.012 ± 0.118
4.032AlaIle: 4.032 ± 0.235
4.067AlaLys: 4.067 ± 0.389
4.44AlaLeu: 4.44 ± 0.315
1.598AlaMet: 1.598 ± 0.2
3.818AlaAsn: 3.818 ± 0.341
2.611AlaPro: 2.611 ± 0.21
2.593AlaGln: 2.593 ± 0.197
2.522AlaArg: 2.522 ± 0.207
5.346AlaSer: 5.346 ± 0.436
5.896AlaThr: 5.896 ± 0.621
4.333AlaVal: 4.333 ± 0.379
0.568AlaTrp: 0.568 ± 0.107
2.717AlaTyr: 2.717 ± 0.212
0.0AlaXaa: 0.0 ± 0.0
Cys
0.515CysAla: 0.515 ± 0.121
0.124CysCys: 0.124 ± 0.061
0.551CysAsp: 0.551 ± 0.119
0.515CysGlu: 0.515 ± 0.112
0.497CysPhe: 0.497 ± 0.101
0.551CysGly: 0.551 ± 0.106
0.195CysHis: 0.195 ± 0.072
0.426CysIle: 0.426 ± 0.094
0.604CysLys: 0.604 ± 0.089
0.746CysLeu: 0.746 ± 0.14
0.266CysMet: 0.266 ± 0.082
0.426CysAsn: 0.426 ± 0.109
0.373CysPro: 0.373 ± 0.091
0.408CysGln: 0.408 ± 0.101
0.426CysArg: 0.426 ± 0.109
0.675CysSer: 0.675 ± 0.099
0.639CysThr: 0.639 ± 0.11
0.515CysVal: 0.515 ± 0.107
0.213CysTrp: 0.213 ± 0.071
0.355CysTyr: 0.355 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
4.884AspAla: 4.884 ± 0.356
0.764AspCys: 0.764 ± 0.138
4.387AspAsp: 4.387 ± 0.345
3.73AspGlu: 3.73 ± 0.355
3.037AspPhe: 3.037 ± 0.257
6.021AspGly: 6.021 ± 0.439
0.906AspHis: 0.906 ± 0.168
3.818AspIle: 3.818 ± 0.319
3.374AspLys: 3.374 ± 0.322
5.221AspLeu: 5.221 ± 0.309
1.368AspMet: 1.368 ± 0.172
3.907AspAsn: 3.907 ± 0.267
3.144AspPro: 3.144 ± 0.28
2.398AspGln: 2.398 ± 0.214
2.7AspArg: 2.7 ± 0.24
4.103AspSer: 4.103 ± 0.262
4.369AspThr: 4.369 ± 0.297
4.138AspVal: 4.138 ± 0.261
0.941AspTrp: 0.941 ± 0.149
3.374AspTyr: 3.374 ± 0.238
0.0AspXaa: 0.0 ± 0.0
Glu
3.676GluAla: 3.676 ± 0.287
0.604GluCys: 0.604 ± 0.135
3.943GluAsp: 3.943 ± 0.354
4.582GluGlu: 4.582 ± 0.51
2.948GluPhe: 2.948 ± 0.25
3.872GluGly: 3.872 ± 0.29
1.101GluHis: 1.101 ± 0.177
4.014GluIle: 4.014 ± 0.321
3.783GluLys: 3.783 ± 0.537
4.742GluLeu: 4.742 ± 0.333
1.385GluMet: 1.385 ± 0.197
2.913GluAsn: 2.913 ± 0.265
1.669GluPro: 1.669 ± 0.172
2.149GluGln: 2.149 ± 0.185
2.273GluArg: 2.273 ± 0.213
3.765GluSer: 3.765 ± 0.356
4.085GluThr: 4.085 ± 0.318
4.458GluVal: 4.458 ± 0.268
0.941GluTrp: 0.941 ± 0.142
2.842GluTyr: 2.842 ± 0.232
0.0GluXaa: 0.0 ± 0.0
Phe
2.842PheAla: 2.842 ± 0.204
0.337PheCys: 0.337 ± 0.091
3.588PheAsp: 3.588 ± 0.263
2.717PheGlu: 2.717 ± 0.241
1.705PhePhe: 1.705 ± 0.206
3.037PheGly: 3.037 ± 0.214
0.622PheHis: 0.622 ± 0.135
2.486PheIle: 2.486 ± 0.269
2.628PheLys: 2.628 ± 0.219
3.055PheLeu: 3.055 ± 0.291
1.012PheMet: 1.012 ± 0.161
2.771PheAsn: 2.771 ± 0.236
1.758PhePro: 1.758 ± 0.183
1.634PheGln: 1.634 ± 0.154
1.492PheArg: 1.492 ± 0.135
3.019PheSer: 3.019 ± 0.258
3.73PheThr: 3.73 ± 0.403
2.628PheVal: 2.628 ± 0.198
0.426PheTrp: 0.426 ± 0.075
1.652PheTyr: 1.652 ± 0.162
0.0PheXaa: 0.0 ± 0.0
Gly
6.234GlyAla: 6.234 ± 0.61
0.852GlyCys: 0.852 ± 0.161
4.849GlyAsp: 4.849 ± 0.366
3.889GlyGlu: 3.889 ± 0.251
3.286GlyPhe: 3.286 ± 0.287
7.761GlyGly: 7.761 ± 0.707
1.261GlyHis: 1.261 ± 0.197
4.422GlyIle: 4.422 ± 0.291
4.191GlyLys: 4.191 ± 0.414
4.849GlyLeu: 4.849 ± 0.344
1.705GlyMet: 1.705 ± 0.226
4.476GlyAsn: 4.476 ± 0.387
2.149GlyPro: 2.149 ± 0.179
2.895GlyGln: 2.895 ± 0.259
3.144GlyArg: 3.144 ± 0.263
6.98GlySer: 6.98 ± 0.594
7.459GlyThr: 7.459 ± 0.628
5.079GlyVal: 5.079 ± 0.353
0.959GlyTrp: 0.959 ± 0.144
3.286GlyTyr: 3.286 ± 0.275
0.0GlyXaa: 0.0 ± 0.0
His
0.817HisAla: 0.817 ± 0.161
0.32HisCys: 0.32 ± 0.081
1.172HisAsp: 1.172 ± 0.19
0.852HisGlu: 0.852 ± 0.115
0.764HisPhe: 0.764 ± 0.147
1.083HisGly: 1.083 ± 0.17
0.231HisHis: 0.231 ± 0.073
1.048HisIle: 1.048 ± 0.153
0.995HisLys: 0.995 ± 0.177
1.066HisLeu: 1.066 ± 0.127
0.32HisMet: 0.32 ± 0.084
0.71HisAsn: 0.71 ± 0.13
0.87HisPro: 0.87 ± 0.164
0.551HisGln: 0.551 ± 0.109
0.764HisArg: 0.764 ± 0.14
0.995HisSer: 0.995 ± 0.152
0.959HisThr: 0.959 ± 0.127
0.977HisVal: 0.977 ± 0.123
0.231HisTrp: 0.231 ± 0.067
1.03HisTyr: 1.03 ± 0.141
0.0HisXaa: 0.0 ± 0.0
Ile
3.943IleAla: 3.943 ± 0.291
0.693IleCys: 0.693 ± 0.115
4.529IleAsp: 4.529 ± 0.227
3.801IleGlu: 3.801 ± 0.253
2.504IlePhe: 2.504 ± 0.207
3.694IleGly: 3.694 ± 0.286
0.622IleHis: 0.622 ± 0.128
3.215IleIle: 3.215 ± 0.336
3.801IleLys: 3.801 ± 0.296
3.907IleLeu: 3.907 ± 0.288
1.03IleMet: 1.03 ± 0.155
3.854IleAsn: 3.854 ± 0.285
2.735IlePro: 2.735 ± 0.221
2.575IleGln: 2.575 ± 0.256
2.344IleArg: 2.344 ± 0.184
4.618IleSer: 4.618 ± 0.306
5.506IleThr: 5.506 ± 0.582
3.641IleVal: 3.641 ± 0.269
0.728IleTrp: 0.728 ± 0.123
2.06IleTyr: 2.06 ± 0.275
0.0IleXaa: 0.0 ± 0.0
Lys
3.641LysAla: 3.641 ± 0.385
0.533LysCys: 0.533 ± 0.088
3.676LysAsp: 3.676 ± 0.346
4.191LysGlu: 4.191 ± 0.585
2.433LysPhe: 2.433 ± 0.235
3.996LysGly: 3.996 ± 0.44
0.959LysHis: 0.959 ± 0.157
4.138LysIle: 4.138 ± 0.262
5.221LysLys: 5.221 ± 0.621
4.92LysLeu: 4.92 ± 0.423
1.474LysMet: 1.474 ± 0.197
3.144LysAsn: 3.144 ± 0.26
1.829LysPro: 1.829 ± 0.208
2.238LysGln: 2.238 ± 0.247
2.433LysArg: 2.433 ± 0.276
3.499LysSer: 3.499 ± 0.358
3.801LysThr: 3.801 ± 0.303
4.618LysVal: 4.618 ± 0.349
0.799LysTrp: 0.799 ± 0.147
2.771LysTyr: 2.771 ± 0.261
0.0LysXaa: 0.0 ± 0.0
Leu
4.387LeuAla: 4.387 ± 0.293
0.746LeuCys: 0.746 ± 0.146
5.559LeuAsp: 5.559 ± 0.336
4.387LeuGlu: 4.387 ± 0.314
2.806LeuPhe: 2.806 ± 0.242
4.493LeuGly: 4.493 ± 0.347
1.314LeuHis: 1.314 ± 0.178
3.392LeuIle: 3.392 ± 0.239
4.795LeuLys: 4.795 ± 0.42
4.547LeuLeu: 4.547 ± 0.318
1.421LeuMet: 1.421 ± 0.196
4.742LeuAsn: 4.742 ± 0.295
3.144LeuPro: 3.144 ± 0.268
2.735LeuGln: 2.735 ± 0.186
2.913LeuArg: 2.913 ± 0.224
4.92LeuSer: 4.92 ± 0.246
5.523LeuThr: 5.523 ± 0.368
4.387LeuVal: 4.387 ± 0.287
0.568LeuTrp: 0.568 ± 0.118
3.463LeuTyr: 3.463 ± 0.291
0.0LeuXaa: 0.0 ± 0.0
Met
1.474MetAla: 1.474 ± 0.214
0.107MetCys: 0.107 ± 0.05
1.243MetAsp: 1.243 ± 0.183
1.296MetGlu: 1.296 ± 0.22
0.835MetPhe: 0.835 ± 0.152
1.208MetGly: 1.208 ± 0.177
0.497MetHis: 0.497 ± 0.107
1.208MetIle: 1.208 ± 0.162
1.954MetLys: 1.954 ± 0.325
1.421MetLeu: 1.421 ± 0.202
0.622MetMet: 0.622 ± 0.123
1.279MetAsn: 1.279 ± 0.176
0.71MetPro: 0.71 ± 0.118
0.746MetGln: 0.746 ± 0.116
1.012MetArg: 1.012 ± 0.161
2.025MetSer: 2.025 ± 0.252
1.314MetThr: 1.314 ± 0.173
1.154MetVal: 1.154 ± 0.146
0.266MetTrp: 0.266 ± 0.075
0.568MetTyr: 0.568 ± 0.121
0.0MetXaa: 0.0 ± 0.0
Asn
3.801AsnAla: 3.801 ± 0.267
0.551AsnCys: 0.551 ± 0.104
3.41AsnAsp: 3.41 ± 0.225
2.948AsnGlu: 2.948 ± 0.3
2.859AsnPhe: 2.859 ± 0.194
4.618AsnGly: 4.618 ± 0.411
0.959AsnHis: 0.959 ± 0.128
3.907AsnIle: 3.907 ± 0.353
3.037AsnLys: 3.037 ± 0.288
4.564AsnLeu: 4.564 ± 0.336
0.87AsnMet: 0.87 ± 0.136
3.676AsnAsn: 3.676 ± 0.277
3.197AsnPro: 3.197 ± 0.21
2.38AsnGln: 2.38 ± 0.21
2.38AsnArg: 2.38 ± 0.184
3.747AsnSer: 3.747 ± 0.325
4.156AsnThr: 4.156 ± 0.356
4.671AsnVal: 4.671 ± 0.321
0.693AsnTrp: 0.693 ± 0.126
2.433AsnTyr: 2.433 ± 0.227
0.0AsnXaa: 0.0 ± 0.0
Pro
2.327ProAla: 2.327 ± 0.216
0.249ProCys: 0.249 ± 0.072
2.469ProAsp: 2.469 ± 0.202
2.593ProGlu: 2.593 ± 0.24
1.883ProPhe: 1.883 ± 0.188
3.126ProGly: 3.126 ± 0.284
0.799ProHis: 0.799 ± 0.138
2.273ProIle: 2.273 ± 0.2
2.202ProLys: 2.202 ± 0.26
2.344ProLeu: 2.344 ± 0.227
0.71ProMet: 0.71 ± 0.125
2.398ProAsn: 2.398 ± 0.224
1.581ProPro: 1.581 ± 0.247
1.243ProGln: 1.243 ± 0.182
1.439ProArg: 1.439 ± 0.169
3.161ProSer: 3.161 ± 0.21
2.877ProThr: 2.877 ± 0.198
2.664ProVal: 2.664 ± 0.263
0.568ProTrp: 0.568 ± 0.105
1.74ProTyr: 1.74 ± 0.159
0.0ProXaa: 0.0 ± 0.0
Gln
2.327GlnAla: 2.327 ± 0.213
0.373GlnCys: 0.373 ± 0.086
2.06GlnAsp: 2.06 ± 0.166
2.433GlnGlu: 2.433 ± 0.222
1.669GlnPhe: 1.669 ± 0.201
2.54GlnGly: 2.54 ± 0.217
0.675GlnHis: 0.675 ± 0.131
2.753GlnIle: 2.753 ± 0.224
2.451GlnLys: 2.451 ± 0.267
2.895GlnLeu: 2.895 ± 0.23
0.924GlnMet: 0.924 ± 0.147
2.06GlnAsn: 2.06 ± 0.217
1.154GlnPro: 1.154 ± 0.153
1.456GlnGln: 1.456 ± 0.198
1.723GlnArg: 1.723 ± 0.165
2.628GlnSer: 2.628 ± 0.212
2.54GlnThr: 2.54 ± 0.213
2.664GlnVal: 2.664 ± 0.185
0.551GlnTrp: 0.551 ± 0.11
1.776GlnTyr: 1.776 ± 0.172
0.0GlnXaa: 0.0 ± 0.0
Arg
2.309ArgAla: 2.309 ± 0.229
0.337ArgCys: 0.337 ± 0.075
2.38ArgAsp: 2.38 ± 0.215
2.22ArgGlu: 2.22 ± 0.237
1.936ArgPhe: 1.936 ± 0.141
3.072ArgGly: 3.072 ± 0.267
0.586ArgHis: 0.586 ± 0.126
2.682ArgIle: 2.682 ± 0.187
2.557ArgLys: 2.557 ± 0.3
3.339ArgLeu: 3.339 ± 0.247
1.03ArgMet: 1.03 ± 0.183
1.883ArgAsn: 1.883 ± 0.204
1.314ArgPro: 1.314 ± 0.159
1.296ArgGln: 1.296 ± 0.169
1.918ArgArg: 1.918 ± 0.276
2.273ArgSer: 2.273 ± 0.211
2.291ArgThr: 2.291 ± 0.214
3.001ArgVal: 3.001 ± 0.242
0.391ArgTrp: 0.391 ± 0.073
2.309ArgTyr: 2.309 ± 0.248
0.0ArgXaa: 0.0 ± 0.0
Ser
5.488SerAla: 5.488 ± 0.364
0.497SerCys: 0.497 ± 0.111
4.476SerAsp: 4.476 ± 0.37
3.747SerGlu: 3.747 ± 0.272
3.215SerPhe: 3.215 ± 0.271
8.081SerGly: 8.081 ± 0.593
0.941SerHis: 0.941 ± 0.152
4.298SerIle: 4.298 ± 0.295
3.588SerLys: 3.588 ± 0.282
4.387SerLeu: 4.387 ± 0.261
1.439SerMet: 1.439 ± 0.192
4.227SerAsn: 4.227 ± 0.355
2.433SerPro: 2.433 ± 0.214
2.415SerGln: 2.415 ± 0.199
2.007SerArg: 2.007 ± 0.194
6.287SerSer: 6.287 ± 0.551
5.594SerThr: 5.594 ± 0.549
4.849SerVal: 4.849 ± 0.314
0.728SerTrp: 0.728 ± 0.149
2.7SerTyr: 2.7 ± 0.235
0.0SerXaa: 0.0 ± 0.0
Thr
6.34ThrAla: 6.34 ± 0.588
0.48ThrCys: 0.48 ± 0.081
4.245ThrAsp: 4.245 ± 0.375
4.174ThrGlu: 4.174 ± 0.365
3.552ThrPhe: 3.552 ± 0.338
7.406ThrGly: 7.406 ± 0.678
0.959ThrHis: 0.959 ± 0.123
4.813ThrIle: 4.813 ± 0.434
3.836ThrLys: 3.836 ± 0.267
5.665ThrLeu: 5.665 ± 0.387
1.208ThrMet: 1.208 ± 0.194
4.493ThrAsn: 4.493 ± 0.383
3.392ThrPro: 3.392 ± 0.298
2.877ThrGln: 2.877 ± 0.283
2.575ThrArg: 2.575 ± 0.211
5.168ThrSer: 5.168 ± 0.541
6.234ThrThr: 6.234 ± 0.661
5.648ThrVal: 5.648 ± 0.551
0.728ThrTrp: 0.728 ± 0.125
3.321ThrTyr: 3.321 ± 0.28
0.0ThrXaa: 0.0 ± 0.0
Val
4.849ValAla: 4.849 ± 0.339
0.355ValCys: 0.355 ± 0.083
5.328ValAsp: 5.328 ± 0.438
4.618ValGlu: 4.618 ± 0.35
2.522ValPhe: 2.522 ± 0.191
5.062ValGly: 5.062 ± 0.443
0.977ValHis: 0.977 ± 0.173
3.605ValIle: 3.605 ± 0.252
3.712ValLys: 3.712 ± 0.281
4.209ValLeu: 4.209 ± 0.275
1.332ValMet: 1.332 ± 0.193
4.529ValAsn: 4.529 ± 0.274
2.806ValPro: 2.806 ± 0.279
2.788ValGln: 2.788 ± 0.22
2.611ValArg: 2.611 ± 0.191
4.706ValSer: 4.706 ± 0.332
6.198ValThr: 6.198 ± 0.574
4.618ValVal: 4.618 ± 0.349
0.568ValTrp: 0.568 ± 0.101
2.38ValTyr: 2.38 ± 0.196
0.0ValXaa: 0.0 ± 0.0
Trp
0.764TrpAla: 0.764 ± 0.134
0.089TrpCys: 0.089 ± 0.043
0.764TrpAsp: 0.764 ± 0.168
0.604TrpGlu: 0.604 ± 0.12
0.337TrpPhe: 0.337 ± 0.086
0.817TrpGly: 0.817 ± 0.133
0.337TrpHis: 0.337 ± 0.09
0.622TrpIle: 0.622 ± 0.114
0.941TrpLys: 0.941 ± 0.167
0.835TrpLeu: 0.835 ± 0.125
0.302TrpMet: 0.302 ± 0.08
0.835TrpAsn: 0.835 ± 0.127
0.16TrpPro: 0.16 ± 0.047
0.462TrpGln: 0.462 ± 0.099
0.373TrpArg: 0.373 ± 0.089
0.888TrpSer: 0.888 ± 0.124
0.906TrpThr: 0.906 ± 0.131
0.799TrpVal: 0.799 ± 0.11
0.089TrpTrp: 0.089 ± 0.038
0.48TrpTyr: 0.48 ± 0.094
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.646TyrAla: 2.646 ± 0.218
0.604TyrCys: 0.604 ± 0.109
3.943TyrAsp: 3.943 ± 0.324
2.54TyrGlu: 2.54 ± 0.277
1.598TyrPhe: 1.598 ± 0.16
2.682TyrGly: 2.682 ± 0.218
0.799TyrHis: 0.799 ± 0.153
2.646TyrIle: 2.646 ± 0.216
2.575TyrLys: 2.575 ± 0.32
3.09TyrLeu: 3.09 ± 0.223
0.906TyrMet: 0.906 ± 0.143
2.717TyrAsn: 2.717 ± 0.236
1.616TyrPro: 1.616 ± 0.201
1.812TyrGln: 1.812 ± 0.189
2.078TyrArg: 2.078 ± 0.23
2.593TyrSer: 2.593 ± 0.212
3.037TyrThr: 3.037 ± 0.319
2.93TyrVal: 2.93 ± 0.223
0.444TyrTrp: 0.444 ± 0.121
2.184TyrTyr: 2.184 ± 0.235
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 212 proteins (56307 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski