Amino acid dipepetide frequency for Synechococcus virus S-PRM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.468AlaAla: 5.468 ± 0.38
0.605AlaCys: 0.605 ± 0.12
3.9AlaAsp: 3.9 ± 0.304
3.743AlaGlu: 3.743 ± 0.373
2.376AlaPhe: 2.376 ± 0.246
5.222AlaGly: 5.222 ± 0.454
1.255AlaHis: 1.255 ± 0.146
4.146AlaIle: 4.146 ± 0.265
3.855AlaLys: 3.855 ± 0.423
4.549AlaLeu: 4.549 ± 0.38
1.367AlaMet: 1.367 ± 0.173
3.698AlaAsn: 3.698 ± 0.338
2.241AlaPro: 2.241 ± 0.273
2.286AlaGln: 2.286 ± 0.27
2.801AlaArg: 2.801 ± 0.307
3.653AlaSer: 3.653 ± 0.308
5.267AlaThr: 5.267 ± 0.667
3.429AlaVal: 3.429 ± 0.299
0.829AlaTrp: 0.829 ± 0.143
1.972AlaTyr: 1.972 ± 0.201
0.0AlaXaa: 0.0 ± 0.0
Cys
0.695CysAla: 0.695 ± 0.114
0.09CysCys: 0.09 ± 0.047
0.807CysAsp: 0.807 ± 0.15
0.807CysGlu: 0.807 ± 0.154
0.381CysPhe: 0.381 ± 0.089
0.672CysGly: 0.672 ± 0.138
0.247CysHis: 0.247 ± 0.076
0.583CysIle: 0.583 ± 0.097
0.672CysLys: 0.672 ± 0.121
0.583CysLeu: 0.583 ± 0.109
0.359CysMet: 0.359 ± 0.12
0.471CysAsn: 0.471 ± 0.122
0.538CysPro: 0.538 ± 0.106
0.359CysGln: 0.359 ± 0.095
0.56CysArg: 0.56 ± 0.101
0.359CysSer: 0.359 ± 0.097
0.628CysThr: 0.628 ± 0.134
0.829CysVal: 0.829 ± 0.138
0.112CysTrp: 0.112 ± 0.049
0.628CysTyr: 0.628 ± 0.099
0.0CysXaa: 0.0 ± 0.0
Asp
4.437AspAla: 4.437 ± 0.322
0.762AspCys: 0.762 ± 0.134
4.863AspAsp: 4.863 ± 0.441
4.482AspGlu: 4.482 ± 0.369
2.824AspPhe: 2.824 ± 0.259
5.042AspGly: 5.042 ± 0.418
1.277AspHis: 1.277 ± 0.202
3.586AspIle: 3.586 ± 0.257
3.429AspLys: 3.429 ± 0.309
5.827AspLeu: 5.827 ± 0.315
1.524AspMet: 1.524 ± 0.184
3.115AspAsn: 3.115 ± 0.297
3.205AspPro: 3.205 ± 0.275
1.681AspGln: 1.681 ± 0.189
3.048AspArg: 3.048 ± 0.236
4.101AspSer: 4.101 ± 0.318
4.034AspThr: 4.034 ± 0.299
4.482AspVal: 4.482 ± 0.286
1.233AspTrp: 1.233 ± 0.167
3.384AspTyr: 3.384 ± 0.287
0.0AspXaa: 0.0 ± 0.0
Glu
3.944GluAla: 3.944 ± 0.312
0.784GluCys: 0.784 ± 0.138
4.46GluAsp: 4.46 ± 0.346
5.042GluGlu: 5.042 ± 0.545
3.205GluPhe: 3.205 ± 0.309
4.146GluGly: 4.146 ± 0.352
1.277GluHis: 1.277 ± 0.202
4.527GluIle: 4.527 ± 0.388
4.886GluLys: 4.886 ± 0.437
5.132GluLeu: 5.132 ± 0.364
1.905GluMet: 1.905 ± 0.28
4.012GluAsn: 4.012 ± 0.285
2.219GluPro: 2.219 ± 0.256
2.488GluGln: 2.488 ± 0.276
3.093GluArg: 3.093 ± 0.317
3.832GluSer: 3.832 ± 0.347
3.765GluThr: 3.765 ± 0.313
5.087GluVal: 5.087 ± 0.355
0.941GluTrp: 0.941 ± 0.145
2.779GluTyr: 2.779 ± 0.27
0.0GluXaa: 0.0 ± 0.0
Phe
2.6PheAla: 2.6 ± 0.232
0.695PheCys: 0.695 ± 0.146
3.048PheAsp: 3.048 ± 0.218
2.869PheGlu: 2.869 ± 0.225
1.838PhePhe: 1.838 ± 0.241
3.25PheGly: 3.25 ± 0.269
0.717PheHis: 0.717 ± 0.146
2.622PheIle: 2.622 ± 0.216
2.913PheLys: 2.913 ± 0.263
3.138PheLeu: 3.138 ± 0.294
1.143PheMet: 1.143 ± 0.169
2.42PheAsn: 2.42 ± 0.222
1.748PhePro: 1.748 ± 0.263
1.793PheGln: 1.793 ± 0.195
1.591PheArg: 1.591 ± 0.155
2.644PheSer: 2.644 ± 0.237
2.824PheThr: 2.824 ± 0.374
2.981PheVal: 2.981 ± 0.287
0.515PheTrp: 0.515 ± 0.107
1.748PheTyr: 1.748 ± 0.192
0.0PheXaa: 0.0 ± 0.0
Gly
4.527GlyAla: 4.527 ± 0.462
0.717GlyCys: 0.717 ± 0.119
4.527GlyAsp: 4.527 ± 0.516
4.191GlyGlu: 4.191 ± 0.388
3.205GlyPhe: 3.205 ± 0.297
5.401GlyGly: 5.401 ± 0.483
1.233GlyHis: 1.233 ± 0.192
5.132GlyIle: 5.132 ± 0.515
4.28GlyLys: 4.28 ± 0.38
5.401GlyLeu: 5.401 ± 0.349
1.748GlyMet: 1.748 ± 0.217
4.168GlyAsn: 4.168 ± 0.556
3.025GlyPro: 3.025 ± 0.553
2.846GlyGln: 2.846 ± 0.328
3.16GlyArg: 3.16 ± 0.312
5.289GlySer: 5.289 ± 0.536
6.544GlyThr: 6.544 ± 0.706
5.401GlyVal: 5.401 ± 0.556
1.165GlyTrp: 1.165 ± 0.172
3.07GlyTyr: 3.07 ± 0.275
0.0GlyXaa: 0.0 ± 0.0
His
0.964HisAla: 0.964 ± 0.156
0.291HisCys: 0.291 ± 0.081
1.008HisAsp: 1.008 ± 0.147
1.591HisGlu: 1.591 ± 0.178
1.053HisPhe: 1.053 ± 0.154
1.21HisGly: 1.21 ± 0.181
0.493HisHis: 0.493 ± 0.103
0.964HisIle: 0.964 ± 0.133
1.121HisLys: 1.121 ± 0.164
1.524HisLeu: 1.524 ± 0.223
0.202HisMet: 0.202 ± 0.064
0.852HisAsn: 0.852 ± 0.137
0.784HisPro: 0.784 ± 0.162
0.471HisGln: 0.471 ± 0.119
0.807HisArg: 0.807 ± 0.157
0.874HisSer: 0.874 ± 0.172
1.053HisThr: 1.053 ± 0.195
1.233HisVal: 1.233 ± 0.175
0.359HisTrp: 0.359 ± 0.088
0.874HisTyr: 0.874 ± 0.165
0.0HisXaa: 0.0 ± 0.0
Ile
4.191IleAla: 4.191 ± 0.334
0.605IleCys: 0.605 ± 0.131
4.527IleAsp: 4.527 ± 0.33
4.437IleGlu: 4.437 ± 0.359
2.443IlePhe: 2.443 ± 0.324
4.28IleGly: 4.28 ± 0.454
0.986IleHis: 0.986 ± 0.157
3.362IleIle: 3.362 ± 0.312
4.012IleLys: 4.012 ± 0.325
4.661IleLeu: 4.661 ± 0.33
1.345IleMet: 1.345 ± 0.184
3.787IleAsn: 3.787 ± 0.276
2.017IlePro: 2.017 ± 0.256
2.264IleGln: 2.264 ± 0.235
3.317IleArg: 3.317 ± 0.256
4.034IleSer: 4.034 ± 0.322
4.594IleThr: 4.594 ± 0.423
4.034IleVal: 4.034 ± 0.257
0.538IleTrp: 0.538 ± 0.11
2.308IleTyr: 2.308 ± 0.193
0.0IleXaa: 0.0 ± 0.0
Lys
2.913LysAla: 2.913 ± 0.296
0.784LysCys: 0.784 ± 0.147
4.46LysAsp: 4.46 ± 0.367
4.863LysGlu: 4.863 ± 0.483
2.689LysPhe: 2.689 ± 0.242
4.258LysGly: 4.258 ± 0.538
0.762LysHis: 0.762 ± 0.117
3.9LysIle: 3.9 ± 0.356
5.827LysLys: 5.827 ± 0.758
5.737LysLeu: 5.737 ± 0.391
1.681LysMet: 1.681 ± 0.191
3.25LysAsn: 3.25 ± 0.305
2.622LysPro: 2.622 ± 0.321
2.084LysGln: 2.084 ± 0.222
2.51LysArg: 2.51 ± 0.257
3.967LysSer: 3.967 ± 0.328
3.922LysThr: 3.922 ± 0.394
4.527LysVal: 4.527 ± 0.371
0.941LysTrp: 0.941 ± 0.149
3.205LysTyr: 3.205 ± 0.336
0.0LysXaa: 0.0 ± 0.0
Leu
4.505LeuAla: 4.505 ± 0.341
0.538LeuCys: 0.538 ± 0.109
5.379LeuAsp: 5.379 ± 0.347
5.356LeuGlu: 5.356 ± 0.403
3.07LeuPhe: 3.07 ± 0.251
5.536LeuGly: 5.536 ± 0.33
1.345LeuHis: 1.345 ± 0.191
4.012LeuIle: 4.012 ± 0.287
5.692LeuLys: 5.692 ± 0.386
5.087LeuLeu: 5.087 ± 0.471
1.502LeuMet: 1.502 ± 0.181
4.774LeuAsn: 4.774 ± 0.357
2.779LeuPro: 2.779 ± 0.274
2.734LeuGln: 2.734 ± 0.238
4.168LeuArg: 4.168 ± 0.328
4.796LeuSer: 4.796 ± 0.314
4.751LeuThr: 4.751 ± 0.401
4.706LeuVal: 4.706 ± 0.327
1.008LeuTrp: 1.008 ± 0.179
3.227LeuTyr: 3.227 ± 0.252
0.0LeuXaa: 0.0 ± 0.0
Met
1.412MetAla: 1.412 ± 0.178
0.202MetCys: 0.202 ± 0.062
1.21MetAsp: 1.21 ± 0.175
1.412MetGlu: 1.412 ± 0.222
1.165MetPhe: 1.165 ± 0.178
1.3MetGly: 1.3 ± 0.184
0.56MetHis: 0.56 ± 0.108
1.255MetIle: 1.255 ± 0.218
1.748MetLys: 1.748 ± 0.252
1.546MetLeu: 1.546 ± 0.195
0.74MetMet: 0.74 ± 0.112
1.569MetAsn: 1.569 ± 0.196
1.031MetPro: 1.031 ± 0.142
0.964MetGln: 0.964 ± 0.141
1.053MetArg: 1.053 ± 0.183
1.636MetSer: 1.636 ± 0.187
1.546MetThr: 1.546 ± 0.229
1.098MetVal: 1.098 ± 0.172
0.291MetTrp: 0.291 ± 0.106
0.807MetTyr: 0.807 ± 0.131
0.0MetXaa: 0.0 ± 0.0
Asn
3.406AsnAla: 3.406 ± 0.311
0.538AsnCys: 0.538 ± 0.111
3.519AsnAsp: 3.519 ± 0.337
3.519AsnGlu: 3.519 ± 0.269
2.644AsnPhe: 2.644 ± 0.26
3.81AsnGly: 3.81 ± 0.518
1.121AsnHis: 1.121 ± 0.178
3.743AsnIle: 3.743 ± 0.434
3.025AsnLys: 3.025 ± 0.267
4.012AsnLeu: 4.012 ± 0.333
0.784AsnMet: 0.784 ± 0.158
3.048AsnAsn: 3.048 ± 0.378
3.205AsnPro: 3.205 ± 0.279
1.838AsnGln: 1.838 ± 0.184
2.488AsnArg: 2.488 ± 0.211
3.989AsnSer: 3.989 ± 0.362
3.205AsnThr: 3.205 ± 0.328
4.079AsnVal: 4.079 ± 0.373
0.65AsnTrp: 0.65 ± 0.103
2.42AsnTyr: 2.42 ± 0.247
0.0AsnXaa: 0.0 ± 0.0
Pro
2.376ProAla: 2.376 ± 0.236
0.493ProCys: 0.493 ± 0.11
2.6ProAsp: 2.6 ± 0.257
3.429ProGlu: 3.429 ± 0.359
1.457ProPhe: 1.457 ± 0.155
2.936ProGly: 2.936 ± 0.251
0.784ProHis: 0.784 ± 0.144
2.264ProIle: 2.264 ± 0.305
2.42ProLys: 2.42 ± 0.282
2.644ProLeu: 2.644 ± 0.273
0.717ProMet: 0.717 ± 0.135
2.196ProAsn: 2.196 ± 0.266
1.793ProPro: 1.793 ± 0.218
1.569ProGln: 1.569 ± 0.207
1.277ProArg: 1.277 ± 0.192
2.555ProSer: 2.555 ± 0.247
3.765ProThr: 3.765 ± 0.517
2.913ProVal: 2.913 ± 0.263
0.381ProTrp: 0.381 ± 0.094
1.726ProTyr: 1.726 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
1.995GlnAla: 1.995 ± 0.223
0.336GlnCys: 0.336 ± 0.083
1.838GlnAsp: 1.838 ± 0.235
3.115GlnGlu: 3.115 ± 0.288
1.726GlnPhe: 1.726 ± 0.217
2.712GlnGly: 2.712 ± 0.255
0.515GlnHis: 0.515 ± 0.114
2.667GlnIle: 2.667 ± 0.245
2.689GlnLys: 2.689 ± 0.329
2.958GlnLeu: 2.958 ± 0.309
1.053GlnMet: 1.053 ± 0.177
1.658GlnAsn: 1.658 ± 0.211
1.143GlnPro: 1.143 ± 0.178
1.614GlnGln: 1.614 ± 0.2
1.703GlnArg: 1.703 ± 0.185
1.95GlnSer: 1.95 ± 0.241
1.927GlnThr: 1.927 ± 0.211
2.353GlnVal: 2.353 ± 0.249
0.448GlnTrp: 0.448 ± 0.113
1.883GlnTyr: 1.883 ± 0.239
0.0GlnXaa: 0.0 ± 0.0
Arg
2.622ArgAla: 2.622 ± 0.263
0.336ArgCys: 0.336 ± 0.09
2.353ArgAsp: 2.353 ± 0.188
2.555ArgGlu: 2.555 ± 0.233
2.286ArgPhe: 2.286 ± 0.24
3.16ArgGly: 3.16 ± 0.297
0.807ArgHis: 0.807 ± 0.142
3.294ArgIle: 3.294 ± 0.292
3.339ArgLys: 3.339 ± 0.386
3.787ArgLeu: 3.787 ± 0.359
1.098ArgMet: 1.098 ± 0.159
2.039ArgAsn: 2.039 ± 0.197
1.524ArgPro: 1.524 ± 0.231
1.86ArgGln: 1.86 ± 0.228
2.555ArgArg: 2.555 ± 0.324
3.093ArgSer: 3.093 ± 0.274
2.039ArgThr: 2.039 ± 0.276
2.958ArgVal: 2.958 ± 0.213
0.56ArgTrp: 0.56 ± 0.117
2.712ArgTyr: 2.712 ± 0.245
0.0ArgXaa: 0.0 ± 0.0
Ser
4.684SerAla: 4.684 ± 0.39
0.672SerCys: 0.672 ± 0.132
3.698SerAsp: 3.698 ± 0.28
3.72SerGlu: 3.72 ± 0.335
2.622SerPhe: 2.622 ± 0.25
6.006SerGly: 6.006 ± 0.596
1.233SerHis: 1.233 ± 0.158
3.989SerIle: 3.989 ± 0.304
3.944SerLys: 3.944 ± 0.293
4.998SerLeu: 4.998 ± 0.35
1.703SerMet: 1.703 ± 0.197
3.384SerAsn: 3.384 ± 0.399
2.084SerPro: 2.084 ± 0.277
2.443SerGln: 2.443 ± 0.189
2.689SerArg: 2.689 ± 0.29
4.325SerSer: 4.325 ± 0.521
4.46SerThr: 4.46 ± 0.527
4.393SerVal: 4.393 ± 0.297
0.695SerTrp: 0.695 ± 0.126
2.398SerTyr: 2.398 ± 0.226
0.0SerXaa: 0.0 ± 0.0
Thr
4.818ThrAla: 4.818 ± 0.457
0.403ThrCys: 0.403 ± 0.101
4.437ThrAsp: 4.437 ± 0.487
4.124ThrGlu: 4.124 ± 0.295
3.429ThrPhe: 3.429 ± 0.417
6.432ThrGly: 6.432 ± 1.037
1.076ThrHis: 1.076 ± 0.168
4.437ThrIle: 4.437 ± 0.395
3.25ThrLys: 3.25 ± 0.292
4.37ThrLeu: 4.37 ± 0.245
0.919ThrMet: 0.919 ± 0.183
3.586ThrAsn: 3.586 ± 0.564
3.205ThrPro: 3.205 ± 0.246
2.308ThrGln: 2.308 ± 0.264
2.846ThrArg: 2.846 ± 0.26
4.908ThrSer: 4.908 ± 0.523
5.222ThrThr: 5.222 ± 0.581
5.558ThrVal: 5.558 ± 0.618
0.605ThrTrp: 0.605 ± 0.128
2.286ThrTyr: 2.286 ± 0.262
0.0ThrXaa: 0.0 ± 0.0
Val
4.28ValAla: 4.28 ± 0.347
0.829ValCys: 0.829 ± 0.143
4.886ValAsp: 4.886 ± 0.318
4.93ValGlu: 4.93 ± 0.315
2.241ValPhe: 2.241 ± 0.246
5.76ValGly: 5.76 ± 0.628
0.941ValHis: 0.941 ± 0.158
3.855ValIle: 3.855 ± 0.308
4.236ValLys: 4.236 ± 0.412
4.482ValLeu: 4.482 ± 0.28
1.389ValMet: 1.389 ± 0.2
3.81ValAsn: 3.81 ± 0.269
3.025ValPro: 3.025 ± 0.261
2.667ValGln: 2.667 ± 0.252
2.801ValArg: 2.801 ± 0.244
4.684ValSer: 4.684 ± 0.43
5.76ValThr: 5.76 ± 0.623
5.267ValVal: 5.267 ± 0.41
0.471ValTrp: 0.471 ± 0.104
2.936ValTyr: 2.936 ± 0.256
0.0ValXaa: 0.0 ± 0.0
Trp
0.74TrpAla: 0.74 ± 0.141
0.202TrpCys: 0.202 ± 0.066
0.919TrpAsp: 0.919 ± 0.161
0.829TrpGlu: 0.829 ± 0.177
0.56TrpPhe: 0.56 ± 0.133
0.784TrpGly: 0.784 ± 0.152
0.336TrpHis: 0.336 ± 0.089
0.829TrpIle: 0.829 ± 0.131
0.852TrpLys: 0.852 ± 0.131
0.896TrpLeu: 0.896 ± 0.152
0.403TrpMet: 0.403 ± 0.089
0.538TrpAsn: 0.538 ± 0.113
0.359TrpPro: 0.359 ± 0.093
0.515TrpGln: 0.515 ± 0.098
0.583TrpArg: 0.583 ± 0.135
0.829TrpSer: 0.829 ± 0.15
0.717TrpThr: 0.717 ± 0.138
1.053TrpVal: 1.053 ± 0.159
0.224TrpTrp: 0.224 ± 0.074
0.426TrpTyr: 0.426 ± 0.089
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.084TyrAla: 2.084 ± 0.188
0.493TyrCys: 0.493 ± 0.12
3.9TyrAsp: 3.9 ± 0.271
2.51TyrGlu: 2.51 ± 0.282
1.883TyrPhe: 1.883 ± 0.224
3.138TyrGly: 3.138 ± 0.299
0.762TyrHis: 0.762 ± 0.135
2.577TyrIle: 2.577 ± 0.243
2.712TyrLys: 2.712 ± 0.315
3.631TyrLeu: 3.631 ± 0.388
0.941TyrMet: 0.941 ± 0.142
2.667TyrAsn: 2.667 ± 0.304
1.748TyrPro: 1.748 ± 0.216
1.502TyrGln: 1.502 ± 0.169
1.927TyrArg: 1.927 ± 0.209
2.622TyrSer: 2.622 ± 0.254
2.196TyrThr: 2.196 ± 0.262
2.869TyrVal: 2.869 ± 0.295
0.628TyrTrp: 0.628 ± 0.125
1.502TyrTyr: 1.502 ± 0.178
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 190 proteins (44622 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski