Amino acid dipepetide frequency for Synechococcus phage S-SSM5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.603AlaAla: 6.603 ± 0.421
0.541AlaCys: 0.541 ± 0.103
4.192AlaAsp: 4.192 ± 0.265
3.703AlaGlu: 3.703 ± 0.358
3.127AlaPhe: 3.127 ± 0.238
6.655AlaGly: 6.655 ± 0.492
1.066AlaHis: 1.066 ± 0.124
4.332AlaIle: 4.332 ± 0.315
3.581AlaLys: 3.581 ± 0.36
5.153AlaLeu: 5.153 ± 0.271
1.415AlaMet: 1.415 ± 0.226
4.07AlaAsn: 4.07 ± 0.38
2.672AlaPro: 2.672 ± 0.258
2.655AlaGln: 2.655 ± 0.218
2.48AlaArg: 2.48 ± 0.225
5.048AlaSer: 5.048 ± 0.402
6.201AlaThr: 6.201 ± 0.568
4.681AlaVal: 4.681 ± 0.311
0.646AlaTrp: 0.646 ± 0.1
2.428AlaTyr: 2.428 ± 0.23
0.0AlaXaa: 0.0 ± 0.0
Cys
0.541CysAla: 0.541 ± 0.097
0.052CysCys: 0.052 ± 0.03
0.873CysAsp: 0.873 ± 0.142
0.803CysGlu: 0.803 ± 0.147
0.419CysPhe: 0.419 ± 0.111
0.576CysGly: 0.576 ± 0.106
0.21CysHis: 0.21 ± 0.069
0.524CysIle: 0.524 ± 0.129
0.594CysLys: 0.594 ± 0.097
0.699CysLeu: 0.699 ± 0.138
0.349CysMet: 0.349 ± 0.081
0.402CysAsn: 0.402 ± 0.089
0.384CysPro: 0.384 ± 0.104
0.384CysGln: 0.384 ± 0.102
0.349CysArg: 0.349 ± 0.097
0.629CysSer: 0.629 ± 0.105
0.541CysThr: 0.541 ± 0.126
0.437CysVal: 0.437 ± 0.093
0.14CysTrp: 0.14 ± 0.057
0.332CysTyr: 0.332 ± 0.08
0.0CysXaa: 0.0 ± 0.0
Asp
4.961AspAla: 4.961 ± 0.286
0.786AspCys: 0.786 ± 0.132
4.21AspAsp: 4.21 ± 0.302
4.017AspGlu: 4.017 ± 0.308
3.214AspPhe: 3.214 ± 0.233
5.869AspGly: 5.869 ± 0.495
1.048AspHis: 1.048 ± 0.167
3.913AspIle: 3.913 ± 0.301
3.266AspLys: 3.266 ± 0.345
4.751AspLeu: 4.751 ± 0.337
1.45AspMet: 1.45 ± 0.226
3.528AspAsn: 3.528 ± 0.278
3.284AspPro: 3.284 ± 0.243
1.764AspGln: 1.764 ± 0.173
2.306AspArg: 2.306 ± 0.227
4.157AspSer: 4.157 ± 0.257
4.157AspThr: 4.157 ± 0.312
4.087AspVal: 4.087 ± 0.326
1.135AspTrp: 1.135 ± 0.164
3.004AspTyr: 3.004 ± 0.226
0.0AspXaa: 0.0 ± 0.0
Glu
3.493GluAla: 3.493 ± 0.265
0.611GluCys: 0.611 ± 0.126
3.703GluAsp: 3.703 ± 0.286
4.384GluGlu: 4.384 ± 0.573
2.707GluPhe: 2.707 ± 0.222
3.825GluGly: 3.825 ± 0.324
0.838GluHis: 0.838 ± 0.124
4.332GluIle: 4.332 ± 0.29
3.703GluLys: 3.703 ± 0.519
5.013GluLeu: 5.013 ± 0.35
1.188GluMet: 1.188 ± 0.194
2.952GluAsn: 2.952 ± 0.236
1.572GluPro: 1.572 ± 0.168
2.428GluGln: 2.428 ± 0.284
2.62GluArg: 2.62 ± 0.277
3.493GluSer: 3.493 ± 0.255
3.563GluThr: 3.563 ± 0.274
4.803GluVal: 4.803 ± 0.366
1.031GluTrp: 1.031 ± 0.162
2.76GluTyr: 2.76 ± 0.253
0.0GluXaa: 0.0 ± 0.0
Phe
2.742PheAla: 2.742 ± 0.214
0.349PheCys: 0.349 ± 0.089
3.703PheAsp: 3.703 ± 0.226
2.76PheGlu: 2.76 ± 0.242
1.782PhePhe: 1.782 ± 0.178
3.092PheGly: 3.092 ± 0.265
0.576PheHis: 0.576 ± 0.114
2.498PheIle: 2.498 ± 0.258
2.428PheLys: 2.428 ± 0.265
3.266PheLeu: 3.266 ± 0.268
1.066PheMet: 1.066 ± 0.165
2.707PheAsn: 2.707 ± 0.242
1.59PhePro: 1.59 ± 0.21
1.729PheGln: 1.729 ± 0.191
1.467PheArg: 1.467 ± 0.123
3.092PheSer: 3.092 ± 0.262
3.319PheThr: 3.319 ± 0.321
2.934PheVal: 2.934 ± 0.264
0.349PheTrp: 0.349 ± 0.078
1.694PheTyr: 1.694 ± 0.171
0.0PheXaa: 0.0 ± 0.0
Gly
6.306GlyAla: 6.306 ± 0.485
0.734GlyCys: 0.734 ± 0.14
4.699GlyAsp: 4.699 ± 0.35
3.948GlyGlu: 3.948 ± 0.235
3.266GlyPhe: 3.266 ± 0.327
7.983GlyGly: 7.983 ± 0.952
1.205GlyHis: 1.205 ± 0.153
4.332GlyIle: 4.332 ± 0.331
4.262GlyLys: 4.262 ± 0.397
4.664GlyLeu: 4.664 ± 0.287
1.747GlyMet: 1.747 ± 0.306
4.419GlyAsn: 4.419 ± 0.461
2.148GlyPro: 2.148 ± 0.197
2.742GlyGln: 2.742 ± 0.206
2.76GlyArg: 2.76 ± 0.21
6.707GlySer: 6.707 ± 0.658
7.668GlyThr: 7.668 ± 0.88
5.328GlyVal: 5.328 ± 0.361
1.048GlyTrp: 1.048 ± 0.126
3.371GlyTyr: 3.371 ± 0.264
0.0GlyXaa: 0.0 ± 0.0
His
0.803HisAla: 0.803 ± 0.14
0.279HisCys: 0.279 ± 0.078
0.926HisAsp: 0.926 ± 0.125
1.066HisGlu: 1.066 ± 0.163
0.751HisPhe: 0.751 ± 0.127
1.275HisGly: 1.275 ± 0.202
0.576HisHis: 0.576 ± 0.15
0.803HisIle: 0.803 ± 0.125
1.013HisLys: 1.013 ± 0.155
1.083HisLeu: 1.083 ± 0.124
0.507HisMet: 0.507 ± 0.125
0.786HisAsn: 0.786 ± 0.135
0.961HisPro: 0.961 ± 0.152
0.524HisGln: 0.524 ± 0.103
0.716HisArg: 0.716 ± 0.124
0.978HisSer: 0.978 ± 0.106
1.205HisThr: 1.205 ± 0.136
0.978HisVal: 0.978 ± 0.152
0.245HisTrp: 0.245 ± 0.07
0.873HisTyr: 0.873 ± 0.144
0.0HisXaa: 0.0 ± 0.0
Ile
3.686IleAla: 3.686 ± 0.225
0.646IleCys: 0.646 ± 0.136
4.559IleAsp: 4.559 ± 0.335
3.843IleGlu: 3.843 ± 0.276
2.306IlePhe: 2.306 ± 0.173
3.948IleGly: 3.948 ± 0.266
0.769IleHis: 0.769 ± 0.161
3.179IleIle: 3.179 ± 0.237
4.349IleLys: 4.349 ± 0.311
4.402IleLeu: 4.402 ± 0.31
1.031IleMet: 1.031 ± 0.158
4.157IleAsn: 4.157 ± 0.332
2.777IlePro: 2.777 ± 0.255
2.306IleGln: 2.306 ± 0.213
2.358IleArg: 2.358 ± 0.216
4.262IleSer: 4.262 ± 0.368
5.345IleThr: 5.345 ± 0.721
3.79IleVal: 3.79 ± 0.307
0.524IleTrp: 0.524 ± 0.097
2.026IleTyr: 2.026 ± 0.26
0.0IleXaa: 0.0 ± 0.0
Lys
3.668LysAla: 3.668 ± 0.449
0.524LysCys: 0.524 ± 0.118
3.284LysAsp: 3.284 ± 0.302
4.035LysGlu: 4.035 ± 0.541
2.533LysPhe: 2.533 ± 0.282
3.965LysGly: 3.965 ± 0.339
0.961LysHis: 0.961 ± 0.192
3.721LysIle: 3.721 ± 0.323
4.646LysLys: 4.646 ± 0.627
4.961LysLeu: 4.961 ± 0.387
1.764LysMet: 1.764 ± 0.294
2.969LysAsn: 2.969 ± 0.256
2.026LysPro: 2.026 ± 0.301
2.445LysGln: 2.445 ± 0.276
2.55LysArg: 2.55 ± 0.301
3.686LysSer: 3.686 ± 0.397
3.668LysThr: 3.668 ± 0.218
4.419LysVal: 4.419 ± 0.281
0.769LysTrp: 0.769 ± 0.145
2.812LysTyr: 2.812 ± 0.278
0.0LysXaa: 0.0 ± 0.0
Leu
4.961LeuAla: 4.961 ± 0.314
0.803LeuCys: 0.803 ± 0.142
5.223LeuAsp: 5.223 ± 0.325
4.07LeuGlu: 4.07 ± 0.308
2.795LeuPhe: 2.795 ± 0.181
4.751LeuGly: 4.751 ± 0.403
1.502LeuHis: 1.502 ± 0.234
4.052LeuIle: 4.052 ± 0.29
4.646LeuLys: 4.646 ± 0.394
5.048LeuLeu: 5.048 ± 0.388
1.432LeuMet: 1.432 ± 0.214
4.454LeuAsn: 4.454 ± 0.327
2.812LeuPro: 2.812 ± 0.267
2.725LeuGln: 2.725 ± 0.203
3.319LeuArg: 3.319 ± 0.289
5.24LeuSer: 5.24 ± 0.295
5.485LeuThr: 5.485 ± 0.487
4.314LeuVal: 4.314 ± 0.244
0.681LeuTrp: 0.681 ± 0.137
3.319LeuTyr: 3.319 ± 0.247
0.0LeuXaa: 0.0 ± 0.0
Met
1.659MetAla: 1.659 ± 0.232
0.245MetCys: 0.245 ± 0.071
0.943MetAsp: 0.943 ± 0.174
1.345MetGlu: 1.345 ± 0.233
0.873MetPhe: 0.873 ± 0.162
1.24MetGly: 1.24 ± 0.199
0.437MetHis: 0.437 ± 0.087
1.275MetIle: 1.275 ± 0.212
1.974MetLys: 1.974 ± 0.314
1.45MetLeu: 1.45 ± 0.172
0.541MetMet: 0.541 ± 0.14
1.17MetAsn: 1.17 ± 0.164
0.891MetPro: 0.891 ± 0.147
1.013MetGln: 1.013 ± 0.163
0.961MetArg: 0.961 ± 0.167
1.712MetSer: 1.712 ± 0.291
1.467MetThr: 1.467 ± 0.183
1.066MetVal: 1.066 ± 0.146
0.245MetTrp: 0.245 ± 0.073
0.891MetTyr: 0.891 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
4.0AsnAla: 4.0 ± 0.343
0.437AsnCys: 0.437 ± 0.094
3.581AsnAsp: 3.581 ± 0.244
2.812AsnGlu: 2.812 ± 0.237
2.376AsnPhe: 2.376 ± 0.211
4.541AsnGly: 4.541 ± 0.516
0.838AsnHis: 0.838 ± 0.115
3.93AsnIle: 3.93 ± 0.332
2.934AsnLys: 2.934 ± 0.261
4.314AsnLeu: 4.314 ± 0.337
0.891AsnMet: 0.891 ± 0.166
3.441AsnAsn: 3.441 ± 0.292
3.179AsnPro: 3.179 ± 0.212
2.009AsnGln: 2.009 ± 0.181
2.323AsnArg: 2.323 ± 0.211
3.738AsnSer: 3.738 ± 0.321
4.926AsnThr: 4.926 ± 0.382
4.122AsnVal: 4.122 ± 0.32
0.664AsnTrp: 0.664 ± 0.122
2.428AsnTyr: 2.428 ± 0.216
0.0AsnXaa: 0.0 ± 0.0
Pro
2.568ProAla: 2.568 ± 0.264
0.367ProCys: 0.367 ± 0.114
2.585ProAsp: 2.585 ± 0.241
2.742ProGlu: 2.742 ± 0.298
1.764ProPhe: 1.764 ± 0.219
3.004ProGly: 3.004 ± 0.292
0.786ProHis: 0.786 ± 0.113
2.288ProIle: 2.288 ± 0.187
2.218ProLys: 2.218 ± 0.274
2.376ProLeu: 2.376 ± 0.193
0.681ProMet: 0.681 ± 0.137
2.236ProAsn: 2.236 ± 0.182
1.677ProPro: 1.677 ± 0.215
1.24ProGln: 1.24 ± 0.142
1.712ProArg: 1.712 ± 0.193
3.022ProSer: 3.022 ± 0.239
3.074ProThr: 3.074 ± 0.238
2.445ProVal: 2.445 ± 0.209
0.507ProTrp: 0.507 ± 0.126
1.642ProTyr: 1.642 ± 0.224
0.0ProXaa: 0.0 ± 0.0
Gln
2.218GlnAla: 2.218 ± 0.216
0.367GlnCys: 0.367 ± 0.084
1.921GlnAsp: 1.921 ± 0.204
2.62GlnGlu: 2.62 ± 0.24
1.712GlnPhe: 1.712 ± 0.186
2.253GlnGly: 2.253 ± 0.279
0.646GlnHis: 0.646 ± 0.125
2.568GlnIle: 2.568 ± 0.231
2.445GlnLys: 2.445 ± 0.292
3.057GlnLeu: 3.057 ± 0.249
0.838GlnMet: 0.838 ± 0.131
1.991GlnAsn: 1.991 ± 0.161
1.328GlnPro: 1.328 ± 0.152
1.397GlnGln: 1.397 ± 0.181
1.782GlnArg: 1.782 ± 0.18
2.428GlnSer: 2.428 ± 0.209
2.48GlnThr: 2.48 ± 0.24
2.498GlnVal: 2.498 ± 0.233
0.332GlnTrp: 0.332 ± 0.076
1.817GlnTyr: 1.817 ± 0.211
0.0GlnXaa: 0.0 ± 0.0
Arg
2.655ArgAla: 2.655 ± 0.195
0.297ArgCys: 0.297 ± 0.069
2.306ArgAsp: 2.306 ± 0.252
2.428ArgGlu: 2.428 ± 0.281
1.886ArgPhe: 1.886 ± 0.153
2.638ArgGly: 2.638 ± 0.289
0.856ArgHis: 0.856 ± 0.157
2.76ArgIle: 2.76 ± 0.205
2.742ArgLys: 2.742 ± 0.28
3.249ArgLeu: 3.249 ± 0.271
1.17ArgMet: 1.17 ± 0.189
2.026ArgAsn: 2.026 ± 0.185
1.275ArgPro: 1.275 ± 0.157
1.502ArgGln: 1.502 ± 0.138
2.096ArgArg: 2.096 ± 0.309
2.428ArgSer: 2.428 ± 0.202
2.428ArgThr: 2.428 ± 0.253
3.004ArgVal: 3.004 ± 0.267
0.384ArgTrp: 0.384 ± 0.102
2.341ArgTyr: 2.341 ± 0.219
0.0ArgXaa: 0.0 ± 0.0
Ser
5.135SerAla: 5.135 ± 0.344
0.576SerCys: 0.576 ± 0.125
4.279SerAsp: 4.279 ± 0.305
3.528SerGlu: 3.528 ± 0.26
3.371SerPhe: 3.371 ± 0.274
7.93SerGly: 7.93 ± 0.669
1.083SerHis: 1.083 ± 0.136
4.175SerIle: 4.175 ± 0.423
3.459SerLys: 3.459 ± 0.337
4.472SerLeu: 4.472 ± 0.315
1.537SerMet: 1.537 ± 0.213
3.983SerAsn: 3.983 ± 0.297
2.48SerPro: 2.48 ± 0.268
2.358SerGln: 2.358 ± 0.203
2.568SerArg: 2.568 ± 0.207
5.485SerSer: 5.485 ± 0.574
5.537SerThr: 5.537 ± 0.57
4.961SerVal: 4.961 ± 0.369
0.681SerTrp: 0.681 ± 0.132
2.55SerTyr: 2.55 ± 0.175
0.0SerXaa: 0.0 ± 0.0
Thr
7.022ThrAla: 7.022 ± 0.782
0.559ThrCys: 0.559 ± 0.107
4.437ThrAsp: 4.437 ± 0.394
3.598ThrGlu: 3.598 ± 0.276
3.336ThrPhe: 3.336 ± 0.326
7.039ThrGly: 7.039 ± 0.787
0.821ThrHis: 0.821 ± 0.116
5.083ThrIle: 5.083 ± 0.474
3.808ThrLys: 3.808 ± 0.305
5.712ThrLeu: 5.712 ± 0.581
1.432ThrMet: 1.432 ± 0.18
4.681ThrAsn: 4.681 ± 0.489
3.074ThrPro: 3.074 ± 0.241
2.638ThrGln: 2.638 ± 0.237
2.655ThrArg: 2.655 ± 0.202
5.572ThrSer: 5.572 ± 0.504
6.672ThrThr: 6.672 ± 0.742
6.044ThrVal: 6.044 ± 0.654
0.803ThrTrp: 0.803 ± 0.125
2.9ThrTyr: 2.9 ± 0.26
0.0ThrXaa: 0.0 ± 0.0
Val
5.205ValAla: 5.205 ± 0.336
0.367ValCys: 0.367 ± 0.081
5.275ValAsp: 5.275 ± 0.38
4.262ValGlu: 4.262 ± 0.25
2.672ValPhe: 2.672 ± 0.183
5.153ValGly: 5.153 ± 0.4
0.873ValHis: 0.873 ± 0.134
3.459ValIle: 3.459 ± 0.27
3.948ValLys: 3.948 ± 0.284
4.227ValLeu: 4.227 ± 0.299
1.31ValMet: 1.31 ± 0.183
4.122ValAsn: 4.122 ± 0.347
2.9ValPro: 2.9 ± 0.264
2.812ValGln: 2.812 ± 0.207
2.69ValArg: 2.69 ± 0.215
5.362ValSer: 5.362 ± 0.393
6.026ValThr: 6.026 ± 0.521
4.751ValVal: 4.751 ± 0.371
0.821ValTrp: 0.821 ± 0.131
2.393ValTyr: 2.393 ± 0.244
0.0ValXaa: 0.0 ± 0.0
Trp
0.559TrpAla: 0.559 ± 0.089
0.157TrpCys: 0.157 ± 0.065
0.786TrpAsp: 0.786 ± 0.119
0.734TrpGlu: 0.734 ± 0.134
0.454TrpPhe: 0.454 ± 0.109
0.716TrpGly: 0.716 ± 0.112
0.437TrpHis: 0.437 ± 0.1
0.576TrpIle: 0.576 ± 0.091
0.838TrpLys: 0.838 ± 0.145
0.751TrpLeu: 0.751 ± 0.134
0.279TrpMet: 0.279 ± 0.076
0.786TrpAsn: 0.786 ± 0.119
0.262TrpPro: 0.262 ± 0.07
0.332TrpGln: 0.332 ± 0.074
0.576TrpArg: 0.576 ± 0.111
0.803TrpSer: 0.803 ± 0.129
0.961TrpThr: 0.961 ± 0.149
0.891TrpVal: 0.891 ± 0.126
0.122TrpTrp: 0.122 ± 0.041
0.524TrpTyr: 0.524 ± 0.094
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.463TyrAla: 2.463 ± 0.199
0.541TyrCys: 0.541 ± 0.105
3.424TyrAsp: 3.424 ± 0.258
2.323TyrGlu: 2.323 ± 0.225
1.799TyrPhe: 1.799 ± 0.185
2.882TyrGly: 2.882 ± 0.206
0.821TyrHis: 0.821 ± 0.153
2.463TyrIle: 2.463 ± 0.215
2.568TyrLys: 2.568 ± 0.272
2.987TyrLeu: 2.987 ± 0.217
0.751TyrMet: 0.751 ± 0.14
2.603TyrAsn: 2.603 ± 0.263
1.642TyrPro: 1.642 ± 0.168
1.694TyrGln: 1.694 ± 0.2
2.183TyrArg: 2.183 ± 0.253
2.306TyrSer: 2.306 ± 0.225
3.162TyrThr: 3.162 ± 0.362
3.074TyrVal: 3.074 ± 0.289
0.419TyrTrp: 0.419 ± 0.115
2.148TyrTyr: 2.148 ± 0.19
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 225 proteins (57251 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski