Amino acid dipepetide frequency for Synechococcus phage S-H25

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.543AlaAla: 6.543 ± 0.542
0.348AlaCys: 0.348 ± 0.098
3.977AlaAsp: 3.977 ± 0.27
4.105AlaGlu: 4.105 ± 0.447
2.511AlaPhe: 2.511 ± 0.253
6.36AlaGly: 6.36 ± 0.544
0.935AlaHis: 0.935 ± 0.118
4.729AlaIle: 4.729 ± 0.296
4.234AlaLys: 4.234 ± 0.505
4.619AlaLeu: 4.619 ± 0.266
1.558AlaMet: 1.558 ± 0.211
4.16AlaAsn: 4.16 ± 0.352
2.639AlaPro: 2.639 ± 0.224
2.566AlaGln: 2.566 ± 0.206
2.841AlaArg: 2.841 ± 0.259
5.315AlaSer: 5.315 ± 0.431
6.103AlaThr: 6.103 ± 0.546
4.435AlaVal: 4.435 ± 0.283
0.696AlaTrp: 0.696 ± 0.101
2.181AlaTyr: 2.181 ± 0.205
0.0AlaXaa: 0.0 ± 0.0
Cys
0.55CysAla: 0.55 ± 0.089
0.073CysCys: 0.073 ± 0.041
0.751CysAsp: 0.751 ± 0.138
0.568CysGlu: 0.568 ± 0.089
0.403CysPhe: 0.403 ± 0.106
0.495CysGly: 0.495 ± 0.104
0.183CysHis: 0.183 ± 0.064
0.458CysIle: 0.458 ± 0.099
0.733CysLys: 0.733 ± 0.162
0.605CysLeu: 0.605 ± 0.113
0.238CysMet: 0.238 ± 0.079
0.55CysAsn: 0.55 ± 0.122
0.275CysPro: 0.275 ± 0.081
0.275CysGln: 0.275 ± 0.075
0.33CysArg: 0.33 ± 0.093
0.641CysSer: 0.641 ± 0.13
0.495CysThr: 0.495 ± 0.116
0.532CysVal: 0.532 ± 0.118
0.128CysTrp: 0.128 ± 0.062
0.348CysTyr: 0.348 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
5.572AspAla: 5.572 ± 0.391
0.696AspCys: 0.696 ± 0.137
4.674AspAsp: 4.674 ± 0.331
4.124AspGlu: 4.124 ± 0.359
2.914AspPhe: 2.914 ± 0.271
5.993AspGly: 5.993 ± 0.492
0.733AspHis: 0.733 ± 0.119
4.014AspIle: 4.014 ± 0.33
3.592AspLys: 3.592 ± 0.308
4.49AspLeu: 4.49 ± 0.318
1.356AspMet: 1.356 ± 0.21
3.372AspAsn: 3.372 ± 0.273
3.372AspPro: 3.372 ± 0.258
1.851AspGln: 1.851 ± 0.201
2.438AspArg: 2.438 ± 0.206
4.014AspSer: 4.014 ± 0.359
4.38AspThr: 4.38 ± 0.26
4.454AspVal: 4.454 ± 0.259
0.935AspTrp: 0.935 ± 0.132
3.189AspTyr: 3.189 ± 0.226
0.0AspXaa: 0.0 ± 0.0
Glu
3.501GluAla: 3.501 ± 0.344
0.751GluCys: 0.751 ± 0.15
4.472GluAsp: 4.472 ± 0.354
4.619GluGlu: 4.619 ± 0.547
3.189GluPhe: 3.189 ± 0.277
4.105GluGly: 4.105 ± 0.326
1.063GluHis: 1.063 ± 0.175
3.831GluIle: 3.831 ± 0.306
3.647GluLys: 3.647 ± 0.488
5.15GluLeu: 5.15 ± 0.393
1.814GluMet: 1.814 ± 0.286
3.152GluAsn: 3.152 ± 0.249
1.668GluPro: 1.668 ± 0.174
2.383GluGln: 2.383 ± 0.199
2.639GluArg: 2.639 ± 0.304
3.702GluSer: 3.702 ± 0.282
4.032GluThr: 4.032 ± 0.309
4.545GluVal: 4.545 ± 0.333
0.861GluTrp: 0.861 ± 0.152
2.731GluTyr: 2.731 ± 0.217
0.0GluXaa: 0.0 ± 0.0
Phe
2.822PheAla: 2.822 ± 0.251
0.495PheCys: 0.495 ± 0.11
3.464PheAsp: 3.464 ± 0.263
2.877PheGlu: 2.877 ± 0.258
1.796PhePhe: 1.796 ± 0.181
3.042PheGly: 3.042 ± 0.244
0.605PheHis: 0.605 ± 0.137
2.676PheIle: 2.676 ± 0.256
2.346PheLys: 2.346 ± 0.223
2.822PheLeu: 2.822 ± 0.279
1.063PheMet: 1.063 ± 0.171
2.877PheAsn: 2.877 ± 0.23
1.503PhePro: 1.503 ± 0.171
1.759PheGln: 1.759 ± 0.181
1.631PheArg: 1.631 ± 0.174
3.079PheSer: 3.079 ± 0.278
3.006PheThr: 3.006 ± 0.225
2.676PheVal: 2.676 ± 0.268
0.403PheTrp: 0.403 ± 0.094
1.961PheTyr: 1.961 ± 0.179
0.0PheXaa: 0.0 ± 0.0
Gly
5.957GlyAla: 5.957 ± 0.494
0.641GlyCys: 0.641 ± 0.113
5.205GlyAsp: 5.205 ± 0.431
4.344GlyGlu: 4.344 ± 0.3
3.391GlyPhe: 3.391 ± 0.255
7.936GlyGly: 7.936 ± 0.769
1.045GlyHis: 1.045 ± 0.137
4.197GlyIle: 4.197 ± 0.331
3.556GlyLys: 3.556 ± 0.368
4.765GlyLeu: 4.765 ± 0.294
1.741GlyMet: 1.741 ± 0.277
4.417GlyAsn: 4.417 ± 0.387
1.998GlyPro: 1.998 ± 0.197
2.767GlyGln: 2.767 ± 0.254
3.006GlyArg: 3.006 ± 0.327
6.268GlySer: 6.268 ± 0.552
7.258GlyThr: 7.258 ± 0.718
5.022GlyVal: 5.022 ± 0.319
1.246GlyTrp: 1.246 ± 0.155
3.739GlyTyr: 3.739 ± 0.459
0.0GlyXaa: 0.0 ± 0.0
His
0.715HisAla: 0.715 ± 0.12
0.312HisCys: 0.312 ± 0.073
0.916HisAsp: 0.916 ± 0.143
0.843HisGlu: 0.843 ± 0.176
0.751HisPhe: 0.751 ± 0.143
1.026HisGly: 1.026 ± 0.143
0.275HisHis: 0.275 ± 0.072
0.916HisIle: 0.916 ± 0.135
0.806HisLys: 0.806 ± 0.147
0.806HisLeu: 0.806 ± 0.114
0.532HisMet: 0.532 ± 0.114
0.715HisAsn: 0.715 ± 0.127
0.861HisPro: 0.861 ± 0.149
0.568HisGln: 0.568 ± 0.11
0.477HisArg: 0.477 ± 0.101
0.953HisSer: 0.953 ± 0.124
0.916HisThr: 0.916 ± 0.124
0.953HisVal: 0.953 ± 0.159
0.22HisTrp: 0.22 ± 0.065
0.733HisTyr: 0.733 ± 0.14
0.0HisXaa: 0.0 ± 0.0
Ile
4.234IleAla: 4.234 ± 0.295
0.477IleCys: 0.477 ± 0.11
4.93IleAsp: 4.93 ± 0.357
4.325IleGlu: 4.325 ± 0.284
2.053IlePhe: 2.053 ± 0.171
4.527IleGly: 4.527 ± 0.38
0.605IleHis: 0.605 ± 0.107
3.556IleIle: 3.556 ± 0.323
3.922IleLys: 3.922 ± 0.286
3.885IleLeu: 3.885 ± 0.337
0.971IleMet: 0.971 ± 0.159
4.105IleAsn: 4.105 ± 0.263
2.749IlePro: 2.749 ± 0.264
2.731IleGln: 2.731 ± 0.26
2.401IleArg: 2.401 ± 0.212
4.674IleSer: 4.674 ± 0.548
5.15IleThr: 5.15 ± 0.524
4.38IleVal: 4.38 ± 0.372
0.733IleTrp: 0.733 ± 0.108
2.273IleTyr: 2.273 ± 0.211
0.0IleXaa: 0.0 ± 0.0
Lys
3.721LysAla: 3.721 ± 0.453
0.623LysCys: 0.623 ± 0.131
3.482LysAsp: 3.482 ± 0.332
4.032LysGlu: 4.032 ± 0.509
2.474LysPhe: 2.474 ± 0.221
3.244LysGly: 3.244 ± 0.354
0.751LysHis: 0.751 ± 0.165
3.721LysIle: 3.721 ± 0.261
4.252LysLys: 4.252 ± 0.685
4.655LysLeu: 4.655 ± 0.357
1.265LysMet: 1.265 ± 0.23
3.134LysAsn: 3.134 ± 0.311
1.796LysPro: 1.796 ± 0.228
2.309LysGln: 2.309 ± 0.308
2.419LysArg: 2.419 ± 0.325
4.124LysSer: 4.124 ± 0.367
3.171LysThr: 3.171 ± 0.253
3.959LysVal: 3.959 ± 0.314
0.77LysTrp: 0.77 ± 0.142
3.171LysTyr: 3.171 ± 0.35
0.0LysXaa: 0.0 ± 0.0
Leu
4.362LeuAla: 4.362 ± 0.305
0.605LeuCys: 0.605 ± 0.126
5.242LeuAsp: 5.242 ± 0.34
4.619LeuGlu: 4.619 ± 0.314
2.932LeuPhe: 2.932 ± 0.22
4.747LeuGly: 4.747 ± 0.387
1.081LeuHis: 1.081 ± 0.174
3.922LeuIle: 3.922 ± 0.266
4.71LeuLys: 4.71 ± 0.399
5.333LeuLeu: 5.333 ± 0.411
1.1LeuMet: 1.1 ± 0.197
4.948LeuAsn: 4.948 ± 0.439
2.749LeuPro: 2.749 ± 0.245
2.822LeuGln: 2.822 ± 0.248
3.189LeuArg: 3.189 ± 0.217
5.205LeuSer: 5.205 ± 0.284
5.517LeuThr: 5.517 ± 0.434
4.399LeuVal: 4.399 ± 0.245
0.586LeuTrp: 0.586 ± 0.11
3.372LeuTyr: 3.372 ± 0.249
0.0LeuXaa: 0.0 ± 0.0
Met
1.704MetAla: 1.704 ± 0.238
0.128MetCys: 0.128 ± 0.051
1.081MetAsp: 1.081 ± 0.159
1.173MetGlu: 1.173 ± 0.214
0.861MetPhe: 0.861 ± 0.16
1.228MetGly: 1.228 ± 0.262
0.403MetHis: 0.403 ± 0.115
1.173MetIle: 1.173 ± 0.2
1.613MetLys: 1.613 ± 0.289
1.723MetLeu: 1.723 ± 0.23
0.586MetMet: 0.586 ± 0.142
1.301MetAsn: 1.301 ± 0.183
0.953MetPro: 0.953 ± 0.181
0.953MetGln: 0.953 ± 0.192
1.191MetArg: 1.191 ± 0.21
1.668MetSer: 1.668 ± 0.248
1.32MetThr: 1.32 ± 0.23
1.063MetVal: 1.063 ± 0.161
0.312MetTrp: 0.312 ± 0.094
0.751MetTyr: 0.751 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
3.721AsnAla: 3.721 ± 0.287
0.403AsnCys: 0.403 ± 0.082
3.409AsnAsp: 3.409 ± 0.225
3.409AsnGlu: 3.409 ± 0.236
2.713AsnPhe: 2.713 ± 0.247
4.765AsnGly: 4.765 ± 0.612
1.008AsnHis: 1.008 ± 0.137
3.757AsnIle: 3.757 ± 0.354
2.731AsnLys: 2.731 ± 0.282
4.82AsnLeu: 4.82 ± 0.349
1.008AsnMet: 1.008 ± 0.157
3.189AsnAsn: 3.189 ± 0.365
3.317AsnPro: 3.317 ± 0.236
2.181AsnGln: 2.181 ± 0.205
2.383AsnArg: 2.383 ± 0.193
3.959AsnSer: 3.959 ± 0.31
4.234AsnThr: 4.234 ± 0.477
4.197AsnVal: 4.197 ± 0.366
0.715AsnTrp: 0.715 ± 0.127
2.566AsnTyr: 2.566 ± 0.188
0.0AsnXaa: 0.0 ± 0.0
Pro
2.548ProAla: 2.548 ± 0.203
0.33ProCys: 0.33 ± 0.104
2.639ProAsp: 2.639 ± 0.283
2.749ProGlu: 2.749 ± 0.265
1.686ProPhe: 1.686 ± 0.18
3.061ProGly: 3.061 ± 0.338
0.623ProHis: 0.623 ± 0.104
2.419ProIle: 2.419 ± 0.238
2.126ProLys: 2.126 ± 0.255
2.108ProLeu: 2.108 ± 0.192
0.586ProMet: 0.586 ± 0.132
2.419ProAsn: 2.419 ± 0.214
1.759ProPro: 1.759 ± 0.188
1.521ProGln: 1.521 ± 0.188
1.521ProArg: 1.521 ± 0.163
3.006ProSer: 3.006 ± 0.225
3.244ProThr: 3.244 ± 0.289
2.584ProVal: 2.584 ± 0.213
0.66ProTrp: 0.66 ± 0.105
1.558ProTyr: 1.558 ± 0.162
0.0ProXaa: 0.0 ± 0.0
Gln
2.346GlnAla: 2.346 ± 0.187
0.293GlnCys: 0.293 ± 0.078
1.833GlnAsp: 1.833 ± 0.167
2.438GlnGlu: 2.438 ± 0.218
1.723GlnPhe: 1.723 ± 0.158
2.603GlnGly: 2.603 ± 0.206
0.843GlnHis: 0.843 ± 0.15
2.896GlnIle: 2.896 ± 0.268
2.199GlnLys: 2.199 ± 0.277
3.042GlnLeu: 3.042 ± 0.222
0.88GlnMet: 0.88 ± 0.189
1.888GlnAsn: 1.888 ± 0.181
1.155GlnPro: 1.155 ± 0.171
1.43GlnGln: 1.43 ± 0.247
1.576GlnArg: 1.576 ± 0.194
2.566GlnSer: 2.566 ± 0.221
2.089GlnThr: 2.089 ± 0.147
2.786GlnVal: 2.786 ± 0.216
0.513GlnTrp: 0.513 ± 0.087
1.888GlnTyr: 1.888 ± 0.192
0.0GlnXaa: 0.0 ± 0.0
Arg
2.822ArgAla: 2.822 ± 0.382
0.33ArgCys: 0.33 ± 0.072
2.144ArgAsp: 2.144 ± 0.163
2.419ArgGlu: 2.419 ± 0.339
1.686ArgPhe: 1.686 ± 0.184
2.896ArgGly: 2.896 ± 0.244
0.696ArgHis: 0.696 ± 0.136
2.896ArgIle: 2.896 ± 0.245
2.822ArgLys: 2.822 ± 0.375
3.427ArgLeu: 3.427 ± 0.224
1.246ArgMet: 1.246 ± 0.22
1.741ArgAsn: 1.741 ± 0.195
1.393ArgPro: 1.393 ± 0.18
1.613ArgGln: 1.613 ± 0.168
1.961ArgArg: 1.961 ± 0.343
2.163ArgSer: 2.163 ± 0.23
2.566ArgThr: 2.566 ± 0.285
2.877ArgVal: 2.877 ± 0.243
0.586ArgTrp: 0.586 ± 0.127
2.236ArgTyr: 2.236 ± 0.21
0.0ArgXaa: 0.0 ± 0.0
Ser
5.425SerAla: 5.425 ± 0.366
0.458SerCys: 0.458 ± 0.115
4.435SerAsp: 4.435 ± 0.308
3.556SerGlu: 3.556 ± 0.243
3.207SerPhe: 3.207 ± 0.325
6.983SerGly: 6.983 ± 0.598
0.751SerHis: 0.751 ± 0.099
4.894SerIle: 4.894 ± 0.386
3.592SerLys: 3.592 ± 0.325
4.82SerLeu: 4.82 ± 0.267
1.393SerMet: 1.393 ± 0.213
4.179SerAsn: 4.179 ± 0.347
2.456SerPro: 2.456 ± 0.22
2.511SerGln: 2.511 ± 0.21
2.199SerArg: 2.199 ± 0.213
5.315SerSer: 5.315 ± 0.465
5.187SerThr: 5.187 ± 0.421
5.095SerVal: 5.095 ± 0.499
0.605SerTrp: 0.605 ± 0.117
2.969SerTyr: 2.969 ± 0.206
0.0SerXaa: 0.0 ± 0.0
Thr
5.957ThrAla: 5.957 ± 0.496
0.458ThrCys: 0.458 ± 0.094
4.215ThrAsp: 4.215 ± 0.388
4.124ThrGlu: 4.124 ± 0.286
3.629ThrPhe: 3.629 ± 0.292
6.818ThrGly: 6.818 ± 0.649
0.88ThrHis: 0.88 ± 0.129
5.113ThrIle: 5.113 ± 0.504
3.336ThrLys: 3.336 ± 0.253
6.158ThrLeu: 6.158 ± 0.548
1.063ThrMet: 1.063 ± 0.159
4.27ThrAsn: 4.27 ± 0.53
3.556ThrPro: 3.556 ± 0.241
2.291ThrGln: 2.291 ± 0.195
3.006ThrArg: 3.006 ± 0.227
5.15ThrSer: 5.15 ± 0.362
6.14ThrThr: 6.14 ± 0.663
5.517ThrVal: 5.517 ± 0.517
0.733ThrTrp: 0.733 ± 0.125
2.767ThrTyr: 2.767 ± 0.202
0.0ThrXaa: 0.0 ± 0.0
Val
5.058ValAla: 5.058 ± 0.397
0.477ValCys: 0.477 ± 0.092
5.297ValAsp: 5.297 ± 0.418
4.289ValGlu: 4.289 ± 0.232
2.694ValPhe: 2.694 ± 0.209
5.242ValGly: 5.242 ± 0.572
0.861ValHis: 0.861 ± 0.12
4.124ValIle: 4.124 ± 0.348
3.629ValLys: 3.629 ± 0.303
4.362ValLeu: 4.362 ± 0.287
1.338ValMet: 1.338 ± 0.199
3.867ValAsn: 3.867 ± 0.399
2.914ValPro: 2.914 ± 0.278
2.254ValGln: 2.254 ± 0.17
2.548ValArg: 2.548 ± 0.245
4.637ValSer: 4.637 ± 0.344
5.865ValThr: 5.865 ± 0.435
4.71ValVal: 4.71 ± 0.4
0.751ValTrp: 0.751 ± 0.104
2.969ValTyr: 2.969 ± 0.279
0.0ValXaa: 0.0 ± 0.0
Trp
0.751TrpAla: 0.751 ± 0.121
0.183TrpCys: 0.183 ± 0.059
0.916TrpAsp: 0.916 ± 0.143
0.788TrpGlu: 0.788 ± 0.173
0.586TrpPhe: 0.586 ± 0.117
0.825TrpGly: 0.825 ± 0.136
0.348TrpHis: 0.348 ± 0.077
0.586TrpIle: 0.586 ± 0.097
0.715TrpLys: 0.715 ± 0.139
0.678TrpLeu: 0.678 ± 0.148
0.385TrpMet: 0.385 ± 0.096
0.935TrpAsn: 0.935 ± 0.12
0.22TrpPro: 0.22 ± 0.059
0.403TrpGln: 0.403 ± 0.083
0.641TrpArg: 0.641 ± 0.117
0.861TrpSer: 0.861 ± 0.14
0.953TrpThr: 0.953 ± 0.152
0.806TrpVal: 0.806 ± 0.129
0.147TrpTrp: 0.147 ± 0.055
0.458TrpTyr: 0.458 ± 0.102
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.548TyrAla: 2.548 ± 0.173
0.55TyrCys: 0.55 ± 0.1
3.079TyrAsp: 3.079 ± 0.262
2.456TyrGlu: 2.456 ± 0.229
1.723TyrPhe: 1.723 ± 0.203
2.603TyrGly: 2.603 ± 0.224
0.605TyrHis: 0.605 ± 0.128
2.804TyrIle: 2.804 ± 0.197
2.511TyrLys: 2.511 ± 0.255
3.171TyrLeu: 3.171 ± 0.248
1.045TyrMet: 1.045 ± 0.16
3.262TyrAsn: 3.262 ± 0.271
1.814TyrPro: 1.814 ± 0.226
1.796TyrGln: 1.796 ± 0.185
2.126TyrArg: 2.126 ± 0.229
2.731TyrSer: 2.731 ± 0.241
3.556TyrThr: 3.556 ± 0.338
2.896TyrVal: 2.896 ± 0.234
0.586TyrTrp: 0.586 ± 0.125
2.126TyrTyr: 2.126 ± 0.217
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 213 proteins (54563 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski