Amino acid dipepetide frequency for Synechococcus phage S-N03

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.722AlaAla: 7.722 ± 0.51
0.422AlaCys: 0.422 ± 0.119
5.028AlaAsp: 5.028 ± 0.36
5.148AlaGlu: 5.148 ± 0.311
2.836AlaPhe: 2.836 ± 0.227
6.596AlaGly: 6.596 ± 0.475
1.146AlaHis: 1.146 ± 0.177
4.082AlaIle: 4.082 ± 0.234
4.746AlaLys: 4.746 ± 0.469
5.993AlaLeu: 5.993 ± 0.426
1.669AlaMet: 1.669 ± 0.183
3.72AlaAsn: 3.72 ± 0.304
3.198AlaPro: 3.198 ± 0.27
2.675AlaGln: 2.675 ± 0.265
3.338AlaArg: 3.338 ± 0.291
5.148AlaSer: 5.148 ± 0.379
5.028AlaThr: 5.028 ± 0.442
5.349AlaVal: 5.349 ± 0.335
0.965AlaTrp: 0.965 ± 0.14
2.413AlaTyr: 2.413 ± 0.253
0.0AlaXaa: 0.0 ± 0.0
Cys
0.362CysAla: 0.362 ± 0.103
0.121CysCys: 0.121 ± 0.056
0.684CysAsp: 0.684 ± 0.118
0.905CysGlu: 0.905 ± 0.129
0.302CysPhe: 0.302 ± 0.073
0.744CysGly: 0.744 ± 0.153
0.342CysHis: 0.342 ± 0.091
0.241CysIle: 0.241 ± 0.067
0.382CysLys: 0.382 ± 0.091
0.744CysLeu: 0.744 ± 0.159
0.181CysMet: 0.181 ± 0.063
0.442CysAsn: 0.442 ± 0.107
0.342CysPro: 0.342 ± 0.099
0.382CysGln: 0.382 ± 0.108
0.342CysArg: 0.342 ± 0.096
0.603CysSer: 0.603 ± 0.101
0.523CysThr: 0.523 ± 0.105
0.825CysVal: 0.825 ± 0.121
0.181CysTrp: 0.181 ± 0.065
0.362CysTyr: 0.362 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
5.188AspAla: 5.188 ± 0.336
0.563AspCys: 0.563 ± 0.101
4.283AspAsp: 4.283 ± 0.357
4.283AspGlu: 4.283 ± 0.269
2.896AspPhe: 2.896 ± 0.259
5.671AspGly: 5.671 ± 0.4
1.026AspHis: 1.026 ± 0.143
3.861AspIle: 3.861 ± 0.291
3.761AspLys: 3.761 ± 0.369
6.154AspLeu: 6.154 ± 0.368
1.327AspMet: 1.327 ± 0.159
3.198AspAsn: 3.198 ± 0.285
3.238AspPro: 3.238 ± 0.361
2.313AspGln: 2.313 ± 0.223
2.976AspArg: 2.976 ± 0.258
3.7AspSer: 3.7 ± 0.337
3.962AspThr: 3.962 ± 0.3
4.444AspVal: 4.444 ± 0.309
0.925AspTrp: 0.925 ± 0.151
3.338AspTyr: 3.338 ± 0.303
0.0AspXaa: 0.0 ± 0.0
Glu
6.033GluAla: 6.033 ± 0.427
0.804GluCys: 0.804 ± 0.122
4.384GluAsp: 4.384 ± 0.345
5.973GluGlu: 5.973 ± 0.542
3.057GluPhe: 3.057 ± 0.273
4.786GluGly: 4.786 ± 0.331
1.006GluHis: 1.006 ± 0.139
3.7GluIle: 3.7 ± 0.274
4.203GluLys: 4.203 ± 0.502
6.134GluLeu: 6.134 ± 0.318
1.468GluMet: 1.468 ± 0.211
2.675GluAsn: 2.675 ± 0.216
2.393GluPro: 2.393 ± 0.356
2.795GluGln: 2.795 ± 0.274
3.057GluArg: 3.057 ± 0.245
3.298GluSer: 3.298 ± 0.342
4.022GluThr: 4.022 ± 0.322
4.404GluVal: 4.404 ± 0.337
0.865GluTrp: 0.865 ± 0.159
2.272GluTyr: 2.272 ± 0.205
0.0GluXaa: 0.0 ± 0.0
Phe
2.856PheAla: 2.856 ± 0.272
0.322PheCys: 0.322 ± 0.097
2.896PheAsp: 2.896 ± 0.254
2.172PheGlu: 2.172 ± 0.26
1.307PhePhe: 1.307 ± 0.184
2.775PheGly: 2.775 ± 0.177
0.724PheHis: 0.724 ± 0.137
2.313PheIle: 2.313 ± 0.237
1.649PheLys: 1.649 ± 0.21
2.896PheLeu: 2.896 ± 0.229
1.146PheMet: 1.146 ± 0.143
1.91PheAsn: 1.91 ± 0.191
1.367PhePro: 1.367 ± 0.17
1.408PheGln: 1.408 ± 0.175
1.85PheArg: 1.85 ± 0.162
2.755PheSer: 2.755 ± 0.291
3.218PheThr: 3.218 ± 0.306
2.956PheVal: 2.956 ± 0.328
0.362PheTrp: 0.362 ± 0.097
1.669PheTyr: 1.669 ± 0.178
0.0PheXaa: 0.0 ± 0.0
Gly
5.731GlyAla: 5.731 ± 0.371
0.463GlyCys: 0.463 ± 0.086
5.731GlyAsp: 5.731 ± 0.428
5.048GlyGlu: 5.048 ± 0.392
3.057GlyPhe: 3.057 ± 0.234
8.285GlyGly: 8.285 ± 0.929
1.066GlyHis: 1.066 ± 0.164
3.499GlyIle: 3.499 ± 0.219
4.585GlyLys: 4.585 ± 0.57
5.55GlyLeu: 5.55 ± 0.343
1.77GlyMet: 1.77 ± 0.23
3.66GlyAsn: 3.66 ± 0.345
2.494GlyPro: 2.494 ± 0.286
2.976GlyGln: 2.976 ± 0.345
3.238GlyArg: 3.238 ± 0.22
5.349GlySer: 5.349 ± 0.433
6.596GlyThr: 6.596 ± 0.523
5.128GlyVal: 5.128 ± 0.398
1.046GlyTrp: 1.046 ± 0.15
3.097GlyTyr: 3.097 ± 0.264
0.0GlyXaa: 0.0 ± 0.0
His
0.704HisAla: 0.704 ± 0.114
0.181HisCys: 0.181 ± 0.063
1.388HisAsp: 1.388 ± 0.214
0.925HisGlu: 0.925 ± 0.138
0.784HisPhe: 0.784 ± 0.114
1.146HisGly: 1.146 ± 0.143
0.583HisHis: 0.583 ± 0.124
0.865HisIle: 0.865 ± 0.132
1.146HisLys: 1.146 ± 0.183
1.589HisLeu: 1.589 ± 0.197
0.342HisMet: 0.342 ± 0.08
0.985HisAsn: 0.985 ± 0.145
0.865HisPro: 0.865 ± 0.121
0.804HisGln: 0.804 ± 0.128
0.965HisArg: 0.965 ± 0.164
1.026HisSer: 1.026 ± 0.128
0.965HisThr: 0.965 ± 0.144
0.845HisVal: 0.845 ± 0.13
0.161HisTrp: 0.161 ± 0.055
0.784HisTyr: 0.784 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
4.183IleAla: 4.183 ± 0.335
0.503IleCys: 0.503 ± 0.103
4.163IleAsp: 4.163 ± 0.306
3.801IleGlu: 3.801 ± 0.308
1.609IlePhe: 1.609 ± 0.173
3.238IleGly: 3.238 ± 0.273
1.026IleHis: 1.026 ± 0.145
2.755IleIle: 2.755 ± 0.28
2.996IleLys: 2.996 ± 0.248
3.56IleLeu: 3.56 ± 0.281
0.965IleMet: 0.965 ± 0.126
2.775IleAsn: 2.775 ± 0.268
2.293IlePro: 2.293 ± 0.227
2.011IleGln: 2.011 ± 0.195
2.474IleArg: 2.474 ± 0.266
3.439IleSer: 3.439 ± 0.338
4.605IleThr: 4.605 ± 0.335
3.479IleVal: 3.479 ± 0.254
0.523IleTrp: 0.523 ± 0.104
2.051IleTyr: 2.051 ± 0.204
0.0IleXaa: 0.0 ± 0.0
Lys
4.806LysAla: 4.806 ± 0.533
0.563LysCys: 0.563 ± 0.107
3.419LysAsp: 3.419 ± 0.306
4.967LysGlu: 4.967 ± 0.633
2.071LysPhe: 2.071 ± 0.208
3.861LysGly: 3.861 ± 0.457
1.327LysHis: 1.327 ± 0.19
2.856LysIle: 2.856 ± 0.235
5.168LysLys: 5.168 ± 0.817
5.028LysLeu: 5.028 ± 0.382
1.689LysMet: 1.689 ± 0.213
2.655LysAsn: 2.655 ± 0.227
2.232LysPro: 2.232 ± 0.223
2.333LysGln: 2.333 ± 0.254
2.916LysArg: 2.916 ± 0.335
2.494LysSer: 2.494 ± 0.248
3.379LysThr: 3.379 ± 0.324
3.921LysVal: 3.921 ± 0.327
0.764LysTrp: 0.764 ± 0.123
2.333LysTyr: 2.333 ± 0.253
0.0LysXaa: 0.0 ± 0.0
Leu
5.892LeuAla: 5.892 ± 0.372
0.825LeuCys: 0.825 ± 0.157
5.209LeuAsp: 5.209 ± 0.343
5.912LeuGlu: 5.912 ± 0.359
2.655LeuPhe: 2.655 ± 0.237
4.424LeuGly: 4.424 ± 0.378
1.569LeuHis: 1.569 ± 0.192
3.801LeuIle: 3.801 ± 0.248
4.806LeuLys: 4.806 ± 0.389
5.571LeuLeu: 5.571 ± 0.452
2.433LeuMet: 2.433 ± 0.251
4.123LeuAsn: 4.123 ± 0.292
3.479LeuPro: 3.479 ± 0.231
3.057LeuGln: 3.057 ± 0.274
3.861LeuArg: 3.861 ± 0.29
4.686LeuSer: 4.686 ± 0.279
6.154LeuThr: 6.154 ± 0.466
4.565LeuVal: 4.565 ± 0.3
0.945LeuTrp: 0.945 ± 0.181
2.755LeuTyr: 2.755 ± 0.244
0.0LeuXaa: 0.0 ± 0.0
Met
2.091MetAla: 2.091 ± 0.215
0.181MetCys: 0.181 ± 0.059
1.267MetAsp: 1.267 ± 0.159
1.267MetGlu: 1.267 ± 0.147
1.086MetPhe: 1.086 ± 0.16
1.589MetGly: 1.589 ± 0.19
0.442MetHis: 0.442 ± 0.098
0.925MetIle: 0.925 ± 0.138
1.951MetLys: 1.951 ± 0.264
1.347MetLeu: 1.347 ± 0.168
0.583MetMet: 0.583 ± 0.116
1.408MetAsn: 1.408 ± 0.211
1.026MetPro: 1.026 ± 0.147
0.925MetGln: 0.925 ± 0.151
1.247MetArg: 1.247 ± 0.176
2.393MetSer: 2.393 ± 0.194
2.071MetThr: 2.071 ± 0.241
1.146MetVal: 1.146 ± 0.14
0.241MetTrp: 0.241 ± 0.075
0.603MetTyr: 0.603 ± 0.118
0.0MetXaa: 0.0 ± 0.0
Asn
3.278AsnAla: 3.278 ± 0.252
0.563AsnCys: 0.563 ± 0.109
2.715AsnAsp: 2.715 ± 0.198
2.554AsnGlu: 2.554 ± 0.238
2.554AsnPhe: 2.554 ± 0.229
4.263AsnGly: 4.263 ± 0.358
0.724AsnHis: 0.724 ± 0.115
2.775AsnIle: 2.775 ± 0.264
2.896AsnLys: 2.896 ± 0.25
4.223AsnLeu: 4.223 ± 0.316
1.066AsnMet: 1.066 ± 0.16
3.318AsnAsn: 3.318 ± 0.361
2.775AsnPro: 2.775 ± 0.289
1.991AsnGln: 1.991 ± 0.189
2.232AsnArg: 2.232 ± 0.257
3.157AsnSer: 3.157 ± 0.274
3.137AsnThr: 3.137 ± 0.307
3.62AsnVal: 3.62 ± 0.3
0.644AsnTrp: 0.644 ± 0.103
2.353AsnTyr: 2.353 ± 0.24
0.0AsnXaa: 0.0 ± 0.0
Pro
3.117ProAla: 3.117 ± 0.269
0.362ProCys: 0.362 ± 0.083
2.655ProAsp: 2.655 ± 0.268
3.218ProGlu: 3.218 ± 0.379
1.528ProPhe: 1.528 ± 0.195
3.519ProGly: 3.519 ± 0.291
0.503ProHis: 0.503 ± 0.104
1.85ProIle: 1.85 ± 0.217
2.494ProLys: 2.494 ± 0.332
2.353ProLeu: 2.353 ± 0.212
0.825ProMet: 0.825 ± 0.125
2.031ProAsn: 2.031 ± 0.226
1.267ProPro: 1.267 ± 0.193
1.468ProGln: 1.468 ± 0.169
1.77ProArg: 1.77 ± 0.227
2.936ProSer: 2.936 ± 0.293
3.982ProThr: 3.982 ± 0.276
3.258ProVal: 3.258 ± 0.263
0.463ProTrp: 0.463 ± 0.09
1.347ProTyr: 1.347 ± 0.202
0.0ProXaa: 0.0 ± 0.0
Gln
2.594GlnAla: 2.594 ± 0.233
0.322GlnCys: 0.322 ± 0.079
2.252GlnAsp: 2.252 ± 0.212
2.333GlnGlu: 2.333 ± 0.223
1.448GlnPhe: 1.448 ± 0.191
3.037GlnGly: 3.037 ± 0.325
0.744GlnHis: 0.744 ± 0.113
2.413GlnIle: 2.413 ± 0.251
2.112GlnLys: 2.112 ± 0.251
3.6GlnLeu: 3.6 ± 0.25
0.985GlnMet: 0.985 ± 0.134
1.689GlnAsn: 1.689 ± 0.217
1.548GlnPro: 1.548 ± 0.184
1.89GlnGln: 1.89 ± 0.211
2.011GlnArg: 2.011 ± 0.228
2.433GlnSer: 2.433 ± 0.221
2.474GlnThr: 2.474 ± 0.24
2.554GlnVal: 2.554 ± 0.231
0.583GlnTrp: 0.583 ± 0.124
1.669GlnTyr: 1.669 ± 0.161
0.0GlnXaa: 0.0 ± 0.0
Arg
3.077ArgAla: 3.077 ± 0.276
0.583ArgCys: 0.583 ± 0.108
2.936ArgAsp: 2.936 ± 0.249
2.815ArgGlu: 2.815 ± 0.256
1.548ArgPhe: 1.548 ± 0.18
3.077ArgGly: 3.077 ± 0.269
0.865ArgHis: 0.865 ± 0.138
2.554ArgIle: 2.554 ± 0.249
3.198ArgLys: 3.198 ± 0.337
3.419ArgLeu: 3.419 ± 0.272
1.307ArgMet: 1.307 ± 0.171
2.534ArgAsn: 2.534 ± 0.279
1.669ArgPro: 1.669 ± 0.185
1.669ArgGln: 1.669 ± 0.194
2.755ArgArg: 2.755 ± 0.251
2.474ArgSer: 2.474 ± 0.234
2.534ArgThr: 2.534 ± 0.238
3.278ArgVal: 3.278 ± 0.263
0.784ArgTrp: 0.784 ± 0.145
2.474ArgTyr: 2.474 ± 0.237
0.0ArgXaa: 0.0 ± 0.0
Ser
5.048SerAla: 5.048 ± 0.399
0.402SerCys: 0.402 ± 0.106
4.384SerAsp: 4.384 ± 0.365
3.921SerGlu: 3.921 ± 0.294
2.453SerPhe: 2.453 ± 0.258
6.496SerGly: 6.496 ± 0.532
0.945SerHis: 0.945 ± 0.133
3.6SerIle: 3.6 ± 0.29
2.996SerLys: 2.996 ± 0.288
4.645SerLeu: 4.645 ± 0.315
1.468SerMet: 1.468 ± 0.195
3.278SerAsn: 3.278 ± 0.265
2.916SerPro: 2.916 ± 0.215
2.333SerGln: 2.333 ± 0.195
2.433SerArg: 2.433 ± 0.209
5.068SerSer: 5.068 ± 0.382
4.565SerThr: 4.565 ± 0.508
4.444SerVal: 4.444 ± 0.317
0.704SerTrp: 0.704 ± 0.134
2.413SerTyr: 2.413 ± 0.241
0.0SerXaa: 0.0 ± 0.0
Thr
5.55ThrAla: 5.55 ± 0.428
0.583ThrCys: 0.583 ± 0.132
4.746ThrAsp: 4.746 ± 0.313
4.082ThrGlu: 4.082 ± 0.341
3.218ThrPhe: 3.218 ± 0.308
6.596ThrGly: 6.596 ± 0.531
0.945ThrHis: 0.945 ± 0.136
3.962ThrIle: 3.962 ± 0.379
3.761ThrLys: 3.761 ± 0.317
5.711ThrLeu: 5.711 ± 0.441
1.408ThrMet: 1.408 ± 0.186
3.72ThrAsn: 3.72 ± 0.394
3.66ThrPro: 3.66 ± 0.343
2.574ThrGln: 2.574 ± 0.278
2.453ThrArg: 2.453 ± 0.329
5.148ThrSer: 5.148 ± 0.446
5.953ThrThr: 5.953 ± 0.535
4.967ThrVal: 4.967 ± 0.353
0.925ThrTrp: 0.925 ± 0.131
2.836ThrTyr: 2.836 ± 0.216
0.0ThrXaa: 0.0 ± 0.0
Val
5.812ValAla: 5.812 ± 0.408
0.664ValCys: 0.664 ± 0.121
4.645ValAsp: 4.645 ± 0.442
4.605ValGlu: 4.605 ± 0.369
2.232ValPhe: 2.232 ± 0.208
4.746ValGly: 4.746 ± 0.35
1.066ValHis: 1.066 ± 0.161
3.66ValIle: 3.66 ± 0.279
3.399ValLys: 3.399 ± 0.265
4.364ValLeu: 4.364 ± 0.306
1.951ValMet: 1.951 ± 0.219
3.358ValAsn: 3.358 ± 0.293
2.896ValPro: 2.896 ± 0.281
2.554ValGln: 2.554 ± 0.273
3.077ValArg: 3.077 ± 0.276
4.907ValSer: 4.907 ± 0.28
5.41ValThr: 5.41 ± 0.455
4.283ValVal: 4.283 ± 0.313
0.925ValTrp: 0.925 ± 0.156
2.634ValTyr: 2.634 ± 0.271
0.0ValXaa: 0.0 ± 0.0
Trp
1.026TrpAla: 1.026 ± 0.173
0.121TrpCys: 0.121 ± 0.051
1.146TrpAsp: 1.146 ± 0.146
0.925TrpGlu: 0.925 ± 0.149
0.402TrpPhe: 0.402 ± 0.081
0.724TrpGly: 0.724 ± 0.126
0.261TrpHis: 0.261 ± 0.082
0.664TrpIle: 0.664 ± 0.111
0.543TrpLys: 0.543 ± 0.101
0.925TrpLeu: 0.925 ± 0.126
0.402TrpMet: 0.402 ± 0.101
0.724TrpAsn: 0.724 ± 0.141
0.181TrpPro: 0.181 ± 0.064
0.664TrpGln: 0.664 ± 0.12
0.623TrpArg: 0.623 ± 0.129
0.905TrpSer: 0.905 ± 0.129
1.046TrpThr: 1.046 ± 0.166
0.845TrpVal: 0.845 ± 0.135
0.322TrpTrp: 0.322 ± 0.081
0.603TrpTyr: 0.603 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.152TyrAla: 2.152 ± 0.202
0.463TyrCys: 0.463 ± 0.093
3.177TyrAsp: 3.177 ± 0.283
2.474TyrGlu: 2.474 ± 0.217
1.508TyrPhe: 1.508 ± 0.161
2.795TyrGly: 2.795 ± 0.286
0.744TyrHis: 0.744 ± 0.12
2.071TyrIle: 2.071 ± 0.212
2.011TyrLys: 2.011 ± 0.226
2.916TyrLeu: 2.916 ± 0.272
0.784TyrMet: 0.784 ± 0.141
2.634TyrAsn: 2.634 ± 0.193
1.106TyrPro: 1.106 ± 0.151
1.87TyrGln: 1.87 ± 0.2
1.931TyrArg: 1.931 ± 0.216
2.594TyrSer: 2.594 ± 0.248
3.177TyrThr: 3.177 ± 0.271
2.876TyrVal: 2.876 ± 0.252
0.724TyrTrp: 0.724 ± 0.106
2.031TyrTyr: 2.031 ± 0.203
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 247 proteins (49727 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski