Amino acid dipepetide frequency for Synechococcus phage S-CAM22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.006AlaAla: 7.006 ± 0.587
0.434AlaCys: 0.434 ± 0.11
4.308AlaAsp: 4.308 ± 0.336
4.182AlaGlu: 4.182 ± 0.446
2.697AlaPhe: 2.697 ± 0.238
6.535AlaGly: 6.535 ± 0.517
0.887AlaHis: 0.887 ± 0.15
4.001AlaIle: 4.001 ± 0.27
3.965AlaLys: 3.965 ± 0.407
4.852AlaLeu: 4.852 ± 0.385
1.521AlaMet: 1.521 ± 0.225
4.037AlaAsn: 4.037 ± 0.385
2.679AlaPro: 2.679 ± 0.274
2.643AlaGln: 2.643 ± 0.221
2.788AlaArg: 2.788 ± 0.281
5.431AlaSer: 5.431 ± 0.55
6.119AlaThr: 6.119 ± 0.596
4.833AlaVal: 4.833 ± 0.361
0.579AlaTrp: 0.579 ± 0.109
2.028AlaTyr: 2.028 ± 0.176
0.0AlaXaa: 0.0 ± 0.0
Cys
0.579CysAla: 0.579 ± 0.098
0.036CysCys: 0.036 ± 0.03
0.652CysAsp: 0.652 ± 0.164
0.489CysGlu: 0.489 ± 0.113
0.489CysPhe: 0.489 ± 0.119
0.597CysGly: 0.597 ± 0.139
0.308CysHis: 0.308 ± 0.084
0.453CysIle: 0.453 ± 0.13
0.615CysLys: 0.615 ± 0.121
0.634CysLeu: 0.634 ± 0.102
0.217CysMet: 0.217 ± 0.072
0.489CysAsn: 0.489 ± 0.119
0.29CysPro: 0.29 ± 0.09
0.362CysGln: 0.362 ± 0.098
0.308CysArg: 0.308 ± 0.095
0.489CysSer: 0.489 ± 0.098
0.615CysThr: 0.615 ± 0.138
0.561CysVal: 0.561 ± 0.125
0.109CysTrp: 0.109 ± 0.045
0.308CysTyr: 0.308 ± 0.072
0.0CysXaa: 0.0 ± 0.0
Asp
5.141AspAla: 5.141 ± 0.312
0.833AspCys: 0.833 ± 0.158
5.087AspAsp: 5.087 ± 0.496
4.363AspGlu: 4.363 ± 0.383
2.878AspPhe: 2.878 ± 0.226
6.01AspGly: 6.01 ± 0.422
1.104AspHis: 1.104 ± 0.185
4.164AspIle: 4.164 ± 0.3
3.621AspLys: 3.621 ± 0.384
4.689AspLeu: 4.689 ± 0.326
1.484AspMet: 1.484 ± 0.204
3.566AspAsn: 3.566 ± 0.35
3.168AspPro: 3.168 ± 0.286
2.19AspGln: 2.19 ± 0.213
2.48AspArg: 2.48 ± 0.19
4.327AspSer: 4.327 ± 0.314
4.164AspThr: 4.164 ± 0.333
4.327AspVal: 4.327 ± 0.323
1.14AspTrp: 1.14 ± 0.172
3.385AspTyr: 3.385 ± 0.265
0.0AspXaa: 0.0 ± 0.0
Glu
3.639GluAla: 3.639 ± 0.327
0.652GluCys: 0.652 ± 0.122
4.254GluAsp: 4.254 ± 0.322
4.942GluGlu: 4.942 ± 0.623
3.132GluPhe: 3.132 ± 0.259
4.146GluGly: 4.146 ± 0.285
0.941GluHis: 0.941 ± 0.167
4.345GluIle: 4.345 ± 0.421
3.856GluLys: 3.856 ± 0.48
4.526GluLeu: 4.526 ± 0.366
1.593GluMet: 1.593 ± 0.293
3.114GluAsn: 3.114 ± 0.258
1.792GluPro: 1.792 ± 0.19
2.534GluGln: 2.534 ± 0.268
2.752GluArg: 2.752 ± 0.347
3.928GluSer: 3.928 ± 0.32
3.675GluThr: 3.675 ± 0.352
4.634GluVal: 4.634 ± 0.319
0.851GluTrp: 0.851 ± 0.126
2.715GluTyr: 2.715 ± 0.236
0.0GluXaa: 0.0 ± 0.0
Phe
2.842PheAla: 2.842 ± 0.243
0.507PheCys: 0.507 ± 0.099
3.711PheAsp: 3.711 ± 0.26
2.661PheGlu: 2.661 ± 0.224
1.702PhePhe: 1.702 ± 0.196
3.114PheGly: 3.114 ± 0.189
0.543PheHis: 0.543 ± 0.114
2.679PheIle: 2.679 ± 0.228
2.172PheLys: 2.172 ± 0.216
2.969PheLeu: 2.969 ± 0.256
1.195PheMet: 1.195 ± 0.19
2.679PheAsn: 2.679 ± 0.275
1.647PhePro: 1.647 ± 0.213
1.611PheGln: 1.611 ± 0.203
1.557PheArg: 1.557 ± 0.162
3.222PheSer: 3.222 ± 0.274
3.168PheThr: 3.168 ± 0.325
2.915PheVal: 2.915 ± 0.324
0.344PheTrp: 0.344 ± 0.082
1.575PheTyr: 1.575 ± 0.155
0.0PheXaa: 0.0 ± 0.0
Gly
6.3GlyAla: 6.3 ± 0.568
0.615GlyCys: 0.615 ± 0.108
5.684GlyAsp: 5.684 ± 0.47
4.073GlyGlu: 4.073 ± 0.317
2.951GlyPhe: 2.951 ± 0.231
8.508GlyGly: 8.508 ± 1.452
1.104GlyHis: 1.104 ± 0.155
3.983GlyIle: 3.983 ± 0.304
4.109GlyLys: 4.109 ± 0.395
4.743GlyLeu: 4.743 ± 0.368
1.72GlyMet: 1.72 ± 0.285
4.58GlyAsn: 4.58 ± 0.464
2.118GlyPro: 2.118 ± 0.325
2.896GlyGln: 2.896 ± 0.271
2.498GlyArg: 2.498 ± 0.227
6.734GlySer: 6.734 ± 0.74
7.259GlyThr: 7.259 ± 0.742
5.069GlyVal: 5.069 ± 0.381
1.14GlyTrp: 1.14 ± 0.157
3.53GlyTyr: 3.53 ± 0.284
0.0GlyXaa: 0.0 ± 0.0
His
0.815HisAla: 0.815 ± 0.159
0.163HisCys: 0.163 ± 0.066
0.815HisAsp: 0.815 ± 0.139
0.959HisGlu: 0.959 ± 0.135
0.941HisPhe: 0.941 ± 0.149
1.014HisGly: 1.014 ± 0.157
0.217HisHis: 0.217 ± 0.072
0.815HisIle: 0.815 ± 0.158
0.887HisLys: 0.887 ± 0.204
1.159HisLeu: 1.159 ± 0.17
0.326HisMet: 0.326 ± 0.092
0.706HisAsn: 0.706 ± 0.122
0.833HisPro: 0.833 ± 0.173
0.416HisGln: 0.416 ± 0.108
0.742HisArg: 0.742 ± 0.14
0.742HisSer: 0.742 ± 0.116
1.05HisThr: 1.05 ± 0.165
0.978HisVal: 0.978 ± 0.145
0.217HisTrp: 0.217 ± 0.06
0.76HisTyr: 0.76 ± 0.16
0.0HisXaa: 0.0 ± 0.0
Ile
4.236IleAla: 4.236 ± 0.251
0.579IleCys: 0.579 ± 0.133
4.96IleAsp: 4.96 ± 0.338
4.2IleGlu: 4.2 ± 0.327
2.335IlePhe: 2.335 ± 0.194
4.363IleGly: 4.363 ± 0.343
0.76IleHis: 0.76 ± 0.144
3.421IleIle: 3.421 ± 0.279
3.783IleLys: 3.783 ± 0.322
4.182IleLeu: 4.182 ± 0.313
1.086IleMet: 1.086 ± 0.173
3.747IleAsn: 3.747 ± 0.285
2.643IlePro: 2.643 ± 0.243
2.1IleGln: 2.1 ± 0.184
2.462IleArg: 2.462 ± 0.221
4.671IleSer: 4.671 ± 0.454
5.395IleThr: 5.395 ± 0.663
3.928IleVal: 3.928 ± 0.343
0.597IleTrp: 0.597 ± 0.115
2.136IleTyr: 2.136 ± 0.249
0.0IleXaa: 0.0 ± 0.0
Lys
3.584LysAla: 3.584 ± 0.515
0.724LysCys: 0.724 ± 0.154
3.349LysAsp: 3.349 ± 0.282
4.272LysGlu: 4.272 ± 0.599
2.571LysPhe: 2.571 ± 0.303
3.693LysGly: 3.693 ± 0.388
0.815LysHis: 0.815 ± 0.15
3.82LysIle: 3.82 ± 0.24
4.978LysLys: 4.978 ± 0.818
4.652LysLeu: 4.652 ± 0.358
1.466LysMet: 1.466 ± 0.229
2.679LysAsn: 2.679 ± 0.25
1.901LysPro: 1.901 ± 0.246
2.353LysGln: 2.353 ± 0.352
2.39LysArg: 2.39 ± 0.295
3.874LysSer: 3.874 ± 0.372
3.783LysThr: 3.783 ± 0.26
4.037LysVal: 4.037 ± 0.333
0.724LysTrp: 0.724 ± 0.147
2.788LysTyr: 2.788 ± 0.341
0.0LysXaa: 0.0 ± 0.0
Leu
4.96LeuAla: 4.96 ± 0.346
0.67LeuCys: 0.67 ± 0.141
5.72LeuAsp: 5.72 ± 0.368
4.073LeuGlu: 4.073 ± 0.363
2.697LeuPhe: 2.697 ± 0.234
4.634LeuGly: 4.634 ± 0.39
1.358LeuHis: 1.358 ± 0.173
3.983LeuIle: 3.983 ± 0.295
4.544LeuLys: 4.544 ± 0.416
5.177LeuLeu: 5.177 ± 0.396
1.249LeuMet: 1.249 ± 0.264
4.109LeuAsn: 4.109 ± 0.304
2.607LeuPro: 2.607 ± 0.279
3.023LeuGln: 3.023 ± 0.27
3.204LeuArg: 3.204 ± 0.296
4.743LeuSer: 4.743 ± 0.275
5.702LeuThr: 5.702 ± 0.615
4.164LeuVal: 4.164 ± 0.285
0.652LeuTrp: 0.652 ± 0.114
3.096LeuTyr: 3.096 ± 0.252
0.0LeuXaa: 0.0 ± 0.0
Met
1.412MetAla: 1.412 ± 0.238
0.127MetCys: 0.127 ± 0.046
1.249MetAsp: 1.249 ± 0.229
1.466MetGlu: 1.466 ± 0.248
0.833MetPhe: 0.833 ± 0.147
1.231MetGly: 1.231 ± 0.221
0.344MetHis: 0.344 ± 0.094
1.05MetIle: 1.05 ± 0.208
1.629MetLys: 1.629 ± 0.32
1.665MetLeu: 1.665 ± 0.238
0.543MetMet: 0.543 ± 0.124
1.376MetAsn: 1.376 ± 0.192
1.068MetPro: 1.068 ± 0.204
0.887MetGln: 0.887 ± 0.165
0.978MetArg: 0.978 ± 0.168
1.629MetSer: 1.629 ± 0.258
1.593MetThr: 1.593 ± 0.216
0.996MetVal: 0.996 ± 0.176
0.29MetTrp: 0.29 ± 0.104
0.76MetTyr: 0.76 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
3.928AsnAla: 3.928 ± 0.384
0.543AsnCys: 0.543 ± 0.115
3.403AsnAsp: 3.403 ± 0.219
3.096AsnGlu: 3.096 ± 0.254
2.571AsnPhe: 2.571 ± 0.237
4.616AsnGly: 4.616 ± 0.528
0.815AsnHis: 0.815 ± 0.117
4.037AsnIle: 4.037 ± 0.431
3.041AsnLys: 3.041 ± 0.253
4.634AsnLeu: 4.634 ± 0.389
0.923AsnMet: 0.923 ± 0.178
3.965AsnAsn: 3.965 ± 0.435
2.878AsnPro: 2.878 ± 0.199
2.028AsnGln: 2.028 ± 0.171
1.919AsnArg: 1.919 ± 0.202
3.928AsnSer: 3.928 ± 0.388
4.363AsnThr: 4.363 ± 0.529
3.928AsnVal: 3.928 ± 0.322
0.67AsnTrp: 0.67 ± 0.122
2.1AsnTyr: 2.1 ± 0.221
0.0AsnXaa: 0.0 ± 0.0
Pro
2.697ProAla: 2.697 ± 0.278
0.29ProCys: 0.29 ± 0.077
2.734ProAsp: 2.734 ± 0.253
2.752ProGlu: 2.752 ± 0.262
1.738ProPhe: 1.738 ± 0.201
3.458ProGly: 3.458 ± 0.313
0.597ProHis: 0.597 ± 0.113
2.353ProIle: 2.353 ± 0.204
2.028ProLys: 2.028 ± 0.304
2.082ProLeu: 2.082 ± 0.214
0.525ProMet: 0.525 ± 0.128
2.39ProAsn: 2.39 ± 0.179
1.846ProPro: 1.846 ± 0.196
1.249ProGln: 1.249 ± 0.142
1.358ProArg: 1.358 ± 0.156
2.77ProSer: 2.77 ± 0.288
3.222ProThr: 3.222 ± 0.19
2.299ProVal: 2.299 ± 0.258
0.489ProTrp: 0.489 ± 0.11
1.684ProTyr: 1.684 ± 0.192
0.0ProXaa: 0.0 ± 0.0
Gln
2.1GlnAla: 2.1 ± 0.269
0.253GlnCys: 0.253 ± 0.065
2.082GlnAsp: 2.082 ± 0.188
2.48GlnGlu: 2.48 ± 0.25
1.774GlnPhe: 1.774 ± 0.191
2.209GlnGly: 2.209 ± 0.251
0.579GlnHis: 0.579 ± 0.109
2.462GlnIle: 2.462 ± 0.19
2.19GlnLys: 2.19 ± 0.375
3.023GlnLeu: 3.023 ± 0.264
0.905GlnMet: 0.905 ± 0.133
1.937GlnAsn: 1.937 ± 0.22
1.122GlnPro: 1.122 ± 0.132
1.72GlnGln: 1.72 ± 0.209
1.521GlnArg: 1.521 ± 0.193
2.607GlnSer: 2.607 ± 0.233
2.552GlnThr: 2.552 ± 0.243
2.842GlnVal: 2.842 ± 0.233
0.453GlnTrp: 0.453 ± 0.103
1.81GlnTyr: 1.81 ± 0.181
0.0GlnXaa: 0.0 ± 0.0
Arg
2.643ArgAla: 2.643 ± 0.284
0.344ArgCys: 0.344 ± 0.066
2.064ArgAsp: 2.064 ± 0.166
2.661ArgGlu: 2.661 ± 0.324
1.846ArgPhe: 1.846 ± 0.171
2.824ArgGly: 2.824 ± 0.313
0.634ArgHis: 0.634 ± 0.126
2.896ArgIle: 2.896 ± 0.217
2.408ArgLys: 2.408 ± 0.39
3.222ArgLeu: 3.222 ± 0.244
1.05ArgMet: 1.05 ± 0.209
1.81ArgAsn: 1.81 ± 0.178
1.303ArgPro: 1.303 ± 0.155
1.394ArgGln: 1.394 ± 0.163
2.136ArgArg: 2.136 ± 0.358
2.625ArgSer: 2.625 ± 0.22
2.353ArgThr: 2.353 ± 0.271
2.589ArgVal: 2.589 ± 0.256
0.344ArgTrp: 0.344 ± 0.091
2.046ArgTyr: 2.046 ± 0.229
0.0ArgXaa: 0.0 ± 0.0
Ser
5.521SerAla: 5.521 ± 0.367
0.362SerCys: 0.362 ± 0.092
3.928SerAsp: 3.928 ± 0.271
3.838SerGlu: 3.838 ± 0.282
3.421SerPhe: 3.421 ± 0.307
7.694SerGly: 7.694 ± 0.797
0.634SerHis: 0.634 ± 0.09
4.49SerIle: 4.49 ± 0.481
3.657SerLys: 3.657 ± 0.381
4.544SerLeu: 4.544 ± 0.268
1.376SerMet: 1.376 ± 0.229
4.218SerAsn: 4.218 ± 0.355
3.222SerPro: 3.222 ± 0.36
2.589SerGln: 2.589 ± 0.21
2.408SerArg: 2.408 ± 0.216
5.974SerSer: 5.974 ± 0.66
5.883SerThr: 5.883 ± 0.494
4.942SerVal: 4.942 ± 0.502
0.634SerTrp: 0.634 ± 0.134
2.679SerTyr: 2.679 ± 0.221
0.0SerXaa: 0.0 ± 0.0
Thr
6.028ThrAla: 6.028 ± 0.636
0.398ThrCys: 0.398 ± 0.104
4.598ThrAsp: 4.598 ± 0.418
4.109ThrGlu: 4.109 ± 0.342
3.512ThrPhe: 3.512 ± 0.456
6.861ThrGly: 6.861 ± 0.851
0.996ThrHis: 0.996 ± 0.145
5.431ThrIle: 5.431 ± 0.548
3.82ThrLys: 3.82 ± 0.267
6.01ThrLeu: 6.01 ± 0.456
1.376ThrMet: 1.376 ± 0.193
4.29ThrAsn: 4.29 ± 0.573
2.824ThrPro: 2.824 ± 0.257
2.552ThrGln: 2.552 ± 0.204
2.661ThrArg: 2.661 ± 0.276
5.811ThrSer: 5.811 ± 0.719
5.938ThrThr: 5.938 ± 0.791
6.481ThrVal: 6.481 ± 0.717
0.652ThrTrp: 0.652 ± 0.106
2.788ThrTyr: 2.788 ± 0.214
0.0ThrXaa: 0.0 ± 0.0
Val
4.779ValAla: 4.779 ± 0.421
0.398ValCys: 0.398 ± 0.089
5.413ValAsp: 5.413 ± 0.41
4.435ValGlu: 4.435 ± 0.293
2.734ValPhe: 2.734 ± 0.252
5.141ValGly: 5.141 ± 0.325
0.959ValHis: 0.959 ± 0.165
3.892ValIle: 3.892 ± 0.267
3.783ValLys: 3.783 ± 0.253
4.254ValLeu: 4.254 ± 0.257
1.34ValMet: 1.34 ± 0.197
4.272ValAsn: 4.272 ± 0.335
2.969ValPro: 2.969 ± 0.239
2.172ValGln: 2.172 ± 0.21
2.534ValArg: 2.534 ± 0.225
5.123ValSer: 5.123 ± 0.394
6.083ValThr: 6.083 ± 0.669
4.888ValVal: 4.888 ± 0.368
0.579ValTrp: 0.579 ± 0.087
2.408ValTyr: 2.408 ± 0.274
0.0ValXaa: 0.0 ± 0.0
Trp
0.833TrpAla: 0.833 ± 0.137
0.145TrpCys: 0.145 ± 0.056
0.851TrpAsp: 0.851 ± 0.164
0.634TrpGlu: 0.634 ± 0.163
0.434TrpPhe: 0.434 ± 0.085
0.615TrpGly: 0.615 ± 0.117
0.398TrpHis: 0.398 ± 0.11
0.67TrpIle: 0.67 ± 0.115
0.797TrpLys: 0.797 ± 0.152
0.688TrpLeu: 0.688 ± 0.143
0.344TrpMet: 0.344 ± 0.098
0.688TrpAsn: 0.688 ± 0.128
0.217TrpPro: 0.217 ± 0.066
0.416TrpGln: 0.416 ± 0.089
0.398TrpArg: 0.398 ± 0.106
0.76TrpSer: 0.76 ± 0.114
0.996TrpThr: 0.996 ± 0.146
0.724TrpVal: 0.724 ± 0.143
0.145TrpTrp: 0.145 ± 0.05
0.344TrpTyr: 0.344 ± 0.073
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.48TyrAla: 2.48 ± 0.194
0.525TyrCys: 0.525 ± 0.123
3.259TyrAsp: 3.259 ± 0.268
2.335TyrGlu: 2.335 ± 0.268
1.557TyrPhe: 1.557 ± 0.19
2.353TyrGly: 2.353 ± 0.258
0.561TyrHis: 0.561 ± 0.09
2.643TyrIle: 2.643 ± 0.29
2.571TyrLys: 2.571 ± 0.242
2.661TyrLeu: 2.661 ± 0.253
0.869TyrMet: 0.869 ± 0.149
2.752TyrAsn: 2.752 ± 0.268
1.43TyrPro: 1.43 ± 0.182
1.484TyrGln: 1.484 ± 0.155
2.1TyrArg: 2.1 ± 0.218
2.625TyrSer: 2.625 ± 0.3
3.222TyrThr: 3.222 ± 0.391
3.023TyrVal: 3.023 ± 0.259
0.489TyrTrp: 0.489 ± 0.123
2.118TyrTyr: 2.118 ± 0.201
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 214 proteins (55241 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski