Amino acid dipepetide frequency for Synechococcus phage S-SSM7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.808AlaAla: 4.808 ± 0.315
0.518AlaCys: 0.518 ± 0.09
3.453AlaAsp: 3.453 ± 0.249
3.453AlaGlu: 3.453 ± 0.279
2.497AlaPhe: 2.497 ± 0.195
5.339AlaGly: 5.339 ± 0.378
1.142AlaHis: 1.142 ± 0.143
3.599AlaIle: 3.599 ± 0.206
3.865AlaLys: 3.865 ± 0.287
4.396AlaLeu: 4.396 ± 0.266
1.474AlaMet: 1.474 ± 0.153
3.559AlaAsn: 3.559 ± 0.259
2.085AlaPro: 2.085 ± 0.177
2.152AlaGln: 2.152 ± 0.19
2.284AlaArg: 2.284 ± 0.182
4.157AlaSer: 4.157 ± 0.332
5.153AlaThr: 5.153 ± 0.45
3.812AlaVal: 3.812 ± 0.242
0.638AlaTrp: 0.638 ± 0.087
2.338AlaTyr: 2.338 ± 0.169
0.0AlaXaa: 0.0 ± 0.0
Cys
0.837CysAla: 0.837 ± 0.126
0.385CysCys: 0.385 ± 0.126
0.757CysAsp: 0.757 ± 0.106
0.73CysGlu: 0.73 ± 0.115
0.518CysPhe: 0.518 ± 0.101
0.757CysGly: 0.757 ± 0.129
0.305CysHis: 0.305 ± 0.07
0.677CysIle: 0.677 ± 0.104
0.638CysLys: 0.638 ± 0.106
0.691CysLeu: 0.691 ± 0.119
0.292CysMet: 0.292 ± 0.068
0.558CysAsn: 0.558 ± 0.11
0.558CysPro: 0.558 ± 0.095
0.372CysGln: 0.372 ± 0.086
0.651CysArg: 0.651 ± 0.093
0.677CysSer: 0.677 ± 0.13
0.89CysThr: 0.89 ± 0.128
0.638CysVal: 0.638 ± 0.11
0.186CysTrp: 0.186 ± 0.061
0.545CysTyr: 0.545 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
4.064AspAla: 4.064 ± 0.208
0.916AspCys: 0.916 ± 0.126
4.343AspAsp: 4.343 ± 0.311
4.489AspGlu: 4.489 ± 0.292
3.373AspPhe: 3.373 ± 0.236
4.834AspGly: 4.834 ± 0.259
1.063AspHis: 1.063 ± 0.137
4.303AspIle: 4.303 ± 0.27
4.303AspLys: 4.303 ± 0.287
5.459AspLeu: 5.459 ± 0.222
1.74AspMet: 1.74 ± 0.178
3.413AspAsn: 3.413 ± 0.191
3.48AspPro: 3.48 ± 0.315
2.125AspGln: 2.125 ± 0.192
2.284AspArg: 2.284 ± 0.222
3.945AspSer: 3.945 ± 0.201
3.679AspThr: 3.679 ± 0.243
4.529AspVal: 4.529 ± 0.24
1.275AspTrp: 1.275 ± 0.139
3.838AspTyr: 3.838 ± 0.286
0.0AspXaa: 0.0 ± 0.0
Glu
3.546GluAla: 3.546 ± 0.255
0.823GluCys: 0.823 ± 0.123
4.449GluAsp: 4.449 ± 0.31
5.047GluGlu: 5.047 ± 0.428
2.882GluPhe: 2.882 ± 0.202
3.984GluGly: 3.984 ± 0.263
1.248GluHis: 1.248 ± 0.138
4.502GluIle: 4.502 ± 0.312
4.688GluLys: 4.688 ± 0.367
4.967GluLeu: 4.967 ± 0.381
1.341GluMet: 1.341 ± 0.175
3.334GluAsn: 3.334 ± 0.197
2.045GluPro: 2.045 ± 0.205
2.404GluGln: 2.404 ± 0.195
2.616GluArg: 2.616 ± 0.226
4.011GluSer: 4.011 ± 0.311
3.068GluThr: 3.068 ± 0.234
4.277GluVal: 4.277 ± 0.217
0.89GluTrp: 0.89 ± 0.113
2.856GluTyr: 2.856 ± 0.202
0.0GluXaa: 0.0 ± 0.0
Phe
2.975PheAla: 2.975 ± 0.211
0.385PheCys: 0.385 ± 0.076
3.652PheAsp: 3.652 ± 0.285
2.869PheGlu: 2.869 ± 0.178
1.793PhePhe: 1.793 ± 0.17
3.041PheGly: 3.041 ± 0.262
0.81PheHis: 0.81 ± 0.103
2.537PheIle: 2.537 ± 0.177
2.763PheLys: 2.763 ± 0.214
2.882PheLeu: 2.882 ± 0.182
0.81PheMet: 0.81 ± 0.109
2.856PheAsn: 2.856 ± 0.225
1.833PhePro: 1.833 ± 0.174
1.687PheGln: 1.687 ± 0.155
1.58PheArg: 1.58 ± 0.162
3.041PheSer: 3.041 ± 0.25
3.254PheThr: 3.254 ± 0.283
3.041PheVal: 3.041 ± 0.212
0.651PheTrp: 0.651 ± 0.094
1.846PheTyr: 1.846 ± 0.142
0.0PheXaa: 0.0 ± 0.0
Gly
5.06GlyAla: 5.06 ± 0.368
0.903GlyCys: 0.903 ± 0.182
5.1GlyAsp: 5.1 ± 0.339
3.599GlyGlu: 3.599 ± 0.26
3.095GlyPhe: 3.095 ± 0.29
7.57GlyGly: 7.57 ± 0.904
1.116GlyHis: 1.116 ± 0.15
5.485GlyIle: 5.485 ± 0.543
4.197GlyLys: 4.197 ± 0.274
5.034GlyLeu: 5.034 ± 0.305
1.913GlyMet: 1.913 ± 0.2
4.489GlyAsn: 4.489 ± 0.324
1.74GlyPro: 1.74 ± 0.162
2.656GlyGln: 2.656 ± 0.179
3.108GlyArg: 3.108 ± 0.196
5.791GlySer: 5.791 ± 0.454
6.561GlyThr: 6.561 ± 0.565
5.645GlyVal: 5.645 ± 0.429
1.076GlyTrp: 1.076 ± 0.145
3.32GlyTyr: 3.32 ± 0.21
0.0GlyXaa: 0.0 ± 0.0
His
0.97HisAla: 0.97 ± 0.133
0.292HisCys: 0.292 ± 0.076
1.209HisAsp: 1.209 ± 0.122
1.009HisGlu: 1.009 ± 0.137
0.81HisPhe: 0.81 ± 0.117
1.341HisGly: 1.341 ± 0.166
0.584HisHis: 0.584 ± 0.101
1.195HisIle: 1.195 ± 0.131
1.142HisLys: 1.142 ± 0.143
1.248HisLeu: 1.248 ± 0.139
0.372HisMet: 0.372 ± 0.088
0.983HisAsn: 0.983 ± 0.12
1.023HisPro: 1.023 ± 0.138
0.545HisGln: 0.545 ± 0.089
0.81HisArg: 0.81 ± 0.129
1.036HisSer: 1.036 ± 0.132
1.222HisThr: 1.222 ± 0.164
1.036HisVal: 1.036 ± 0.122
0.319HisTrp: 0.319 ± 0.067
0.996HisTyr: 0.996 ± 0.119
0.0HisXaa: 0.0 ± 0.0
Ile
4.131IleAla: 4.131 ± 0.236
0.611IleCys: 0.611 ± 0.107
5.007IleAsp: 5.007 ± 0.279
4.303IleGlu: 4.303 ± 0.262
2.338IlePhe: 2.338 ± 0.182
4.741IleGly: 4.741 ± 0.388
1.036IleHis: 1.036 ± 0.128
3.599IleIle: 3.599 ± 0.268
4.263IleLys: 4.263 ± 0.282
4.542IleLeu: 4.542 ± 0.282
1.58IleMet: 1.58 ± 0.128
3.639IleAsn: 3.639 ± 0.244
2.63IlePro: 2.63 ± 0.217
2.231IleGln: 2.231 ± 0.182
2.616IleArg: 2.616 ± 0.201
4.436IleSer: 4.436 ± 0.3
5.339IleThr: 5.339 ± 0.428
3.931IleVal: 3.931 ± 0.208
0.518IleTrp: 0.518 ± 0.088
2.47IleTyr: 2.47 ± 0.203
0.0IleXaa: 0.0 ± 0.0
Lys
3.493LysAla: 3.493 ± 0.268
0.677LysCys: 0.677 ± 0.096
4.051LysAsp: 4.051 ± 0.234
4.795LysGlu: 4.795 ± 0.397
2.563LysPhe: 2.563 ± 0.186
3.719LysGly: 3.719 ± 0.301
1.235LysHis: 1.235 ± 0.138
4.37LysIle: 4.37 ± 0.349
5.777LysLys: 5.777 ± 0.65
5.047LysLeu: 5.047 ± 0.302
1.82LysMet: 1.82 ± 0.181
3.599LysAsn: 3.599 ± 0.239
2.856LysPro: 2.856 ± 0.29
2.245LysGln: 2.245 ± 0.205
2.284LysArg: 2.284 ± 0.202
4.396LysSer: 4.396 ± 0.252
3.719LysThr: 3.719 ± 0.212
4.914LysVal: 4.914 ± 0.319
0.77LysTrp: 0.77 ± 0.121
3.413LysTyr: 3.413 ± 0.255
0.0LysXaa: 0.0 ± 0.0
Leu
3.918LeuAla: 3.918 ± 0.24
0.837LeuCys: 0.837 ± 0.139
5.18LeuAsp: 5.18 ± 0.273
4.994LeuGlu: 4.994 ± 0.282
3.108LeuPhe: 3.108 ± 0.199
4.595LeuGly: 4.595 ± 0.294
1.527LeuHis: 1.527 ± 0.173
3.785LeuIle: 3.785 ± 0.225
5.459LeuLys: 5.459 ± 0.339
4.489LeuLeu: 4.489 ± 0.291
1.886LeuMet: 1.886 ± 0.191
4.449LeuAsn: 4.449 ± 0.273
3.108LeuPro: 3.108 ± 0.224
2.616LeuGln: 2.616 ± 0.188
2.988LeuArg: 2.988 ± 0.2
5.299LeuSer: 5.299 ± 0.262
4.888LeuThr: 4.888 ± 0.286
4.582LeuVal: 4.582 ± 0.274
0.704LeuTrp: 0.704 ± 0.113
2.696LeuTyr: 2.696 ± 0.207
0.0LeuXaa: 0.0 ± 0.0
Met
1.527MetAla: 1.527 ± 0.144
0.213MetCys: 0.213 ± 0.057
1.302MetAsp: 1.302 ± 0.14
1.341MetGlu: 1.341 ± 0.173
0.97MetPhe: 0.97 ± 0.115
1.341MetGly: 1.341 ± 0.141
0.305MetHis: 0.305 ± 0.084
1.421MetIle: 1.421 ± 0.175
1.926MetLys: 1.926 ± 0.209
1.421MetLeu: 1.421 ± 0.142
0.691MetMet: 0.691 ± 0.108
1.341MetAsn: 1.341 ± 0.154
1.195MetPro: 1.195 ± 0.154
0.77MetGln: 0.77 ± 0.119
1.036MetArg: 1.036 ± 0.157
1.859MetSer: 1.859 ± 0.193
1.82MetThr: 1.82 ± 0.176
1.222MetVal: 1.222 ± 0.127
0.239MetTrp: 0.239 ± 0.056
0.797MetTyr: 0.797 ± 0.103
0.0MetXaa: 0.0 ± 0.0
Asn
3.227AsnAla: 3.227 ± 0.265
0.584AsnCys: 0.584 ± 0.103
3.613AsnAsp: 3.613 ± 0.225
3.214AsnGlu: 3.214 ± 0.235
2.616AsnPhe: 2.616 ± 0.208
4.157AsnGly: 4.157 ± 0.304
1.063AsnHis: 1.063 ± 0.119
4.17AsnIle: 4.17 ± 0.246
3.188AsnLys: 3.188 ± 0.245
4.609AsnLeu: 4.609 ± 0.29
1.076AsnMet: 1.076 ± 0.138
3.108AsnAsn: 3.108 ± 0.329
2.869AsnPro: 2.869 ± 0.204
1.952AsnGln: 1.952 ± 0.172
2.245AsnArg: 2.245 ± 0.162
3.573AsnSer: 3.573 ± 0.239
3.945AsnThr: 3.945 ± 0.224
4.17AsnVal: 4.17 ± 0.355
0.757AsnTrp: 0.757 ± 0.105
2.431AsnTyr: 2.431 ± 0.187
0.0AsnXaa: 0.0 ± 0.0
Pro
2.005ProAla: 2.005 ± 0.159
0.452ProCys: 0.452 ± 0.081
3.134ProAsp: 3.134 ± 0.256
3.334ProGlu: 3.334 ± 0.305
1.873ProPhe: 1.873 ± 0.157
2.563ProGly: 2.563 ± 0.212
0.903ProHis: 0.903 ± 0.119
2.351ProIle: 2.351 ± 0.177
2.59ProLys: 2.59 ± 0.237
2.431ProLeu: 2.431 ± 0.182
0.903ProMet: 0.903 ± 0.131
2.377ProAsn: 2.377 ± 0.173
1.74ProPro: 1.74 ± 0.211
1.594ProGln: 1.594 ± 0.164
1.62ProArg: 1.62 ± 0.185
3.002ProSer: 3.002 ± 0.241
3.201ProThr: 3.201 ± 0.204
2.736ProVal: 2.736 ± 0.178
0.584ProTrp: 0.584 ± 0.097
1.514ProTyr: 1.514 ± 0.144
0.0ProXaa: 0.0 ± 0.0
Gln
1.58GlnAla: 1.58 ± 0.131
0.398GlnCys: 0.398 ± 0.081
1.992GlnAsp: 1.992 ± 0.174
2.311GlnGlu: 2.311 ± 0.181
2.205GlnPhe: 2.205 ± 0.158
2.457GlnGly: 2.457 ± 0.205
0.624GlnHis: 0.624 ± 0.098
2.444GlnIle: 2.444 ± 0.207
2.497GlnLys: 2.497 ± 0.209
2.563GlnLeu: 2.563 ± 0.184
0.943GlnMet: 0.943 ± 0.121
1.514GlnAsn: 1.514 ± 0.152
1.182GlnPro: 1.182 ± 0.131
1.647GlnGln: 1.647 ± 0.145
1.647GlnArg: 1.647 ± 0.132
2.457GlnSer: 2.457 ± 0.209
2.47GlnThr: 2.47 ± 0.203
2.351GlnVal: 2.351 ± 0.163
0.598GlnTrp: 0.598 ± 0.103
1.514GlnTyr: 1.514 ± 0.141
0.0GlnXaa: 0.0 ± 0.0
Arg
2.245ArgAla: 2.245 ± 0.185
0.677ArgCys: 0.677 ± 0.126
2.59ArgAsp: 2.59 ± 0.185
2.218ArgGlu: 2.218 ± 0.2
1.926ArgPhe: 1.926 ± 0.179
3.201ArgGly: 3.201 ± 0.227
0.757ArgHis: 0.757 ± 0.102
2.696ArgIle: 2.696 ± 0.205
3.041ArgLys: 3.041 ± 0.268
3.188ArgLeu: 3.188 ± 0.231
1.036ArgMet: 1.036 ± 0.121
2.125ArgAsn: 2.125 ± 0.167
1.501ArgPro: 1.501 ± 0.142
1.155ArgGln: 1.155 ± 0.133
1.766ArgArg: 1.766 ± 0.157
2.55ArgSer: 2.55 ± 0.194
2.085ArgThr: 2.085 ± 0.157
2.63ArgVal: 2.63 ± 0.245
0.571ArgTrp: 0.571 ± 0.092
1.58ArgTyr: 1.58 ± 0.173
0.0ArgXaa: 0.0 ± 0.0
Ser
4.184SerAla: 4.184 ± 0.245
0.584SerCys: 0.584 ± 0.111
4.277SerAsp: 4.277 ± 0.236
3.227SerGlu: 3.227 ± 0.272
3.254SerPhe: 3.254 ± 0.19
7.477SerGly: 7.477 ± 0.735
1.368SerHis: 1.368 ± 0.141
4.516SerIle: 4.516 ± 0.245
3.905SerLys: 3.905 ± 0.252
4.994SerLeu: 4.994 ± 0.268
1.381SerMet: 1.381 ± 0.153
3.918SerAsn: 3.918 ± 0.292
2.47SerPro: 2.47 ± 0.163
2.47SerGln: 2.47 ± 0.166
2.537SerArg: 2.537 ± 0.153
4.795SerSer: 4.795 ± 0.455
4.954SerThr: 4.954 ± 0.523
4.436SerVal: 4.436 ± 0.292
0.717SerTrp: 0.717 ± 0.1
2.856SerTyr: 2.856 ± 0.205
0.0SerXaa: 0.0 ± 0.0
Thr
5.193ThrAla: 5.193 ± 0.401
0.717ThrCys: 0.717 ± 0.134
4.131ThrAsp: 4.131 ± 0.234
3.945ThrGlu: 3.945 ± 0.213
3.573ThrPhe: 3.573 ± 0.293
6.216ThrGly: 6.216 ± 0.598
0.943ThrHis: 0.943 ± 0.114
4.755ThrIle: 4.755 ± 0.327
3.905ThrLys: 3.905 ± 0.239
4.848ThrLeu: 4.848 ± 0.274
1.049ThrMet: 1.049 ± 0.155
3.745ThrAsn: 3.745 ± 0.279
3.4ThrPro: 3.4 ± 0.27
2.165ThrGln: 2.165 ± 0.199
2.245ThrArg: 2.245 ± 0.193
5.233ThrSer: 5.233 ± 0.443
5.432ThrThr: 5.432 ± 0.596
5.1ThrVal: 5.1 ± 0.388
0.823ThrTrp: 0.823 ± 0.109
2.723ThrTyr: 2.723 ± 0.226
0.0ThrXaa: 0.0 ± 0.0
Val
3.838ValAla: 3.838 ± 0.321
0.77ValCys: 0.77 ± 0.111
4.808ValAsp: 4.808 ± 0.209
4.449ValGlu: 4.449 ± 0.229
2.683ValPhe: 2.683 ± 0.174
6.163ValGly: 6.163 ± 0.615
0.89ValHis: 0.89 ± 0.122
4.303ValIle: 4.303 ± 0.232
4.396ValLys: 4.396 ± 0.278
4.077ValLeu: 4.077 ± 0.223
1.235ValMet: 1.235 ± 0.143
4.051ValAsn: 4.051 ± 0.235
2.723ValPro: 2.723 ± 0.178
2.351ValGln: 2.351 ± 0.182
2.55ValArg: 2.55 ± 0.189
4.795ValSer: 4.795 ± 0.341
5.273ValThr: 5.273 ± 0.428
4.569ValVal: 4.569 ± 0.26
0.598ValTrp: 0.598 ± 0.087
2.537ValTyr: 2.537 ± 0.244
0.0ValXaa: 0.0 ± 0.0
Trp
0.73TrpAla: 0.73 ± 0.101
0.146TrpCys: 0.146 ± 0.044
0.89TrpAsp: 0.89 ± 0.122
0.903TrpGlu: 0.903 ± 0.112
0.558TrpPhe: 0.558 ± 0.104
0.916TrpGly: 0.916 ± 0.119
0.359TrpHis: 0.359 ± 0.079
0.916TrpIle: 0.916 ± 0.116
0.651TrpLys: 0.651 ± 0.096
0.877TrpLeu: 0.877 ± 0.124
0.425TrpMet: 0.425 ± 0.078
0.77TrpAsn: 0.77 ± 0.113
0.305TrpPro: 0.305 ± 0.063
0.505TrpGln: 0.505 ± 0.088
0.704TrpArg: 0.704 ± 0.106
0.837TrpSer: 0.837 ± 0.129
0.691TrpThr: 0.691 ± 0.096
0.691TrpVal: 0.691 ± 0.105
0.186TrpTrp: 0.186 ± 0.06
0.545TrpTyr: 0.545 ± 0.09
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.391TyrAla: 2.391 ± 0.16
0.77TyrCys: 0.77 ± 0.118
3.52TyrAsp: 3.52 ± 0.223
2.749TyrGlu: 2.749 ± 0.182
1.673TyrPhe: 1.673 ± 0.161
3.041TyrGly: 3.041 ± 0.194
0.863TyrHis: 0.863 ± 0.131
2.338TyrIle: 2.338 ± 0.151
2.563TyrLys: 2.563 ± 0.196
3.347TyrLeu: 3.347 ± 0.271
0.757TyrMet: 0.757 ± 0.114
2.736TyrAsn: 2.736 ± 0.165
2.165TyrPro: 2.165 ± 0.202
1.753TyrGln: 1.753 ± 0.145
1.966TyrArg: 1.966 ± 0.159
2.431TyrSer: 2.431 ± 0.175
2.497TyrThr: 2.497 ± 0.187
2.749TyrVal: 2.749 ± 0.197
0.505TyrTrp: 0.505 ± 0.097
2.377TyrTyr: 2.377 ± 0.184
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 319 proteins (75294 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski