Amino acid dipepetide frequency for Synechococcus phage S-ShM2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.207AlaAla: 6.207 ± 0.485
0.393AlaCys: 0.393 ± 0.104
4.001AlaAsp: 4.001 ± 0.297
3.522AlaGlu: 3.522 ± 0.276
2.89AlaPhe: 2.89 ± 0.265
6.429AlaGly: 6.429 ± 0.554
0.958AlaHis: 0.958 ± 0.138
4.189AlaIle: 4.189 ± 0.385
3.83AlaLys: 3.83 ± 0.332
4.565AlaLeu: 4.565 ± 0.305
1.453AlaMet: 1.453 ± 0.192
4.018AlaAsn: 4.018 ± 0.313
2.77AlaPro: 2.77 ± 0.245
2.394AlaGln: 2.394 ± 0.232
2.702AlaArg: 2.702 ± 0.231
4.77AlaSer: 4.77 ± 0.416
5.66AlaThr: 5.66 ± 0.544
4.668AlaVal: 4.668 ± 0.321
0.598AlaTrp: 0.598 ± 0.111
2.394AlaTyr: 2.394 ± 0.205
0.0AlaXaa: 0.0 ± 0.0
Cys
0.581CysAla: 0.581 ± 0.09
0.137CysCys: 0.137 ± 0.055
0.804CysAsp: 0.804 ± 0.161
0.496CysGlu: 0.496 ± 0.117
0.513CysPhe: 0.513 ± 0.109
0.616CysGly: 0.616 ± 0.133
0.359CysHis: 0.359 ± 0.086
0.701CysIle: 0.701 ± 0.121
0.633CysLys: 0.633 ± 0.113
0.53CysLeu: 0.53 ± 0.115
0.256CysMet: 0.256 ± 0.074
0.547CysAsn: 0.547 ± 0.12
0.359CysPro: 0.359 ± 0.093
0.359CysGln: 0.359 ± 0.091
0.479CysArg: 0.479 ± 0.101
0.769CysSer: 0.769 ± 0.147
0.445CysThr: 0.445 ± 0.091
0.496CysVal: 0.496 ± 0.12
0.171CysTrp: 0.171 ± 0.065
0.581CysTyr: 0.581 ± 0.105
0.0CysXaa: 0.0 ± 0.0
Asp
5.42AspAla: 5.42 ± 0.414
0.838AspCys: 0.838 ± 0.153
4.634AspAsp: 4.634 ± 0.411
3.967AspGlu: 3.967 ± 0.296
2.667AspPhe: 2.667 ± 0.228
5.642AspGly: 5.642 ± 0.509
1.043AspHis: 1.043 ± 0.12
4.001AspIle: 4.001 ± 0.296
3.403AspLys: 3.403 ± 0.309
4.788AspLeu: 4.788 ± 0.312
1.83AspMet: 1.83 ± 0.199
3.591AspAsn: 3.591 ± 0.29
3.3AspPro: 3.3 ± 0.347
2.394AspGln: 2.394 ± 0.196
2.753AspArg: 2.753 ± 0.238
4.463AspSer: 4.463 ± 0.268
4.514AspThr: 4.514 ± 0.3
4.035AspVal: 4.035 ± 0.286
0.906AspTrp: 0.906 ± 0.14
3.18AspTyr: 3.18 ± 0.235
0.0AspXaa: 0.0 ± 0.0
Glu
3.368GluAla: 3.368 ± 0.224
0.906GluCys: 0.906 ± 0.185
3.608GluAsp: 3.608 ± 0.27
4.651GluGlu: 4.651 ± 0.529
3.044GluPhe: 3.044 ± 0.244
3.898GluGly: 3.898 ± 0.257
0.787GluHis: 0.787 ± 0.128
3.864GluIle: 3.864 ± 0.272
3.539GluLys: 3.539 ± 0.359
4.907GluLeu: 4.907 ± 0.284
1.83GluMet: 1.83 ± 0.27
3.112GluAsn: 3.112 ± 0.222
2.018GluPro: 2.018 ± 0.218
2.531GluGln: 2.531 ± 0.265
2.684GluArg: 2.684 ± 0.298
3.796GluSer: 3.796 ± 0.265
3.916GluThr: 3.916 ± 0.252
4.206GluVal: 4.206 ± 0.265
0.804GluTrp: 0.804 ± 0.134
2.479GluTyr: 2.479 ± 0.226
0.0GluXaa: 0.0 ± 0.0
Phe
2.753PheAla: 2.753 ± 0.219
0.598PheCys: 0.598 ± 0.095
3.522PheAsp: 3.522 ± 0.242
2.377PheGlu: 2.377 ± 0.193
1.949PhePhe: 1.949 ± 0.147
3.385PheGly: 3.385 ± 0.313
0.633PheHis: 0.633 ± 0.129
2.736PheIle: 2.736 ± 0.216
1.812PheLys: 1.812 ± 0.184
3.197PheLeu: 3.197 ± 0.314
0.923PheMet: 0.923 ± 0.134
2.377PheAsn: 2.377 ± 0.212
1.83PhePro: 1.83 ± 0.247
1.453PheGln: 1.453 ± 0.121
1.812PheArg: 1.812 ± 0.192
3.334PheSer: 3.334 ± 0.246
3.078PheThr: 3.078 ± 0.255
2.941PheVal: 2.941 ± 0.245
0.376PheTrp: 0.376 ± 0.087
1.744PheTyr: 1.744 ± 0.143
0.0PheXaa: 0.0 ± 0.0
Gly
6.087GlyAla: 6.087 ± 0.633
0.769GlyCys: 0.769 ± 0.149
4.753GlyAsp: 4.753 ± 0.398
4.138GlyGlu: 4.138 ± 0.333
3.009GlyPhe: 3.009 ± 0.202
8.583GlyGly: 8.583 ± 1.168
0.872GlyHis: 0.872 ± 0.138
4.873GlyIle: 4.873 ± 0.452
3.796GlyLys: 3.796 ± 0.364
4.736GlyLeu: 4.736 ± 0.379
1.778GlyMet: 1.778 ± 0.229
4.411GlyAsn: 4.411 ± 0.365
2.206GlyPro: 2.206 ± 0.284
2.89GlyGln: 2.89 ± 0.212
3.112GlyArg: 3.112 ± 0.25
6.566GlySer: 6.566 ± 0.494
7.13GlyThr: 7.13 ± 0.704
4.89GlyVal: 4.89 ± 0.337
1.094GlyTrp: 1.094 ± 0.165
3.351GlyTyr: 3.351 ± 0.321
0.0GlyXaa: 0.0 ± 0.0
His
0.821HisAla: 0.821 ± 0.139
0.222HisCys: 0.222 ± 0.063
1.18HisAsp: 1.18 ± 0.181
0.923HisGlu: 0.923 ± 0.147
0.701HisPhe: 0.701 ± 0.149
1.094HisGly: 1.094 ± 0.154
0.256HisHis: 0.256 ± 0.083
0.855HisIle: 0.855 ± 0.115
0.804HisLys: 0.804 ± 0.155
1.111HisLeu: 1.111 ± 0.14
0.41HisMet: 0.41 ± 0.088
0.718HisAsn: 0.718 ± 0.138
0.872HisPro: 0.872 ± 0.127
0.496HisGln: 0.496 ± 0.09
0.479HisArg: 0.479 ± 0.097
0.923HisSer: 0.923 ± 0.12
1.111HisThr: 1.111 ± 0.142
0.684HisVal: 0.684 ± 0.086
0.222HisTrp: 0.222 ± 0.058
0.889HisTyr: 0.889 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
4.069IleAla: 4.069 ± 0.245
0.547IleCys: 0.547 ± 0.108
5.13IleAsp: 5.13 ± 0.342
4.531IleGlu: 4.531 ± 0.306
2.496IlePhe: 2.496 ± 0.22
4.463IleGly: 4.463 ± 0.266
0.804IleHis: 0.804 ± 0.147
3.796IleIle: 3.796 ± 0.343
3.642IleLys: 3.642 ± 0.279
4.446IleLeu: 4.446 ± 0.28
1.385IleMet: 1.385 ± 0.21
3.505IleAsn: 3.505 ± 0.229
2.77IlePro: 2.77 ± 0.312
2.719IleGln: 2.719 ± 0.261
2.565IleArg: 2.565 ± 0.245
4.428IleSer: 4.428 ± 0.343
5.352IleThr: 5.352 ± 0.597
3.676IleVal: 3.676 ± 0.325
0.598IleTrp: 0.598 ± 0.117
2.274IleTyr: 2.274 ± 0.263
0.0IleXaa: 0.0 ± 0.0
Lys
3.232LysAla: 3.232 ± 0.349
0.667LysCys: 0.667 ± 0.119
3.317LysAsp: 3.317 ± 0.322
3.898LysGlu: 3.898 ± 0.422
2.753LysPhe: 2.753 ± 0.187
3.266LysGly: 3.266 ± 0.366
0.855LysHis: 0.855 ± 0.146
4.104LysIle: 4.104 ± 0.327
4.582LysLys: 4.582 ± 0.58
4.172LysLeu: 4.172 ± 0.354
1.505LysMet: 1.505 ± 0.217
2.907LysAsn: 2.907 ± 0.289
2.069LysPro: 2.069 ± 0.222
2.325LysGln: 2.325 ± 0.258
2.462LysArg: 2.462 ± 0.272
3.83LysSer: 3.83 ± 0.364
3.061LysThr: 3.061 ± 0.223
4.104LysVal: 4.104 ± 0.275
0.787LysTrp: 0.787 ± 0.13
2.684LysTyr: 2.684 ± 0.313
0.0LysXaa: 0.0 ± 0.0
Leu
4.856LeuAla: 4.856 ± 0.355
0.804LeuCys: 0.804 ± 0.157
6.019LeuAsp: 6.019 ± 0.312
3.881LeuGlu: 3.881 ± 0.317
2.992LeuPhe: 2.992 ± 0.227
4.856LeuGly: 4.856 ± 0.296
1.419LeuHis: 1.419 ± 0.168
3.762LeuIle: 3.762 ± 0.239
4.258LeuLys: 4.258 ± 0.382
4.89LeuLeu: 4.89 ± 0.385
1.539LeuMet: 1.539 ± 0.195
4.292LeuAsn: 4.292 ± 0.292
2.941LeuPro: 2.941 ± 0.327
3.214LeuGln: 3.214 ± 0.227
3.437LeuArg: 3.437 ± 0.253
4.822LeuSer: 4.822 ± 0.298
5.215LeuThr: 5.215 ± 0.456
4.206LeuVal: 4.206 ± 0.258
0.718LeuTrp: 0.718 ± 0.118
2.873LeuTyr: 2.873 ± 0.278
0.0LeuXaa: 0.0 ± 0.0
Met
1.641MetAla: 1.641 ± 0.201
0.085MetCys: 0.085 ± 0.04
1.214MetAsp: 1.214 ± 0.176
1.573MetGlu: 1.573 ± 0.182
0.787MetPhe: 0.787 ± 0.161
1.317MetGly: 1.317 ± 0.194
0.427MetHis: 0.427 ± 0.097
1.128MetIle: 1.128 ± 0.142
1.744MetLys: 1.744 ± 0.261
1.778MetLeu: 1.778 ± 0.203
0.787MetMet: 0.787 ± 0.159
1.436MetAsn: 1.436 ± 0.203
1.077MetPro: 1.077 ± 0.149
0.633MetGln: 0.633 ± 0.129
0.992MetArg: 0.992 ± 0.149
1.727MetSer: 1.727 ± 0.222
1.505MetThr: 1.505 ± 0.221
1.47MetVal: 1.47 ± 0.184
0.342MetTrp: 0.342 ± 0.071
0.821MetTyr: 0.821 ± 0.13
0.0MetXaa: 0.0 ± 0.0
Asn
3.864AsnAla: 3.864 ± 0.369
0.564AsnCys: 0.564 ± 0.093
3.061AsnAsp: 3.061 ± 0.214
2.787AsnGlu: 2.787 ± 0.241
2.684AsnPhe: 2.684 ± 0.219
4.497AsnGly: 4.497 ± 0.394
0.889AsnHis: 0.889 ± 0.114
4.001AsnIle: 4.001 ± 0.261
2.958AsnLys: 2.958 ± 0.25
4.685AsnLeu: 4.685 ± 0.481
0.923AsnMet: 0.923 ± 0.172
3.266AsnAsn: 3.266 ± 0.289
2.941AsnPro: 2.941 ± 0.195
2.206AsnGln: 2.206 ± 0.172
2.804AsnArg: 2.804 ± 0.251
4.087AsnSer: 4.087 ± 0.394
3.556AsnThr: 3.556 ± 0.294
3.83AsnVal: 3.83 ± 0.312
0.787AsnTrp: 0.787 ± 0.111
2.616AsnTyr: 2.616 ± 0.226
0.0AsnXaa: 0.0 ± 0.0
Pro
2.667ProAla: 2.667 ± 0.239
0.427ProCys: 0.427 ± 0.118
2.838ProAsp: 2.838 ± 0.278
2.804ProGlu: 2.804 ± 0.297
1.812ProPhe: 1.812 ± 0.19
3.214ProGly: 3.214 ± 0.25
0.752ProHis: 0.752 ± 0.123
2.189ProIle: 2.189 ± 0.285
2.171ProLys: 2.171 ± 0.281
2.342ProLeu: 2.342 ± 0.231
0.581ProMet: 0.581 ± 0.123
2.052ProAsn: 2.052 ± 0.201
1.966ProPro: 1.966 ± 0.234
1.676ProGln: 1.676 ± 0.186
1.761ProArg: 1.761 ± 0.148
3.044ProSer: 3.044 ± 0.244
3.129ProThr: 3.129 ± 0.252
2.924ProVal: 2.924 ± 0.267
0.462ProTrp: 0.462 ± 0.088
1.641ProTyr: 1.641 ± 0.17
0.0ProXaa: 0.0 ± 0.0
Gln
2.035GlnAla: 2.035 ± 0.178
0.308GlnCys: 0.308 ± 0.07
2.171GlnAsp: 2.171 ± 0.23
2.531GlnGlu: 2.531 ± 0.218
1.778GlnPhe: 1.778 ± 0.215
2.513GlnGly: 2.513 ± 0.227
0.53GlnHis: 0.53 ± 0.1
3.163GlnIle: 3.163 ± 0.281
2.308GlnLys: 2.308 ± 0.318
2.992GlnLeu: 2.992 ± 0.207
0.992GlnMet: 0.992 ± 0.166
2.154GlnAsn: 2.154 ± 0.156
1.214GlnPro: 1.214 ± 0.142
1.864GlnGln: 1.864 ± 0.222
1.795GlnArg: 1.795 ± 0.16
2.565GlnSer: 2.565 ± 0.203
2.445GlnThr: 2.445 ± 0.218
2.565GlnVal: 2.565 ± 0.212
0.547GlnTrp: 0.547 ± 0.089
2.035GlnTyr: 2.035 ± 0.234
0.0GlnXaa: 0.0 ± 0.0
Arg
2.684ArgAla: 2.684 ± 0.211
0.308ArgCys: 0.308 ± 0.078
2.445ArgAsp: 2.445 ± 0.218
2.599ArgGlu: 2.599 ± 0.336
1.864ArgPhe: 1.864 ± 0.182
3.044ArgGly: 3.044 ± 0.204
0.787ArgHis: 0.787 ± 0.121
2.719ArgIle: 2.719 ± 0.226
2.89ArgLys: 2.89 ± 0.359
3.112ArgLeu: 3.112 ± 0.236
1.111ArgMet: 1.111 ± 0.157
2.291ArgAsn: 2.291 ± 0.209
1.624ArgPro: 1.624 ± 0.187
1.522ArgGln: 1.522 ± 0.149
2.394ArgArg: 2.394 ± 0.361
2.736ArgSer: 2.736 ± 0.211
2.496ArgThr: 2.496 ± 0.213
3.334ArgVal: 3.334 ± 0.23
0.564ArgTrp: 0.564 ± 0.109
2.223ArgTyr: 2.223 ± 0.202
0.0ArgXaa: 0.0 ± 0.0
Ser
5.061SerAla: 5.061 ± 0.384
0.479SerCys: 0.479 ± 0.11
4.411SerAsp: 4.411 ± 0.298
3.642SerGlu: 3.642 ± 0.282
3.488SerPhe: 3.488 ± 0.299
7.198SerGly: 7.198 ± 0.675
0.906SerHis: 0.906 ± 0.131
4.617SerIle: 4.617 ± 0.357
4.035SerLys: 4.035 ± 0.333
4.77SerLeu: 4.77 ± 0.248
1.539SerMet: 1.539 ± 0.191
4.463SerAsn: 4.463 ± 0.28
2.377SerPro: 2.377 ± 0.248
2.462SerGln: 2.462 ± 0.215
2.633SerArg: 2.633 ± 0.215
5.882SerSer: 5.882 ± 0.646
5.301SerThr: 5.301 ± 0.408
4.736SerVal: 4.736 ± 0.287
0.804SerTrp: 0.804 ± 0.126
3.044SerTyr: 3.044 ± 0.276
0.0SerXaa: 0.0 ± 0.0
Thr
5.54ThrAla: 5.54 ± 0.568
0.581ThrCys: 0.581 ± 0.116
4.531ThrAsp: 4.531 ± 0.354
3.676ThrGlu: 3.676 ± 0.286
2.924ThrPhe: 2.924 ± 0.243
6.993ThrGly: 6.993 ± 0.65
0.821ThrHis: 0.821 ± 0.115
5.147ThrIle: 5.147 ± 0.48
3.42ThrLys: 3.42 ± 0.277
5.831ThrLeu: 5.831 ± 0.501
1.111ThrMet: 1.111 ± 0.15
4.48ThrAsn: 4.48 ± 0.45
2.89ThrPro: 2.89 ± 0.207
2.496ThrGln: 2.496 ± 0.174
2.582ThrArg: 2.582 ± 0.192
5.301ThrSer: 5.301 ± 0.479
6.429ThrThr: 6.429 ± 0.778
5.215ThrVal: 5.215 ± 0.538
0.838ThrTrp: 0.838 ± 0.129
2.582ThrTyr: 2.582 ± 0.215
0.0ThrXaa: 0.0 ± 0.0
Val
4.411ValAla: 4.411 ± 0.317
0.513ValCys: 0.513 ± 0.114
5.318ValAsp: 5.318 ± 0.538
4.377ValGlu: 4.377 ± 0.294
2.257ValPhe: 2.257 ± 0.162
4.599ValGly: 4.599 ± 0.277
0.735ValHis: 0.735 ± 0.133
3.83ValIle: 3.83 ± 0.304
3.42ValLys: 3.42 ± 0.264
4.463ValLeu: 4.463 ± 0.289
1.368ValMet: 1.368 ± 0.168
3.539ValAsn: 3.539 ± 0.344
3.334ValPro: 3.334 ± 0.3
2.753ValGln: 2.753 ± 0.227
2.719ValArg: 2.719 ± 0.217
5.044ValSer: 5.044 ± 0.334
5.642ValThr: 5.642 ± 0.506
4.531ValVal: 4.531 ± 0.286
0.65ValTrp: 0.65 ± 0.101
2.65ValTyr: 2.65 ± 0.198
0.0ValXaa: 0.0 ± 0.0
Trp
0.667TrpAla: 0.667 ± 0.105
0.171TrpCys: 0.171 ± 0.053
0.701TrpAsp: 0.701 ± 0.105
0.855TrpGlu: 0.855 ± 0.133
0.547TrpPhe: 0.547 ± 0.107
0.735TrpGly: 0.735 ± 0.11
0.308TrpHis: 0.308 ± 0.076
0.701TrpIle: 0.701 ± 0.123
0.872TrpLys: 0.872 ± 0.183
0.684TrpLeu: 0.684 ± 0.13
0.291TrpMet: 0.291 ± 0.064
0.94TrpAsn: 0.94 ± 0.116
0.239TrpPro: 0.239 ± 0.056
0.547TrpGln: 0.547 ± 0.103
0.53TrpArg: 0.53 ± 0.088
1.06TrpSer: 1.06 ± 0.123
0.804TrpThr: 0.804 ± 0.121
0.701TrpVal: 0.701 ± 0.123
0.103TrpTrp: 0.103 ± 0.044
0.393TrpTyr: 0.393 ± 0.07
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.445TyrAla: 2.445 ± 0.174
0.513TyrCys: 0.513 ± 0.109
3.71TyrAsp: 3.71 ± 0.292
2.719TyrGlu: 2.719 ± 0.28
1.676TyrPhe: 1.676 ± 0.182
2.702TyrGly: 2.702 ± 0.187
0.633TyrHis: 0.633 ± 0.12
2.821TyrIle: 2.821 ± 0.23
2.462TyrLys: 2.462 ± 0.243
2.992TyrLeu: 2.992 ± 0.272
0.855TyrMet: 0.855 ± 0.147
2.907TyrAsn: 2.907 ± 0.243
1.659TyrPro: 1.659 ± 0.175
1.573TyrGln: 1.573 ± 0.157
2.052TyrArg: 2.052 ± 0.223
2.702TyrSer: 2.702 ± 0.21
2.616TyrThr: 2.616 ± 0.355
2.941TyrVal: 2.941 ± 0.223
0.479TyrTrp: 0.479 ± 0.1
2.137TyrTyr: 2.137 ± 0.206
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 230 proteins (58486 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski