Amino acid dipepetide frequency for Prochlorococcus phage MED4-213

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.054AlaAla: 6.054 ± 0.41
0.579AlaCys: 0.579 ± 0.092
4.229AlaAsp: 4.229 ± 0.314
3.001AlaGlu: 3.001 ± 0.286
2.422AlaPhe: 2.422 ± 0.254
5.159AlaGly: 5.159 ± 0.375
0.965AlaHis: 0.965 ± 0.117
4.036AlaIle: 4.036 ± 0.366
3.983AlaLys: 3.983 ± 0.396
4.562AlaLeu: 4.562 ± 0.342
1.404AlaMet: 1.404 ± 0.181
4.475AlaAsn: 4.475 ± 0.375
2.632AlaPro: 2.632 ± 0.284
2.597AlaGln: 2.597 ± 0.193
2.404AlaArg: 2.404 ± 0.193
5.422AlaSer: 5.422 ± 0.414
7.036AlaThr: 7.036 ± 0.678
4.439AlaVal: 4.439 ± 0.321
0.667AlaTrp: 0.667 ± 0.114
2.667AlaTyr: 2.667 ± 0.229
0.0AlaXaa: 0.0 ± 0.0
Cys
0.719CysAla: 0.719 ± 0.104
0.105CysCys: 0.105 ± 0.039
0.807CysAsp: 0.807 ± 0.146
0.614CysGlu: 0.614 ± 0.11
0.404CysPhe: 0.404 ± 0.082
0.439CysGly: 0.439 ± 0.097
0.281CysHis: 0.281 ± 0.074
0.509CysIle: 0.509 ± 0.099
0.421CysLys: 0.421 ± 0.097
0.526CysLeu: 0.526 ± 0.096
0.211CysMet: 0.211 ± 0.074
0.509CysAsn: 0.509 ± 0.104
0.491CysPro: 0.491 ± 0.1
0.368CysGln: 0.368 ± 0.067
0.351CysArg: 0.351 ± 0.078
0.544CysSer: 0.544 ± 0.112
0.719CysThr: 0.719 ± 0.118
0.579CysVal: 0.579 ± 0.098
0.088CysTrp: 0.088 ± 0.041
0.386CysTyr: 0.386 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
4.948AspAla: 4.948 ± 0.274
0.562AspCys: 0.562 ± 0.098
4.492AspAsp: 4.492 ± 0.414
4.246AspGlu: 4.246 ± 0.413
2.878AspPhe: 2.878 ± 0.206
5.282AspGly: 5.282 ± 0.364
0.965AspHis: 0.965 ± 0.186
4.106AspIle: 4.106 ± 0.299
3.878AspLys: 3.878 ± 0.408
5.036AspLeu: 5.036 ± 0.366
1.667AspMet: 1.667 ± 0.212
4.001AspAsn: 4.001 ± 0.36
3.123AspPro: 3.123 ± 0.311
2.141AspGln: 2.141 ± 0.229
2.492AspArg: 2.492 ± 0.212
4.124AspSer: 4.124 ± 0.238
4.597AspThr: 4.597 ± 0.347
4.352AspVal: 4.352 ± 0.307
0.86AspTrp: 0.86 ± 0.166
3.211AspTyr: 3.211 ± 0.265
0.0AspXaa: 0.0 ± 0.0
Glu
3.281GluAla: 3.281 ± 0.215
0.509GluCys: 0.509 ± 0.098
3.931GluAsp: 3.931 ± 0.363
4.65GluGlu: 4.65 ± 0.542
2.702GluPhe: 2.702 ± 0.246
3.632GluGly: 3.632 ± 0.301
1.369GluHis: 1.369 ± 0.22
4.071GluIle: 4.071 ± 0.306
3.72GluLys: 3.72 ± 0.447
4.72GluLeu: 4.72 ± 0.355
1.492GluMet: 1.492 ± 0.233
2.808GluAsn: 2.808 ± 0.259
1.597GluPro: 1.597 ± 0.164
2.422GluGln: 2.422 ± 0.259
2.281GluArg: 2.281 ± 0.237
3.299GluSer: 3.299 ± 0.241
3.825GluThr: 3.825 ± 0.255
3.825GluVal: 3.825 ± 0.261
0.965GluTrp: 0.965 ± 0.161
2.439GluTyr: 2.439 ± 0.234
0.0GluXaa: 0.0 ± 0.0
Phe
2.72PheAla: 2.72 ± 0.213
0.562PheCys: 0.562 ± 0.107
3.176PheAsp: 3.176 ± 0.244
2.597PheGlu: 2.597 ± 0.213
1.562PhePhe: 1.562 ± 0.178
2.79PheGly: 2.79 ± 0.287
0.755PheHis: 0.755 ± 0.141
2.334PheIle: 2.334 ± 0.262
2.597PheLys: 2.597 ± 0.311
3.106PheLeu: 3.106 ± 0.252
0.877PheMet: 0.877 ± 0.16
2.492PheAsn: 2.492 ± 0.203
1.597PhePro: 1.597 ± 0.197
1.544PheGln: 1.544 ± 0.168
1.579PheArg: 1.579 ± 0.223
3.211PheSer: 3.211 ± 0.271
3.738PheThr: 3.738 ± 0.349
3.176PheVal: 3.176 ± 0.245
0.404PheTrp: 0.404 ± 0.084
1.878PheTyr: 1.878 ± 0.155
0.0PheXaa: 0.0 ± 0.0
Gly
5.72GlyAla: 5.72 ± 0.519
0.579GlyCys: 0.579 ± 0.092
5.387GlyAsp: 5.387 ± 0.445
3.545GlyGlu: 3.545 ± 0.275
2.615GlyPhe: 2.615 ± 0.219
8.072GlyGly: 8.072 ± 0.887
0.895GlyHis: 0.895 ± 0.136
4.685GlyIle: 4.685 ± 0.266
3.966GlyLys: 3.966 ± 0.393
4.843GlyLeu: 4.843 ± 0.296
1.492GlyMet: 1.492 ± 0.192
5.071GlyAsn: 5.071 ± 0.486
1.334GlyPro: 1.334 ± 0.147
2.913GlyGln: 2.913 ± 0.271
2.457GlyArg: 2.457 ± 0.237
6.949GlySer: 6.949 ± 0.665
7.475GlyThr: 7.475 ± 0.871
4.825GlyVal: 4.825 ± 0.388
0.719GlyTrp: 0.719 ± 0.112
3.264GlyTyr: 3.264 ± 0.282
0.0GlyXaa: 0.0 ± 0.0
His
0.79HisAla: 0.79 ± 0.124
0.211HisCys: 0.211 ± 0.053
0.807HisAsp: 0.807 ± 0.132
1.263HisGlu: 1.263 ± 0.179
0.895HisPhe: 0.895 ± 0.132
1.263HisGly: 1.263 ± 0.17
0.439HisHis: 0.439 ± 0.093
1.193HisIle: 1.193 ± 0.151
0.948HisLys: 0.948 ± 0.154
1.105HisLeu: 1.105 ± 0.177
0.439HisMet: 0.439 ± 0.11
0.737HisAsn: 0.737 ± 0.116
1.0HisPro: 1.0 ± 0.149
0.456HisGln: 0.456 ± 0.097
0.667HisArg: 0.667 ± 0.15
1.018HisSer: 1.018 ± 0.15
1.492HisThr: 1.492 ± 0.2
0.93HisVal: 0.93 ± 0.154
0.298HisTrp: 0.298 ± 0.086
0.877HisTyr: 0.877 ± 0.134
0.0HisXaa: 0.0 ± 0.0
Ile
4.036IleAla: 4.036 ± 0.283
0.544IleCys: 0.544 ± 0.109
4.931IleAsp: 4.931 ± 0.289
4.053IleGlu: 4.053 ± 0.258
2.281IlePhe: 2.281 ± 0.217
4.334IleGly: 4.334 ± 0.271
1.07IleHis: 1.07 ± 0.173
3.65IleIle: 3.65 ± 0.269
4.141IleLys: 4.141 ± 0.339
3.79IleLeu: 3.79 ± 0.26
1.035IleMet: 1.035 ± 0.166
3.615IleAsn: 3.615 ± 0.249
2.93IlePro: 2.93 ± 0.269
2.211IleGln: 2.211 ± 0.193
2.316IleArg: 2.316 ± 0.184
4.896IleSer: 4.896 ± 0.333
6.124IleThr: 6.124 ± 0.553
4.457IleVal: 4.457 ± 0.267
0.474IleTrp: 0.474 ± 0.089
2.351IleTyr: 2.351 ± 0.231
0.0IleXaa: 0.0 ± 0.0
Lys
3.58LysAla: 3.58 ± 0.371
0.579LysCys: 0.579 ± 0.095
3.615LysAsp: 3.615 ± 0.367
4.089LysGlu: 4.089 ± 0.462
2.702LysPhe: 2.702 ± 0.211
3.878LysGly: 3.878 ± 0.362
1.298LysHis: 1.298 ± 0.201
4.001LysIle: 4.001 ± 0.3
5.422LysLys: 5.422 ± 0.633
4.422LysLeu: 4.422 ± 0.384
1.737LysMet: 1.737 ± 0.237
3.106LysAsn: 3.106 ± 0.316
2.035LysPro: 2.035 ± 0.252
1.948LysGln: 1.948 ± 0.197
2.579LysArg: 2.579 ± 0.319
3.72LysSer: 3.72 ± 0.344
3.545LysThr: 3.545 ± 0.285
4.211LysVal: 4.211 ± 0.362
0.86LysTrp: 0.86 ± 0.145
2.808LysTyr: 2.808 ± 0.284
0.0LysXaa: 0.0 ± 0.0
Leu
4.527LeuAla: 4.527 ± 0.351
0.755LeuCys: 0.755 ± 0.139
5.72LeuAsp: 5.72 ± 0.346
4.597LeuGlu: 4.597 ± 0.424
2.422LeuPhe: 2.422 ± 0.218
4.176LeuGly: 4.176 ± 0.343
1.123LeuHis: 1.123 ± 0.157
4.246LeuIle: 4.246 ± 0.302
4.825LeuLys: 4.825 ± 0.393
4.545LeuLeu: 4.545 ± 0.371
1.351LeuMet: 1.351 ± 0.195
4.457LeuAsn: 4.457 ± 0.249
2.632LeuPro: 2.632 ± 0.201
2.527LeuGln: 2.527 ± 0.213
3.299LeuArg: 3.299 ± 0.27
5.212LeuSer: 5.212 ± 0.236
5.124LeuThr: 5.124 ± 0.342
4.404LeuVal: 4.404 ± 0.301
0.614LeuTrp: 0.614 ± 0.115
3.141LeuTyr: 3.141 ± 0.273
0.0LeuXaa: 0.0 ± 0.0
Met
1.509MetAla: 1.509 ± 0.202
0.298MetCys: 0.298 ± 0.081
1.492MetAsp: 1.492 ± 0.221
1.193MetGlu: 1.193 ± 0.18
1.018MetPhe: 1.018 ± 0.148
1.018MetGly: 1.018 ± 0.162
0.439MetHis: 0.439 ± 0.103
1.176MetIle: 1.176 ± 0.166
1.737MetLys: 1.737 ± 0.213
1.228MetLeu: 1.228 ± 0.187
0.772MetMet: 0.772 ± 0.162
1.035MetAsn: 1.035 ± 0.179
0.737MetPro: 0.737 ± 0.148
1.123MetGln: 1.123 ± 0.149
0.86MetArg: 0.86 ± 0.121
1.79MetSer: 1.79 ± 0.221
1.439MetThr: 1.439 ± 0.21
1.176MetVal: 1.176 ± 0.14
0.228MetTrp: 0.228 ± 0.061
0.684MetTyr: 0.684 ± 0.109
0.0MetXaa: 0.0 ± 0.0
Asn
4.404AsnAla: 4.404 ± 0.452
0.474AsnCys: 0.474 ± 0.111
3.053AsnAsp: 3.053 ± 0.205
2.913AsnGlu: 2.913 ± 0.264
2.808AsnPhe: 2.808 ± 0.221
5.527AsnGly: 5.527 ± 0.528
1.0AsnHis: 1.0 ± 0.148
4.211AsnIle: 4.211 ± 0.27
3.053AsnLys: 3.053 ± 0.29
4.299AsnLeu: 4.299 ± 0.284
1.211AsnMet: 1.211 ± 0.166
3.738AsnAsn: 3.738 ± 0.4
2.72AsnPro: 2.72 ± 0.22
2.088AsnGln: 2.088 ± 0.186
2.0AsnArg: 2.0 ± 0.167
3.931AsnSer: 3.931 ± 0.426
4.299AsnThr: 4.299 ± 0.44
3.86AsnVal: 3.86 ± 0.244
0.772AsnTrp: 0.772 ± 0.115
2.422AsnTyr: 2.422 ± 0.232
0.0AsnXaa: 0.0 ± 0.0
Pro
2.316ProAla: 2.316 ± 0.223
0.281ProCys: 0.281 ± 0.074
2.685ProAsp: 2.685 ± 0.264
2.509ProGlu: 2.509 ± 0.244
1.79ProPhe: 1.79 ± 0.163
2.229ProGly: 2.229 ± 0.188
0.649ProHis: 0.649 ± 0.128
1.878ProIle: 1.878 ± 0.183
2.035ProLys: 2.035 ± 0.255
2.474ProLeu: 2.474 ± 0.19
0.649ProMet: 0.649 ± 0.136
2.158ProAsn: 2.158 ± 0.21
1.597ProPro: 1.597 ± 0.215
1.228ProGln: 1.228 ± 0.16
1.246ProArg: 1.246 ± 0.184
2.878ProSer: 2.878 ± 0.31
3.58ProThr: 3.58 ± 0.291
3.404ProVal: 3.404 ± 0.318
0.597ProTrp: 0.597 ± 0.114
1.737ProTyr: 1.737 ± 0.207
0.0ProXaa: 0.0 ± 0.0
Gln
2.246GlnAla: 2.246 ± 0.229
0.298GlnCys: 0.298 ± 0.093
2.053GlnAsp: 2.053 ± 0.181
2.053GlnGlu: 2.053 ± 0.234
1.93GlnPhe: 1.93 ± 0.204
2.667GlnGly: 2.667 ± 0.316
0.684GlnHis: 0.684 ± 0.109
2.615GlnIle: 2.615 ± 0.206
2.281GlnLys: 2.281 ± 0.205
3.404GlnLeu: 3.404 ± 0.249
0.755GlnMet: 0.755 ± 0.145
2.141GlnAsn: 2.141 ± 0.233
1.439GlnPro: 1.439 ± 0.245
1.579GlnGln: 1.579 ± 0.284
1.649GlnArg: 1.649 ± 0.155
2.422GlnSer: 2.422 ± 0.187
2.351GlnThr: 2.351 ± 0.232
2.299GlnVal: 2.299 ± 0.222
0.597GlnTrp: 0.597 ± 0.187
1.702GlnTyr: 1.702 ± 0.201
0.0GlnXaa: 0.0 ± 0.0
Arg
2.544ArgAla: 2.544 ± 0.222
0.456ArgCys: 0.456 ± 0.095
2.351ArgAsp: 2.351 ± 0.217
2.334ArgGlu: 2.334 ± 0.246
1.948ArgPhe: 1.948 ± 0.174
2.386ArgGly: 2.386 ± 0.187
0.632ArgHis: 0.632 ± 0.117
2.685ArgIle: 2.685 ± 0.23
2.527ArgLys: 2.527 ± 0.362
2.737ArgLeu: 2.737 ± 0.263
0.948ArgMet: 0.948 ± 0.134
2.0ArgAsn: 2.0 ± 0.214
1.351ArgPro: 1.351 ± 0.164
1.404ArgGln: 1.404 ± 0.193
1.86ArgArg: 1.86 ± 0.217
2.527ArgSer: 2.527 ± 0.261
2.281ArgThr: 2.281 ± 0.258
2.597ArgVal: 2.597 ± 0.203
0.368ArgTrp: 0.368 ± 0.076
1.825ArgTyr: 1.825 ± 0.213
0.0ArgXaa: 0.0 ± 0.0
Ser
5.387SerAla: 5.387 ± 0.317
0.667SerCys: 0.667 ± 0.11
4.211SerAsp: 4.211 ± 0.255
3.141SerGlu: 3.141 ± 0.237
3.264SerPhe: 3.264 ± 0.251
8.563SerGly: 8.563 ± 0.955
1.018SerHis: 1.018 ± 0.13
4.825SerIle: 4.825 ± 0.305
3.562SerLys: 3.562 ± 0.344
4.861SerLeu: 4.861 ± 0.293
1.228SerMet: 1.228 ± 0.17
4.352SerAsn: 4.352 ± 0.427
2.843SerPro: 2.843 ± 0.243
2.562SerGln: 2.562 ± 0.192
2.176SerArg: 2.176 ± 0.201
5.703SerSer: 5.703 ± 0.47
6.036SerThr: 6.036 ± 0.63
4.668SerVal: 4.668 ± 0.34
0.649SerTrp: 0.649 ± 0.128
2.913SerTyr: 2.913 ± 0.227
0.0SerXaa: 0.0 ± 0.0
Thr
5.966ThrAla: 5.966 ± 0.576
0.526ThrCys: 0.526 ± 0.108
5.054ThrAsp: 5.054 ± 0.308
3.65ThrGlu: 3.65 ± 0.215
4.211ThrPhe: 4.211 ± 0.323
6.914ThrGly: 6.914 ± 0.78
1.07ThrHis: 1.07 ± 0.181
5.685ThrIle: 5.685 ± 0.554
3.58ThrLys: 3.58 ± 0.334
5.668ThrLeu: 5.668 ± 0.391
1.246ThrMet: 1.246 ± 0.179
4.931ThrAsn: 4.931 ± 0.482
3.352ThrPro: 3.352 ± 0.296
2.948ThrGln: 2.948 ± 0.255
2.772ThrArg: 2.772 ± 0.216
6.317ThrSer: 6.317 ± 0.609
7.458ThrThr: 7.458 ± 0.819
6.37ThrVal: 6.37 ± 0.713
0.93ThrTrp: 0.93 ± 0.132
3.369ThrTyr: 3.369 ± 0.484
0.0ThrXaa: 0.0 ± 0.0
Val
4.843ValAla: 4.843 ± 0.371
0.509ValCys: 0.509 ± 0.12
4.72ValAsp: 4.72 ± 0.316
3.86ValGlu: 3.86 ± 0.33
2.685ValPhe: 2.685 ± 0.219
5.457ValGly: 5.457 ± 0.36
0.948ValHis: 0.948 ± 0.137
3.825ValIle: 3.825 ± 0.245
4.194ValLys: 4.194 ± 0.369
4.317ValLeu: 4.317 ± 0.247
1.211ValMet: 1.211 ± 0.162
3.843ValAsn: 3.843 ± 0.286
2.544ValPro: 2.544 ± 0.198
2.737ValGln: 2.737 ± 0.205
2.246ValArg: 2.246 ± 0.173
5.264ValSer: 5.264 ± 0.42
6.738ValThr: 6.738 ± 0.726
4.124ValVal: 4.124 ± 0.292
0.404ValTrp: 0.404 ± 0.091
2.562ValTyr: 2.562 ± 0.224
0.0ValXaa: 0.0 ± 0.0
Trp
0.684TrpAla: 0.684 ± 0.105
0.158TrpCys: 0.158 ± 0.048
0.807TrpAsp: 0.807 ± 0.145
0.825TrpGlu: 0.825 ± 0.149
0.456TrpPhe: 0.456 ± 0.129
0.526TrpGly: 0.526 ± 0.084
0.351TrpHis: 0.351 ± 0.085
0.632TrpIle: 0.632 ± 0.105
0.737TrpLys: 0.737 ± 0.13
0.737TrpLeu: 0.737 ± 0.133
0.368TrpMet: 0.368 ± 0.093
0.614TrpAsn: 0.614 ± 0.093
0.175TrpPro: 0.175 ± 0.058
0.509TrpGln: 0.509 ± 0.139
0.562TrpArg: 0.562 ± 0.088
0.632TrpSer: 0.632 ± 0.115
0.948TrpThr: 0.948 ± 0.126
0.702TrpVal: 0.702 ± 0.113
0.193TrpTrp: 0.193 ± 0.058
0.509TrpTyr: 0.509 ± 0.097
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.457TyrAla: 2.457 ± 0.236
0.421TyrCys: 0.421 ± 0.097
3.439TyrAsp: 3.439 ± 0.266
2.404TyrGlu: 2.404 ± 0.245
1.702TyrPhe: 1.702 ± 0.204
2.492TyrGly: 2.492 ± 0.252
0.877TyrHis: 0.877 ± 0.14
2.878TyrIle: 2.878 ± 0.247
2.544TyrLys: 2.544 ± 0.237
3.316TyrLeu: 3.316 ± 0.253
0.842TyrMet: 0.842 ± 0.144
2.702TyrAsn: 2.702 ± 0.218
1.79TyrPro: 1.79 ± 0.238
1.86TyrGln: 1.86 ± 0.297
1.965TyrArg: 1.965 ± 0.194
2.772TyrSer: 2.772 ± 0.226
3.211TyrThr: 3.211 ± 0.297
2.632TyrVal: 2.632 ± 0.24
0.439TyrTrp: 0.439 ± 0.082
2.176TyrTyr: 2.176 ± 0.212
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 215 proteins (56990 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski