Amino acid dipepetide frequency for Gordonia phage GMA2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.866AlaAla: 9.866 ± 0.792
0.938AlaCys: 0.938 ± 0.237
5.758AlaAsp: 5.758 ± 0.518
5.823AlaGlu: 5.823 ± 0.49
3.105AlaPhe: 3.105 ± 0.299
6.405AlaGly: 6.405 ± 0.549
1.585AlaHis: 1.585 ± 0.33
4.691AlaIle: 4.691 ± 0.416
5.305AlaLys: 5.305 ± 0.527
8.314AlaLeu: 8.314 ± 0.582
2.329AlaMet: 2.329 ± 0.252
3.429AlaAsn: 3.429 ± 0.417
4.594AlaPro: 4.594 ± 0.471
3.267AlaGln: 3.267 ± 0.377
5.532AlaArg: 5.532 ± 0.492
7.052AlaSer: 7.052 ± 0.495
5.92AlaThr: 5.92 ± 0.469
6.729AlaVal: 6.729 ± 0.551
1.941AlaTrp: 1.941 ± 0.215
2.685AlaTyr: 2.685 ± 0.267
0.0AlaXaa: 0.0 ± 0.0
Cys
0.97CysAla: 0.97 ± 0.202
0.129CysCys: 0.129 ± 0.065
0.938CysAsp: 0.938 ± 0.213
0.615CysGlu: 0.615 ± 0.164
0.323CysPhe: 0.323 ± 0.107
0.938CysGly: 0.938 ± 0.209
0.194CysHis: 0.194 ± 0.083
0.129CysIle: 0.129 ± 0.06
0.356CysLys: 0.356 ± 0.104
0.485CysLeu: 0.485 ± 0.121
0.226CysMet: 0.226 ± 0.078
0.162CysAsn: 0.162 ± 0.072
0.453CysPro: 0.453 ± 0.145
0.162CysGln: 0.162 ± 0.067
0.518CysArg: 0.518 ± 0.143
0.647CysSer: 0.647 ± 0.179
0.615CysThr: 0.615 ± 0.173
0.615CysVal: 0.615 ± 0.156
0.129CysTrp: 0.129 ± 0.087
0.453CysTyr: 0.453 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
7.214AspAla: 7.214 ± 0.595
0.55AspCys: 0.55 ± 0.137
5.079AspAsp: 5.079 ± 0.446
4.626AspGlu: 4.626 ± 0.429
2.135AspPhe: 2.135 ± 0.268
4.561AspGly: 4.561 ± 0.44
1.65AspHis: 1.65 ± 0.23
3.364AspIle: 3.364 ± 0.312
3.526AspLys: 3.526 ± 0.462
5.596AspLeu: 5.596 ± 0.452
1.65AspMet: 1.65 ± 0.241
2.038AspAsn: 2.038 ± 0.265
3.785AspPro: 3.785 ± 0.395
2.135AspGln: 2.135 ± 0.29
3.138AspArg: 3.138 ± 0.358
5.046AspSer: 5.046 ± 0.419
3.882AspThr: 3.882 ± 0.347
4.626AspVal: 4.626 ± 0.448
1.423AspTrp: 1.423 ± 0.188
2.556AspTyr: 2.556 ± 0.35
0.0AspXaa: 0.0 ± 0.0
Glu
5.338GluAla: 5.338 ± 0.415
0.518GluCys: 0.518 ± 0.151
4.108GluAsp: 4.108 ± 0.418
4.496GluGlu: 4.496 ± 0.414
2.232GluPhe: 2.232 ± 0.226
3.364GluGly: 3.364 ± 0.307
1.391GluHis: 1.391 ± 0.258
3.817GluIle: 3.817 ± 0.458
4.205GluLys: 4.205 ± 0.343
5.305GluLeu: 5.305 ± 0.475
1.747GluMet: 1.747 ± 0.275
2.75GluAsn: 2.75 ± 0.284
2.75GluPro: 2.75 ± 0.269
2.297GluGln: 2.297 ± 0.297
3.623GluArg: 3.623 ± 0.361
5.208GluSer: 5.208 ± 0.448
4.205GluThr: 4.205 ± 0.423
3.914GluVal: 3.914 ± 0.361
1.229GluTrp: 1.229 ± 0.179
1.844GluTyr: 1.844 ± 0.231
0.0GluXaa: 0.0 ± 0.0
Phe
2.459PheAla: 2.459 ± 0.318
0.291PheCys: 0.291 ± 0.093
2.329PheAsp: 2.329 ± 0.322
2.459PheGlu: 2.459 ± 0.293
0.938PhePhe: 0.938 ± 0.183
2.75PheGly: 2.75 ± 0.331
0.485PheHis: 0.485 ± 0.119
1.488PheIle: 1.488 ± 0.26
1.456PheLys: 1.456 ± 0.252
2.167PheLeu: 2.167 ± 0.312
0.873PheMet: 0.873 ± 0.179
1.294PheAsn: 1.294 ± 0.232
1.682PhePro: 1.682 ± 0.229
0.938PheGln: 0.938 ± 0.14
1.553PheArg: 1.553 ± 0.235
2.459PheSer: 2.459 ± 0.291
2.038PheThr: 2.038 ± 0.268
2.232PheVal: 2.232 ± 0.213
0.647PheTrp: 0.647 ± 0.156
0.55PheTyr: 0.55 ± 0.153
0.0PheXaa: 0.0 ± 0.0
Gly
6.082GlyAla: 6.082 ± 0.483
0.776GlyCys: 0.776 ± 0.155
4.885GlyAsp: 4.885 ± 0.503
4.432GlyGlu: 4.432 ± 0.375
2.847GlyPhe: 2.847 ± 0.348
4.755GlyGly: 4.755 ± 0.522
1.294GlyHis: 1.294 ± 0.244
4.173GlyIle: 4.173 ± 0.512
3.332GlyLys: 3.332 ± 0.311
6.146GlyLeu: 6.146 ± 0.761
1.488GlyMet: 1.488 ± 0.292
1.747GlyAsn: 1.747 ± 0.276
3.267GlyPro: 3.267 ± 0.415
2.167GlyGln: 2.167 ± 0.251
3.397GlyArg: 3.397 ± 0.354
5.92GlySer: 5.92 ± 0.538
4.302GlyThr: 4.302 ± 0.457
4.982GlyVal: 4.982 ± 0.544
1.132GlyTrp: 1.132 ± 0.192
2.232GlyTyr: 2.232 ± 0.286
0.0GlyXaa: 0.0 ± 0.0
His
1.714HisAla: 1.714 ± 0.293
0.129HisCys: 0.129 ± 0.068
1.553HisAsp: 1.553 ± 0.212
1.1HisGlu: 1.1 ± 0.226
0.55HisPhe: 0.55 ± 0.16
1.197HisGly: 1.197 ± 0.194
0.356HisHis: 0.356 ± 0.108
0.679HisIle: 0.679 ± 0.163
1.003HisLys: 1.003 ± 0.213
1.617HisLeu: 1.617 ± 0.274
0.679HisMet: 0.679 ± 0.154
0.453HisAsn: 0.453 ± 0.132
1.262HisPro: 1.262 ± 0.213
0.518HisGln: 0.518 ± 0.126
1.65HisArg: 1.65 ± 0.252
1.52HisSer: 1.52 ± 0.302
1.229HisThr: 1.229 ± 0.191
1.456HisVal: 1.456 ± 0.242
0.356HisTrp: 0.356 ± 0.1
0.744HisTyr: 0.744 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
5.273IleAla: 5.273 ± 0.347
0.809IleCys: 0.809 ± 0.158
3.332IleAsp: 3.332 ± 0.383
3.655IleGlu: 3.655 ± 0.427
1.456IlePhe: 1.456 ± 0.195
3.3IleGly: 3.3 ± 0.467
0.97IleHis: 0.97 ± 0.21
2.264IleIle: 2.264 ± 0.271
2.911IleLys: 2.911 ± 0.538
3.558IleLeu: 3.558 ± 0.366
0.744IleMet: 0.744 ± 0.186
1.617IleAsn: 1.617 ± 0.212
2.232IlePro: 2.232 ± 0.273
1.844IleGln: 1.844 ± 0.288
3.461IleArg: 3.461 ± 0.329
4.594IleSer: 4.594 ± 0.389
3.461IleThr: 3.461 ± 0.342
3.623IleVal: 3.623 ± 0.411
0.55IleTrp: 0.55 ± 0.14
1.003IleTyr: 1.003 ± 0.189
0.0IleXaa: 0.0 ± 0.0
Lys
4.755LysAla: 4.755 ± 0.541
0.485LysCys: 0.485 ± 0.162
3.17LysAsp: 3.17 ± 0.414
3.3LysGlu: 3.3 ± 0.363
1.456LysPhe: 1.456 ± 0.311
2.814LysGly: 2.814 ± 0.493
1.294LysHis: 1.294 ± 0.219
2.976LysIle: 2.976 ± 0.388
4.238LysLys: 4.238 ± 0.646
4.011LysLeu: 4.011 ± 0.319
1.423LysMet: 1.423 ± 0.236
2.006LysAsn: 2.006 ± 0.304
3.364LysPro: 3.364 ± 0.314
1.876LysGln: 1.876 ± 0.297
3.235LysArg: 3.235 ± 0.352
4.011LysSer: 4.011 ± 0.456
3.979LysThr: 3.979 ± 0.341
2.976LysVal: 2.976 ± 0.257
0.712LysTrp: 0.712 ± 0.128
1.682LysTyr: 1.682 ± 0.269
0.0LysXaa: 0.0 ± 0.0
Leu
7.667LeuAla: 7.667 ± 0.574
0.712LeuCys: 0.712 ± 0.155
6.179LeuAsp: 6.179 ± 0.557
5.37LeuGlu: 5.37 ± 0.438
2.135LeuPhe: 2.135 ± 0.317
6.243LeuGly: 6.243 ± 0.504
1.359LeuHis: 1.359 ± 0.252
3.591LeuIle: 3.591 ± 0.315
4.496LeuLys: 4.496 ± 0.476
6.049LeuLeu: 6.049 ± 0.574
1.973LeuMet: 1.973 ± 0.26
3.203LeuAsn: 3.203 ± 0.33
4.076LeuPro: 4.076 ± 0.332
2.523LeuGln: 2.523 ± 0.233
4.917LeuArg: 4.917 ± 0.475
6.696LeuSer: 6.696 ± 0.401
5.014LeuThr: 5.014 ± 0.313
5.305LeuVal: 5.305 ± 0.423
1.617LeuTrp: 1.617 ± 0.243
1.941LeuTyr: 1.941 ± 0.334
0.0LeuXaa: 0.0 ± 0.0
Met
1.909MetAla: 1.909 ± 0.23
0.162MetCys: 0.162 ± 0.085
1.391MetAsp: 1.391 ± 0.193
0.744MetGlu: 0.744 ± 0.18
0.615MetPhe: 0.615 ± 0.141
0.873MetGly: 0.873 ± 0.155
0.421MetHis: 0.421 ± 0.11
1.391MetIle: 1.391 ± 0.206
0.97MetLys: 0.97 ± 0.2
2.297MetLeu: 2.297 ± 0.302
0.291MetMet: 0.291 ± 0.093
1.197MetAsn: 1.197 ± 0.191
1.229MetPro: 1.229 ± 0.198
1.035MetGln: 1.035 ± 0.172
1.747MetArg: 1.747 ± 0.209
2.361MetSer: 2.361 ± 0.282
3.105MetThr: 3.105 ± 0.362
1.294MetVal: 1.294 ± 0.222
0.453MetTrp: 0.453 ± 0.149
0.647MetTyr: 0.647 ± 0.128
0.0MetXaa: 0.0 ± 0.0
Asn
3.623AsnAla: 3.623 ± 0.369
0.291AsnCys: 0.291 ± 0.12
2.782AsnAsp: 2.782 ± 0.352
1.747AsnGlu: 1.747 ± 0.246
1.359AsnPhe: 1.359 ± 0.209
2.426AsnGly: 2.426 ± 0.364
0.615AsnHis: 0.615 ± 0.154
1.876AsnIle: 1.876 ± 0.311
1.682AsnLys: 1.682 ± 0.256
2.847AsnLeu: 2.847 ± 0.331
0.744AsnMet: 0.744 ± 0.135
1.391AsnAsn: 1.391 ± 0.211
2.491AsnPro: 2.491 ± 0.272
1.294AsnGln: 1.294 ± 0.21
2.426AsnArg: 2.426 ± 0.408
2.394AsnSer: 2.394 ± 0.241
2.135AsnThr: 2.135 ± 0.232
2.426AsnVal: 2.426 ± 0.273
0.647AsnTrp: 0.647 ± 0.162
0.906AsnTyr: 0.906 ± 0.169
0.0AsnXaa: 0.0 ± 0.0
Pro
4.917ProAla: 4.917 ± 0.525
0.291ProCys: 0.291 ± 0.09
3.332ProAsp: 3.332 ± 0.371
4.496ProGlu: 4.496 ± 0.382
1.456ProPhe: 1.456 ± 0.249
4.141ProGly: 4.141 ± 0.424
0.97ProHis: 0.97 ± 0.182
2.361ProIle: 2.361 ± 0.283
2.879ProLys: 2.879 ± 0.343
3.526ProLeu: 3.526 ± 0.314
1.229ProMet: 1.229 ± 0.187
2.297ProAsn: 2.297 ± 0.319
2.329ProPro: 2.329 ± 0.358
1.003ProGln: 1.003 ± 0.172
2.135ProArg: 2.135 ± 0.266
3.526ProSer: 3.526 ± 0.4
2.976ProThr: 2.976 ± 0.354
3.267ProVal: 3.267 ± 0.308
0.744ProTrp: 0.744 ± 0.187
1.035ProTyr: 1.035 ± 0.189
0.0ProXaa: 0.0 ± 0.0
Gln
3.138GlnAla: 3.138 ± 0.302
0.194GlnCys: 0.194 ± 0.077
1.714GlnAsp: 1.714 ± 0.219
1.423GlnGlu: 1.423 ± 0.216
1.132GlnPhe: 1.132 ± 0.166
1.844GlnGly: 1.844 ± 0.308
0.582GlnHis: 0.582 ± 0.159
1.812GlnIle: 1.812 ± 0.203
1.617GlnLys: 1.617 ± 0.241
2.976GlnLeu: 2.976 ± 0.319
1.003GlnMet: 1.003 ± 0.18
1.391GlnAsn: 1.391 ± 0.249
1.229GlnPro: 1.229 ± 0.213
1.585GlnGln: 1.585 ± 0.315
2.135GlnArg: 2.135 ± 0.227
2.232GlnSer: 2.232 ± 0.264
1.973GlnThr: 1.973 ± 0.257
2.297GlnVal: 2.297 ± 0.294
0.647GlnTrp: 0.647 ± 0.133
0.906GlnTyr: 0.906 ± 0.19
0.0GlnXaa: 0.0 ± 0.0
Arg
5.985ArgAla: 5.985 ± 0.539
0.647ArgCys: 0.647 ± 0.154
4.044ArgAsp: 4.044 ± 0.426
4.238ArgGlu: 4.238 ± 0.414
1.585ArgPhe: 1.585 ± 0.2
4.076ArgGly: 4.076 ± 0.346
1.456ArgHis: 1.456 ± 0.191
2.847ArgIle: 2.847 ± 0.319
3.364ArgLys: 3.364 ± 0.384
5.111ArgLeu: 5.111 ± 0.434
1.585ArgMet: 1.585 ± 0.229
2.135ArgAsn: 2.135 ± 0.298
2.556ArgPro: 2.556 ± 0.318
2.006ArgGln: 2.006 ± 0.262
4.141ArgArg: 4.141 ± 0.52
4.658ArgSer: 4.658 ± 0.318
3.041ArgThr: 3.041 ± 0.404
3.526ArgVal: 3.526 ± 0.336
1.359ArgTrp: 1.359 ± 0.244
1.52ArgTyr: 1.52 ± 0.233
0.0ArgXaa: 0.0 ± 0.0
Ser
7.408SerAla: 7.408 ± 0.536
0.744SerCys: 0.744 ± 0.145
5.661SerAsp: 5.661 ± 0.456
4.399SerGlu: 4.399 ± 0.443
2.07SerPhe: 2.07 ± 0.27
6.696SerGly: 6.696 ± 0.535
1.617SerHis: 1.617 ± 0.28
3.267SerIle: 3.267 ± 0.314
4.141SerLys: 4.141 ± 0.435
6.47SerLeu: 6.47 ± 0.487
1.909SerMet: 1.909 ± 0.295
3.235SerAsn: 3.235 ± 0.351
3.429SerPro: 3.429 ± 0.296
1.812SerGln: 1.812 ± 0.254
4.658SerArg: 4.658 ± 0.43
5.661SerSer: 5.661 ± 0.483
4.852SerThr: 4.852 ± 0.422
5.564SerVal: 5.564 ± 0.418
1.812SerTrp: 1.812 ± 0.294
1.747SerTyr: 1.747 ± 0.203
0.0SerXaa: 0.0 ± 0.0
Thr
7.278ThrAla: 7.278 ± 0.642
0.388ThrCys: 0.388 ± 0.1
4.496ThrAsp: 4.496 ± 0.479
3.72ThrGlu: 3.72 ± 0.407
2.07ThrPhe: 2.07 ± 0.311
5.014ThrGly: 5.014 ± 0.55
1.035ThrHis: 1.035 ± 0.201
2.976ThrIle: 2.976 ± 0.335
3.138ThrLys: 3.138 ± 0.274
5.499ThrLeu: 5.499 ± 0.502
1.488ThrMet: 1.488 ± 0.21
1.65ThrAsn: 1.65 ± 0.264
3.558ThrPro: 3.558 ± 0.364
1.65ThrGln: 1.65 ± 0.226
4.173ThrArg: 4.173 ± 0.419
4.432ThrSer: 4.432 ± 0.418
4.044ThrThr: 4.044 ± 0.398
5.208ThrVal: 5.208 ± 0.494
1.294ThrTrp: 1.294 ± 0.192
1.65ThrTyr: 1.65 ± 0.241
0.0ThrXaa: 0.0 ± 0.0
Val
5.661ValAla: 5.661 ± 0.542
0.582ValCys: 0.582 ± 0.153
4.496ValAsp: 4.496 ± 0.411
4.917ValGlu: 4.917 ± 0.455
2.07ValPhe: 2.07 ± 0.235
4.496ValGly: 4.496 ± 0.396
1.747ValHis: 1.747 ± 0.214
4.367ValIle: 4.367 ± 0.341
3.3ValLys: 3.3 ± 0.353
5.402ValLeu: 5.402 ± 0.475
1.844ValMet: 1.844 ± 0.273
2.394ValAsn: 2.394 ± 0.287
2.717ValPro: 2.717 ± 0.3
2.394ValGln: 2.394 ± 0.244
3.752ValArg: 3.752 ± 0.345
4.917ValSer: 4.917 ± 0.542
4.949ValThr: 4.949 ± 0.41
4.723ValVal: 4.723 ± 0.402
1.456ValTrp: 1.456 ± 0.286
1.844ValTyr: 1.844 ± 0.266
0.0ValXaa: 0.0 ± 0.0
Trp
1.326TrpAla: 1.326 ± 0.217
0.194TrpCys: 0.194 ± 0.072
1.488TrpAsp: 1.488 ± 0.289
0.97TrpGlu: 0.97 ± 0.229
0.582TrpPhe: 0.582 ± 0.12
1.165TrpGly: 1.165 ± 0.213
0.421TrpHis: 0.421 ± 0.101
1.488TrpIle: 1.488 ± 0.216
0.906TrpLys: 0.906 ± 0.184
1.391TrpLeu: 1.391 ± 0.213
0.421TrpMet: 0.421 ± 0.128
0.712TrpAsn: 0.712 ± 0.183
0.776TrpPro: 0.776 ± 0.18
0.518TrpGln: 0.518 ± 0.112
1.359TrpArg: 1.359 ± 0.173
1.52TrpSer: 1.52 ± 0.183
1.326TrpThr: 1.326 ± 0.192
1.229TrpVal: 1.229 ± 0.194
0.291TrpTrp: 0.291 ± 0.1
0.712TrpTyr: 0.712 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.685TyrAla: 2.685 ± 0.276
0.226TyrCys: 0.226 ± 0.087
1.876TyrAsp: 1.876 ± 0.227
1.812TyrGlu: 1.812 ± 0.291
0.97TyrPhe: 0.97 ± 0.184
2.297TyrGly: 2.297 ± 0.298
0.323TyrHis: 0.323 ± 0.125
1.1TyrIle: 1.1 ± 0.222
0.938TyrLys: 0.938 ± 0.19
2.297TyrLeu: 2.297 ± 0.252
0.453TyrMet: 0.453 ± 0.1
1.003TyrAsn: 1.003 ± 0.133
1.132TyrPro: 1.132 ± 0.175
0.809TyrGln: 0.809 ± 0.149
2.329TyrArg: 2.329 ± 0.293
2.297TyrSer: 2.297 ± 0.255
1.714TyrThr: 1.714 ± 0.233
2.038TyrVal: 2.038 ± 0.263
0.356TyrTrp: 0.356 ± 0.113
0.809TyrTyr: 0.809 ± 0.171
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 126 proteins (30914 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski